Add TRT-RTX EP support, keep NvTensorRtRtx as user facing name, and force QDQ #1791

anujj · 2025-09-24T13:51:12Z

Keep NvTensorRtRtx as user facing name.
Automatically set extra_options["use_qdq"]=True when using TRT-RTX to
ensure Q/DQ quantization path is enabled by default.

anujj · 2025-09-24T13:52:38Z

@gaugarg-nv @BLSharda @kunal-vaishnavi @baijumeswani

anujj · 2025-09-25T03:35:54Z

Updated PR to keep user facing name as NvTensorRtRtx

kunal-vaishnavi · 2025-09-25T03:43:26Z

src/python/py/models/builder.py


 if __name__ == '__main__':
    args = get_args()
+    if args.execution_provider == "NvTensorRtRtx":


We should handle this logic inside create_model so that users who import create_model can leverage this (e.g. Olive).

i am forcing use_qdq in check_extra_options() with is called before create_model

# force use_qdq for trt-rtx if args.execution_provider == "trt-rtx": kv_pairs["use_qdq"] = True

i had to switch the condition to if args.execution_provider == "NvtensorRtRtx":, which should be okey i guess ?

This is a separate issue from what you are mentioning. This check to change the EP name from NvTensorRtRtx to trt-rtx will not happen for users who import create_model. Here is an example which will fail with your current PR changes.

from onnxruntime_genai.models.builder import create_model input_dir = "" output_dir = "./phi-4-mini" cache_dir = "./cache_dir" model_name = "microsoft/Phi-4-mini-instruct" precision = "int4" ep = "NvTensorRtRtx" extra_options = { "use_qdq": "true" } create_model(model_name, input_dir, output_dir, precision, ep, cache_dir, **extra_options)

In the above code, create_model will not replace NvTensorRtRtx with trt-rtx. This will cause the incorrect ONNX model to get created. After moving the replacement of the EP name inside create_model, the above code should work.

kunal-vaishnavi · 2025-09-25T03:49:17Z

src/python/py/models/builder.py


+    # force use_qdq for trt-rtx
+    if args.execution_provider == "trt-rtx":
+        kv_pairs["use_qdq"] = True


In the original implementation of this before it was removed, there was a comment which said that QDQ was used because opset 21 was desired. The default opset is now 21 but it seems that creating a QDQ model is the true intention.

I also remember there was an issue in Olive which originated from forcing use_qdq here. Could there be a future scenario where creating a TRT-RTX model without QDQ is desired?

we dont support matmulnbit in trt-rtx that's why it is always required, without this int4 path will break.

Will MatMulNBits be supported in the future?

No plan for that kunal

simplify the changes in the latest commit

gaugarg-nv · 2025-09-25T10:03:44Z

@anujj we will also have to revert this PR in Olive: microsoft/Olive#2169

anujj · 2025-09-25T11:48:28Z

Raised a PR for that : microsoft/Olive#2182

As per the discussion with the GenAI team, we are moving the the NvtensorRtRtx user facing name GenAI PR: microsoft/onnxruntime-genai#1791

kunal-vaishnavi · 2025-09-29T17:52:02Z

src/python/py/models/builder.py

        raise ValueError("Both 'exclude_lm_head' and 'include_hidden_states' cannot be used together. Please use only one of them at once.")


+


Suggested change

kunal-vaishnavi · 2025-09-29T17:52:11Z

src/python/py/models/builder.py


 @torch.no_grad
 def create_model(model_name, input_path, output_dir, precision, execution_provider, cache_dir, **extra_options):
+


Suggested change

enable dqd

05ee2b1

keep user facing name NvTensorRtRtx

03d11f8

anujj changed the title ~~Add TRT-RTX EP support, normalize alias, and force QDQ~~ Add TRT-RTX EP support, keep NvTensorRtRtx as user facing name, and force QDQ Sep 25, 2025

del new line"

5174ae0

kunal-vaishnavi reviewed Sep 25, 2025

View reviewed changes

refactor

fdf0661

anujj mentioned this pull request Sep 25, 2025

Rename trt-rtx to NvTensorRtRtx microsoft/Olive#2182

Merged

xiaoyu-work pushed a commit to microsoft/Olive that referenced this pull request Sep 26, 2025

Rename trt-rtx to NvTensorRtRtx (#2182)

c620dfd

As per the discussion with the GenAI team, we are moving the the NvtensorRtRtx user facing name GenAI PR: microsoft/onnxruntime-genai#1791

kunal-vaishnavi reviewed Sep 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add TRT-RTX EP support, keep NvTensorRtRtx as user facing name, and force QDQ #1791

Add TRT-RTX EP support, keep NvTensorRtRtx as user facing name, and force QDQ #1791

Uh oh!

anujj commented Sep 24, 2025 •

edited

Loading

Uh oh!

anujj commented Sep 24, 2025

Uh oh!

anujj commented Sep 25, 2025

Uh oh!

kunal-vaishnavi Sep 25, 2025

Uh oh!

anujj Sep 25, 2025

Uh oh!

kunal-vaishnavi Sep 25, 2025

Uh oh!

kunal-vaishnavi Sep 25, 2025

Uh oh!

anujj Sep 25, 2025

Uh oh!

kunal-vaishnavi Sep 25, 2025

Uh oh!

anujj Sep 25, 2025

Uh oh!

anujj Sep 25, 2025

Uh oh!

gaugarg-nv commented Sep 25, 2025

Uh oh!

anujj commented Sep 25, 2025

Uh oh!

kunal-vaishnavi Sep 29, 2025

Uh oh!

kunal-vaishnavi Sep 29, 2025

Uh oh!

Uh oh!

		raise ValueError("Both 'exclude_lm_head' and 'include_hidden_states' cannot be used together. Please use only one of them at once.")


		@torch.no_grad
		def create_model(model_name, input_path, output_dir, precision, execution_provider, cache_dir, **extra_options):

Add TRT-RTX EP support, keep NvTensorRtRtx as user facing name, and force QDQ #1791

Are you sure you want to change the base?

Add TRT-RTX EP support, keep NvTensorRtRtx as user facing name, and force QDQ #1791

Uh oh!

Conversation

anujj commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anujj commented Sep 24, 2025

Uh oh!

anujj commented Sep 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gaugarg-nv commented Sep 25, 2025

Uh oh!

anujj commented Sep 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

anujj commented Sep 24, 2025 •

edited

Loading