-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Comparing changes
Open a pull request
base repository: NVIDIA/TensorRT-LLM
base: v1.1.0rc2.post1
head repository: NVIDIA/TensorRT-LLM
compare: v1.1.0rc2.post2
- 16 commits
- 48 files changed
- 16 contributors
Commits on Sep 5, 2025
-
[None] [test] Add MNNVL AlltoAll tests to pre-merge (#7465)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com> Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com> Co-authored-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 1455074 - Browse repository at this point
Copy the full SHA 1455074View commit details -
[TRTLLM-7292][feat] Support multi-threaded tokenizers for trtllm-serve (
#7515) Signed-off-by: Yilin Fan <206948969+nv-yilinf@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 6a5806b - Browse repository at this point
Copy the full SHA 6a5806bView commit details
Commits on Sep 6, 2025
-
[None][fix] trtllm-serve yaml loading (#7551)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 3b024cb - Browse repository at this point
Copy the full SHA 3b024cbView commit details -
[None][ci] Increase the number of retries in docker image generation (#…
…7557) Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 5cf4f19 - Browse repository at this point
Copy the full SHA 5cf4f19View commit details -
[None][infra] update nspect version (#7552)
Signed-off-by: Yiteng Niu <6831097+niukuo@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for fcdc55b - Browse repository at this point
Copy the full SHA fcdc55bView commit details -
[None][ci] Improve SSH connection stability (#7567)
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 2b02dd7 - Browse repository at this point
Copy the full SHA 2b02dd7View commit details
Commits on Sep 7, 2025
-
[None][chore] Bump version to 1.1.0rc2.post2 (#7582)
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 72dd6b1 - Browse repository at this point
Copy the full SHA 72dd6b1View commit details -
[None][ci] Block some nodes to avoid unstable network access (#7593)
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 2d5f0e1 - Browse repository at this point
Copy the full SHA 2d5f0e1View commit details
Commits on Sep 8, 2025
-
[https://nvbugs/5498967][fix] Downgrade NCCL (#7556)
Signed-off-by: yizhang-nv <187001205+yizhang-nv@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 4658b77 - Browse repository at this point
Copy the full SHA 4658b77View commit details -
[TRTLLM-6994][feat] FP8 Context MLA integration. (#7581)
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9938f4f - Browse repository at this point
Copy the full SHA 9938f4fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3299435 - Browse repository at this point
Copy the full SHA 3299435View commit details -
[None][ci] Fix a typo in the Slurm command
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for bc90a34 - Browse repository at this point
Copy the full SHA bc90a34View commit details -
[None][chore] Make low_precision_combine as a llm arg (#7598)
Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 75745c7 - Browse repository at this point
Copy the full SHA 75745c7View commit details
Commits on Sep 9, 2025
-
[None][fix] Update deployment guide and cherry-pick CI test fix from …
…main (#7623) Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com> Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com> Co-authored-by: bhsueh_NV <11360707+byshiue@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for d60dad6 - Browse repository at this point
Copy the full SHA d60dad6View commit details -
[None][feat] Cherry-pick Responses API and multiple postprocess worke…
…rs support for chat harmony (#7600) Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com> Co-authored-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com> Co-authored-by: Tao Li @ NVIDIA <tali@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for ac0df0a - Browse repository at this point
Copy the full SHA ac0df0aView commit details -
[None][chore] Fix kernel launch param and add TRTLLM MoE backend test (…
…#7524) Signed-off-by: Pengbo Wang <221450789+pengbowang-nv@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for ef0d06d - Browse repository at this point
Copy the full SHA ef0d06dView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v1.1.0rc2.post1...v1.1.0rc2.post2