KEMBAR78
Comparing v1.1.0rc2.post1...v1.1.0rc2.post2 · NVIDIA/TensorRT-LLM · GitHub
Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: NVIDIA/TensorRT-LLM
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v1.1.0rc2.post1
Choose a base ref
...
head repository: NVIDIA/TensorRT-LLM
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v1.1.0rc2.post2
Choose a head ref
  • 16 commits
  • 48 files changed
  • 16 contributors

Commits on Sep 5, 2025

  1. [None] [test] Add MNNVL AlltoAll tests to pre-merge (#7465)

    Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
    Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
    Co-authored-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
    kaiyux and zongfeijing authored Sep 5, 2025
    Configuration menu
    Copy the full SHA
    1455074 View commit details
    Browse the repository at this point in the history
  2. [TRTLLM-7292][feat] Support multi-threaded tokenizers for trtllm-serve (

    #7515)
    
    Signed-off-by: Yilin Fan <206948969+nv-yilinf@users.noreply.github.com>
    nv-yilinf authored Sep 5, 2025
    Configuration menu
    Copy the full SHA
    6a5806b View commit details
    Browse the repository at this point in the history

Commits on Sep 6, 2025

  1. [None][fix] trtllm-serve yaml loading (#7551)

    Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
    Superjomn authored Sep 6, 2025
    Configuration menu
    Copy the full SHA
    3b024cb View commit details
    Browse the repository at this point in the history
  2. [None][ci] Increase the number of retries in docker image generation (#…

    …7557)
    
    Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
    chzblych committed Sep 6, 2025
    Configuration menu
    Copy the full SHA
    5cf4f19 View commit details
    Browse the repository at this point in the history
  3. [None][infra] update nspect version (#7552)

    Signed-off-by: Yiteng Niu <6831097+niukuo@users.noreply.github.com>
    niukuo authored and chzblych committed Sep 6, 2025
    Configuration menu
    Copy the full SHA
    fcdc55b View commit details
    Browse the repository at this point in the history
  4. [None][ci] Improve SSH connection stability (#7567)

    Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
    chzblych committed Sep 6, 2025
    Configuration menu
    Copy the full SHA
    2b02dd7 View commit details
    Browse the repository at this point in the history

Commits on Sep 7, 2025

  1. [None][chore] Bump version to 1.1.0rc2.post2 (#7582)

    Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
    yiqingy0 authored Sep 7, 2025
    Configuration menu
    Copy the full SHA
    72dd6b1 View commit details
    Browse the repository at this point in the history
  2. [None][ci] Block some nodes to avoid unstable network access (#7593)

    Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
    chzblych committed Sep 7, 2025
    Configuration menu
    Copy the full SHA
    2d5f0e1 View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2025

  1. [https://nvbugs/5498967][fix] Downgrade NCCL (#7556)

    Signed-off-by: yizhang-nv <187001205+yizhang-nv@users.noreply.github.com>
    yizhang-nv authored Sep 8, 2025
    Configuration menu
    Copy the full SHA
    4658b77 View commit details
    Browse the repository at this point in the history
  2. [TRTLLM-6994][feat] FP8 Context MLA integration. (#7581)

    Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
    yuxianq authored Sep 8, 2025
    Configuration menu
    Copy the full SHA
    9938f4f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3299435 View commit details
    Browse the repository at this point in the history
  4. [None][ci] Fix a typo in the Slurm command

    Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
    chzblych committed Sep 8, 2025
    Configuration menu
    Copy the full SHA
    bc90a34 View commit details
    Browse the repository at this point in the history
  5. [None][chore] Make low_precision_combine as a llm arg (#7598)

    Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
    zongfeijing authored Sep 8, 2025
    Configuration menu
    Copy the full SHA
    75745c7 View commit details
    Browse the repository at this point in the history

Commits on Sep 9, 2025

  1. [None][fix] Update deployment guide and cherry-pick CI test fix from …

    …main (#7623)
    
    Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
    Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
    Co-authored-by: bhsueh_NV <11360707+byshiue@users.noreply.github.com>
    dongfengy and byshiue authored Sep 9, 2025
    Configuration menu
    Copy the full SHA
    d60dad6 View commit details
    Browse the repository at this point in the history
  2. [None][feat] Cherry-pick Responses API and multiple postprocess worke…

    …rs support for chat harmony (#7600)
    
    Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
    Co-authored-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
    Co-authored-by: Tao Li @ NVIDIA <tali@nvidia.com>
    3 people authored Sep 9, 2025
    Configuration menu
    Copy the full SHA
    ac0df0a View commit details
    Browse the repository at this point in the history
  3. [None][chore] Fix kernel launch param and add TRTLLM MoE backend test (

    …#7524)
    
    Signed-off-by: Pengbo Wang <221450789+pengbowang-nv@users.noreply.github.com>
    pengbowang-nv authored Sep 9, 2025
    Configuration menu
    Copy the full SHA
    ef0d06d View commit details
    Browse the repository at this point in the history
Loading