Skip to content

[Klaud Cold] Update dsr1-fp4-b200-trt (+mtp) TRT-LLM image to v1.3.0rc14#1489

Open
functionstackx wants to merge 2 commits into
mainfrom
update-dsr1-fp4-b200-trt-v1.3.0rc14
Open

[Klaud Cold] Update dsr1-fp4-b200-trt (+mtp) TRT-LLM image to v1.3.0rc14#1489
functionstackx wants to merge 2 commits into
mainfrom
update-dsr1-fp4-b200-trt-v1.3.0rc14

Conversation

@functionstackx
Copy link
Copy Markdown
Collaborator

Summary

Update TensorRT-LLM image (off: v1.2.0rc6.post2 104d / mtp: v1.2.0rc6.post3 101d) to v1.3.0rc14 (latest pre-release)

  • dsr1-fp4-b200-trt: nvcr.io#nvidia/tensorrt-llm/release:1.2.0rc6.post2nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc14
  • dsr1-fp4-b200-trt-mtp: nvcr.io#nvidia/tensorrt-llm/release:1.2.0rc6.post3nvcr.io#nvidia/tensorrt-llm/release:1.3.0rc14

Test plan

  • full-sweep-enabled sweep passes.

🤖 Generated with Claude Code

Update TensorRT-LLM image (off: v1.2.0rc6.post2 104d / mtp: v1.2.0rc6.post3 101d) to v1.3.0rc14 (latest pre-release)

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@github-actions
Copy link
Copy Markdown
Contributor

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

2 similar comments
@github-actions
Copy link
Copy Markdown
Contributor

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

@github-actions
Copy link
Copy Markdown
Contributor

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

@github-actions
Copy link
Copy Markdown
Contributor

Copy link
Copy Markdown
Contributor

@claude claude Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM — mechanical image version bump with matching changelog entry.

Extended reasoning...

Overview

This PR bumps the TensorRT-LLM image for two recipes (dsr1-fp4-b200-trt and dsr1-fp4-b200-trt-mtp) from v1.2.0rc6.post2/post3 to v1.3.0rc14 in .github/configs/nvidia-master.yaml, plus a corresponding perf-changelog.yaml entry referencing this PR. Total changeset is 4 line edits.

Security risks

None. These are container image tags in a benchmark configuration file pointing to the official NVIDIA NGC registry (nvcr.io#nvidia/tensorrt-llm/release). No auth, crypto, permission, or user-input handling is touched.

Level of scrutiny

Low. This is a routine [Klaud Cold] image version bump — the same pattern as the immediately preceding merged commits (e.g., #1426, #1395, #1448). The full-sweep-enabled label indicates the performance sweep will validate the new image end-to-end, which is the meaningful gate here, not code review.

Other factors

The bug hunting system reported no findings. Both image references are updated consistently to the same target tag, and the changelog entry accurately describes the change and links back to this PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

1 participant