-
Notifications
You must be signed in to change notification settings - Fork 1.8k
[Fix][Chore][Qwen3] fix bug of using fp4 on sm120 #6065
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
/bot run |
|
PR_Github #12092 [ run ] triggered by Bot |
|
PR_Github #12092 [ run ] completed with state |
|
/bot run |
|
PR_Github #12120 [ run ] triggered by Bot |
|
PR_Github #12120 [ run ] completed with state |
WalkthroughA function determining NVFP4 output support in attention operations was updated to explicitly exclude SM version 120, in addition to versions below 100. The code comment was revised to reflect that SM 120 does not support NVFP4 output. The Qwen3 model name was corrected in documentation. A test parameter adjusting GPU memory fraction was changed, and two skip entries for specific tests were removed from the waiver list. No changes were made to function signatures or public interfaces. Changes
Suggested reviewers
Poem
📜 Recent review detailsConfiguration used: .coderabbit.yaml 📒 Files selected for processing (1)
🚧 Files skipped from review as they are similar to previous changes (1)
🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
SupportNeed help? Create a ticket on our support page for assistance with any issues or questions. Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
Documentation and Community
|
|
/bot run |
|
PR_Github #12231 [ run ] triggered by Bot |
|
PR_Github #12231 [ run ] completed with state |
|
/bot run |
|
PR_Github #12245 [ run ] triggered by Bot |
|
PR_Github #12244 [ ] completed with state |
|
PR_Github #12245 [ run ] completed with state |
|
/bot run |
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
|
/bot run |
|
PR_Github #12349 [ run ] triggered by Bot |
|
PR_Github #12350 [ run ] triggered by Bot |
|
PR_Github #12349 [ run ] completed with state |
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
|
PR_Github #12350 [ run ] completed with state |
…wen3_235b tests Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
|
/bot run --disable-fail-fast |
|
/bot run --disable-fail-fast |
|
PR_Github #12367 [ run ] triggered by Bot |
|
PR_Github #12367 [ run ] completed with state |
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com> Signed-off-by: Shreyas Misra <shreyasm@nvidia.com>
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com> Signed-off-by: Ransiki Zhang <ransikiz@nvidia.com>
Summary by CodeRabbit
Bug Fixes
Documentation
Tests