Expend compatibility check for all quantized MoE models #13465

JustinTong0323 · 2025-11-18T01:05:29Z

Motivation

From the perspective of user friendliness, it is essential to perform the check prior to the occurrence of the actual error. Additionally, we must ensure that the error message is rational.

This update introduces a new method, check_quantized_moe_compatibility, to validate the configuration of quantized Mixture of Experts (MoE) models. It ensures that the tensor parallel size and intermediate size adhere to the required divisibility conditions, improving error handling and model configuration validation.

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Work with maintainers to merge your PR. See the PR Merge Process

This update introduces a new method, check_quantized_moe_compatibility, to validate the configuration of quantized Mixture of Experts (MoE) models. It ensures that the tensor parallel size and intermediate size adhere to the required divisibility conditions, improving error handling and model configuration validation. Signed-off-by: Xinyuan Tong <[email protected]>

gemini-code-assist · 2025-11-18T01:05:33Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Signed-off-by: Xinyuan Tong <[email protected]>

JustinTong0323 requested review from Fridge003, Ying1123, hnyls2002, ispobock and merrymercy as code owners November 18, 2025 01:05

sglang-bot added the run-ci label Nov 18, 2025

Kangyan-Zhou and others added 2 commits November 17, 2025 17:08

Merge branch 'main' into minor-expend-moe-check

a6e7712

Merge branch 'main' into minor-expend-moe-check

3861d92

hnyls2002 approved these changes Nov 19, 2025

View reviewed changes

hnyls2002 merged commit a355794 into sgl-project:main Nov 19, 2025
92 of 109 checks passed

alisonshao pushed a commit that referenced this pull request Nov 19, 2025

Expend compatibility check for all quantized MoE models (#13465)

c2c5855

Signed-off-by: Xinyuan Tong <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expend compatibility check for all quantized MoE models #13465

Expend compatibility check for all quantized MoE models #13465

Uh oh!

JustinTong0323 commented Nov 18, 2025

Uh oh!

gemini-code-assist bot commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Expend compatibility check for all quantized MoE models #13465

Expend compatibility check for all quantized MoE models #13465

Uh oh!

Conversation

JustinTong0323 commented Nov 18, 2025

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Uh oh!

gemini-code-assist bot commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants