forked from sgl-project/sglang
-
Notifications
You must be signed in to change notification settings - Fork 11
Pull requests: zejunchen-zejun/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Perf] Support Qwen-Image-Edit A8W8 GEMM and fused QKV
diffusion
#282
opened May 29, 2026 by
LiuYinfeng01
Loading…
5 tasks
ci(qwen35): unify benchmark Docker image to 20260430 build
#270
opened Apr 30, 2026 by
gyohuangxin
Collaborator
Loading…
Fix full-attention layer id mapping for Hybrid models (e.g. Qwen3-Next/Qwen3.5)
#249
opened Apr 14, 2026 by
wanzhenchn
Collaborator
Loading…
4 tasks
[feat] support aiter pa for Qwen3.5 GA module in decode phase
#241
opened Apr 9, 2026 by
At1a8
Collaborator
Loading…
4 tasks
Cherry-pick fix for piecewise cuda graph in Qwen3.5
#234
opened Apr 2, 2026 by
apinge
Loading…
4 tasks
enable all2all overlap, and use rope overlap last v gemm
#173
opened Jan 22, 2026 by
ganyi1996ppo
Loading…
4 tasks
add offline generate lora qwen-image-edit script
#157
opened Jan 13, 2026 by
zhuyuhua-v
Collaborator
Loading…
4 tasks
Cuda Graph Capture WA for HIP Runtime
#147
opened Jan 9, 2026 by
sammysun0711
Collaborator
Loading…
4 tasks
[Feat] add ttft measure for qwen3vl
#128
opened Dec 31, 2025 by
ZLkanyo009
Collaborator
Loading…
4 tasks
Regulate flash_attn_varlen_fp8_pertensor_func according to precision issue
#123
opened Dec 29, 2025 by
apinge
Loading…
4 tasks
Add tuned triton MOE config for Qwen3-Omni
#105
opened Dec 19, 2025 by
sammysun0711
Collaborator
Loading…
4 tasks
Qwen3 next -- fixed conv update split q/k/v in decode phase
#87
opened Dec 10, 2025 by
IzacharyI
Loading…
4 tasks
Qwen3 next -- fixed sigmoid and mul broadcast issue
#86
opened Dec 10, 2025 by
IzacharyI
Loading…
6 tasks
[CI] Enable Qwen3-Omni Performance Benchmark
#85
opened Dec 10, 2025 by
sammysun0711
Collaborator
Loading…
4 tasks
ProTip!
Filter pull requests by the default branch with base:Qwen3.5_v0.5.9.