Skip to content

Pull requests: zejunchen-zejun/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Adapt with #20522 #26877 in mamba
#288 opened Jun 13, 2026 by IzacharyI Loading…
5 tasks
Apply pa_gluon for aiter_backend
#278 opened May 15, 2026 by apinge Loading…
5 tasks
ci(qwen35): unify benchmark Docker image to 20260430 build
#270 opened Apr 30, 2026 by gyohuangxin Collaborator Loading…
add pagged attention nhd for aiter_backend
#269 opened Apr 28, 2026 by apinge Draft
5 tasks
Fix full-attention layer id mapping for Hybrid models (e.g. Qwen3-Next/Qwen3.5)
#249 opened Apr 14, 2026 by wanzhenchn Collaborator Loading…
4 tasks
[GDN] Fused all preprocessing into one kernel
#244 opened Apr 9, 2026 by apinge Loading…
4 tasks
[feat] support aiter pa for Qwen3.5 GA module in decode phase
#241 opened Apr 9, 2026 by At1a8 Collaborator Loading…
4 tasks
Cherry-pick fix for piecewise cuda graph in Qwen3.5
#234 opened Apr 2, 2026 by apinge Loading…
4 tasks
diasble flash infer rope
#189 opened Feb 6, 2026 by LiuYinfeng01 Loading…
4 tasks
enable all2all overlap, and use rope overlap last v gemm
#173 opened Jan 22, 2026 by ganyi1996ppo Loading…
4 tasks
add offline generate lora qwen-image-edit script
#157 opened Jan 13, 2026 by zhuyuhua-v Collaborator Loading…
4 tasks
Cuda Graph Capture WA for HIP Runtime
#147 opened Jan 9, 2026 by sammysun0711 Collaborator Loading…
4 tasks
[Feat] add ttft measure for qwen3vl
#128 opened Dec 31, 2025 by ZLkanyo009 Collaborator Loading…
4 tasks
[feat] Add ROCm ATOM model impl backend
#119 opened Dec 26, 2025 by zejunchen-zejun Owner Loading…
Add tuned triton MOE config for Qwen3-Omni
#105 opened Dec 19, 2025 by sammysun0711 Collaborator Loading…
4 tasks
Qwen3 next -- fixed conv update split q/k/v in decode phase
#87 opened Dec 10, 2025 by IzacharyI Loading…
4 tasks
Qwen3 next -- fixed sigmoid and mul broadcast issue
#86 opened Dec 10, 2025 by IzacharyI Loading…
6 tasks
[CI] Enable Qwen3-Omni Performance Benchmark
#85 opened Dec 10, 2025 by sammysun0711 Collaborator Loading…
4 tasks
Increase _AITER_PARTITION_SIZE_ROCM
#84 opened Dec 10, 2025 by apinge Draft
4 tasks
CI: Debug Qwen3 Next issue
#48 opened Dec 2, 2025 by gyohuangxin Collaborator Draft
ProTip! Filter pull requests by the default branch with base:Qwen3.5_v0.5.9.