-
Notifications
You must be signed in to change notification settings - Fork 87
Pull requests: vllm-project/vllm-gaudi
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[GAUDISW-243560] Monkey-patching _get_attn_scale for the Llama4Attention layer
#760
opened Dec 24, 2025 by
rsmyrek
Loading…
[GAUDISW-243560] Monkey-patching _get_attn_scale for the Llama4Attention layer
#758
opened Dec 23, 2025 by
rsmyrek
Loading…
Prefill batching logic to handle chunked prefill/prefix caching for HPU
#753
opened Dec 23, 2025 by
hlin99
Loading…
Release Notes for v0.13.0
documentation
Improvements or additions to documentation
skip-gaudi-tests
#750
opened Dec 22, 2025 by
mhelf-intel
Loading…
[GAUDISW-244752] add dynamic scale for V-Cache on Hiddden dim
#749
opened Dec 21, 2025 by
dudilester
Loading…
[GAUDISW-244336] Add missing long ctx prompt buckets
#739
opened Dec 18, 2025 by
kfojcik-intel
Loading…
Dryrun implementation for generating command line file
#723
opened Dec 16, 2025 by
rajanintel24
Loading…
Create UBI based vLLM docker build instructions
documentation
Improvements or additions to documentation
skip-gaudi-tests
#713
opened Dec 12, 2025 by
ghandoura
Loading…
Fix the docker image path
documentation
Improvements or additions to documentation
skip-gaudi-tests
#691
opened Dec 5, 2025 by
mhelf-intel
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.