Name and Version
Commit 86a3f0f + #17750
Operating systems
Linux
GGML backends
CUDA
Hardware
NVIDIA L40S
Models
unsloth/Qwen3-VL-30B-A3B-Thinking-1M-GGUF:IQ3_XXS
Problem description & steps to reproduce
Here is an example of payload that can trigger the bug. You may have to restart the request several times to make the issue occur.
request_body(14).json
First Bad Commit
No response
Relevant log output