Skip to content

Eval bug: Qwen3-VL MoE tool calls during reasoning are often not parsed properly #17932

@aviallon

Description

@aviallon

Name and Version

Commit 86a3f0f + #17750

Operating systems

Linux

GGML backends

CUDA

Hardware

NVIDIA L40S

Models

unsloth/Qwen3-VL-30B-A3B-Thinking-1M-GGUF:IQ3_XXS

Problem description & steps to reproduce

Here is an example of payload that can trigger the bug. You may have to restart the request several times to make the issue occur.
request_body(14).json

First Bad Commit

No response

Relevant log output

N/A

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions