Blog post: When Does Constrained Decoding Actually Help a Small VLM? by Arjun-Avadhanam · Pull Request #3375 · huggingface/blog

Arjun-Avadhanam · 2026-05-05T08:38:49Z

A 4-cell ablation study testing how LoRA fine-tuning and Outlines constrained decoding interact on SmolVLM-256M for structured receipt extraction (SROIE dataset).

Three findings:

Constrained decoding is enormously valuable in the zero-shot regime (0% → 97% schema validity)
Once LoRA-trained, constrained decoding adds no measurable benefit (49.6% vs 49.8%)
The repetition_penalty hyperparameter that fixes degenerate loops in the untrained model causes silent failures in the trained model via an FSM-mask interaction

Repo: https://github.com/Arjun-Avadhanam/SmolVLM-CD
Gradio demo included with a rep_penalty slider to reproduce Finding 3 live.

Added smol vlm blog post

Arjun-Avadhanam added 3 commits May 5, 2026 14:00

Create smolvlm-cd.md

c9f92c2

Added smol vlm blog post

Add rp_sweep figure for smolvlm-cd blog post

8d9c7df

Add thumbnail for smolvlm-cd blog post

ad11940

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Blog post: When Does Constrained Decoding Actually Help a Small VLM?#3375

Blog post: When Does Constrained Decoding Actually Help a Small VLM?#3375
Arjun-Avadhanam wants to merge 3 commits intohuggingface:mainfrom
Arjun-Avadhanam:smolvlm-cd

Arjun-Avadhanam commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Arjun-Avadhanam commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant