-
Notifications
You must be signed in to change notification settings - Fork 7
Pull requests: TomTraining/TomTest
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update tables/SUMMARY.md: accuracy overview with Qwen3-8B variants
#36
opened Apr 24, 2026 by
xujiayuan0205
Contributor
Loading…
feat: add badcase recording, LLM judge fallback, dual metrics, and fi…
#26
opened Apr 15, 2026 by
xujiayuan0205
Contributor
Loading…
ProTip!
Adding no:label will show everything without a label.