model: jina-embeddings-v5-omni models by florian-hoenicke · Pull Request #4604 · embeddings-benchmark/mteb

florian-hoenicke · 2026-05-04T16:52:30Z

Hi,

This PR adds the jina-embeddings-v5-omni nano and small base models to MTEB. The text path is parity-verified against the corresponding v5 text models, so the same task routing is used here.

ModelMeta is filled for both models
mteb.get_model_meta(...) works for both models
Results PR: add: jina-embeddings-v5-omni results results#513

Thanks!

Co-authored-by: Cursor <cursoragent@cursor.com>

florian-hoenicke · 2026-05-06T08:31:45Z

Follow-up fix pushed in 320263b: JinaV5OmniWrapper now defaults jinaai/jina-embeddings-v5-omni-nano to torch.float32, which restores text-path parity with jinaai/jina-embeddings-v5-text-nano. Verified on A2 with private HF auth via mteb.get_model(...): omni-nano loads fp32 and matched text-nano exactly on retrieval/query and classification/document probes (max_abs_diff=0.0). Small remains unchanged.

Co-authored-by: Cursor <cursoragent@cursor.com>

florian-hoenicke · 2026-05-06T08:38:30Z

Thanks, updated the prompt/task dispatch test in c42e7a9 to use MockRetrievalTask().metadata instead of a hand-rolled task metadata object.

Co-authored-by: Cursor <cursoragent@cursor.com>

florian-hoenicke · 2026-05-06T08:54:49Z

Follow-up corner-case fix pushed in 9f40546: I found the omni HF remote code was stripping text before tokenization, which made trailing-space inputs differ from the text models. The private HF repos now preserve whitespace and the MTEB metadata pins those fixed revisions. Verified on A2 through mteb.get_model(...): nano 36db6194... and small 8b4f2c44... match their text counterparts on trailing-space probes with max_abs=0.0.

Samoed · 2026-05-06T09:39:08Z

JinaV5OmniWrapper now defaults jinaai/jina-embeddings-v5-omni-nano to torch.float32, which restores text-path parity

You can just pass this in loader_kwargs without changes in __init__

Co-authored-by: Cursor <cursoragent@cursor.com>

florian-hoenicke · 2026-05-06T13:18:45Z

Updated again after the latest private HF repo fix: MTEB now pins nano 6f88a89e... and small 43affca6.... Verified on A2 against the text counterparts on trailing-space probes and all unique STSBenchmark strings: max_abs=0.0, min cosine 1.0 / 0.99999988.

Move the nano dtype default into model metadata and remove model-specific tests per reviewer guidance. Co-authored-by: Cursor <cursoragent@cursor.com>

florian-hoenicke · 2026-05-09T09:18:13Z

Update: the referenced HF repos are now public and ungated:

jinaai/jina-embeddings-v5-omni-nano: Hub API reports private: false, gated: false, disabled: false
jinaai/jina-embeddings-v5-omni-small: Hub API reports private: false, gated: false, disabled: false

I also pushed 605cb14 to address the open review feedback: nano fp32 is now passed through loader_kwargs, and the Jina-specific tests were removed. Local checks passed (test_model_meta.py: 1935 passed; ruff on edited files passed).

Samoed · 2026-05-09T10:07:25Z

I think we would wait until public release

florian-hoenicke · 2026-05-10T09:59:22Z

The public release is live now: both referenced model repos are publicly accessible and ungated on the Hub.

Hub API currently reports private: false, gated: false, disabled: false for both. Please let me know if you need any other release artifact before merging.

add: jina-embeddings-v5-omni models

bde98d0

Co-authored-by: Cursor <cursoragent@cursor.com>

florian-hoenicke mentioned this pull request May 4, 2026

add: jina-embeddings-v5-omni results embeddings-benchmark/results#513

Merged

7 tasks

Samoed reviewed May 4, 2026

View reviewed changes

Comment thread mteb/models/model_implementations/jina_models.py

Samoed added the new model Questions related to adding a new model to the benchmark label May 4, 2026

fix: route jina omni models through multimodal wrapper

efbb0b2

Co-authored-by: Cursor <cursoragent@cursor.com>

Samoed reviewed May 5, 2026

View reviewed changes

Comment thread tests/test_models/test_model_meta.py

Samoed changed the title ~~add: jina-embeddings-v5-omni models~~ model: jina-embeddings-v5-omni models May 5, 2026

fix: load jina omni nano in fp32

320263b

Co-authored-by: Cursor <cursoragent@cursor.com>

test: use mock task for jina omni prompt dispatch

c42e7a9

Co-authored-by: Cursor <cursoragent@cursor.com>

fix: pin jina omni revisions with whitespace parity

9f40546

Co-authored-by: Cursor <cursoragent@cursor.com>

fix: pin jina omni latest private revisions

86c63c5

Co-authored-by: Cursor <cursoragent@cursor.com>

fix: simplify jina omni review follow-up

605cb14

Move the nano dtype default into model metadata and remove model-specific tests per reviewer guidance. Co-authored-by: Cursor <cursoragent@cursor.com>

Samoed approved these changes May 9, 2026

View reviewed changes

Samoed merged commit 5ff08fa into embeddings-benchmark:main May 10, 2026
13 checks passed

Samoed mentioned this pull request May 10, 2026

MVEB Overview #4130

Open

75 tasks

florian-hoenicke mentioned this pull request May 11, 2026

fix: route MIEB/MAEB task-types to correct LoRA adapter for jina v5 omni #4656

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model: jina-embeddings-v5-omni models#4604

model: jina-embeddings-v5-omni models#4604
Samoed merged 7 commits into
embeddings-benchmark:mainfrom
florian-hoenicke:add-jina-v5-omni

florian-hoenicke commented May 4, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

florian-hoenicke commented May 6, 2026 •

edited by Samoed

Loading

Uh oh!

florian-hoenicke commented May 6, 2026 •

edited by Samoed

Loading

Uh oh!

florian-hoenicke commented May 6, 2026 •

edited by Samoed

Loading

Uh oh!

Samoed commented May 6, 2026

Uh oh!

florian-hoenicke commented May 6, 2026

Uh oh!

florian-hoenicke commented May 9, 2026

Uh oh!

Samoed commented May 9, 2026

Uh oh!

florian-hoenicke commented May 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

florian-hoenicke commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

florian-hoenicke commented May 6, 2026 • edited by Samoed Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

florian-hoenicke commented May 6, 2026 • edited by Samoed Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

florian-hoenicke commented May 6, 2026 • edited by Samoed Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Samoed commented May 6, 2026

Uh oh!

florian-hoenicke commented May 6, 2026

Uh oh!

florian-hoenicke commented May 9, 2026

Uh oh!

Samoed commented May 9, 2026

Uh oh!

florian-hoenicke commented May 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

florian-hoenicke commented May 4, 2026 •

edited

Loading

florian-hoenicke commented May 6, 2026 •

edited by Samoed

Loading

florian-hoenicke commented May 6, 2026 •

edited by Samoed

Loading

florian-hoenicke commented May 6, 2026 •

edited by Samoed

Loading