model: jina-embeddings-v5-omni models#4604
Conversation
Co-authored-by: Cursor <cursoragent@cursor.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
Co-authored-by: Cursor <cursoragent@cursor.com>
|
Follow-up fix pushed in 320263b: |
Co-authored-by: Cursor <cursoragent@cursor.com>
|
Thanks, updated the prompt/task dispatch test in c42e7a9 to use |
Co-authored-by: Cursor <cursoragent@cursor.com>
|
Follow-up corner-case fix pushed in 9f40546: I found the omni HF remote code was stripping text before tokenization, which made trailing-space inputs differ from the text models. The private HF repos now preserve whitespace and the MTEB metadata pins those fixed revisions. Verified on A2 through |
You can just pass this in |
Co-authored-by: Cursor <cursoragent@cursor.com>
|
Updated again after the latest private HF repo fix: MTEB now pins nano |
Move the nano dtype default into model metadata and remove model-specific tests per reviewer guidance. Co-authored-by: Cursor <cursoragent@cursor.com>
|
Update: the referenced HF repos are now public and ungated:
I also pushed 605cb14 to address the open review feedback: nano fp32 is now passed through |
|
I think we would wait until public release |
|
The public release is live now: both referenced model repos are publicly accessible and ungated on the Hub.
Hub API currently reports |
Hi,
This PR adds the jina-embeddings-v5-omni nano and small base models to MTEB. The text path is parity-verified against the corresponding v5 text models, so the same task routing is used here.
mteb.get_model_meta(...)works for both modelsThanks!