Update Querit results by moshesbeta · Pull Request #501 · embeddings-benchmark/results

moshesbeta · 2026-04-30T05:42:48Z

Checklist

My model has a model sheet, report, or similar
My model has a reference implementation in mteb/models/model_implementations/, this can be as an API. Instruction on how to add a model can be found here
- No, but there is an existing PR Fix: Querit model revision and release_data mteb#4569
The results submitted are obtained using the reference implementation
My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
I solemnly swear that for all results submitted I have not trained on the evaluation dataset including training splits. If I have, I have disclosed it clearly.

github-actions · 2026-04-30T05:44:53Z

Model Results Comparison

Reference models: intfloat/multilingual-e5-large, google/gemini-embedding-001
New models evaluated: Querit/Querit
Tasks: AlloprofReranking, RuBQReranking, T2Reranking, VoyageMMarcoReranking, WebLINXCandidatesReranking, WikipediaRerankingMultilingual

Results for `Querit/Querit`

task_name	Querit/Querit	google/gemini-embedding-001	intfloat/multilingual-e5-large	Max result	Model with max result	In Training Data
AlloprofReranking	0.7919	0.8177	0.6944	0.8540	Octen/Octen-Embedding-8B	False
RuBQReranking	0.7535	0.7384	0.756	0.8051	ai-sage/Giga-Embeddings-instruct	False
T2Reranking	0.6895	0.6795	0.6632	0.7315	tencent/Youtu-Embedding	True
VoyageMMarcoReranking	0.6788	0.6673	0.6821	0.8366	codefuse-ai/F2LLM-v2-14B	False
WebLINXCandidatesReranking	0.1184	0.1097	0.0778	0.2246	codefuse-ai/F2LLM-v2-8B	False
WikipediaRerankingMultilingual	0.9092	0.9224	0.8981	0.9308	jinaai/jina-reranker-v3	False
Average	0.6569	0.6558	0.6286	0.7304	nan	-

Training datasets: AskUbuntuDupQuestions, AskUbuntuDupQuestions-VN, CQADupStack, MIRACLRanking, MSMARCO, MSMARCO-Fa, MSMARCO-FaHardNegatives, MSMARCO-PL, MSMARCO-PLHardNegatives, MSMARCO-VN, MSMARCOHardNegatives, MSMARCOv2, MindSmallReranking, MrTidyRetrieval, MrTyDiJaRetrievalLite, MultiLongDocReranking, MultiLongDocRetrieval, NanoMSMARCO-VN, NanoMSMARCORetrieval, T2Reranking, ruri-v3-dataset-reranker

Note: Content truncated due to GitHub API limits. See the full report in the workflow artifacts.

KennethEnevoldsen · 2026-04-30T09:54:08Z

this is deleted but never added

this is still missing

Our model primarily focuses on multilingual reranking tasks, and it's not tested on this dataset. Is it necessary to provide results from this test set?

KennethEnevoldsen reviewed Apr 30, 2026

View reviewed changes

KennethEnevoldsen added waiting for review of implementation This PR is waiting for an implementation review before merging the results. and removed waiting for review of implementation This PR is waiting for an implementation review before merging the results. labels Apr 30, 2026

moshesbeta closed this May 14, 2026

moshesbeta force-pushed the main branch from 79f2329 to e16a7d4 Compare May 14, 2026 06:05

Samoed mentioned this pull request May 14, 2026

Add new results of Querit/Querit #540

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Querit results#501

Update Querit results#501
moshesbeta wants to merge 0 commit into
embeddings-benchmark:mainfrom
moshesbeta:main

moshesbeta commented Apr 30, 2026 •

edited by Samoed

Loading

Uh oh!

github-actions Bot commented Apr 30, 2026

Uh oh!

KennethEnevoldsen Apr 30, 2026

Uh oh!

KennethEnevoldsen Apr 30, 2026

Uh oh!

moshesbeta Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

moshesbeta commented Apr 30, 2026 • edited by Samoed Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

github-actions Bot commented Apr 30, 2026

Model Results Comparison

Results for Querit/Querit

Uh oh!

KennethEnevoldsen Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

KennethEnevoldsen Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

moshesbeta Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

moshesbeta commented Apr 30, 2026 •

edited by Samoed

Loading

Results for `Querit/Querit`