Update Querit results#501
Conversation
Model Results ComparisonReference models: Results for
|
| task_name | Querit/Querit | google/gemini-embedding-001 | intfloat/multilingual-e5-large | Max result | Model with max result | In Training Data |
|---|---|---|---|---|---|---|
| AlloprofReranking | 0.7919 | 0.8177 | 0.6944 | 0.8540 | Octen/Octen-Embedding-8B | False |
| RuBQReranking | 0.7535 | 0.7384 | 0.756 | 0.8051 | ai-sage/Giga-Embeddings-instruct | False |
| T2Reranking | 0.6895 | 0.6795 | 0.6632 | 0.7315 | tencent/Youtu-Embedding | True |
| VoyageMMarcoReranking | 0.6788 | 0.6673 | 0.6821 | 0.8366 | codefuse-ai/F2LLM-v2-14B | False |
| WebLINXCandidatesReranking | 0.1184 | 0.1097 | 0.0778 | 0.2246 | codefuse-ai/F2LLM-v2-8B | False |
| WikipediaRerankingMultilingual | 0.9092 | 0.9224 | 0.8981 | 0.9308 | jinaai/jina-reranker-v3 | False |
| Average | 0.6569 | 0.6558 | 0.6286 | 0.7304 | nan | - |
Training datasets: AskUbuntuDupQuestions, AskUbuntuDupQuestions-VN, CQADupStack, MIRACLRanking, MSMARCO, MSMARCO-Fa, MSMARCO-FaHardNegatives, MSMARCO-PL, MSMARCO-PLHardNegatives, MSMARCO-VN, MSMARCOHardNegatives, MSMARCOv2, MindSmallReranking, MrTidyRetrieval, MrTyDiJaRetrievalLite, MultiLongDocReranking, MultiLongDocRetrieval, NanoMSMARCO-VN, NanoMSMARCORetrieval, T2Reranking, ruri-v3-dataset-reranker
Note: Content truncated due to GitHub API limits. See the full report in the workflow artifacts.
There was a problem hiding this comment.
this is deleted but never added
There was a problem hiding this comment.
this is still missing
There was a problem hiding this comment.
Our model primarily focuses on multilingual reranking tasks, and it's not tested on this dataset. Is it necessary to provide results from this test set?
Checklist
mteb/models/model_implementations/, this can be as an API. Instruction on how to add a model can be found here