Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Information Retrieval on Other tasks 14-task aggregate
Loading...
53.22
NDCG@10
LM-Cocktail10
47.1984
48.7617
50.325
51.8883
Nov 22, 2023
NDCG@10
Updated 1mo ago
Evaluation Results
Method
Method
Links
NDCG@10
LM-Cocktail10
Fine-tune on=MSMarco,...
2023.11
53.22
LM-Cocktail2
Fine-tune on=MSMarco,...
2023.11
52.71
BGE
Excluded task=MSMarco
2023.11
52
Fine-tuned model
Fine-tune on=MSMarco,...
2023.11
51.98
LM-Cocktail10
Fine-tune on=HotpotQA,...
2023.11
50.64
LM-Cocktail2
Fine-tune on=HotpotQA,...
2023.11
49.98
BGE
Excluded task=HotpotQA
2023.11
49.81
LM-Cocktail10
Fine-tune on=Quora, Ex...
2023.11
49.11
BGE
Excluded task=Quora
2023.11
48.59
LM-Cocktail2
Fine-tune on=Quora, Ex...
2023.11
48.09
Fine-tuned model
Fine-tune on=HotpotQA,...
2023.11
47.49
Fine-tuned model
Fine-tune on=Quora, Ex...
2023.11
47.43
Feedback
Search any
task
Search any
task