Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BRIGHT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Information RetrievalBRIGHT
Biology nDCG@1063.1
45
Information RetrievalBRIGHT 1.0 (test)
nDCG@10 (Avg)37.9
35
Downstream retrievalBRIGHT
Biology nDCG@520
24
RetrievalBRIGHT 12 datasets aggregate (test)
NDCG@1012.74
20
RetrievalBRIGHT
nDCG@1 (Econ)65.8
13
Passage RerankingBRIGHT
NDCG@10 (Avg)38
12
RetrievalBRIGHT v1 (leaderboard)
Average Retrieval Score46.8
12
Multi-class ClassificationBRIGHT 6class (test)
Accuracy43.4
11
Building Damage AssessmentBRIGHT
F1 (bcd)91.71
10
Information RetrievalBRIGHT v1 (test)
nDCG@10 (Bio)58.2
8
Reasoning-intensive RetrievalBRIGHT
BRIGHT Score19.3
8
Information RetrievalBRIGHT unseen 6 subsets (test)
nDCG@1011.79
7
Image ClassificationBRIGHT (test)
Accuracy0.6458
3
Showing 13 of 13 rows