Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on TYDIQA (Accuracy, AVERAGE MEAN)
Loading...
56.62
Accuracy
TRIM
45.1384
48.1192
51.1
54.0808
Oct 8, 2025
Accuracy
Average Mean Score
Updated 19d ago
Evaluation Results
Method
Method
Links
Accuracy
Average Mean Score
TRIM
Base Model=LLAMA-2-7B,...
2025.10
56.62
48.56
LESS
Base Model=LLAMA-2-7B,...
2025.10
56.13
48.27
Full-data Fine-tuning
Base Model=LLAMA-2-7B,...
2025.10
54
48.99
TAGCOS
Base Model=LLAMA-2-7B,...
2025.10
53.45
46.6
S2L
Base Model=LLAMA-2-7B,...
2025.10
52.74
45.56
BM25
Base Model=LLAMA-2-7B,...
2025.10
52.7
45.41
Random
Base Model=LLAMA-2-7B,...
2025.10
52.45
45.14
Pretrained (no Fine-tuning)
Base Model=LLAMA-2-7B,...
2025.10
46.4
43.43
RDS
Base Model=LLAMA-2-7B,...
2025.10
46.1
42.17
DSIR
Base Model=LLAMA-2-7B,...
2025.10
45.62
42.02
CLD
Base Model=LLAMA-2-7B,...
2025.10
45.58
42.61
Feedback
Search any
task
Search any
task