Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Reading Comprehension on TriviaQA
Loading...
70.3
Accuracy
Mistral
-1.564
17.093
35.75
54.407
Sep 11, 2024
Oct 5, 2024
Oct 30, 2024
Nov 23, 2024
Dec 18, 2024
Jan 11, 2025
Feb 5, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Mistral
Size=7B, Tokens=?, Sho...
2024.09
70.3
Mamba
Size=7B, Tokens=1.2T,...
2024.09
66.2
GSA
Size=7B, Tokens=+100B,...
2024.09
65.8
Llama2
Size=7B, Tokens=2T, Sh...
2024.09
64.2
Gemma
Size=7B, Tokens=6T, Sh...
2024.09
63.7
GSA
Size=7B, Tokens=+20B,...
2024.09
60.7
SUPRA
Size=7B, Tokens=+100B,...
2024.09
60.4
RWKV6
Size=7B, Tokens=1.4T,...
2024.09
59.5
GLA
Size=7B, Tokens=+20B,...
2024.09
57.8
LLaMA2
Model=LLaMA2, Paramete...
2025.02
45.4
RetNet
Size=7B, Tokens=+20B,...
2024.09
43
DiffuLLaMA + P2-self
Model=DiffuLLaMA, Para...
2025.02
18.8
DiffuLLaMA
Model=DiffuLLaMA, Para...
2025.02
18.5
GPT2-M
Model=GPT2-M, Paramete...
2025.02
6.7
GPT2-S
Model=GPT2-S, Paramete...
2025.02
4
DiffuGPT-M
Model=DiffuGPT-M, Para...
2025.02
3.8
DiffuGPT-S
Model=DiffuGPT-S, Para...
2025.02
2
SEDD-M
Model=SEDD-M, Paramete...
2025.02
1.8
SEDD-S
Model=SEDD-S, Paramete...
2025.02
1.5
Plaid1B
Model=Plaid1B, Paramet...
2025.02
1.2
Feedback
Search any
task
Search any
task