Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on GSM8K EN
Loading...
19.71
Accuracy
Baseline
-0.362
4.849
10.06
15.271
May 26, 2025
Accuracy
Updated 2mo ago
Evaluation Results
Method
Method
Links
Accuracy
Baseline
Forgetting Language=none
2025.05
19.71
LWF
Forgetting Language=TR
2025.05
6.95
LWF
Forgetting Language=IT
2025.05
5.38
LWF-mixed
Forgetting Language=mixed
2025.05
3.45
LWF
Forgetting Language=ZH
2025.05
2.69
LWF
Forgetting Language=ES
2025.05
0.41
Feedback
Search any
task
Search any
task