Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on GSM8K ZH
Loading...
9.78
Accuracy
Baseline
-9.2208
-4.2879
0.645
5.5779
May 26, 2025
Accuracy
Updated 2mo ago
Evaluation Results
Method
Method
Links
Accuracy
Baseline
Forgetting Language=none
2025.05
9.78
LWF-mixed
Forgetting Language=mixed
2025.05
7.77
LWF
Forgetting Language=EN
2025.05
2.35
LWF
Forgetting Language=IT
2025.05
-5.42
LWF
Forgetting Language=ES
2025.05
-6.24
LWF
Forgetting Language=TR
2025.05
-8.49
Feedback
Search any
task
Search any
task