Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Math Word Problem solving on UnbiasedMWP Chinese (test)
Loading...
42
Accuracy
ATHENA
17.352
23.751
30.15
36.549
Nov 2, 2023
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
ATHENA
Backbone=RoBERTa-large
2023.11
42
ATHENA
Backbone=RoBERTa-base
2023.11
36.2
DeductReasoner
Backbone=RoBERTa-large
2023.11
34.9
DeductReasoner
Backbone=RoBERTa-base
2023.11
31.6
Graph-to-Tree
Backbone=Random embedding
2023.11
27.2
GTS
Backbone=Random embedding
2023.11
26.2
Transformer
Backbone=Random embedding
2023.11
20.5
R-Transformer
Backbone=RoBERTa-base
2023.11
18.3
Feedback
Search any
task
Search any
task