Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Adversarial Reasoning on StrategyQA (PTR/ReAct Metrics)
Loading...
3
PTR Advancement
PTR
2.85
2.925
3
3.075
Apr 5, 2026
PTR Advancement
ReAct Advancement
Average Delta EM
Updated 12d ago
Evaluation Results
Method
Method
Links
PTR Advancement
ReAct Advancement
Average Delta EM
PTR
Average over=4 models
2026.04
3
100
0.07
ReAct
Average over=4 models
2026.04
3
100
0.07
Feedback
Search any
task
Search any
task