Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Rationale Evaluation on StrategyQA (test)

0.283RORA

GPT-4

0.02820.094350.16050.22665Feb 28, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.02
0.2830.4740.501-0.0660.1610.3162.69
2024.02
0.2530.4590.412-0.0830.2150.3112.32
2024.02
0.1190.3480.6730.145-0.406-
2024.02
0.1150.3810.121-0.0380.0240.138-
2024.02
0.10.2320.155-0.0350.0840.1211.16
2024.02
0.0610.1320.102-0.0130.0270.0710.39
2024.02
0.0430.0260.505-0.0050.1310.376-
2024.02
0.0380.0240.6730.147-0.406-