Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Knowledge Reasoning on MMLU-CF
Loading...
75.9
Accuracy
GHG-TDA
71.948
72.974
74
75.026
Feb 10, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GHG-TDA
Base Model=Claude 3.5...
2026.02
75.9
AoT
Base Model=Claude 3.5...
2026.02
75.2
GHG-TDA
Base Model=GPT-4o
2026.02
75
GoT
Base Model=Claude 3.5...
2026.02
74.4
AoT
Base Model=GPT-4o
2026.02
74.3
ToT
Base Model=Claude 3.5...
2026.02
74.1
GoT
Base Model=GPT-4o
2026.02
73.2
ToT
Base Model=GPT-4o
2026.02
73
CoT
Base Model=Claude 3.5...
2026.02
73
CoT
Base Model=GPT-4o
2026.02
72.1
Feedback
Search any
task
Search any
task