Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Maximum Flow on BA and ER averaged (test)
Loading...
36.59
Accuracy
TLG-F
33.1996
34.0798
34.96
35.8402
Feb 11, 2026
Accuracy
Mean Absolute Error
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Mean Absolute Error
TLG-F
Model=gpt-4o
2026.02
36.59
0.445
L-OWL
Model=gpt-4o
2026.02
36.59
0.475
C-OWL
Model=gpt-4o
2026.02
36.59
0.443
CL-OWL
Model=gpt-4o
2026.02
36.59
0.438
TLG-A
Model=gpt-4o
2026.02
33.33
0.494
Feedback
Search any
task
Search any
task