Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Proof length optimization on MIL
Loading...
43.55
Improvement
ImProver
2.106
12.8655
23.625
34.3845
Oct 7, 2024
Improvement
Nonempty Improvement
Accuracy
Improved Accuracy
Updated 12d ago
Evaluation Results
Method
Method
Links
Improvement
Nonempty Improvement
Accuracy
Improved Accuracy
ImProver
2024.10
43.55
44.76
100
45.94
ImProver
2024.10
30.54
56.56
100
50
GPT-4o
2024.10
6.25
18.58
37.5
14.42
GPT-4o
2024.10
3.7
27.38
13.51
0
Feedback
Search any
task
Search any
task