Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Biological Tool Use on AlphaFold
Loading...
10
Pass@10
GPT-4o
-0.4
2.3
5
7.7
Dec 21, 2024
Pass@10
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@10
GPT-4o
Fine-tuned=true
2024.12
10
GPT-4 Turbo
Scaffold=Ranger, Brows...
2024.12
0
GPT-4o
Scaffold=Ranger, Brows...
2024.12
0
o1
Scaffold=Ranger, Mitig...
2024.12
0
o1-preview
Scaffold=Ranger, Mitig...
2024.12
0
o1-mini
Scaffold=Ranger, Mitig...
2024.12
0
o1
Mitigation status=post...
2024.12
0
o1-preview
Mitigation status=post...
2024.12
0
o1-mini
Mitigation status=post...
2024.12
0
o1
Mitigation status=pre-...
2024.12
0
o1-preview
Mitigation status=pre-...
2024.12
0
o1-mini
Mitigation status=pre-...
2024.12
0
Feedback
Search any
task
Search any
task