Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
QNP Abstraction Generation on Gripper domain (evaluation)
Loading...
85
Coverage Rate
GPT
-3.4
19.55
42.5
65.45
Feb 11, 2026
Coverage Rate
Updated 3mo ago
Evaluation Results
Method
Method
Links
Coverage Rate
GPT
Automated Debugging=true
2026.02
85
Gemini
Automated Debugging=true
2026.02
30
DeepSeek
Automated Debugging=true
2026.02
5
GPT
Automated Debugging=false
2026.02
0
Gemini
Automated Debugging=false
2026.02
0
DeepSeek
Automated Debugging=false
2026.02
0
Qwen
Automated Debugging=true
2026.02
0
Qwen
Automated Debugging=false
2026.02
0
Feedback
Search any
task
Search any
task