Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
QNP abstraction generation on Spanner domain (test)
Loading...
0
Coverage Rate
GPT
-0.001
-0.0005
0
0.0005
Feb 11, 2026
Coverage Rate
Updated 3mo ago
Evaluation Results
Method
Method
Links
Coverage Rate
GPT
Automated Debugging=true
2026.02
0
GPT
Automated Debugging=false
2026.02
0
Gemini
Automated Debugging=true
2026.02
0
Gemini
Automated Debugging=false
2026.02
0
DeepSeek
Automated Debugging=true
2026.02
0
DeepSeek
Automated Debugging=false
2026.02
0
Qwen
Automated Debugging=true
2026.02
0
Qwen
Automated Debugging=false
2026.02
0
Feedback
Search any
task
Search any
task