Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Modeling on Logic (val)
Loading...
131.95
Perplexity
MPPA
131.9308
132.0604
132.19
132.3196
Apr 9, 2026
Perplexity
Updated 9d ago
Evaluation Results
Method
Method
Links
Perplexity
MPPA
2026.04
131.95
Baseline GPT
2026.04
132.43
Feedback
Search any
task
Search any
task