Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deductive logical reasoning on ProverQA OOD hard subset 500 records (test)
Loading...
-
Error Rate
No plottable results for Error Rate (PERCENT).
Metric
Error Rate (PERCENT)
Accuracy (PERCENT)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Error Rate
Accuracy
No evaluation results found.
Feedback
Search any
task
Search any
task