Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Incorrect Reasoning Path Detection benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Incorrect Reasoning Path Detection
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
DeepScaleR
TokUR (AU)
Accuracy
64.24
46
5d ago
GSM8K
DeepConf
Accuracy
98.31
46
5d ago
MATH500
TokUR (TU)
Accuracy
94
46
5d ago
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task