Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Research Idea Evaluation on D_point (randomly sampled 60 instances)

0.8848Rationality Win Rate

InnoEval

0.664320.721560.77880.83604Feb 16, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.02
0.88480.09220.92170.02760.93090.06910.89770.09770.9070.0783
2026.02
0.8710.11980.87560.11520.92630.06450.8710.09680.90320.0837
2026.02
0.86180.12440.86180.07370.90320.08290.88940.09680.89860.0829
2026.02
0.83410.15670.8710.09220.91240.07370.82030.16590.85710.129
2026.02
0.67280.28110.61750.32720.70510.28110.84790.14290.71890.2581