Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Recommendation on GeneralRec
Loading...
8.77
Performance Gap
PaperRepro
6.0068
24.6584
43.31
61.9616
Mar 2, 2026
Performance Gap
Updated 1mo ago
Evaluation Results
Method
Method
Links
Performance Gap
PaperRepro
2026.03
8.77
AutoReproduce
2026.03
28.34
DeepCode
2026.03
28.66
Paper2Code
2026.03
32.11
OpenHands
2026.03
58.77
ReAct
2026.03
77.85
Feedback
Search any
task
Search any
task