Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
post-votes regression on rel-stack
Loading...
0.437
R2
Entity Mean
0.07612
0.16981
0.2635
0.35719
Oct 7, 2025
R2
Updated 1mo ago
Evaluation Results
Method
Method
Links
R2
Entity Mean
Target DB in pretraini...
2025.10
0.437
RT (ours)
Target DB in pretraini...
2025.10
0.35
RT (ours)
Target DB in pretraini...
2025.10
0.339
Griffin
Target DB in pretraini...
2025.10
0.274
Griffin
Target DB in pretraini...
2025.10
0.271
Gemma
Target DB in pretraini...
2025.10
0.09
Gemma
Target DB in pretraini...
2025.10
0.09
Gemma
Target DB in pretraini...
2025.10
0.09
Feedback
Search any
task
Search any
task