Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
post-votes on rel-stack (test)
Loading...
43.7
R^2
Entity Mean
26.436
30.918
35.4
39.882
Oct 7, 2025
R^2
Updated 1mo ago
Evaluation Results
Method
Method
Links
R^2
Entity Mean
Zero-shot=true, Target...
2025.10
43.7
RT
Zero-shot=true, Target...
2025.10
35
RT
Zero-shot=true, Target...
2025.10
33.9
Griffin
Zero-shot=true, Target...
2025.10
27.4
Griffin
Zero-shot=true, Target...
2025.10
27.1
Feedback
Search any
task
Search any
task