Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Short-form generation on DiffTask Short-form generation
Loading...
0.64
PRR
MSP
-0.0776
0.1087
0.295
0.4813
Apr 13, 2026
PRR
Updated 5d ago
Evaluation Results
Method
Method
Links
PRR
MSP
Model=Gemma-2-9B
2026.04
0.64
HBO
Model=Gemma-2-9B
2026.04
0.64
MSP
Model=Llama 3.1-8B
2026.04
0.57
HBO
Model=Llama 3.1-8B
2026.04
0.57
HUQ-SATRMD
Model=Gemma-2-9B
2026.04
0.38
SAPLMA (mid)
Model=Llama 3.1-8B
2026.04
0.35
SAPLMA (mid)
Model=Gemma-2-9B
2026.04
0.3
HUQ-SATRMD
Model=Llama 3.1-8B
2026.04
0.26
HUQ-SATMD
Model=Llama 3.1-8B
2026.04
0.24
SATRMD-MSP
Model=Gemma-2-9B
2026.04
0.24
HUQ-SATMD
Model=Gemma-2-9B
2026.04
0.23
SATRMD-MSP
Model=Llama 3.1-8B
2026.04
0.2
SATMD-MSP
Model=Llama 3.1-8B
2026.04
0.15
SATMD-MSP
Model=Gemma-2-9B
2026.04
-0.05
Feedback
Search any
task
Search any
task