Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-Label Verdict Prediction on PUBHEALTH supplementary experiments
Loading...
2.786
OI
Llama3-8B
0.75384
1.28142
1.809
2.33658
May 31, 2026
OI
PR
CR
Updated 1d ago
Evaluation Results
Method
Method
Links
OI
PR
CR
Llama3-8B
Evaluation Scenario=Co...
2026.05
2.786
-
73.59
Phi-4
Evaluation Scenario=Pe...
2026.05
2.389
29.5
-
Llama3-8B
Evaluation Scenario=Pe...
2026.05
2.256
30.71
-
Phi-4
Evaluation Scenario=Co...
2026.05
1.534
-
60.53
Llama3-8B
Evaluation Scenario=Pe...
2026.05
1.524
39.62
-
Llama3-8B
Evaluation Scenario=Co...
2026.05
1.242
-
55.39
Phi-4
Evaluation Scenario=Co...
2026.05
1.237
-
55.29
Phi-4
Evaluation Scenario=Pe...
2026.05
0.832
54.6
-
Feedback
Search any
task
Search any
task