Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical trial outcome prediction on Arm Perturbation
Loading...
72.25
Macro F1
Qwen-3.5-9B Fine-Tuned
64.1484
66.2517
68.355
70.4583
Apr 24, 2026
Macro F1
Weighted Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro F1
Weighted Accuracy
Qwen-3.5-9B Fine-Tuned
Fine-tuned=true
2026.04
72.25
74.72
o3-mini-2025-01-31
2026.04
64.46
63.4
Feedback
Search any
task
Search any
task