Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical trial outcome prediction on Outcome Perturbation
Loading...
75.5
Macro F1
Qwen-3.5-9B Fine-Tuned
66.764
69.032
71.3
73.568
Apr 24, 2026
Macro F1
Weighted Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro F1
Weighted Accuracy
Qwen-3.5-9B Fine-Tuned
Fine-tuned=true
2026.04
75.5
74.81
o3-mini-2025-01-31
2026.04
67.1
64.71
Feedback
Search any
task
Search any
task