Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-label biomedical classification on PubMed (val)
Loading...
75.87
Macro F1
EvoPool
-2.3692
17.9429
38.255
58.5671
Jun 1, 2026
Macro F1
Updated 1d ago
Evaluation Results
Method
Method
Links
Macro F1
EvoPool
Backbone=gpt-4o-mini
2026.06
75.87
LLM annotation
Backbone=gpt-4o-mini
2026.06
44.1
Alchemist
Backbone=gpt-4o-mini
2026.06
14.98
DataSculpt
Backbone=gpt-4o-mini
2026.06
0.64
Feedback
Search any
task
Search any
task