Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-label Topic Classification on Cell. phone
Loading...
76.3
Micro F1
AK
51.028
57.589
64.15
70.711
May 28, 2026
Micro F1
Updated 2d ago
Evaluation Results
Method
Method
Links
Micro F1
AK
Model=LLaMA 3.3-70B, G...
2026.05
76.3
AK
Model=GPT-OSS 20B, Gra...
2026.05
75.2
Baseline
Source=Sarkar et al.,...
2026.05
52
Feedback
Search any
task
Search any
task