Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Gpt4o

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multilabel ClassificationGpt4o 0.7
Weighted F10.474
3
Binary ClassificationGpt4o 0.7
Weighted F182.9
3
Multilabel ClassificationGpt4o 0.5
Weighted F10.489
3
Binary ClassificationGpt4o 0.5
Weighted F183.2
3
Multiclass ClassificationGpt4o 0.7
Weighted F147.6
2
Multiclass ClassificationGpt4o 0.5
Weighted F10.481
2
Showing 6 of 6 rows