Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Gpt4o

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multilabel ClassificationGpt4o 0.7
Weighted F10.474
3
Binary ClassificationGpt4o 0.7
Weighted F182.9
3
Multilabel ClassificationGpt4o 0.5
Weighted F10.489
3
Binary ClassificationGpt4o 0.5
Weighted F183.2
3
Multiclass ClassificationGpt4o 0.7
Weighted F147.6
2
Multiclass ClassificationGpt4o 0.5
Weighted F10.481
2
Showing 6 of 6 rows