Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Gpt4

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multilabel ClassificationGpt4 0.7
Weighted F160.3
3
Binary ClassificationGpt4 0.7
Weighted F185.4
3
Multilabel ClassificationGpt4 0.5
Weighted F10.609
3
Binary ClassificationGpt4 0.5
Weighted F183.8
3
Multiclass ClassificationGpt4 0.7
Weighted F10.604
2
Multiclass ClassificationGpt4 0.5
Weighted F155.9
2
Showing 6 of 6 rows