Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GPT series models

Benchmarks

Task NameDataset NameSOTA ResultTrend
LLM RoutingGPT series models Out of Domain
Accuracy82
8
LLM RoutingGPT series models (In Domain)
Accuracy97
8
Showing 2 of 2 rows