Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Tonsil

Benchmarks

Task NameDataset NameSOTA ResultTrend
coding-reasoningTonsil (test)
Success Rate58.55
18
Spatial domain detectionTonsil
Accuracy72.9
7
ClusteringTonsil
ARI0.62
4
Showing 3 of 3 rows