Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Others

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image ClassificationOthers
Accuracy93.91
20
Function CallingOthers (SealTools, OpenFunc, ToolAlpaca)
Overall Accuracy87.4
12
Natural Language UnderstandingOthers (WG, Yelp, SciTail, PAWS)
WG Accuracy62.3
11
Image ClassificationOthers
CIFAR-10 Accuracy98.8
10
Multiple-Choice Question AnsweringOTHERS 4-Choice
Delta Acc0
6
Showing 5 of 5 rows