Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WG-M

Benchmarks

Task NameDataset NameSOTA ResultTrend
Commonsense ReasoningWG-M
Accuracy76.45
18
Common-sense reasoningWG-M (In-Distribution)
Accuracy83.61
12
Showing 2 of 2 rows