Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Grounded Commonsense Inference on SWAG (test)

88Accuracy

Human (5 annotations)

51.28860.81970.3579.881Oct 11, 2018
Updated 1mo ago

Evaluation Results

MethodLinks
2018.10
88
2018.10
86.3
2018.10
85
2018.10
78
2018.10
59.2
2018.10
52.7