Share your thoughts, 1 month free Claude Pro on usSee more

Commonsense reasoning on WinoGrande 1.0 (test)

0.8137Accuracy

Mistral-7B + DSIR

Updated 1mo ago

Evaluation Results

Method	Links
Mistral-7B + DSIR 2024.02		0.8137
Mistral-7B Base 2024.02		0.8122
Mistral-7B + Uniform 2024.02		0.8019
Mistral-7B + QuRating 2024.02		0.8011
Mistral-7B + AutoDS 2024.02		0.8003
LLaMA2-7B Base 2024.02		0.7585
LLaMA2-7B + DSIR 2024.02		0.7537
LLaMA2-7B + Uniform 2024.02		0.753
LLaMA2-7B + QuRating 2024.02		0.7466
LLaMA2-7B + AutoDS 2024.02		0.7451
Gemma-2B + DSIR 2024.02		0.6661
Gemma-2B + AutoDS 2024.02		0.6661
Gemma-2B + Uniform 2024.02		0.6638
Gemma-2B + QuRating 2024.02		0.6638
Gemma-2B Base 2024.02		0.6054