Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Sentence completion on HellaSwag (test)

72.35Accuracy

Coherence Boosting (GPT-3 175B)

27.182838.908950.63562.3611Oct 15, 2021May 22, 2022Dec 28, 2022Aug 4, 2023Mar 11, 2024Oct 16, 2024May 24, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
72.35
2021.10
62.66
2021.10
59.18
47.66
2021.10
42.6
2021.10
40
2025.05
34.27
2025.05
33.2
2025.05
32.55
31.84
2025.05
31.62
2025.05
31.36
2021.10
30.99
2025.05
30.75
2021.10
28.92