Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Natural Language Inference on RTE (0-shot and 32-shot settings)

84.8Accuracy (0-shot)

OPT-IML 175B

45.040855.362965.68576.0071Dec 22, 2022Jun 9, 2023Nov 25, 2023May 12, 2024Oct 28, 2024Apr 15, 2025Oct 2, 2025
Updated 2d ago

Evaluation Results

MethodLinks
2022.12
84.883.8
2022.12
83.873.3
2022.12
66.848.7
2025.10
65.34-
2025.10
64.26-
2025.10
63.18-
2025.10
62.82-
2025.10
60.65-
2025.10
60.65-
2022.12
60.371
2025.10
60.29-
2025.10
59.93-
2025.10
59.57-
2022.12
58.161.7
2025.10
57.76-
2025.10
57.4-
2025.10
55.96-
2025.10
55.6-
2022.12
54.247.3
2025.10
53.79-
2025.10
53.43-
2025.10
53.43-
2025.10
53.07-
2025.10
53.07-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.71-
2025.10
52.35-
2025.10
46.57-