Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Natural Language Inference on RTE

93.5Accuracy

Enumerate

76.475280.895185.31589.7349Mar 21, 2022Nov 28, 2022Aug 8, 2023Apr 16, 2024Dec 25, 2024Sep 3, 2025May 14, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.04
93.5
2026.04
92.8
2026.04
92.6
2026.04
92.4
2026.04
92.4
2026.04
92.1
2026.04
92.03
2026.04
91.9
2026.04
91.3
2026.04
91.3
2026.04
91.06
2026.04
91.06
2026.04
91.06
2026.04
90.6
2026.04
90.6
2026.04
90.58
2023.10
90
2026.04
89.86
2026.04
89.86
2026.04
89.86
2023.08
89.8
2026.04
89.5
2026.04
89.37
2026.04
89.13
2026.04
88.65
88.6
2026.04
87.68
2024.05
87
2024.05
86.64
2022.03
86.3
2024.05
86.28
2024.05
86.28
2024.05
86.28
2024.05
85.56
2024.05
85.56
2024.05
85.5
2024.05
85.2
2023.08
85.2
2024.05
85.19
2024.05
84.84
2022.03
84.8
2024.05
84.48
84.48
2024.05
84.12
2024.05
83.75
2024.05
83.75
2024.05
83.6
2024.05
83.6
2025.11
83.6
2026.05
83.5
2024.05
83.3
2026.05
83.3
2026.05
83.3
2026.05
83.1
2024.05
82.7
2024.05
82.31
2025.07
82.31
2025.07
82.31
2022.12
82.17
2022.12
82.1
2023.10
82
2025.07
81.95
2023.05
81.9
2023.11
81.8
2024.05
81.8
2024.05
81.5
2024.05
81.4
2026.05
81.3
2024.05
81.23
2024.05
81.2
2024.05
81.2
2024.05
81.2
2024.05
81.1
2024.05
81.1
2022.12
80.9
2022.12
80.9
2025.07
80.87
2023.02
80.83
2026.05
80.7
2024.05
80.2
2023.11
80.1
2022.12
79.9
2022.12
79.8
2025.07
79.78
2023.05
79.3
2023.11
79.3
2022.03
79.1
2023.05
78.7
2023.05
78.7
2023.11
78.7
2023.11
78.7
2025.11
78.7
2025.07
78.34
2022.12
78.3
2024.05
78.23
2024.05
78
2025.07
77.98
2025.07
77.62
2025.06
77.62
2025.06
77.13
Showing 100 of 590 rows