Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Natural Language Visual Reasoning on NLVR2 (dev)

91.51Accuracy

BEIT-3

81.858884.364486.8789.3756Jan 2, 2021Sep 24, 2021Jun 17, 2022Mar 10, 2023Dec 1, 2023Aug 23, 2024May 16, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2022.08
91.51
2023.05
91.5
2023.01
91.5
2021.11
88.62
2023.05
87.8
2023.01
87.6
2023.01
86.3
2023.01
86.2
2022.05
86.1
2022.08
86.1
2023.05
86.1
2022.12
86.1
2023.01
86.1
2023.01
85.9
2021.11
85.64
2022.06
85.64
2022.05
85.64
2022.05
85.6
2022.11
85.2
2022.11
85
2022.11
84.6
2022.06
84.59
2022.05
84.58
2022.12
84.55
2021.11
84.53
2021.08
84.53
2021.11
84.53
2022.08
84.53
2022.06
84.53
2022.05
84.5
2023.05
84.5
2021.11
84.41
2022.06
84.41
2023.05
84.41
2022.11
84.3
2023.05
84.2
2023.01
84.2
2021.11
84.16
2021.08
84.13
2021.11
84.13
2023.05
84.13
2022.05
84.13
2022.11
84.1
2022.11
83.63
2022.11
83.5
2022.10
83.3
2022.11
83.3
2023.05
82.81
2023.01
82.8
2021.11
82.77
2022.06
82.77
2022.11
82.77
2023.05
82.77
2022.10
82.77
2022.05
82.7
2022.11
82.7
2021.01
82.67
2022.01
82.67
2021.11
82.67
2021.08
82.67
2021.11
82.67
2021.11
82.67
2022.08
82.67
2022.12
82.67
2022.12
82.67
2022.05
82.67
2023.12
82.67
2022.06
82.66
2023.12
82.66
2025.05
82.61
2022.05
82.6
2022.11
82.6
2021.07
82.55
2022.01
82.55
2021.11
82.55
2021.11
82.55
2022.08
82.55
2022.11
82.55
2023.05
82.55
2023.05
82.55
2022.12
82.55
2022.05
82.55
2022.10
82.55
2023.12
82.55
2023.12
82.52
2025.05
82.52
2023.01
82.5
2022.01
82.48
2023.05
82.48
2021.11
82.33
2021.11
82.33
2022.11
82.33
2023.05
82.33
2022.05
82.33
2022.10
82.33
2023.01
82.3
2021.11
82.23
2021.11
82.23
2022.06
82.23
2023.05
82.23
Showing 100 of 307 rows