Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CharXiv

Benchmarks

Task NameDataset NameSOTA ResultTrend
Chart UnderstandingCharXiv
Reasoning Score60.2
44
Chart Understanding and ReasoningCharXiv
Score67.8
37
Chart Question AnsweringCharXiv
Reasoning Score76.4
33
Chart-based ReasoningCharXivRQ
Accuracy67.9
29
Scientific Chart UnderstandingCharXiv 1.0
RQ Score60.2
23
Scientific Figure ReasoningCharXiv
Accuracy48.9
17
Document and chart understandingCharXiv DQ
Pass@198.4
17
Visual ReasoningCharXiv (val)
Text in Chart Accuracy37.95
16
Scientific Chart InterpretationCharXiv Reasoning
Overall Score46.8
12
Chart ReasoningCharXiv
Accuracy74.2
12
Document Question AnsweringCharXiv RQ
Accuracy68.6
11
Document Question AnsweringCharXiv DQ
Accuracy91.3
11
Document UnderstandingCharXiv reas.
Accuracy0.686
11
OCR-related Understanding TasksCharXiv RQ
Accuracy60.2
11
Chart UnderstandingCharXiv RQ
Pass@177.9
10
Multimodal ReasoningCharXiv RQ
Generated Token Length6,448
9
Chart ReasoningCharXiv Reasoning Questions
Accuracy60.52
8
Scientific QACharXiv
CharXiv-D Score95.2
8
Chart ReasoningCHARXIV reasoning
Accuracy (All)64.2
7
Chart UnderstandingCharXiv DQ
Score73.9
7
Image CaptioningCharXiv
Prism37.3
7
Document and chart understandingCharXiv RQ
Pass@169.9
7
OCR-related Understanding TasksCharXiv (DQ)
Accuracy87.4
7
Visual instruction tuningCharXiv
RQ50.8
6
OCRCharXivRQ
Accuracy60.9
6
Showing 25 of 43 rows