Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning

About

Charts play a vital role in data visualization, understanding data patterns, and informed decision-making. However, their unique combination of graphical elements (e.g., bars, lines) and textual components (e.g., labels, legends) poses challenges for general-purpose multimodal models. While vision-language models trained on chart data excel in comprehension, they struggle with generalization. To address these challenges, we propose ChartAssistant, a chart-based vision-language model for universal chart comprehension and reasoning. ChartAssistant leverages ChartSFT, a comprehensive dataset covering diverse chart-related tasks with basic (e.g. bars and pies) and specialized (e.g. radars, and bubbles) chart types. It undergoes a two-stage training process, starting with pre-training on chart-to-table parsing to align chart and text, followed by multitask instruction-following fine-tuning. This approach enables ChartAssistant to achieve competitive performance across various chart tasks. Experimental results demonstrate significant performance gains over the state-of-the-art UniChart and Chartllama method, especially outperforming them on real-world chart data with zero-shot setting. The code and data are available at https://github.com/OpenGVLab/ChartAst.

Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo• 2024

Related benchmarks

TaskDatasetResultRank
Chart Question AnsweringChartQA (test)--
129
Chart UnderstandingChartX
GPT Score0.82
15
Chart-to-TableChartQA (test)
RMSF185.62
12
Factual Inconsistency DetectionCHOCOLATE LVLM
Kendall's Tau0.015
9
Factual Inconsistency DetectionCHOCOLATE LLM
Kendall's Tau0.057
9
Factual Inconsistency DetectionCHOCOLATE (FT)
Kendall's Tau0.036
9
Chart ParsingChartP-Bench Easy Subset
AP Strict0.00e+0
8
Chart ParsingChartP-Bench Hard Subset
AP Strict0.00e+0
8
Chart ParsingChartP-Bench
Average Score1.55
8
Chart ParsingPlotQA SE
AP-Strict5.18
5
Showing 10 of 12 rows

Other info

Follow for update