Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Correlation with Human Judgment on NLV and ChartLLM (171 sampled examples)
Loading...
0.73
Pearson Correlation
MatPlotBench
0.4492
0.5221
0.595
0.6679
Jan 21, 2026
Pearson Correlation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pearson Correlation
MatPlotBench
Scope=All examples
2026.01
0.73
Vision Score
Scope=All examples
2026.01
0.71
Spec Score
Scope=Vega-Lite exampl...
2026.01
0.65
SEVQ
Scope=All examples
2026.01
0.46
Feedback
Search any
task
Search any
task