Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Chain-of-Thought Quality Evaluation on NIH ChestX-ray14
Loading...
4.36
Causal Support
CheXthought-CoT
3.7568
3.9134
4.07
4.2266
Apr 29, 2026
Causal Support
Spatial Localization
Factuality
Updated 1mo ago
Evaluation Results
Method
Method
Links
Causal Support
Spatial Localization
Factuality
CheXthought-CoT
visual attention train...
2026.04
4.36
4
4.69
CheXthought-VLM
visual attention train...
2026.04
4.08
4.33
4.5
Qwen3-VL-8B-Thinking
2026.04
3.83
3.89
3.53
Synthetic-CoT
2026.04
3.78
3.43
3.05
Feedback
Search any
task
Search any
task