Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Disease Classification on CDDMBench
Loading...
33.7
Accuracy
Agri-CPJ (+ LLM-as-a-Judge)
3.852
11.601
19.35
27.099
Apr 26, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Agri-CPJ (+ LLM-as-a-Judge)
Model=GPT-5-Nano, Capt...
2026.04
33.7
Agri-CPJ (+ LLM-as-a-Judge)
Model=GPT-5-Nano, Capt...
2026.04
32.8
Agri-CPJ (+ Caption (Optimized))
Model=GPT-5-Nano, Capt...
2026.04
31.6
Agri-CPJ (+ Caption (Optimized))
Model=GPT-5-Nano, Capt...
2026.04
31.4
Agri-CPJ (+ Few-shot)
Model=GPT-5-Nano, Capt...
2026.04
31
Agri-CPJ (+ Few-shot)
Model=GPT-5-Nano, Capt...
2026.04
29.8
Agri-CPJ (+ LLM-as-a-Judge)
Model=Qwen-VL-Chat, Ca...
2026.04
25.39
Agri-CPJ (+ Few-shot)
Model=Qwen-VL-Chat, Ca...
2026.04
24.49
Agri-CPJ (+ LLM-as-a-Judge)
Model=Qwen-VL-Chat, Ca...
2026.04
14.8
Agri-CPJ (+ Few-shot)
Model=Qwen-VL-Chat, Ca...
2026.04
14.5
Agri-CPJ (+ Caption (Optimized))
Model=Qwen-VL-Chat, Ca...
2026.04
12.1
Zero-shot (Our Baseline)
Model=GPT-5-Nano, Capt...
2026.04
11
Zero-shot (Our Baseline)
Model=GPT-5-Nano, Capt...
2026.04
11
Agri-CPJ (+ Caption (Optimized))
Model=Qwen-VL-Chat, Ca...
2026.04
7.17
Zero-shot (Our Baseline)
Model=Qwen-VL-Chat, Ca...
2026.04
5.8
Zero-shot (Liu et al., 2024) Baseline
Model=Qwen-VL-Chat, Ca...
2026.04
5
Feedback
Search any
task
Search any
task