Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CPJ: Explainable Agricultural Pest Diagnosis via Caption-Prompt-Judge with LLM-Judged Refinement

About

Accurate and interpretable crop disease diagnosis is essential for agricultural decision-making, yet existing methods often rely on costly supervised fine-tuning and perform poorly under domain shifts. We propose Caption--Prompt--Judge (CPJ), a training-free few-shot framework that enhances Agri-Pest VQA through structured, interpretable image captions. CPJ employs large vision-language models to generate multi-angle captions, refined iteratively via an LLM-as-Judge module, which then inform a dual-answer VQA process for both recognition and management responses. Evaluated on CDDMBench, CPJ significantly improves performance: using GPT-5-mini captions, GPT-5-Nano achieves \textbf{+22.7} pp in disease classification and \textbf{+19.5} points in QA score over no-caption baselines. The framework provides transparent, evidence-based reasoning, advancing robust and explainable agricultural diagnosis without fine-tuning. Our code and data are publicly available at: https://github.com/CPJ-Agricultural/CPJ-Agricultural-Diagnosis.

Wentao Zhang, Tao Fang, Lina Lu, Lifei Wang, Weihe Zhong• 2025

Related benchmarks

TaskDatasetResultRank
Crop ClassificationCDDMBench 1.0 (test)
Accuracy63.38
16
Crop Disease Knowledge QACDDMBench 1.0 (test)
QA Score84.5
16
Disease ClassificationCDDMBench 1.0 (test)
Accuracy33.7
16
Knowledge QACDDMBench
QA Accuracy84.5
15
Crop RecognitionCDDMBench
Accuracy63.38
15
Disease RecognitionCDDMBench
Accuracy33.7
15
Showing 6 of 6 rows

Other info

Follow for update