AR-Med: Automated Relevance Enhancement in Medical Search via LLM-Driven Information Augmentation
About
Accurate and reliable search on online healthcare platforms is critical for user safety and service efficacy. Traditional methods, however, often fail to comprehend complex and nuanced user queries, limiting their effectiveness. Large language models (LLMs) present a promising solution, offering powerful semantic understanding to bridge this gap. Despite their potential, deploying LLMs in this high-stakes domain is fraught with challenges, including factual hallucinations, specialized knowledge gaps, and high operational costs. To overcome these barriers, we introduce \textbf{AR-Med}, a novel framework for \textbf{A}utomated \textbf{R}elevance assessment for \textbf{Med}ical search that has been successfully deployed at scale on the Online Medical Delivery Platforms. AR-Med grounds LLM reasoning in verified medical knowledge through a retrieval-augmented approach, ensuring high accuracy and reliability. To enable efficient online service, we design a practical knowledge distillation scheme that compresses large teacher models into compact yet powerful student models. We also introduce LocalQSMed, a multi-expert annotated benchmark developed to guide model iteration and ensure strong alignment between offline and online performance. Extensive experiments show AR-Med achieves an offline accuracy of over 93\%, a 24\% absolute improvement over the original online system, and delivers significant gains in online relevance and user satisfaction. Our work presents a practical and scalable blueprint for developing trustworthy, LLM-powered systems in real-world healthcare applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Medical Search Relevance Assessment | LocalQSMed v1.0 | HR Precision97.23 | 13 | |
| Medical Search Relevance Assessment | LocalQSMed Success-only v1.0 | Accuracy91.9 | 12 | |
| Medical Search Relevance Assessment | LocalQSMed HARD v1.0 | Accuracy0.6611 | 12 | |
| Relevance Classification | AR-Med | Highly Relevant Precision98.39 | 6 | |
| Search and Recommendation | Online Medical Delivery Platform Medical Channel scenario | Global Order Increase29 | 1 | |
| Search and Recommendation | Online Medical Delivery Platform Medical Main Search scenario | Global Order Increase0.35 | 1 | |
| Search and Recommendation | Online Medical Delivery Platform Medical Global scenario | UV_CXR0.16 | 1 |