RAG-Match: Retrieval-Augmented Knowledge Injection and Hierarchical Reasoning for Calibrated Semantic Relevance
2026-05-25 • Information Retrieval
Information Retrieval
AI summaryⓘ
The authors developed a new method called RAG-Match to improve how search engines decide what results are most relevant. Their approach uses three steps: adding extra knowledge, teaching the system to think through reasoning steps, and fine-tuning decisions in tricky cases. This helps the model better understand subtle meanings and background information than older methods. Tests showed that their method worked better than strong existing models on real search tasks.
semantic relevanceknowledge-intensive searchpretraininghierarchical reasoningdecision calibrationLLM baselinesranking metricspreference optimization
Authors
Hengjun Jiang, Liansheng Sun, Yan Jiang, Xiaojie Ke, Yongjin Wang, Xiangkun Liu, Cunxin Gu, Jian Xu, Guanjun Jiang
Abstract
Semantic relevance judgment for search is particularly challenging in knowledge-intensive scenarios, where accurate ranking requires not only semantic matching but also background grounding, multi-step reasoning, and well-calibrated decision boundaries. Existing relevance models mainly rely on direct label supervision or shallow semantic similarity, which limits their ability to handle implicit intent, factual equivalence, and fine-grained relevance distinctions. To address this issue, we propose \textsc{RAG-Match}, a three-stage framework that integrates knowledge-augmented pretraining, hierarchical reasoning alignment, and preference-based decision calibration for relevance modeling. The key idea is to first strengthen query-centered semantic grounding, then align the model with structured relevance reasoning, and finally correct decision-level inconsistencies in difficult boundary cases. Experimental results on a real-world search relevance benchmark show that \textsc{RAG-Match} consistently outperforms strong LLM-based baselines across multiple ranking metrics, demonstrating the effectiveness of combining knowledge injection, reasoning supervision, and preference optimization for fine-grained relevance judgment.