Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PolarMem: A Training-Free Polarized Latent Graph Memory for Verifiable Multimodal Agents

About

As multimodal agents evolve from passive observers to long-horizon decision-makers, they require memory systems that provide not just information availability but logical verifiability. A fundamental limitation of current architectures is the epistemic asymmetry inherent in probabilistic vision-language models and dense associative memories: they conflate semantic affinity with factual existence and structurally fail to encode negative constraints. To this end, we introduce PolarMem, a training-free Polarized Latent Graph Memory designed to ground agent reasoning in verifiable evidence. PolarMem transforms fuzzy perceptual likelihoods into discrete logical constraints through non-parametric distributional partitioning. Furthermore, it employs a polarized graph topology with orthogonal inhibitory connections to explicitly store verified negation as a primary cognitive state. At inference time, we enforce a logic-dominant retrieval paradigm, suppressing hallucinatory patterns that violate negative constraints. Extensive evaluation across eight frozen Vision--Language Models and six benchmarks demonstrates that PolarMem functions as a robust cognitive system, establishing a foundation for verifiable multimodal agents. Our code is available at https://github.com/czs-ict/PolarMem.

Zhisheng Chen, Tingyu Wu, Zijie Zhou, Zhengwei Xie, Ziyan Weng, Yingwei Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Hallucination RobustnessHallusionBench
Score57.8
32
Multimodal Retrieval-Augmented GenerationMRAMG
Score35
32
Multimodal Retrieval-Augmented GenerationMRAG
Score70.8
32
Visual Retrieval-Augmented GenerationVisual-RAG
Score53.4
32
General ReasoningMMMU
Overall Score74.1
32
General ReasoningMMStar
Score68.4
32
Showing 6 of 6 rows

Other info

Follow for update