CogGen: A Cognitively Inspired Recursive Framework for Deep Research Report Generation

About

The autonomous synthesis of deep research reports represents a critical frontier for Large Language Models (LLMs), demanding sophisticated information orchestration and non-linear narrative logic. Current approaches rely on rigid predefined linear workflows, which cause error accumulation, preclude global restructuring from subsequent insights, and ultimately limit in-depth multimodal fusion and report quality. We propose CogGen, a Cognitively inspired recursive framework for deep research report Generation. Leveraging a Hierarchical Recursive Architecture to simulate cognitive writing, CogGen enables flexible planning and global restructuring. To extend this recursivity to multimodal content, we introduce Abstract Visual Representation (AVR): a concise intent-driven language that iteratively refines visual-text layouts without pixel-level regeneration overhead. We further present CLEF, a Cognitive Load Evaluation Framework, and curate a new benchmark from Our World in Data (OWID). Extensive experiments show CogGen achieves state-of-the-art results among open-source systems, generating reports comparable to professional analysts' outputs and surpassing Gemini Deep Research. Our code and dataset are available at https://github.com/NJUNLP/CogGen.

Kuo Tian, Pengfei Sun, Zhen Wu, Junran Ding, Xinyu Dai• 2026

Related benchmarks

Task	Dataset	Result
Multimodal Content Generation	Human Evaluation N=20 (test)	Win Count19	8
Multimodal Report Generation	WildSeek Text-Centric Complex Queries	Organization53.89	6
Multimodal Report Generation	OWID High-Density Multimodal Reports	Organization49.72	6
Content Quality Evaluation	10 cross-domain reports (test)	Organization50	3

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord