Share your thoughts, 1 month free Claude Pro on usSee more

Insight Generation on Internal non-scientific document collections (Gemini/Claude Judged)

4.61Set-level Score (Gemini-2.5-Flash)

INSIGHTGEN

Updated 3mo ago

Evaluation Results

Method	Links
INSIGHTGEN 2026.04		4.61	3.61
INSIGHTGEN 2026.04		4.39	3.39
FAISS+CoT 2026.04		4.13	2.22
FAISS 2026.04		3.78	2.7
GPT+CoT 2026.04		3.48	1.87
FAISS+CoT 2026.04		3.26	2.35
GPT+CoT 2026.04		3.09	2.26
Direct GPT 2026.04		2.96	1.91
Direct GPT 2026.04		2.78	2.09
FAISS 2026.04		1.87	2.04