Share your thoughts, 1 month free Claude Pro on usSee more

Summarization on TL;DR (Completeness, Groundedness, Relevance)

43Completeness

Gemini 2.5 Pro

Updated 3mo ago

Evaluation Results

Method	Links
Gemini 2.5 Pro 2025.12		43	42	-
Gemini 2.5 Flash 2025.12		40	40	-
GPT-OSS 120B 2025.12		40	39	-
Gemini 2.0 Flash 2025.12		39	41	-
Jury-on-Demand 2025.12		38	43	-
Claude 3.7 2025.12		37	39	-
GPT-OSS 20B 2025.12		34	29	-
Gemma 3 2025.12		14	42	-
LLAMA 3.2 2025.12		9	10	-
DeepSeek R1 2025.12		5	13	-
Phi 4 2025.12		1	11	-