SOTA Long Video Question Answering benchmarks and papers with code

Benchmarks

Dataset Name	SOTA Method	Metric
MLVU	InternVL-3-78B	M-Avg79.5	46	1mo ago
Long VideoBench (val)	GOPAgen	Accuracy73.2	46	1mo ago
LVBench	Deep Video Discovery	All Score74.2	31	1mo ago
Video-MME	ToolMerge	Accuracy73.2	30	2mo ago
MovieChat-1K Breakpoint Mode (test)	HierarQ	Accuracy76.4	24	4mo ago
MovieChat-1K Global Mode (test)	HierarQ	Accuracy87.5	24	4mo ago
LV-Bench (val)	Video-RAG	Overall Accuracy66.4	20	3mo ago
EgoSchema (full set)	Dispider	Accuracy55.6	17	4mo ago
Video-MME Long without Subtitles	Video-RAG	Overall Accuracy62.3	16	3mo ago
VideoMME Long	GOPAgen	Accuracy (Long)69.7	14	17d ago
Video-MME w/o subtitles		Accuracy0.818	14	4mo ago
Video-MME (val)	Gemini-1.5-Pro	Accuracy75	12	4mo ago
MLVU multiple-choice questions	VideoXL	Accuracy0.649	12	4mo ago
MLVU 3–120 min	Qwen2.5-VL-7B	Accuracy66.9	11	4mo ago
MMBench-Video (val)	InternVL2.5-78B	Score1.97	11	4mo ago
VideoMME long 30–60 min	WAT	Accuracy50.8	10	4mo ago
TemporalBench	GPT-4o	Binary Accuracy73.2	9	4mo ago
TVQA Long	LLaVA-Video + OneClip-RAG	Overall Accuracy52.1	6	4mo ago
GLVC (test)	VideoDetective	Score69	6	4mo ago
CASTLE Challenge CVPR 2026 EgoVis Workshop		Accuracy58	5	1mo ago
QaEgo4D	LLaVA-Video + OneClip-RAG	Score1.71	5	4mo ago

Showing 21 of 21 rows