Share your thoughts, 1 month free Claude Pro on usSee more

Long Context Retrieval on NIAH-Multi

100Accuracy

Kimi-K2

Updated 5mo ago

Evaluation Results

Method	Links
Kimi-K2 2026.01		100
MiMo-V2-Flash 2026.01		99.9
Kimi-K2 2026.01		99.8
DeepSeek-V3.1 2026.01		99.7
Kimi-K2 2026.01		99.5
MiMo-V2-Flash 2026.01		99.3
MiMo-V2-Flash 2026.01		98.6
DeepSeek-V3.1 2026.01		98.6
DeepSeek-V3.1 2026.01		97.2
MiMo-V2-Flash 2026.01		96.7
DeepSeek-V3.2 2026.01		94.3
DeepSeek-V3.2 2026.01		85.9
DeepSeek-V3.2 2026.01		85.6