Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context Understanding on HELMET 2025

61.44Accuracy (8K Context)

MInference

43.541648.188352.83557.4817Dec 16, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
61.4456.3457.9655.4953.6948.2955.37
2025.12
61.3748.6558.3456.2255.2750.2256.21
2025.12
61.06-58.7356.5755.9249.7956.41
2025.12
60.8661.8257.756.2154.2448.9155.58
2025.12
60.7647.4557.855.9254.7549.3755.72
2025.12
60.6638.5258.0556.2255.4449.6856.01
2025.12
59.8159.857.455.5553.7447.7454.85
2025.12
55.8965.9254.7353.1450.4548.4252.53
2025.12
55.1-50.5448.5144.8539.3347.67
2025.12
54.9146.9550.3646.9342.6234.0545.77
2025.12
53.9347.3149.5947.144.6538.1946.69
2025.12
53.9147.5149.9647.7644.9538.5847.03
2025.12
53.1360.8448.6246.9344.2238.1146.2
2025.12
52.7737.7548.9146.3943.0636.645.55
2025.12
52.4659.0847.9945.6343.1736.9245.23
2025.12
44.2363.0340.7939.8736.2830.0638.25