Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reactive Question Answering on StreamingBench excluding PO 1.0 (test)

76.96Overall Performance (OP)

ROMA

23.847237.636151.42565.2139Jan 15, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
76.9678.9177.9282.0574.8472.982.4161.7965.9151.0640.434.850.458.837.63444.47
2026.01
76.6982.0378.8682.0574.8472.979.6359.7664.4950.5338.829.646.451.633.227.622.8
2026.01
76.1370.4974.1482.472.8670.884.7863.264.9151.6939.7530.3645.8747.235.2724.7924.8
2026.01
75.6181.2576.9782.3771.775.0881.4862.265.625039.630.859.3852.435.62826.4
2026.01
75.6180.4779.1882.3773.5875.0882.4163.165.3447.3439.628.844.451.635.628.426.4
2026.01
75.5185.7176.1978.2359.7761.0573.216059.6723.3838.82640.847.235.627.226.8
2026.01
74.9275.5374.173.0874.4459.5276.1462.9162.1645.835.4625.2638.5743.3439.6227.6533.61
39.0740.0634.4931.0545.9632.431.4834.1642.4927.8931.226.5124.13224.1929.226.55
25.8943.5724.9123.8727.3313.0818.5225.223.8748.725.9124.925.628.424.825.224.12