Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Generation on M2LONGBENCH (test)

39.7Anchor Description

DEEP-REPORTER

-1.5889.13119.8530.569Apr 12, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.04
39.739.739.747.837.238.543.341.727.136.436.828.732.337.94,700
2026.04
38.138.938.512.210.515.410.512.224.335.234.829.230.927.23,900
2026.04
36.438.837.639.230.432.834.434.211.324.32518.619.830.54,500
2026.04
3637.836.927.520.225.225.124.59.721.522.615.817.426.34,300
2026.04
35.637.736.636.436.437.736.436.733.232.830.427.53134.84,500
2026.04
34.838.136.430.424.328.727.127.623.525.123.925.924.629.64,900
2026.04
31.636.834.29.74.97.75.7719.830.83430.428.723.34,000
2026.04
24.727.926.33021.527.128.326.71525.923.921.121.524.85,200
2026.04
23.926.325.144.136.440.143.741.117.823.922.71920.9295,100
2026.04
22.325.323.87.75.77.36.56.810.120.719.414.616.215.63,200
2026.04
19.420.219.815.88.512.210.111.614.228.327.923.923.618.43,900
2026.04
17.423.520.48.13.66.52.45.210.922.716.621.91814.54,100
2026.04
16.616.616.69.35.37.36.57.112.625.521.119.819.714.54,200
2026.04
14.219.416.88.13.66.93.65.610.523.521.918.618.613.73,900
2026.04
1314.613.88.126.13.24.98.121.917.820.717.111.93,800
2026.04
4.97.36.111.76.510.58.99.44.913.89.37.38.88.13,500
2026.04
4.16.55.311.36.19.36.58.34.58.110.55.77.26.93,400
2026.04
0216.17.97.18.57.4006.923.17.55.31,900
2026.04
01.20.66.95.76.36.76.40010.124.38.65.22,000