Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Large Model Performance Prediction on Benchmark-side Pattern Shift Math

46.59Average Score

CPMF

11.666820.733429.838.8666Feb 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
46.5936.45-10.14-----
2026.02
46.5936.45-48.6344.5553.8952.632.82
2026.02
42.0840.36-1.72-----
2026.02
42.0840.36-44.2139.9461.3357.642.11
2026.02
13.0160.6647.65-----
2026.02
13.0160.66-14.711.3186.0977.1818.7