Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-task (Overall) on USIM (test)

0.0359APE

U0

0.0256560.0948030.163950.233097Oct 9, 2025
Updated 9d ago

Evaluation Results

MethodLinks
2025.10
0.0359
2025.10
0.0374
2025.10
0.0861
2025.10
0.141
2025.10
0.1496
2025.10
0.1834
2025.10
0.292