Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Coding Reasoning on Pancreas (test)

39.28Success Rate

CodeCytos + Few Shot

-1.16569.334719.83530.3353May 30, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
39.2860.1864.8664.7657
2026.05
33.6758.562.7962.9355
2026.05
32.4457.2260.3260.4153
2026.05
27.153.1854.6954.749
2026.05
23.6550.4957.1657.3346
2026.05
17.6241.5846.5746.5238
2026.05
14.494351.0251.0338
2026.05
4.0215.1420.1520.0313
2026.05
4.0114.1818.2618.2813
2026.05
3.7814.8520.7620.813
2026.05
1.265.648.148.175
2026.05
1.195.27.587.575
2026.05
1.185.27.567.565
2026.05
0.712.063.315.033
2026.05
0.632.693.753.752
2026.05
0.623.165.045.013
2026.05
0.52.233.163.162
2026.05
0.391.181.973.142