Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multiple Choice Question Answering on PathMMU (val)

62.98Overall Accuracy

Patho-R1-7B

-1.18815.47132.1348.789May 16, 2025Jul 16, 2025Sep 16, 2025Nov 17, 2025Jan 18, 2026Mar 21, 2026May 22, 2026
Updated 9d ago

Evaluation Results

MethodLinks
2025.05
62.9882.563.0141.6763.9564.67
2025.05
58.4471.2560.2733.3362.6659.33
2025.05
58.1667.560.965059.6653.33
2025.05
54.6161.2554.1138.5455.3660.67
2026.05
53.6-----
2026.05
52.84-----
2026.05
50.62-----
2025.05
49.7952.545.8940.6352.3654
2025.05
47.846.2552.0533.3347.6454
2026.05
45.84-----
2025.05
45.8246.2542.4735.4247.6452.67
2025.05
45.6752.545.8928.1351.0744.67
2026.05
42.18-----
2025.05
40.2841.2541.123.9642.9245.33
2025.05
38.4443.7534.9329.1739.9142.67
2026.05
38.16-----
2025.05
37.4546.2536.332.2936.0539.33
2026.05
37.42-----
2026.05
36.18-----
2025.05
33.054532.8817.7134.3334.67
2026.05
32.18-----
2025.05
29.0836.2528.7717.7128.3334
2025.05
27.2322.527.416.6731.3330
2026.05
26.5-----
2025.05
25.2536.2526.037.2927.926
25-----
2025.05
23.42019.1816.6730.0423.33
2025.05
17.8722.520.557.2918.8818
2025.05
1.281.251.3702.150.67