Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Accuracy, Time, and Token Evaluation on MedQA (Medical Reasoning)

79.3Accuracy

RecursiveMAS

26.2640.0353.867.57Apr 28, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
79.31,9121,056
2026.04
78.31,6641,008
2026.04
78.21,348964
2026.04
77.13,9223,731
2026.04
76.11,5221,427
2026.04
76.12,7452,609
2026.04
31.71,7041,378
2026.04
31.21,4271,383
2026.04
30.31,1941,369
2026.04
291,5552,382
2026.04
28.54,6846,307
2026.04
28.33,0974,436