Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Biological Reasoning on BioAlchemy

52.78ProtocolQA Accuracy

GPT-OSS-20B

32.843238.019143.19548.3709Apr 3, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
52.7818.127.2772.7955.7941.35
2026.04
46.222.3215.1568.3262.1142.82
2026.04
45.8318.7311.5268.0957.8940.41
2026.04
42.698.425.766942.6333.7
33.614.9710.9126.895.2616.33