Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Reasoning on Sydney Biology per-architecture breakdown (full)

8.93BLEU

RLearner-LLM

1.95163.76335.5757.3867May 6, 2026
Updated 27d ago

Evaluation Results

MethodLinks
2026.05
8.9388.0384.7254.2123.093.01424.667
2026.05
8.4487.8884.1659.2917.372.26-
2026.05
8.4487.9484.5262.0924.693.03645.804
2026.05
4.1883.5379.9582.8322.842.78-
2026.05
3.6483.6784.2662.927.742.94746.37
2026.05
3.1683.5986.263.2735.622.83764.06
2026.05
2.74-78.8964.518.373.1843-
2026.05
2.2282.4478.762.495.373.193719.001