Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multiple Choice Question Answering on MMLU 1.0 (test)

88.7Accuracy (Clinical knowledge)

Med-PaLM 2

22.76439.8825774.118May 16, 2023Jun 30, 2023Aug 15, 2023Sep 30, 2023Nov 15, 2023Dec 31, 2023Feb 15, 2024
Updated 3d ago

Evaluation Results

MethodLinks
2023.05
88.7--------------------------9284.492.395.883.2
2023.05
88.7--------------------------9284.495.295.883.2
2023.05
88.7--------------------------9785.293.897.280.9
2023.05
86.4--------------------------928093.895.176.9
2023.05
80.4--------------------------7563.783.888.976.3
2024.02
74.71--------------------------7465.9272.7972.9164.73
2024.02
63.1--------------------------63.349.957.463.457.8
2024.02
62.8--------------------------62.746.95760.656.3
2024.02
62.3--------------------------61.348.155.857.256.5
2024.02
61.3--------------------------6149.955.364.453.9
2024.02
60.9--------------------------61.749.655.156.955.5
2024.02
57--------------------------56.746.95158.650.1
2024.02
50.1--------------------------5246.247.347.945.5
2024.02
49.1--------------------------4948.463.847.243.5
2024.02
37.9--------------------------4739.334.242.630.4
2024.02
25.3--------------------------2631.916.92824.9
2021.12
--49.46071.347.85433.85041.150.94376.883.965.166.481.867.268.171.864.984.181---------
2021.12
--36.649.648.233.23927.730.931.321.433.257.357.540.341.960.654.144.74851.453.168.9---------
2021.12
--44.148.252.836.149.8413433.3-30.265.572.347.25371.749.648.358.549.364.964.4---------
2021.12
--9590959095909085-909095909095909090909590---------