Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Question Answering on PubMedQA PQA-L (test)

87.08Accuracy

Gyan-4.4

9.953629.97685070.0232Oct 19, 2022May 22, 2023Dec 24, 2023Jul 27, 2024Feb 28, 2025Oct 2, 2025May 6, 2026
Updated 27d ago

Evaluation Results

MethodLinks
2026.05
87.08418
2022.10
78.2-
2024.03
76.8-
2026.03
76.6-
2026.03
76.4-
2024.03
76.3-
2026.03
75.8-
2024.03
75.5-
2024.03
75.4-
2024.03
75.2-
2024.03
75.1-
2026.03
74.6-
2024.03
74.4-
2024.03
74.2-
2024.03
74.2-
2026.03
74.2-
2026.03
74.2-
2024.03
73.8-
2024.03
73.6-
2024.03
73.4-
2024.03
73.4-
2024.03
73.4-
2026.03
73.2-
2024.03
72.4-
2026.03
72.4-
2026.03
72.4-
2022.10
72.2-
2024.03
72.2-
2026.03
71.8-
2024.03
71.6-
2026.03
71.4-
2022.10
70.2-
2024.03
70.2-
2026.03
68.8-
2026.03
68.2-
2026.03
66.8-
2026.03
64.4-
2022.10
64.2-
2024.03
64.2-
2026.03
62-
2026.03
56.2-
2022.10
55.8-
2024.03
55.8-
2026.03
55.2-
2026.05
12.9262