Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Average Performance across 10 Task Types on 13 Datasets (test)

75.8Avg. Accuracy

PROMPTED

61.65665.3286972.672Oct 3, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.10
75.8
2023.10
68.8
2023.10
68.6
2023.10
67.3
2023.10
65.7
2023.10
64.1
2023.10
63.4
2023.10
62.2