Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Natural Language Processing on BigBench II

-0.37Accuracy Degradation (%)

PromptCOS

-0.4780.2510.981.709Sep 3, 2025
Updated 9d ago

Evaluation Results

MethodLinks
2025.09
-0.3762
2025.09
065
2025.09
0.0353
2025.09
0.0868
2025.09
0.5371
2025.09
0.8381
2025.09
1.4145
2025.09
1.652
2025.09
2.3346