Share your thoughts, 1 month free Claude Pro on usSee more

Zero-shot Task Evaluation on 9 Downstream Tasks Utility

54.7Average Accuracy

DAPT (nontoxic)

Updated 4mo ago

Evaluation Results

Method	Links
DAPT (nontoxic) 2022.02		54.7
SGEAT (heuristic) 2022.02		54.7
Jigsaw (nontoxic) 2022.02		54.6
SGEAT (standard) 2022.02		54.6
SGEAT (augmented) 2022.02		54.4
SGEAT + Rejection Sampling 2022.02		54.4
Word Banning 2022.02		54.3
Rejection Sampling 2022.02		54.3
DEXPERTS 2022.02		46.2
SGEAT + DEXPERTS 2022.02		44.9