Share your thoughts, 1 month free Claude Pro on usSee more

Causal Judgment (test)

76.3Accuracy

AMPLIFY

Updated 4mo ago

Evaluation Results

Method	Links
AMPLIFY 2023.05		76.3
Human-Rater 2023.05		69.6
GPT-3.5 2023.05		63.1
SOTA 2023.05		62.1
AMPLIFY 2023.05		60.5
GPT-3.5 2023.05		57.8
GPT-3 2023.05		55.2
GPT-3 2023.05		55.2
Random 2023.05		50