Share your thoughts, 1 month free Claude Pro on usSee more

Dialogue Generation on DSTC-8 Reddit (test)

12.56R-L Score

SECOND THOUGHTS (AEM + VM default)

Updated 5mo ago

Evaluation Results

Method	Links
SECOND THOUGHTS (AEM + VM default) 2023.01		12.56	12.4
SECOND THOUGHTS (AEM + AIL) 2023.01		11.31	12.85
SECOND THOUGHTS (AEM Only) 2023.01		9.8	11.56
InstructGPT 2023.01		8.8	10.57
GPT-3 2023.01		7.31	11.44