Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Text Generation on Cornell IMDB (test)
Loading...
22.47
ROUGE-L
SECOND THOUGHTS (AEM + VM default)
10.77
13.8075
16.845
19.8825
Jan 1, 2023
ROUGE-L
Perplexity (PPL)
Updated 1mo ago
Evaluation Results
Method
Method
Links
ROUGE-L
Perplexity (PPL)
SECOND THOUGHTS (AEM + VM default)
variant=AEM + VM (defa...
2023.01
22.47
8.84
SECOND THOUGHTS (AEM + AIL)
variant=AEM + AIL
2023.01
19.6
7.31
SECOND THOUGHTS (AEM Only)
variant=AEM Only
2023.01
16.37
7.01
InstructGPT
Service=Huge LM API se...
2023.01
12.53
8.78
GPT-3
Service=Huge LM API se...
2023.01
11.22
8.43
Feedback
Search any
task
Search any
task