Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Response Generation on ChattyChef (test)
Loading...
5.4
BLEU
ChatGPT
3.84
4.245
4.65
5.055
May 26, 2023
BLEU
BLEURT
Length
Diversity (Unigrams)
Diversity (Bigrams)
Updated 1mo ago
Evaluation Results
Method
Method
Links
BLEU
BLEURT
Length
Diversity (Unigrams)
Diversity (Bigrams)
ChatGPT
subset=evaluated on hu...
2023.05
5.4
53
64.9
12.5
45.3
GPT-J+ctr
grounded knowledge sel...
2023.05
4.7
45.9
11.7
9.3
36.6
GPT-J+cut
grounded knowledge sel...
2023.05
4.3
45.2
10.9
9.9
38.7
GPT-J+ctr+int
state-aware model=true...
2023.05
4.2
45.1
10.3
10.8
39.3
GPT-J
2023.05
4.1
44.7
11.1
9.9
37.9
GPT-J+int
incorporates=User Inte...
2023.05
3.9
45
10
10.4
38.5
Feedback
Search any
task
Search any
task