Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Response Generation on ChattyChef (test)
Loading...
5.4
BLEU
ChatGPT
3.84
4.245
4.65
5.055
May 26, 2023
BLEU
BLEURT
Length
Diversity (Unigrams)
Diversity (Bigrams)
Updated 4d ago
Evaluation Results
Method
Method
Links
BLEU
BLEURT
Length
Diversity (Unigrams)
Diversity (Bigrams)
ChatGPT
subset=evaluated on hu...
2023.05
5.4
53
64.9
12.5
45.3
GPT-J+ctr
grounded knowledge sel...
2023.05
4.7
45.9
11.7
9.3
36.6
GPT-J+cut
grounded knowledge sel...
2023.05
4.3
45.2
10.9
9.9
38.7
GPT-J+ctr+int
state-aware model=true...
2023.05
4.2
45.1
10.3
10.8
39.3
GPT-J
2023.05
4.1
44.7
11.1
9.9
37.9
GPT-J+int
incorporates=User Inte...
2023.05
3.9
45
10
10.4
38.5
Feedback
Search any
task
Search any
task