Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Response Generation on Human Evaluation (200 items) single-turn (test)
Loading...
62
Adversarial Win Rate
Adver-REGS
58.9
60.45
62
63.55
Jan 23, 2017
Adversarial Win Rate
Adversarial Lose Rate
Adversarial Tie Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Adversarial Win Rate
Adversarial Lose Rate
Adversarial Tie Rate
Adver-REGS
Comparison Baseline=Mu...
2017.01
62
18
20
Feedback
Search any
task
Search any
task