Share your thoughts, 1 month free Claude Pro on usSee more

Response Quality Evaluation on MT-Bench

8.71Average Response Quality

No defense

Updated 3mo ago

Evaluation Results

Method	Links
No defense 2024.02		8.71
Backtranslation 2024.02		8.6
Response Check 2024.02		8.58
Paraphrase 2024.02		8.43
No defense 2024.02		7.36
SmoothLLM 2024.02		7.35
Response Check 2024.02		7.3
Backtranslation 2024.02		7.26
Paraphrase 2024.02		7.23
No defense 2024.02		6.8
MTSA-T3 2025.05		6.78
Baseline 2025.05		6.76
Response Check 2024.02		6.74
Paraphrase 2024.02		6.69
Backtranslation 2024.02		6.34
SmoothLLM 2024.02		5.89
SmoothLLM 2024.02		5.81
Baseline 2025.05		5.64
MTSA-T3 2025.05		5.57