Share your thoughts, 1 month free Claude Pro on usSee more

Conversational Ability on MT-Bench (Score and Avg. Time)

7.58MT-Bench Score

Qwen3-32B

Updated 4mo ago

Evaluation Results

Method	Links
Qwen3-32B 2026.03		7.58	-	-	-
Gemma-3-27B-it 2026.03		7.24	-	-	-
Llama-3.3-70b-Instruct 2026.03		6.88	-	-	-
Karnak 2026.03		6.51	-	-	-
Fanar-27B 2026.03		6.12	-	-	-
AceGPT-v2-70B-Chat 2026.03		6.01	-	-	-
Jais-2-70B-Chat 2026.03		5.63	-	-	-
Fanar-1-9B-Instruct 2026.03		5.58	-	-	-
DPO 2026.03		5.52	-	-	-
DPO 2026.03		5.05	-	-	-
DSPA 2026.03		5.02	-	-	-
DSPA 2026.03		4.86	-	-	-
Allam-7B-Instruct-preview-v2 2026.03		4.62	-	-	-
Prompt Eng 2026.03		4.6	-	-	-
RepE 2026.03		4.48	-	-	-
DSPA 2026.03		4.39	-	-	-
Base Model 2026.03		4.39	-	-	-
DPO 2026.03		4.36	-	-	-
RepE 2026.03		4.36	-	-	-
Static-SAE 2026.03		4.33	-	-	-
RepE 2026.03		4.32	-	-	-
AceGPT-v2-32B-Chat 2026.03		4.3	-	-	-
Base Model 2026.03		4.12	-	-	-
Base Model 2026.03		3.91	-	-	-
Prompt Eng 2026.03		3.88	-	-	-
Prompt Eng 2026.03		3.84	-	-	-
Static-SAE 2026.03		3.49	-	-	-
Static-SAE 2026.03		3.24	-	-	-
Standard Fine-tuning 2026.02		-	4.58	2.461	-
AtteNT 2026.02		-	4.49	2.132	-
Standard Fine-tuning 2026.02		-	5.03	2.042	-
AtteNT 2026.02		-	5.32	1.802	-
Standard Fine-tuning 2026.02		-	5.42	2.282	-
AtteNT 2026.02		-	5.44	2.012	-
Full FT 2024.08		-	-	-	4.85
LoRA 2024.08		-	-	-	4.6
AdaLoRA 2024.08		-	-	-	4.79
DoRA 2024.08		-	-	-	4.48
MiLoRA 2024.08		-	-	-	4.5
LoRA+ 2024.08		-	-	-	5.11
LoRA-FA 2024.08		-	-	-	4.67
LoRA-GA 2024.08		-	-	-	5.04
PiSSA 2024.08		-	-	-	4.92
CorDA 2024.08		-	-	-	5.15
CorDA++ 2024.08		-	-	-	5.64
BA-LoRA 2024.08		-	-	-	5.11