Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Conversational Language Modeling on MT-Bench GPT-4 evaluator (test)

6.46MT-Bench Score

FT-Adam

3.72484.43495.1455.8551Oct 11, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.10
6.46
2023.10
6.44
2023.10
6.3
2023.10
6.27
2023.10
6.11
2023.10
6.08
2023.10
5.95
2023.10
5.94
2023.10
5.74
2023.10
5.11
2023.10
4.69
2023.10
3.83