Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Conversation Evaluation on Proprietary Chinese 500 Multi-turn Dialogues (test)

79.4Win Rate vs GPT4

C-SFT-Empathy

7.43226.11644.863.484Sep 10, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.09
79.497.577.3
2024.09
10.2-9