Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Dialogue Generation on Medical Dialogue Human Evaluation
Loading...
3.7
Fluency
Ground-truth
2.8784
3.0917
3.305
3.5183
Jun 12, 2025
Fluency
Knowledge Consistency
Overall Quality
Updated 1mo ago
Evaluation Results
Method
Method
Links
Fluency
Knowledge Consistency
Overall Quality
Ground-truth
2025.06
3.7
3.75
3.95
MedRef
2025.06
3.55
3.68
3.79
DFMed
2025.06
3.42
3.57
3.65
E-A&Cxt only
2025.06
2.91
3.05
3.14
Feedback
Search any
task
Search any
task