Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical Generation on MTS-Dialogue
Loading...
45.87
Composite Score (ROUGE/BLEU/METEOR/BERTScore)
SLERP Merge
24.0716
29.7308
35.39
41.0492
Apr 2, 2026
Composite Score (ROUGE/BLEU/METEOR/BERTScore)
Updated 16d ago
Evaluation Results
Method
Method
Links
Composite Score (ROUGE/BLEU/METEOR/BERTScore)
SLERP Merge
Evaluation protocol=SFT
2026.04
45.87
GatorTronLlama_SFT
Evaluation protocol=SFT
2026.04
45.5
GatorTronLlama
Evaluation protocol=SFT
2026.04
45.12
Llama-3.1-8B-Instruct
Evaluation protocol=SFT
2026.04
44.41
Linear Merge
Evaluation protocol=SFT
2026.04
43.94
SLERP Merge
Evaluation protocol=0-...
2026.04
27.54
Linear Merge
Evaluation protocol=0-...
2026.04
26.99
Llama-3.1-8B-Instruct
Evaluation protocol=0-...
2026.04
26.6
GatorTronLlama_SFT
Evaluation protocol=0-...
2026.04
25.77
GatorTronLlama
Evaluation protocol=0-...
2026.04
24.91
Feedback
Search any
task
Search any
task