Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Radiology Report Summarization Dataset Human Evaluation (test)
Loading...
4.62
Readability
CSTRL-T
4.5888
4.5969
4.605
4.6131
Feb 21, 2025
Readability
Factual Correctness
Informativeness
Redundancy
Completeness
Updated 1mo ago
Evaluation Results
Method
Method
Links
Readability
Factual Correctness
Informativeness
Redundancy
Completeness
CSTRL-T
Model Type=Teacher
2025.02
4.62
4.39
4.31
4.11
4.75
CSTRL-S
Model Type=Student, Di...
2025.02
4.59
4.29
4.28
4.23
4.54
Feedback
Search any
task
Search any
task