Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Structured report generation on SRRG-Findings (test)
Loading...
3
BLEU
CoGaze
1.2112
1.6756
2.14
2.6044
Mar 27, 2026
BLEU
R-L
RG
Precision
Recall
F1 Score
Updated 20d ago
Evaluation Results
Method
Method
Links
BLEU
R-L
RG
Precision
Recall
F1 Score
CoGaze
variant=Llama-3B
2026.03
3
21.64
15.53
74.83
85.56
78.07
CoGaze
variant=DistilGPT2
2026.03
2.8
20.23
14.23
75.82
85.61
78.32
CheXagent
2026.03
1.8
19.65
15.41
77.12
82.56
77.9
RaDialog
2026.03
1.28
17.53
13.82
69.48
70.12
69.76
Feedback
Search any
task
Search any
task