Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Clinical Efficacy Scoring on MIMIC-CXR (test)
Loading...
47.2
Example-based Precision
MCA-RG
32.744
36.497
40.25
44.003
Jul 9, 2025
Example-based Precision
Example-based Recall
Example-based F1
Macro-based Precision
Macro-based Recall
Macro-based F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Example-based Precision
Example-based Recall
Example-based F1
Macro-based Precision
Macro-based Recall
Macro-based F1
MCA-RG
LLM=7B
2025.07
47.2
40.6
40.8
44.3
30.6
33.5
ORID
Year=2021, LLM=7B
2025.07
43.5
29.5
35.2
-
-
-
WarmStart
Year=2023, LLM=None
2025.07
41.8
36.7
36.7
41.7
29.5
30.6
MiniGPT-Med*
Year=2024, LLM=7B
2025.07
33.5
24.5
26.4
23.6
18.8
17.5
R2GenCMN
Year=2022, LLM=None
2025.07
33.4
27.5
27.8
35.4
27.1
27.5
R2Gen
Year=2020, LLM=None
2025.07
33.3
27.3
27.6
29.7
18.9
19.3
Feedback
Search any
task
Search any
task