Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scan-level report labelling on Stenosis (test)
Loading...
96.3
Balanced Accuracy
Direct Query
93.18
93.99
94.8
95.61
Oct 22, 2024
Balanced Accuracy
EER
AU ROC
F1 Score
Updated 3mo ago
Evaluation Results
Method
Method
Links
Balanced Accuracy
EER
AU ROC
F1 Score
Direct Query
Query Model=Llama3
2024.10
96.3
0
0.987
96.2
Direct Query
Query Model=GPT-4
2024.10
95.1
-
-
94.9
Direct Query
Query Model=Zephyr
2024.10
94.5
0.037
0.981
95
Summary-Query
Summary Model=Zephyr,...
2024.10
94.5
0.111
0.985
95
Summary-Query
Summary Model=Llama3,...
2024.10
94.5
0.037
0.995
95
Summary-Query
Summary Model=Z-SFT, Q...
2024.10
93.3
0.037
0.943
93.7
Feedback
Search any
task
Search any
task