Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scan-level report labelling on Spondylolisthesis (test)
Loading...
97.4
Balanced Accuracy
Direct Query
94.072
94.936
95.8
96.664
Oct 22, 2024
Balanced Accuracy
EER
AU ROC
F1 Score
Updated 3mo ago
Evaluation Results
Method
Method
Links
Balanced Accuracy
EER
AU ROC
F1 Score
Direct Query
Query Model=GPT-4
2024.10
97.4
-
-
97.3
Direct Query
Query Model=Zephyr
2024.10
97.4
0.026
0.996
97.4
Direct Query
Query Model=Llama3
2024.10
97.4
0.026
0.993
97.4
Summary-Query
Summary Model=Llama3,...
2024.10
97.4
0.026
0.994
97.4
Summary-Query
Summary Model=Z-SFT, Q...
2024.10
94.8
0.026
0.984
94.6
Summary-Query
Summary Model=Zephyr,...
2024.10
94.2
0.023
0.985
93.8
Feedback
Search any
task
Search any
task