Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scan-level report labelling on Herniation (test)
Loading...
0.946
Balanced Accuracy
Summary-Query
0.76088
0.80894
0.857
0.90506
Oct 22, 2024
Balanced Accuracy
EER
AU ROC
F1 Score
Updated 3mo ago
Evaluation Results
Method
Method
Links
Balanced Accuracy
EER
AU ROC
F1 Score
Summary-Query
Summary Model=Llama3,...
2024.10
0.946
0.071
0.99
0.931
Direct Query
Query Model=GPT-4
2024.10
0.935
-
-
0.915
Direct Query
Query Model=Llama3
2024.10
0.911
0.071
0.98
0.893
Direct Query
Query Model=Zephyr
2024.10
0.869
0.119
0.947
0.842
Summary-Query
Summary Model=Zephyr,...
2024.10
0.774
0.31
0.815
0.738
Summary-Query
Summary Model=Z-SFT, Q...
2024.10
0.768
0.214
0.871
0.73
Feedback
Search any
task
Search any
task