Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical Structured Data Extraction on n2c2
Loading...
101
True Positives (TP)
Infherno
29.24
47.87
66.5
85.13
Jul 16, 2025
True Positives (TP)
False Positives (FP)
False Negatives (FN)
Precision (Pr)
Recall (Re)
F1 Score (F1)
Eased False Negatives (Eased FN)
Eased Recall (Eased Re)
Eased F1 Score (Eased F1)
Total Concepts
Concepts with Codes
Concepts with Correct Codes
Updated 1mo ago
Evaluation Results
Method
Method
Links
True Positives (TP)
False Positives (FP)
False Negatives (FN)
Precision (Pr)
Recall (Re)
F1 Score (F1)
Eased False Negatives (Eased FN)
Eased Recall (Eased Re)
Eased F1 Score (Eased F1)
Total Concepts
Concepts with Codes
Concepts with Correct Codes
Infherno
LLM=Gemini-2.5 Pro
2025.07
101
0
66
100
60.5
75.4
6
94.4
97.1
104
104
104
Infherno
LLM=DeepSeek V3.1 Chat
2025.07
82
0
85
100
49.1
65.9
25
76.6
86.8
84
84
82
Infherno
LLM=Claude Sonnet 4.5
2025.07
78
0
91
100
46.2
63.2
31
71.6
83.4
79
78
78
Infherno
LLM=Qwen3-235B-A22B-2507
2025.07
76
0
92
100
45.2
62.3
32
70.4
82.6
77
76
76
Infherno
LLM=GPT-5
2025.07
68
0
100
100
40.5
57.6
40
63
77.3
69
63
42
Infherno
LLM=Qwen3-8B
2025.07
32
0
134
100
19.3
32.3
74
30.2
46.4
36
36
35
Feedback
Search any
task
Search any
task