Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical Diagnosis on RJUA
Loading...
11.37
EM
C-MIG
-0.4548
2.6151
5.685
8.7549
May 27, 2026
EM
KG
Average Score
Updated 6d ago
Evaluation Results
Method
Method
Links
EM
KG
Average Score
C-MIG
Backbone=Qwen2.5-7B
2026.05
11.37
27.49
19.43
AutoRefine-HardDoc
Backbone=Qwen2.5-7B
2026.05
10.9
25.88
18.39
Search-R1
Backbone=Qwen2.5-7B
2026.05
10.43
20.47
15.45
AutoRefine
Backbone=Qwen2.5-7B
2026.05
10.43
25.21
17.82
AutoRefine-Embedding
Backbone=Qwen2.5-7B
2026.05
9.48
24.83
17.16
AutoRefine-HardSearch
Backbone=Qwen2.5-7B
2026.05
9.48
23.51
16.5
AutoRefine-ICDTree
Backbone=Qwen2.5-7B
2026.05
9
21.71
15.36
IGPO
Backbone=Qwen2.5-7B
2026.05
8.53
28.63
18.58
Qwen2.5-7B
Backbone=Qwen2.5-7B
2026.05
0
0.76
0.38
Feedback
Search any
task
Search any
task