Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Section Hierarchy Parsing on HRDH (test)
Loading...
93.21
F1 Score
DSHP-LLM
23.2492
41.4121
59.575
77.7379
Apr 14, 2026
F1 Score
TEDS
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
TEDS
DSHP-LLM
base_model=Mistral-8B
2026.04
93.21
91.99
DSHP-LLM
base_model=Qwen-2.5-3B
2026.04
88.56
86.58
DSHP-LLM
base_model=Llama-3.2-3B
2026.04
86.64
84.59
DSHP-LLM
base_model=Qwen-2.5-7B
2026.04
63.3
63.81
Llama-3.2-3B
2026.04
43.89
49.04
Mistral-8B
2026.04
34.45
39.74
Qwen-2.5-3B
2026.04
32.99
37.34
Qwen-2.5-7B
2026.04
29.62
38.07
GPT-4
2026.04
25.94
33.42
Feedback
Search any
task
Search any
task