Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Semantic Type Annotation on T2D
Loading...
97.7
Micro-F1
ZTab-per (GPT-4o,GPT-4o)
82.828
86.689
90.55
94.411
Mar 12, 2026
Micro-F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Micro-F1
ZTab-per (GPT-4o,GPT-4o)
Category=Zero-shot
2026.03
97.7
ZTab-per (GPT-4.1-mini,GPT-4.1-mini)
Category=Zero-shot
2026.03
97.7
ZTab-per (GPT-3.5,GPT-3.5)
Category=Zero-shot
2026.03
96.2
ZTab-per (GPT-4o-mini,GPT-4o-mini)
Category=Zero-shot
2026.03
96.2
CENTS (GPT-4.1-mini)
Category=Zero-shot
2026.03
96.2
CENTS (GPT-4o)
Category=Zero-shot
2026.03
94.7
CENTS (GPT-4o-mini)
Category=Zero-shot
2026.03
92.4
Doduo (BERT)
Category=Supervised (R...
2026.03
91.1
GPT-3.5-based
Category=Zero-shot
2026.03
89.4
CENTS (GPT-3.5)
Category=Zero-shot
2026.03
88
ArcheTypeFT (Llama-7B)
Category=Supervised (R...
2026.03
88
Chorus (GPT-3.5)
Category=Zero-shot
2026.03
83.4
Feedback
Search any
task
Search any
task