Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Document-level Information Extraction on BETTER
Loading...
36.74
F1 Score
THINKTWICE Qwen 3 (oracle)
1.7752
10.8526
19.93
29.0074
Jan 26, 2026
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
THINKTWICE Qwen 3 (oracle)
Selector=oracle, Backb...
2026.01
36.74
THINKTWICE Llama R1 (oracle)
Selector=oracle, Backb...
2026.01
34.08
THINKTWICE Qwen 3
Selector=F1 Voting, Ba...
2026.01
20.02
THINKTWICE Qwen 3
Selector=Majority, Bac...
2026.01
17.38
THINKTWICE Llama R1
Selector=F1 Voting, Ba...
2026.01
17.1
Greedy Qwen 3
Selector=X, Backbone=Q...
2026.01
16.12
Greedy Llama R1
Selector=X, Backbone=L...
2026.01
14.78
THINKTWICE Llama R1
Selector=Majority, Bac...
2026.01
3.12
Feedback
Search any
task
Search any
task