Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Comprehensive Document Safety Analysis on T-SROIE dense text for robustness (test)
Loading...
62.5
M-F1
DocShield
23.292
33.471
43.65
53.829
Apr 3, 2026
M-F1
Updated 13d ago
Evaluation Results
Method
Method
Links
M-F1
DocShield
2026.04
62.5
Gemini-2.5-Pro
2026.04
57
qwen3-vl-8B
2026.04
55.6
InternVL-3.5-8B
2026.04
53.7
gpt-4o-o3-0416
2026.04
52.8
FakeShield
2026.04
26.8
Qwen2.5-VL-7B
2026.04
26.2
DeepSeekVL-7B
2026.04
24.8
Feedback
Search any
task
Search any
task