Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Image-text fact-checking on AVerImaTeC (test)
Loading...
89
Question Score
HUMANE
26.6
42.8
59
75.2
Feb 16, 2026
Question Score
Evidence Score
Verdict Accuracy
Justification Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Question Score
Evidence Score
Verdict Accuracy
Justification Score
HUMANE
2026.02
89
54
55
56
AIC CTU
2026.02
81
33
35
30
teamName
2026.02
66
23
26
22
REVEAL
2026.02
63
28
24
13
Baseline
type=iterative agentic
2026.02
55
17
11
13
XxP
2026.02
39
27
26
20
ADA-AGGR
2026.02
37
46
54
43
fv
2026.02
29
16
16
13
Feedback
Search any
task
Search any
task