Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Anomaly Type Classification on M3-AD Texture Scene
Loading...
55.6
Type Score
GEMINI-2.5-FLASH-LITE
-1.496
13.327
28.15
42.973
Feb 10, 2026
Type Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Type Score
GEMINI-2.5-FLASH-LITE
Category=COMMERCIAL, P...
2026.02
55.6
GPT-5.1-MINI
Category=COMMERCIAL, P...
2026.02
52.3
QWEN-3-VL-INSTRUCT
Category=RA-MONITOR, P...
2026.02
49.7
QWEN-3-VL-INSTRUCT
Category=RA-MONITOR, P...
2026.02
49
QWEN-3-VL-THINKING
Category=THINKING, Par...
2026.02
46.4
INTERN-VL-3.5
Category=OPEN-SOURCE,...
2026.02
45.4
GPT-5.1-NANO
Category=COMMERCIAL, P...
2026.02
42.5
QWEN-3-VL-THINKING
Category=THINKING, Par...
2026.02
41.5
INTERN-VL-3.5
Category=OPEN-SOURCE,...
2026.02
40.6
LLAVA-ONEVISION-SI
Category=OPEN-SOURCE,...
2026.02
38.7
QWEN-3-VL-INSTRUCT
Category=OPEN-SOURCE,...
2026.02
34.8
QWEN-2.5-VL-INSTRUCT
Category=OPEN-SOURCE,...
2026.02
33.2
QWEN-2.5-VL-INSTRUCT
Category=OPEN-SOURCE,...
2026.02
32.7
QWEN-3-VL-INSTRUCT
Category=OPEN-SOURCE,...
2026.02
32.5
ANOMALY-R1
Category=OPEN-SOURCE,...
2026.02
32
QWEN-3-VL-INSTRUCT
Category=OPEN-SOURCE,...
2026.02
31.2
QWEN-2.5-VL-INSTRUCT
Category=OPEN-SOURCE,...
2026.02
29.2
QWEN-2-VL-INSTRUCT
Category=OPEN-SOURCE,...
2026.02
23.4
QWEN-2-VL-INSTRUCT
Category=OPEN-SOURCE,...
2026.02
0.7
Feedback
Search any
task
Search any
task