Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Failure Detection on Vision-Based Indoor Robot Navigation OOD
Loading...
50
F1 Score
Ours
21.296
28.748
36.2
43.652
Jun 6, 2025
F1 Score
Lead Time (sec)
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
Lead Time (sec)
Ours
taxonomy-level context...
2025.06
50
1.21
NoContext
taxonomy-level context...
2025.06
40.5
0.76
LLM-AD
2025.06
27.2
1.38
ENet-BC
2025.06
22.4
1.01
Feedback
Search any
task
Search any
task