Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Failure Detection on Franka seen
Loading...
0.15
Brier Score
SAFE-RNN-TDQC (Ours)
0.14164
0.19807
0.2545
0.31093
Apr 22, 2026
Brier Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Brier Score
SAFE-RNN-TDQC (Ours)
VLA Model=π0-FAST
2026.04
0.15
RNN-TDQC (Ours)
VLA Model=π0-FAST
2026.04
0.204
SAFE-MLP BCE
VLA Model=π0-FAST
2026.04
0.206
SAFE-MLP-TDQC (Ours)
VLA Model=π0-FAST
2026.04
0.21
SAFE-RNN
VLA Model=π0-FAST
2026.04
0.22
RNN-BCE
VLA Model=π0-FAST
2026.04
0.237
Avg entropy
VLA Model=π0-FAST
2026.04
0.281
Avg prob.
VLA Model=π0-FAST
2026.04
0.29
Max prob.
VLA Model=π0-FAST
2026.04
0.331
Running Avg entropy
VLA Model=π0-FAST
2026.04
0.341
Running Avg prob.
VLA Model=π0-FAST
2026.04
0.359
Feedback
Search any
task
Search any
task