Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Fault detection on LEMMA
Loading...
74.1
AUROC
Full
72.956
73.253
73.55
73.847
Apr 9, 2026
AUROC
Fidelity MAE
Updated 9d ago
Evaluation Results
Method
Method
Links
AUROC
Fidelity MAE
Full
Model variant=Full
2026.04
74.1
-
ToE
Model variant=ToE, k=5
2026.04
73
0.106
Feedback
Search any
task
Search any
task