Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Behavior Abstraction Detection on DARPA TC Barephone
Loading...
97.2
Precision
SmartGuard
87.632
90.116
92.6
95.084
Jun 20, 2025
Precision
Recall
F1-score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1-score
SmartGuard
Backbone=LLaMa2-3b
2025.06
97.2
96.8
97
SmartGuard
Backbone=OPT-1.3b
2025.06
95.9
94.5
95.2
Extractor
Method=Extractor
2025.06
88
100
93.6
Feedback
Search any
task
Search any
task