Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Interpretability Evaluation on HarM (test)
Loading...
7.18
Faithfulness
PrismAgent
6.7016
6.8258
6.95
7.0742
May 1, 2026
Faithfulness
Coherence
Depth
Rationality
Clarity
Updated 28d ago
Evaluation Results
Method
Method
Links
Faithfulness
Coherence
Depth
Rationality
Clarity
PrismAgent
2026.05
7.18
8.58
6.44
7.37
9.03
Baseline
2026.05
6.72
7.46
5.83
6.71
8.77
Feedback
Search any
task
Search any
task