Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Interpretability Evaluation on MAMI (test)
Loading...
7.61
Faithfulness
PrismAgent
6.6012
6.8631
7.125
7.3869
May 1, 2026
Faithfulness
Inference Coherence
Inference Depth
Judgment Rationality
Expression Clarity
Updated 28d ago
Evaluation Results
Method
Method
Links
Faithfulness
Inference Coherence
Inference Depth
Judgment Rationality
Expression Clarity
PrismAgent
2026.05
7.61
7.45
6.46
7.45
9.18
Baseline
2026.05
6.64
7.05
6.02
7.02
8.73
Feedback
Search any
task
Search any
task