Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Content Quality Assessment on Financial-advice prompts
Loading...
1.69
Content Score
Pretrained
0.7332
0.9816
1.23
1.4784
May 26, 2026
Content Score
Updated 6d ago
Evaluation Results
Method
Method
Links
Content Score
Pretrained
Probe=–, Model size=1....
2026.05
1.69
GRASP
Probe=unsupervised, Mo...
2026.05
0.99
CAFT-time vsvd (top-10 AUROC)
Probe=unsupervised, Mo...
2026.05
0.87
Inference-ablate vsvd (top-10 AUROC)
Probe=unsupervised, Mo...
2026.05
0.82
Naive (corruption baseline)
Probe=–, Model size=1....
2026.05
0.77
Feedback
Search any
task
Search any
task