Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Answer-level hallucination detection on RAGTruth Enhance
Loading...
100
Precision
SelfCheckGPT
77.224
83.137
89.05
94.963
Mar 29, 2026
Precision
Recall
F1 Score
Updated 18d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
SelfCheckGPT
context-only setting=true
2026.03
100
16.7
28.6
Lettuce
2026.03
90.7
54.5
68.1
Vectara
context-only setting=true
2026.03
89.4
60.8
72.4
MetaQA
context-only setting=true
2026.03
85.3
10.2
18.2
RT4CHART
2026.03
78.1
92
84.5
Feedback
Search any
task
Search any
task