Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAGognize

Benchmarks

Task NameDataset NameSOTA ResultTrend
Response-level Hallucination DetectionRAGognize
AUROC94.91
13
Token-Level Hallucination DetectionRAGognize
AUROC93.33
7
Showing 2 of 2 rows