Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Factual Grounding Evaluation on SQuAD
Loading...
92
ROC AUC
Deepchecks Grounded in Context
63.92
71.21
78.5
85.79
May 14, 2026
ROC AUC
Updated 19d ago
Evaluation Results
Method
Method
Links
ROC AUC
Deepchecks Grounded in Context
Evaluator=Proprietary SLM
2026.05
92
RAGAS Faithfulness
Evaluator LLM=GPT-4o
2026.05
83
Langsmith Answer Faithfulness
Evaluator LLM=GPT-4o
2026.05
65
Feedback
Search any
task
Search any
task