Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
RAG Evaluation on RAG-dataset-12000
Loading...
96
Accuracy
Deepchecks
63.76
72.13
80.5
88.87
May 14, 2026
Accuracy
Updated 19d ago
Evaluation Results
Method
Method
Links
Accuracy
Deepchecks
2026.05
96
Langsmith
LLM Model=GPT-4o
2026.05
96
RAGAS
LLM Model=GPT-4o
2026.05
65
Feedback
Search any
task
Search any
task