Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Factual Grounding and Causal Reasoning

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hallucination MitigationFactual Grounding and Causal Reasoning Evaluation Set
AC4.25
14
Showing 1 of 1 rows