Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hallucination Mitigation on Factual Grounding and Causal Reasoning Evaluation Set

4.25AC

CIP

0.0381.13152.2253.3185Dec 12, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
4.250.520.210.23-2.330.37
2025.12
4.020.50.250.28-2.540.38
2025.12
2.960.40.480.42-2.610.35
2025.12
2.10.380.350.33-1.30.26
2025.12
1.920.15-----
2025.12
1.50.320.230.22-1.150.24
2025.12
1.480.12-----
2025.12
1.160.30.160.170.0010.690.2
2025.12
0.80.250.180.160.0020.60.18
2025.12
0.80.12-----
2025.12
0.470.1-----
2025.12
0.350.05-----
2025.12
0.350.08-----
2025.12
0.20.07-----