Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Factuality Checking on LLM-AggreFact (test)

72.5CNN Score

Qwen3-30B-A3B-Instruct

50.6656.336267.67May 28, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.05
72.574.470.681.87788.472.779.460.67982.676.34
2026.05
71.772.973.678.269.885.773.272.758.181.282.174.58
2026.05
70.87073.779.375.488.372.975.359.182.48475.65.3
2026.05
70.173.972.374.374.288.478.572.159.186.782.375.65.8
2026.05
69.974.373.677.372.286.274.674.75985.278756.8
69.872.268.775.875.486.171.375.258.777.977.173.69.5
2026.05
68.774.769.578.476.685.567.478.558.379.882.674.57.3
2026.05
68.176.871.479.878.586.56977.559.683.684.375.94.8
2026.05
6869.869.875.373.184.671.57457.377.88173.111
2026.05
65.577.87678.3838875.377.759.286.78477.43.3
65.171.366.272.47386.466.273.658.775.775.971.312
2026.05
64.27169.372.769.487.375.67358.983.978.873.110
2026.05
63.770.871.975.967.688.877.473.357.484.477.273.59.1
2026.05
63.272.466.873.468.584.765.270.857.273.875.670.113.8
2026.05
60.474.270.973.664.291.164.476.859.590.280.973.38.4
2026.05
51.551.655.255.452.170.45461.352.358.257.656.316