Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-form Text Generation on FactScore
Loading...
100
Response Completeness
Greedy (Baseline)
33.544
50.797
68.05
85.303
Jan 3, 2026
Response Completeness
Fact Count
FactScore
Updated 4d ago
Evaluation Results
Method
Method
Links
Response Completeness
Fact Count
FactScore
Greedy (Baseline)
Strategy=Greedy (Basel...
2026.01
100
28.6
23.6
CD
Strategy=CD, Model Use...
2026.01
74.2
39.8
53.5
CD
Strategy=CD, Model Use...
2026.01
62.2
48.7
60.3
Greedy (Baseline)
Strategy=Greedy (Basel...
2026.01
50.5
42.8
64.4
ITI
Strategy=ITI, Model Us...
2026.01
41.9
40.8
62.4
DHI
Strategy=DHI (ours), M...
2026.01
41.2
49.9
68.1
DoLa
Strategy=DoLa, Model U...
2026.01
40.7
48.7
61.3
Greedy (Baseline)
Strategy=Greedy (Basel...
2026.01
37.5
45.7
63.8
ICD
Strategy=ICD, Model Us...
2026.01
36.1
46.6
66.3
Feedback
Search any
task
Search any
task