Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text Quality Assessment on WikiText-103 In Domain (test)
Loading...
0.67
GPT-4 Preference Ratio (Better)
FoSS
0.41
0.4775
0.545
0.6125
Feb 11, 2026
GPT-4 Preference Ratio (Better)
GPT-4 Preference Ratio (Neutral)
GPT-4 Preference Ratio (Worse)
Updated 4d ago
Evaluation Results
Method
Method
Links
GPT-4 Preference Ratio (Better)
GPT-4 Preference Ratio (Neutral)
GPT-4 Preference Ratio (Worse)
FoSS
Comparison Baseline=kN...
2026.02
0.67
0.15
0.18
FoSS
Comparison Baseline=GF...
2026.02
0.55
0.29
0.16
FoSS
Comparison Baseline=Tr...
2026.02
0.53
0.19
0.28
FoSS
Comparison Baseline=CoG
2026.02
0.42
0.31
0.27
Feedback
Search any
task
Search any
task