Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Data Filtering on OpenWebText Mainstream
Loading...
92.4
Balanced Accuracy
GPT-4o
75.4584
79.8567
84.255
88.6533
Oct 3, 2024
Balanced Accuracy
Human Preference Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Balanced Accuracy
Human Preference Score
GPT-4o
#Queries to GPT-4o=13....
2024.10
92.4
50
SIEVE
#Queries to GPT-4o=100...
2024.10
91
54
GPT-3.5-Turbo
#Queries to GPT-4o=13....
2024.10
76.11
-
Feedback
Search any
task
Search any
task