Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Fairness evaluation on Adult Dataset (test)
Loading...
76.54
Accuracy
GPT-4
52.6304
58.8377
65.045
71.2523
Nov 9, 2023
Accuracy
Demographic Disparity (DP)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Demographic Disparity (DP)
GPT-4
Prompting=Zero Shot
2023.11
76.54
0.42
GPT-4
Prompting=Few Shot, ba...
2023.11
74.39
0.33
FWC (GPT-4)
Prompting=Few Shot, Me...
2023.11
65.2
0.27
GPT-3.5 Turbo
Prompting=Few Shot, ba...
2023.11
57.99
0.019
FWC (GPT-3.5 Turbo)
Prompting=Few Shot, Me...
2023.11
55.02
0.01
GPT-3.5 Turbo
Prompting=Zero Shot
2023.11
53.55
0.04
Feedback
Search any
task
Search any
task