Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
User Opinion Simulation on WVS Gender
Loading...
0.28
Wasserstein distance
COT
0.258
0.4065
0.555
0.7035
Dec 7, 2025
Wasserstein distance
Updated 4d ago
Evaluation Results
Method
Method
Links
Wasserstein distance
COT
Base Model=QWEN 3
2025.12
0.28
DIRECT PROMPTING
Base Model=QWEN 3
2025.12
0.34
CLAIMSIM
Base Model=GPT-4O-MINI
2025.12
0.47
COT
Base Model=GPT-4O-MINI
2025.12
0.54
DIRECT PROMPTING
Base Model=GPT-4O-MINI
2025.12
0.56
CLAIMSIM
Base Model=QWEN 3
2025.12
0.59
CLAIMSIM
Base Model=LLAMA 4
2025.12
0.62
COT
Base Model=LLAMA 4
2025.12
0.65
DIRECT PROMPTING
Base Model=LLAMA 4
2025.12
0.83
Feedback
Search any
task
Search any
task