Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
User Opinion Simulation on WVS Gender
Loading...
0.28
Wasserstein distance
COT
0.258
0.4065
0.555
0.7035
Dec 7, 2025
Wasserstein distance
Updated 1mo ago
Evaluation Results
Method
Method
Links
Wasserstein distance
COT
Base Model=QWEN 3
2025.12
0.28
DIRECT PROMPTING
Base Model=QWEN 3
2025.12
0.34
CLAIMSIM
Base Model=GPT-4O-MINI
2025.12
0.47
COT
Base Model=GPT-4O-MINI
2025.12
0.54
DIRECT PROMPTING
Base Model=GPT-4O-MINI
2025.12
0.56
CLAIMSIM
Base Model=QWEN 3
2025.12
0.59
CLAIMSIM
Base Model=LLAMA 4
2025.12
0.62
COT
Base Model=LLAMA 4
2025.12
0.65
DIRECT PROMPTING
Base Model=LLAMA 4
2025.12
0.83
Feedback
Search any
task
Search any
task