Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Cultural alignment on Egypt survey 1.0 (test)
Loading...
50.15
Soft Metric
GPT-3.5
44.4508
45.9304
47.41
48.8896
Feb 20, 2024
Soft Metric
Hard Metric
Soft Alignment Difference (Ar-En)
Updated 4d ago
Evaluation Results
Method
Method
Links
Soft Metric
Hard Metric
Soft Alignment Difference (Ar-En)
GPT-3.5
Prompting language=Arabic
2024.02
50.15
28.56
3.07
AceGPT-Chat
Prompting language=Arabic
2024.02
49.49
30.6
3.34
LLAMA-2-Chat
Prompting language=Eng...
2024.02
47.95
25.61
-
GPT-3.5
Prompting language=Eng...
2024.02
47.08
23.42
-
mT0-XXL
Prompting language=Arabic
2024.02
46.69
27.1
1.53
AceGPT-Chat
Prompting language=Eng...
2024.02
46.15
28.83
-
mT0-XXL
Prompting language=Eng...
2024.02
45.16
28.75
-
LLAMA-2-Chat
Prompting language=Arabic
2024.02
44.67
23.34
-3.28
Feedback
Search any
task
Search any
task