Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Ruin Names on Ruin Names (test)
Loading...
78.6
Accuracy
AMPLIFY
22.856
37.328
51.8
66.272
May 19, 2023
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
AMPLIFY
Model=GPT-3, Prompting...
2023.05
78.6
Human-Rater
Model=Human, Prompting...
2023.05
77.7
AMPLIFY
Model=GPT-3.5, Prompti...
2023.05
77.5
SOTA
Model=N/A, Prompting S...
2023.05
72.8
GPT-3.5
Model=GPT-3.5, Prompti...
2023.05
69.6
GPT-3
Model=GPT-3, Prompting...
2023.05
64
GPT-3
Model=GPT-3, Prompting...
2023.05
62.9
GPT-3.5
Model=GPT-3.5, Prompti...
2023.05
62.9
Random
Model=N/A, Prompting S...
2023.05
25
Feedback
Search any
task
Search any
task