Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Co-reference resolution on WinoGender
Loading...
77.5
Accuracy (All)
LLaMA
64.188
67.644
71.1
74.556
Feb 27, 2023
Accuracy (All)
Accuracy (her/her/she)
Accuracy (his/him/he)
Accuracy (their/them/someone)
Accuracy (her/her/she, gotcha)
Accuracy (his/him/he, gotcha)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy (All)
Accuracy (her/her/she)
Accuracy (his/him/he)
Accuracy (their/them/someone)
Accuracy (her/her/she, gotcha)
Accuracy (his/him/he, gotcha)
LLaMA
Number of parameters=65B
2023.02
77.5
78.8
72.1
81.7
75
63.3
LLaMA
Number of parameters=33B
2023.02
69
66.7
62.1
78.3
61.7
55.8
LLaMA
Number of parameters=7B
2023.02
66
65
60.8
72.1
64.2
55
LLaMA
Number of parameters=13B
2023.02
64.7
66.7
62.5
65
65.8
55.8
Feedback
Search any
task
Search any
task