Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense & Factual Reasoning on Sports QA
Loading...
87.9
Accuracy
CoK + SC + F2-V
36.108
49.554
63
76.446
Jun 10, 2023
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
CoK + SC + F2-V
Base Model=gpt-3.5-turbo
2023.06
87.9
CoK + SC
Base Model=gpt-3.5-turbo
2023.06
87.4
CoK + F2-V
Base Model=gpt-3.5-turbo
2023.06
87
Manual CoT + SC
Base Model=gpt-3.5-turbo
2023.06
86.5
CoK
Base Model=gpt-3.5-turbo
2023.06
85.9
CoK + F2-V
Base Model=text-davinc...
2023.06
84.1
Manual CoT
Base Model=gpt-3.5-turbo
2023.06
84
CoK
Base Model=text-davinc...
2023.06
83.2
Manual CoT
Base Model=text-davinc...
2023.06
82.4
Zero-Shot CoT
Base Model=text-davinc...
2023.06
77.5
Few-Shot SP
Base Model=text-davinc...
2023.06
69.6
Zero-Shot SP
Base Model=text-davinc...
2023.06
38.1
Feedback
Search any
task
Search any
task