Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Date Understanding on BIG-bench Hard (test)
Loading...
75.2
Test Accuracy
DLN-2
21.432
35.391
49.35
63.309
Jun 21, 2023
Jul 14, 2023
Aug 7, 2023
Aug 30, 2023
Sep 23, 2023
Oct 16, 2023
Nov 9, 2023
Test Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Test Accuracy
DLN-2
LLM=GPT-3
2023.06
75.2
CoT
LLM=GPT-3
2023.06
72.4
0-shot
LLM=GPT-3
2023.06
56.4
PE2
Final Prompt=Let's thi...
2023.11
56
DLN-1
LLM=GPT-3
2023.06
55.7
PE2
Final Prompt=Analyzing...
2023.11
54.4
Iterative APE
Final Prompt=Let's mov...
2023.11
48
APO
Final Prompt=Accuratel...
2023.11
48
Iterative APE
Final Prompt=Let's dis...
2023.11
46.7
APO
Final Prompt=Determine...
2023.11
45
Zero-shot CoT
Final Prompt=Let's thi...
2023.11
39.1
Zero-shot CoT
Final Prompt=Let's thi...
2023.11
36
APE
LLM=GPT-3
2023.06
32.1
APE-400
LLM=GPT-3
2023.06
23.5
Feedback
Search any
task
Search any
task