Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Data Analysis on DACO (test A)
Loading...
50.79
Helpfulness
GPT-4
9.034
19.8745
30.715
41.5555
Mar 4, 2024
Helpfulness
Entailment
BLEU
Updated 4d ago
Evaluation Results
Method
Method
Links
Helpfulness
Entailment
BLEU
GPT-4
# para.=175B+, Code ge...
2024.03
50.79
4.59
17.77
GPT-4
# para.=175B+, Code ge...
2024.03
30.43
3.35
14.9
ChatGPT
# para.=20B+, Code gen...
2024.03
26.51
2.74
14.22
FG-RLHF
# para.=6B, Code gener...
2024.03
19.42
3.65
13.13
ChatGPT
# para.=20B+, Code gen...
2024.03
19.31
3.06
13.22
TAPAS
# para.=337M, Code gen...
2024.03
19.19
1.96
11.62
SFT
# para.=6B, Code gener...
2024.03
18.96
2.3
14.47
TAPEX
# para.=406M, Code gen...
2024.03
15.08
3.34
14.6
SFT
# para.=6B, Code gener...
2024.03
13.73
2.15
14.88
RLHF
# para.=6B, Code gener...
2024.03
10.64
3.18
12.66
Feedback
Search any
task
Search any
task