Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-set knowledge retrieval on T-REx (All)
Loading...
34.5
Macro-averaged EM
NPM
2.26
10.63
19
27.37
Dec 2, 2022
Macro-averaged EM
Updated 2d ago
Evaluation Results
Method
Method
Links
Macro-averaged EM
NPM
#Params=1.0x, C (Text...
2022.12
34.5
BM25 + GPT-3 175B
#Params=500x, C (Text...
2022.12
32
GPT-3 175B
#Params=500x, C (Text...
2022.12
25.7
BM25 + T5
#Params=2.2x, C (Text...
2022.12
22.2
BM25 + GPT-3 13B
#Params=37x, C (Text C...
2022.12
22.2
BM25 + T5 3B
#Params=8.5x, C (Text...
2022.12
21.6
BM25 + OPT 13B
#Params=37x, C (Text C...
2022.12
18.9
GPT-3 13B
#Params=37x, C (Text C...
2022.12
16.4
OPT 13B
#Params=37x, C (Text C...
2022.12
15
BM25 + GPT-3 6.7B
#Params=19x, C (Text C...
2022.12
14.9
BM25 + OPT 2.7B
#Params=7.6x, C (Text...
2022.12
14.8
BM25 + OPT 6.7B
#Params=19x, C (Text C...
2022.12
14.8
T5
#Params=2.2x, C (Text...
2022.12
13.3
T5 3B
#Params=8.5x, C (Text...
2022.12
12.1
OPT 6.7B
#Params=19x, C (Text C...
2022.12
11.6
OPT 2.7B
#Params=7.6x, C (Text...
2022.12
9.8
GPT-3 6.7B
#Params=19x, C (Text C...
2022.12
8.1
GPT-3 2.7B
#Params=7.6x, C (Text...
2022.12
4.4
BM25 + GPT-3 2.7B
#Params=7.6x, C (Text...
2022.12
3.5
Feedback
Search any
task
Search any
task