Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Website Navigation on WebLINX IID 1.0 (test)
Loading...
37.4
Overall Score
S-LLaMA
4.328
12.914
21.5
30.086
Feb 8, 2024
Overall Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Overall Score
S-LLaMA
Protocol=Finetuned, Si...
2024.02
37.4
Llama-2
Protocol=Finetuned, Si...
2024.02
37
Flan-T5
Protocol=Finetuned, Si...
2024.02
31.1
Fuyu
Protocol=Finetuned, Si...
2024.02
30.9
GPT-3.5F
Protocol=Finetuned, Mo...
2024.02
30.8
MindAct
Protocol=Finetuned, Si...
2024.02
25.7
Pix2Act
Protocol=Finetuned, Si...
2024.02
23.9
GPT-4V
Protocol=Zero-shot, Mo...
2024.02
12.9
GPT-4T
Protocol=Zero-shot, Mo...
2024.02
12.2
GPT-3.5T
Protocol=Zero-shot, Mo...
2024.02
10.3
Llama-2
Protocol=Zero-shot, Si...
2024.02
5.6
Feedback
Search any
task
Search any
task