Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline Action Prediction on Mind2Web Cross-Domain v1.0 (test)
Loading...
50.2
Element Accuracy
HTML-T5-XL
28.568
34.184
39.8
45.416
Jul 24, 2023
Element Accuracy
Operation F1
Step Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Element Accuracy
Operation F1
Step Success Rate
HTML-T5-XL
Train=SL
2023.07
50.2
74.9
48.3
MindAct (Flan-T5-XL)
Train=SL, LLM=Flan-T5-XL
2023.07
42.1
66.5
39.6
MindAct (GPT-4)
Train=ICL, LLM=GPT-4
2023.07
37.1
46.5
26.4
Synapse (GPT-3.5)
Train=ICL, LLM=GPT-3.5
2023.07
29.4
-
25.9
Feedback
Search any
task
Search any
task