Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WebLINX

Benchmarks

Task NameDataset NameSOTA ResultTrend
HTML observation reductionWebLinx
Average Wall-Clock Time (s)0
11
Website NavigationWebLINX IID 1.0 (test)
Overall Score37.4
11
Website NavigationWebLINX OOD 1.0 (test)
IM84
11
RerankingWebLINX CandidatesReranking (test)
MAP18.02
10
Single-step action predictionWebLinx (test-iid)
Cumulative Runtime28.5
3
Showing 5 of 5 rows