Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Flip on URLB Walker 1.0 (test)
Loading...
729
Mean Score
MOSS
289.08
403.29
517.5
631.71
Oct 13, 2022
Mean Score
Standard Error
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Score
Standard Error
MOSS
Method Category=Compet...
2022.10
729
40
CIC
Method Category=Compet...
2022.10
715
40
APT
Method Category=Data-b...
2022.10
596
24
AS-Alice
Method Category=Data-b...
2022.10
491
20
AS-Bob
Method Category=Data-b...
2022.10
475
16
SMM
Method Category=Compet...
2022.10
428
8
RND
Method Category=Knowle...
2022.10
412
18
ICM
Method Category=Knowle...
2022.10
381
10
Proto-RL
Method Category=Data-b...
2022.10
378
4
APS
Method Category=Compet...
2022.10
355
18
Disagreement
Method Category=Knowle...
2022.10
313
8
DIAYN
Method Category=Compet...
2022.10
306
12
Feedback
Search any
task
Search any
task