Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reinforcement Learning on Procgen (test)

21.61BigFish Return

Sparse Masked Attention Policies

1.57966.779811.9817.1802Jun 30, 2021Apr 9, 2022Jan 17, 2023Oct 27, 2023Aug 5, 2024May 15, 2025Feb 23, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
21.61-8.8430.74-9.1146.38-5.414.12-6.7610.95---8.73
2023.06
18.559.427.96.37.737-5.63.59.55.33.223.36.89.86.8
2023.06
18.55.39.328.46.87.639.8-7.87.29.89.52.823.66.69.87.1
2026.02
11.96-8.2129.61-8.3938.43-4.492.74-5.544.86---2.25
2026.02
11.32-8.4230.07-8.8836.69-4.882.62-5.713.85---2.71
2023.06
11.278.927.85.98.547.2-5.12.87.48.32.314.36.610.39.8
2023.06
10.96.38.8285.56.827.9-6.32.99.66.31.88.77.28.96.9
2021.06
10.668.1296.26.636.3112.1---------
2021.06
9.75.38.528.36.4530.2----------
2021.06
9.65.68.5285.9535.3105.3---------
2023.06
9.258.627.66.24.830-6.33.59.26.34.28.36.67.86.3
2021.06
8.25.88.129.55.4533.6100---------
2021.06
6.64.46.825.66.23.826.985.6---------
2023.06
65.78.826.26.7431-8.37.485.92.95.56.67.73.6
2023.06
5.996.36--6.64---8.336.919.486.293.846.916.75--
2023.06
5.52.6--6.4---6.784.420.82.64.584.24.9--
2023.06
4.94.97.316.45.44.43.2-6.64.51.12.61.94.44.40.53
2021.06
45.18.526.75.84.924.7----------
2023.06
3.826.19--5.78---5.782.548.766.143.717.786.94--
2023.06
2.95.58.626.25.84.924.9-5.62.47.85.42.27.86.17.43.1
2026.02
2.35-8.9227.68-3.7142.56-4.691.98-5.012.19---6.12
2021.11
-------22---------
2021.11
-------34.5---------
2021.11
-------48.5---------