Share your thoughts, 1 month free Claude Pro on usSee more

Multi-Objective Reinforcement Learning on Lunar Lander 4d

1.24Hypervolume (HV)

SPFT

Updated 1mo ago

Evaluation Results

Method	Links
SPFT 2025.08		1.24	2.36	1.75	-	480,000
D3PO 2026.02		1.23	2.39	32	10	-
C-MORL 2026.02		1.12	2.35	104	20	-
C-MORL 2025.08		1.12	2.35	1.04	-	500,000
GPI-LS 2026.02		1.06	1.81	13	5	-
GPI-LS 2025.08		1.06	1.69	0.13	-	500,000
PCN 2026.02		0.78	1.44	3	7	-
Envelope 2025.08		0.43	-2.84	0.19	-	500,000