Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reacher

Benchmarks

Task NameDataset NameSOTA ResultTrend
Target AcquisitionReacher Quadrant (test)
Target Hit Rate (per min)19.7
15
Target AcquisitionReacher Linear (test)
Hit Rate (per minute)20
15
Target AcquisitionReacher Continuous (test)
Hit Rate (per min)18.8
15
Reinforcement LearningReacher
Average Return-4.1
12
ReachingReacher 3D
Success Rate95.1
10
Continuous Controlreacher
Average Reward0.72
9
Continuous ControlReacher v5
Average Episodic Reward-3.9
8
Actuator InversionReacher H (eval-in)
AER291
8
Actuator InversionReacher E (Ceval-in)
AER582
8
Actuator InversionReacher H (train)
AER290
8
Actuator InversionReacher E C (train)
AER584
8
ControlReacher v2
Median Samples251
8
Continuous ControlReacher v2
Average Return-4
7
Off-dynamics Reinforcement LearningReacher 0.5 density v1 (test)
Reward-11.7
7
Off-dynamics Reinforcement LearningReacher broken source environment MuJoCo
Average Reward30
7
Reinforcement LearningReacher 1.5 gravity MuJoCo
Reward-9.5
7
Reinforcement LearningReacher 0.5 gravity (test)
Average Return-7.1
7
Continuous ControlReacher v1 (train)
Max Avg Return-3.6
7
Continuous Control (Negative Reward)Reacher Pybullet
Mean Return16.8
6
Continuous Control (Positive Reward)Reacher Pybullet
Mean Return18.7
6
Continuous Control (Negative Reward)Reacher Mujoco
Mean Return-6.3
6
Continuous ControlfixedReacher
Average Reward0.849
6
Reinforcement LearningReacher
Maximum Return6.48
5
Continuous Robotic ControlReacher normal v2 (test)
Final Performance-2.39
5
Extrapolative GeneralizationReacher-hard Unseen Goal
Mean Reward967.64
5
Showing 25 of 38 rows