Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

K2D13

Benchmarks

Task NameDataset NameSOTA ResultTrend
Offline Reinforcement LearningK2D13 1000-5-mid
NAUC0.91
4
Offline Reinforcement LearningK2D13-1000-1-mid
NAUC0.82
4
Offline Reinforcement LearningK2D13-500-1-mid
NAUC78
4
Offline Reinforcement LearningK2D13 500-5 mid
NAUC82
4
Offline Reinforcement LearningK2D13-250-5-mid
NAUC0.75
4
Reinforcement Learning target performance estimationK2D13-1000-5 late
NAUC86
4
Reinforcement Learning target performance estimationK2D13 1000-1-late
NAUC78
4
Reinforcement Learning target performance estimationK2D13-500-5-late
NAUC81
4
Reinforcement Learning target performance estimationK2D13 500-1-late
NAUC57
4
Reinforcement Learning target performance estimationK2D13-250-5-late
NAUC68
4
Reinforcement Learning target performance estimationK2D13-250-1 late
NAUC25
4
Offline In-Context Reinforcement LearningK2D13-1000-5 (complete)
NAUC93
4
Offline In-Context Reinforcement LearningK2D13-1000-1 (complete)
NAUC87
4
Offline In-Context Reinforcement LearningK2D13 1000-5-early (test)
NAUC51
4
Offline In-Context Reinforcement LearningK2D13-1000-1-early (test)
NAUC55
4
Offline In-Context Reinforcement LearningK2D13-500-5-early (test)
NAUC56
4
Offline In-Context Reinforcement LearningK2D13-500-1-early (test)
NAUC37
4
Offline In-Context Reinforcement LearningK2D13-250-1-early (test)
NAUC12
4
Showing 18 of 18 rows