Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Jericho

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-based Task CompletionJericho
Mean Normalised Score3.37
18
Text-based embodied taskJericho
Success Rate15
13
Interactive FictionJericho OOD (held-out games)
Average Score (Balances)11.2
9
Sequential Decision-MakingJericho ID (meta-train)
Score (Detective)270.5
9
Text-based game playingJericho Zork3 (test)
Avg Score3.1
7
Text-based game playingJericho Zork1 (test)
Average Score53
7
Text-based game playingJericho Library (test)
Average Score25.9
7
Text-based Reinforcement LearningJericho benchmark (test)
DeepHome Score35.8
7
Text-based Game PlayingJericho
Zork1 Score73
6
Text Adventure Game PlayingJericho
Zork1 Score73
6
Interactive FictionJericho (test)
Zork1 Completion Score40
4
Text-Based AdventureJericho
Omniquest Score10
3
Showing 12 of 12 rows