Share your thoughts, 1 month free Claude Pro on usSee more

Text Game on Frozen Lake (test)

38.3Accuracy

OPCD

Updated 4mo ago

Evaluation Results

Method	Links
OPCD 2026.02		38.3	66.7
Context Distill. 2026.02		35.2	65.4
In-Context 2026.02		31.4	-
Base Model 2026.02		6.3	67.3