Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Synthetic in-context reasoning on MAD
Loading...
-
Compress
No plottable results for Compress (PERCENT).
Metric
Compress (PERCENT)
Fuzzy Recall (PERCENT)
In-Ctx Recall (PERCENT)
Memorize TrainSet (PERCENT)
Noisy Recall (PERCENT)
Selective Copy (PERCENT)
Average Score (PERCENT)
Updated 4d ago
Evaluation Results
Method
Method
Links
Compress
Fuzzy Recall
In-Ctx Recall
Memorize TrainSet
Noisy Recall
Selective Copy
Average Score
No evaluation results found.
Feedback
Search any
task
Search any
task