Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AA-LCR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Long Context ReasoningAA-LCR
Score48.3
8
General Task (Agentic Coding)AA-LCR
Score74
6
Long Context UnderstandingAA-LCR
Accuracy68
5
Long Context & Context LearningAA-LCR
Pass@158.5
4
Long Context ReasoningAA-LCR
Accuracy66.9
3
Showing 5 of 5 rows