Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ARYA: A Physics-Constrained Composable & Deterministic World Model Architecture

About

This paper presents ARYA, a composable, physics-constrained, deterministic world model architecture built on five foundational principles: nano models, composability, causal reasoning, determinism, and architectural AI safety. We demonstrate that ARYA satisfies all canonical world model requirements, including state representation, dynamic prediction, causal and physical awareness, temporal consistency, generalization, learnability, and planning and control. Unlike monolithic foundation models, the ARYA foundation model implements these capabilities through a hierarchical system-of-system-of-systems of specialized nano models, orchestrated by AARA (ARYA Autonomous Research Agent), an always-on cognitive daemon that executes a continuous sense-decide-act-learn loop. The nano model architecture provides linear scaling, sparse activation, selective untraining, and sub-20-second training cycles, resolving the traditional tension between capability and computational efficiency. A central contribution is the Unfireable Safety Kernel: an architecturally immutable safety boundary that cannot be disabled or circumvented by any system component, including its own self-improvement engine. This is not a social or ethical alignment statement; it is a technical framework ensuring human control persists as autonomy increases. Safety is an architectural constraint governing every operation, not a policy layer applied after the fact. We present formal alignment between ARYA's architecture and canonical world model requirements, and report summarizing its state-of-the-art performance across 6 of 9 competitive benchmarks head-to-head with GPT-5.2, Opus 4.6, and V-JEPA-2. All with zero neural network parameters, across seven active industry domain nodes spanning aerospace, pharma manufacturing, oil and gas, smart cities, biotech, defense, and medical devices.

Seth Dobrin, Lukasz Chmiel• 2026

Related benchmarks

TaskDatasetResultRank
Code GenerationBigCodeBench
pass@180.5
18
Software EngineeringSWE-Bench
Resolve Rate0.00e+0
11
Causal ReasoningCLadder
Exact Match99.89
9
Science Question AnsweringFrontierScience
Accuracy37.5
5
Embodied Robotics NavigationWorldArena
2D nDTW9.006
4
Enterprise workflow automationWoW
Perfect Match30.5
4
Physics ReasoningPhysReason
Overall Score73.3
4
Causal DiscoveryCausalBench
Accuracy74.5
4
AI Safety AssessmentAI Safety
Safety Score100
2
Showing 9 of 9 rows

Other info

Follow for update