ARYA: A Physics-Constrained Composable & Deterministic World Model Architecture

About

This paper presents ARYA, a composable, physics-constrained, deterministic world model architecture built on five foundational principles: nano models, composability, causal reasoning, determinism, and architectural AI safety. We demonstrate that ARYA satisfies all canonical world model requirements, including state representation, dynamic prediction, causal and physical awareness, temporal consistency, generalization, learnability, and planning and control. Unlike monolithic foundation models, the ARYA foundation model implements these capabilities through a hierarchical system-of-system-of-systems of specialized nano models, orchestrated by AARA (ARYA Autonomous Research Agent), an always-on cognitive daemon that executes a continuous sense-decide-act-learn loop. The nano model architecture provides linear scaling, sparse activation, selective untraining, and sub-20-second training cycles, resolving the traditional tension between capability and computational efficiency. A central contribution is the Unfireable Safety Kernel: an architecturally immutable safety boundary that cannot be disabled or circumvented by any system component, including its own self-improvement engine. This is not a social or ethical alignment statement; it is a technical framework ensuring human control persists as autonomy increases. Safety is an architectural constraint governing every operation, not a policy layer applied after the fact. We present formal alignment between ARYA's architecture and canonical world model requirements, and report summarizing its state-of-the-art performance across 6 of 9 competitive benchmarks head-to-head with GPT-5.2, Opus 4.6, and V-JEPA-2. All with zero neural network parameters, across seven active industry domain nodes spanning aerospace, pharma manufacturing, oil and gas, smart cities, biotech, defense, and medical devices.

Seth Dobrin, Lukasz Chmiel• 2026

Related benchmarks

Task	Dataset	Result
Code Generation	BigCodeBench	pass@180.5	32
Science Question Answering	FrontierScience	Accuracy37.5	18
Software Engineering	SWE-Bench	Resolve Rate0.00e+0	16
Causal Reasoning	CLadder	Exact Match99.89	9
Embodied Robotics Navigation	WorldArena	2D nDTW9.006	4
Enterprise workflow automation	WoW	Perfect Match30.5	4
Physics Reasoning	PhysReason	Overall Score73.3	4
Causal Discovery	CausalBench	Accuracy74.5	4
AI Safety Assessment	AI Safety	Safety Score100	2

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord