InfoAtlas: A Foundation Model for Zero-Shot Statistical Dependence Estimate
About
Measuring statistical dependency between high-dimensional random variables is a fundamental task in data science and machine learning. Neural mutual information (MI) estimators offer a promising avenue, but they typically require costly iterative optimization for each new dataset, making them impractical for real-time applications. We present InfoAtlas, a foundation model-like architecture that eliminates this bottleneck by directly inferring MI in a single forward pass. Pretrained on large-scale synthetic data with rich dependence patterns, InfoAtlas learns to identify diverse dependence structures and predict MI directly from the dataset. Comprehensive experiments demonstrate that InfoAtlas matches state-of-the-art neural estimators in accuracy while achieving $100\times$ speedup, can flexibly handle varying dimensions and sample sizes through a single unified model, and generalizes effectively to complex, real-world scenarios. By reformulating MI estimation as an inference task, InfoAtlas establishes a foundation for real-time dependency analysis.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Mutual Information Estimation | BMI benchmark | MI Estimate Score (Mn-dense 5-5-0.5)0.6 | 8 | |
| Robotic Policy Success Rate | ManiSkill 2 Pick Cube (Seen) | Success Rate94.2 | 5 | |
| Robotic Policy Success Rate | ManiSkill 2 Pick Cube (Unseen) | Success Rate82 | 5 | |
| Robotic Policy Success Rate | ManiSkill Stack Cube 2 (Seen) | Success Rate68.2 | 5 | |
| Robotic Policy Success Rate | ManiSkill 2 Peg Insertion (Seen) | Success Rate72.4 | 5 | |
| Robotic Policy Success Rate | ManiSkill 2 Peg Insertion (Unseen) | Success Rate18.3 | 5 | |
| Robotic Policy Success Rate | ManiSkill Stack Cube 2 (Unseen) | Success Rate37 | 5 |