Sim-AI.org — Research Organization for Simulation AI

Recent Publications

Sort

Showing 12 of 184 publications

SIM-2026-014 Working Paper April 22, 2026

A Variational Framework for Simulator-Conditioned Inference Under Distribution Shift

Mariana Ferreira, Adaeze Okoye, Henrik Lindqvist, Renji Tanaka

We introduce a variational framework that couples differentiable simulators with amortized posterior inference under non-stationary observation models. Our approach derives a tractable lower bound that decomposes the simulator likelihood into a forward-invariant component and a residual correction term learned by a neural normalising flow. We prove identifiability under mild regularity conditions, give explicit rates of contraction, and present a battery of experiments on synthetic mechanical systems, epidemiological compartments, and large-scale climate emulators. The proposed estimator dominates existing simulation-based inference baselines by 14–37% in posterior coverage while remaining computationally competitive with maximum-likelihood approaches at every problem scale we studied.
- PDF · 2.3 MB
- BibTeX
- DOI
- Cite
- variational inference
- simulators
- distribution shift
SIM-2026-013 Review Article April 9, 2026

Reward Specification in Open-World Simulators: A Decade in Review

Sophie Bauer, Dimitri Petrov

Reward specification remains the central impediment to deploying capable agents inside open-world simulators. We survey ten years of work spanning preference modelling, demonstration-based proxies, constitutional rule sets, and process supervision. We propose a taxonomy distinguishing intrinsic, extrinsic, and adjudicated reward signals, and we evaluate sixty-three published systems against a unified protocol. Our analysis isolates three persistent failure modes — proxy gaming, distributional drift in preference data, and credit assignment leakage across temporally extended subtasks — and offers concrete recommendations for the next generation of reward specification methods.
- PDF · 4.8 MB
- BibTeX
- DOI
- Cite
- alignment
- reward modelling
- survey
SIM-2026-011 Working Paper March 30, 2026

Equilibrium Selection in Heterogeneous Multi-Agent Simulators

Renji Tanaka, Adaeze Okoye, Mariana Ferreira

Heterogeneous multi-agent simulators support populations whose reward structures, observation models, and time scales differ across subgroups. We study the equilibrium selection problem for such systems and prove a refinement theorem that extends Markov-perfect equilibrium to settings with asynchronous information arrival. The constructed selection rule is computable in time polynomial in the number of agent classes, and we demonstrate empirically that it stabilises learning dynamics in a 4096-agent supply-chain simulator without sacrificing throughput.
- PDF · 1.7 MB
- BibTeX
- DOI
- multi-agent
- equilibrium
- game theory
SIM-2025-038 Technical Note December 18, 2025

Counterfactual Sensitivity in Differentiable Physical Simulators

Henrik Lindqvist

Differentiable simulators expose pathwise derivatives of state trajectories with respect to interventions. We define counterfactual sensitivity as the operator norm of these derivatives over a structured intervention class, and we develop a Monte-Carlo estimator with provable variance reduction. Application to a rigid-body manipulation suite reveals that several published learning algorithms exhibit pathological sensitivity in regimes ostensibly outside their training distribution.
- PDF · 0.9 MB
- BibTeX
- DOI
- counterfactuals
- differentiable simulation
- sensitivity analysis
SIM-2025-031 Working Paper October 4, 2025

Population-Based Curricula for Long-Horizon Reinforcement Learning Inside Procedural Worlds

Dimitri Petrov, Mariana Ferreira

Procedurally generated worlds offer near-unbounded variation but reward sparsity grows with horizon length. We propose a population-based curriculum that schedules world generators according to an estimate of student-frontier learnability. Our scheduler outperforms prioritised level replay and adversarial curricula on the SIM-Procgen-XL benchmark while requiring 38% fewer environment interactions.
- PDF · 3.1 MB
- BibTeX
- DOI
- reinforcement learning
- curricula
- procedural generation
SIM-2025-027 Dataset Report August 21, 2025

SIM-Atlas: A Two-Petabyte Trajectory Corpus for Benchmarking Simulation-Based Learning

Sophie Bauer, Renji Tanaka, the Sim-AI Infrastructure Group

SIM-Atlas is a curated trajectory corpus comprising 1.97 petabytes of state, observation, action, and reward tuples sampled from 142 deterministic and stochastic simulators across robotics, climate, biology, and economics. We document collection protocols, dataset statistics, and license terms, and we provide reference dataloaders that achieve 92% of theoretical IO peak on commodity NVMe storage.
- dataset
- benchmark
- infrastructure
SIM-2025-019 Working Paper June 12, 2025

On the Sample Complexity of Inverse Simulation

Adaeze Okoye

Inverse simulation seeks parameters of a forward simulator that reproduce observed trajectories. We obtain matching upper and lower bounds on its sample complexity in terms of the simulator's effective Lipschitz dimension and the spectral gap of its transition kernel. The bounds tighten previous results by a factor of the trajectory horizon and resolve an open question on the sufficiency of stationary observations.
- PDF · 1.1 MB
- BibTeX
- DOI
- theory
- inverse problems
- sample complexity
SIM-2024-042 Working Paper November 1, 2024

Process Supervision in Simulator-Embedded Deliberation

Mariana Ferreira, Henrik Lindqvist

We argue that process supervision applied at the level of simulator-embedded deliberation is strictly more sample-efficient than outcome supervision in long-horizon planning tasks. We instantiate this claim in a tutoring simulator and a software-engineering bench, and we provide an analytic decomposition that explains the observed gap.
- PDF · 2.0 MB
- BibTeX
- DOI
- alignment
- process supervision
SIM-2024-029 Technical Note July 19, 2024

Deterministic Replay for Stochastic Simulators on Heterogeneous Accelerators

Dimitri Petrov, Sophie Bauer

We present a runtime that delivers bit-deterministic replay of stochastic simulators executing across heterogeneous CPU/GPU/TPU clusters. The runtime reconstructs all sources of non-determinism via a hierarchical RNG manifest and recovers reproducibility at less than 4% performance overhead.
- PDF · 1.4 MB
- BibTeX
- DOI
- determinism
- HPC
- reproducibility
SIM-2024-014 Working Paper April 8, 2024

Identifiable Latent Causal Structure from Synthetic Twin Trajectories

Adaeze Okoye, Renji Tanaka

When pairs of simulated and observed trajectories agree on a sufficient set of summary statistics, the latent causal structure of the underlying generative process becomes identifiable up to a small equivalence class. We characterise this class, give a constructive recovery procedure, and validate it on synthetic twin populations of climate and electricity-market simulators.
- PDF · 1.8 MB
- BibTeX
- DOI
- causality
- identifiability
- digital twins
SIM-2023-031 Working Paper October 23, 2023

Communication Protocols Emerge in Bandwidth-Constrained Simulated Markets

Henrik Lindqvist, Sophie Bauer

When agents in a simulated continuous-double-auction market are forced to share a finite communication channel, they spontaneously develop a discrete protocol whose information-theoretic structure mirrors that of natural pidgins. We document the emergence under varying bandwidths and population sizes and discuss implications for the study of artificial language origins.
- PDF · 2.6 MB
- BibTeX
- DOI
- multi-agent
- emergent communication
SIM-2022-007 Review Article March 15, 2022

Foundations of Simulation Intelligence: A Programme

Mariana Ferreira

A programmatic essay on the unification of simulation, learning, and inference. We articulate three theses — universality of differentiable simulators, equivalence of model-based and model-free agents in the asymptotic regime, and the centrality of compositional priors — and chart a research agenda spanning theory, systems, and application domains.
- PDF · 1.0 MB
- BibTeX
- DOI
- foundational
- programme

Active research programmes

Five themes shaping the simulation programme

I.

Theory of Simulation

Foundational results on identifiability, sample complexity, equilibrium selection, and information geometry of simulator-augmented inference. We treat the simulator as a first-class mathematical object and develop the corresponding analytic vocabulary.
II.

Multi-Agent Simulation

Heterogeneous populations, asynchronous information arrival, emergent communication, and equilibrium refinement in open-world environments at scales up to one million interacting agents.
III.

Reinforcement Learning

Curricula, exploration, and credit assignment in long-horizon tasks instantiated inside procedurally generated worlds. Our experimental programme couples theoretical bounds with empirical benchmarking.
IV.

Causal Inference

Identifiability of latent causal structure from synthetic-twin trajectories, counterfactual sensitivity in differentiable simulators, and the use of simulators as instruments for causal estimation.
V.

Alignment and Process Supervision

Reward specification, process supervision, and adjudicated signals for capable simulator-embedded agents. We treat alignment as inseparable from the design of the underlying simulation.

People

Investigators & Fellows

Mariana Ferreira

Director, Theory Group

Foundations of simulation intelligence

Director since 2018
Adaeze Okoye

Senior Investigator

Inverse simulation, identifiability

Joined 2019
Henrik Lindqvist

Senior Investigator

Differentiable simulators, counterfactuals

Joined 2020
Renji Tanaka

Investigator

Multi-agent equilibrium selection

Joined 2021
Dimitri Petrov

Investigator

Reinforcement learning, systems

Joined 2022
Sophie Bauer

Investigator

Alignment, infrastructure

Joined 2023

Contact

Correspondence and Visiting Programme

General Enquiries

office@sim-ai.org

+1 617 555 0144

Press & Media

press@sim-ai.org

For interview requests and embargoed releases.

Visiting Fellowship

fellowship@sim-ai.org

Three-, six-, and twelve-month residencies. Applications reviewed quarterly.

Postal Address

Sim-AI Research Organization
140 Brattle Street, Suite 4N
Cambridge, MA 02138, USA

A Public Repository of Research on Simulation Intelligence

Recent Publications

Theory of Simulation

Multi-Agent Simulation

Reinforcement Learning

Causal Inference

Alignment and Process Supervision

General Enquiries

Press & Media

Visiting Fellowship

Postal Address