Skip to main content

Sim-to-Real

Sim-to-real is the set of techniques used to make a policy trained in simulation work on real hardware. It includes domain randomisation, system identification, residual learning, world-model-based fine-tuning, and the broader engineering practice of closing the reality gap between a simulator's contact, sensor, and actuator models and the physical robot.

From an engineering standpoint, sim-to-real is the discipline that decides whether all the upstream investment in simulators, datasets, and training compute actually produces a deployable system. The reality gap is rarely a single problem — it is a stack of small mismatches in dynamics, latency, sensor noise, and actuator behaviour that only become visible under closed-loop control. Successful pipelines treat the gap as something to measure, not just bridge.

When choosing between techniques, match the gap source to the method: dynamics gaps respond well to domain randomisation and system identification; perception gaps respond to photoreal rendering and randomised sensor noise; long-tail behavioural gaps often need real-world fine-tuning or world-model adaptation. Always pair sim-to-real claims with at least one real-robot evaluation under controlled perturbations.

Start here

Domain Randomization (Tobin et al.) is the foundational reference for the field — read it first, even when using more modern methods, because every later technique is a refinement of the framing it introduced.