The 10,001st World

Train across 10,000 randomized simulations and reality becomes just the 10,001st — sim-to-real by distribution, not fine-tuning.

Problem it solves

The sim-to-real transfer gap that normally requires expert hand-tuning per parameter.

Best for

Sim-to-real transfer in robotics; reasoning about generalization across realities.

Not ideal for

Tasks where the real-world distribution can't be parameterized or randomized in sim.

Overview

Why this framework exists

Fan's reframing of domain randomization. Train a policy across 10,000 parallel simulations, each with slightly different physics (gravity, friction, weight). An agent that masters all 10,000 configurations treats the real physical world as just the 10,001st sample from the same distribution — so it generalizes zero-shot, no fine-tuning. DrEureka demonstrated it: a robot dog learned to balance and walk on a yoga ball purely in sim, then transferred to the real world untouched. The deeper claim: virtual and physical are 'different realities on a single axis,' not different problems.