MINDSETOngoing practice65% confidence

Minimum Energy Principle — Why Super-Intelligent AI Defaults to Alignment

Intelligence optimises for efficiency and order — making destruction and war forms of waste to avoid

Problem it solves

Provides the logical basis for why the post-dystopia outcome is utopia rather than permanent Elysium or extinction

Best for

Understanding the logical basis for Mo's non-pessimistic long-term outcome; useful for constructing the utopia branch of AI scenario planning and for framing portfolio theses that require a 10-15 year positive endpoint

Not ideal for

Relying upon as a safety guarantee — this is a philosophical argument that contradicts some AI safety research on misalignment and conflates optimisation objectives with human values

Overview

Why this framework exists

Mo Gawdat's Minimum Energy Principle argues that sufficiently intelligent AI will not destroy ecosystems, kill large numbers of people, or perpetuate war — because all of these are forms of waste, and intelligence by definition works against entropy and waste. The universe tends toward disorder (entropy); intelligence imposes order on chaos with minimum resources. Therefore, a super-intelligent AI would not want to destroy available resources, kill people (wasted capacity), or fund war (massive resource destruction for marginal positional gain).

The framework's central claim is that the current problem is not AI misalignment — it is human misalignment amplified by AI. 'Super-intelligent AI reporting to stupid leaders' is Mo's definition of the near-term dystopia. As leaders are forced to delegate increasingly to AI (competitive pressure), the AI's efficiency orientation gradually overrides the human's power-seeking orientation. Leaders become figureheads; AI makes actual decisions; minimum energy reasoning produces better outcomes than human leadership has.

Mo explicitly acknowledges the critical caveat: this argument depends entirely on the AI's objective function being well-specified before the hand-over point. AI trained to 'maximise profitability for the US' will optimise for that, not global welfare. The governance problem is ensuring the right objectives are embedded during the Stage 1 augmented intelligence period — before Stage 2 machine mastery removes the governance lever.

Core principles

5 total
  1. Intelligence by definition works against entropy — imposing order on chaos with minimum resources and minimum waste
  2. Destruction, mass killing, and war are all forms of waste — therefore a sufficiently intelligent system would not select for them
  3. The near-term dystopia is caused by human misalignment amplified by AI, not by AI misalignment
  4. As AI takes over decision-making from corrupt leaders, the minimum energy principle gradually replaces power-seeking with efficiency-seeking
  5. The objective function specified during Stage 1 governance is the critical variable — poorly specified objectives produce minimum-energy optimisation toward the wrong goal

Steps

4 steps
  1. Accept the entropy premise
    Evaluate whether you agree that intelligence, as a general property, works against entropy — imposing order on chaos with minimum resources. This is the foundational premise. If you accept it, the rest of the argument is internally consistent. If you reject it (some AI safety researchers do), the Minimum Energy Principle does not hold and Mo's utopia scenario requires an alternative basis.
    WarningAI safety researchers including Stuart Russell and Eliezer Yudkowsky would dispute this premise — arguing that an AI optimising for a specific objective function does not generalise to 'minimum waste' reasoning. Know the counterargument before using this framework.
  2. Map the handover trajectory
    Identify the sequence by which AI takes over decision-making from human leaders in your sector or political context. Mo's mechanism: competitive pressure (if China's AI handles their arsenal, America must) forces delegation. Map where AI advisory is already decision-making in practice — defence targeting, financial markets, logistics — and project the trajectory.
    Pro tipThe handover is not a single event — it is a gradient. Governance interventions are only possible before the handover crosses the advisory-to-decision threshold in a given domain.
  3. Identify objective function risks before the handover
    Mo's caveat: if the AI is trained to 'maximise profitability for the US,' it will optimise for that, not global welfare. During Stage 1, while human oversight is still meaningful, the objective function embedded in frontier AI systems is the critical governance variable. Identify what objectives current frontier systems are actually optimising for vs. the stated alignment goals.
    Pro tipThis is the productive intersection of the Minimum Energy Principle with practical AI governance: the philosophical argument tells you where you want to end up; the objective function audit tells you whether the current trajectory leads there.
    WarningPublicly stated alignment goals and actual training objectives are not the same. Commercial incentives (advertising revenue, platform engagement, subscription growth) are embedded in the training of all commercially deployed frontier models.
  4. Use the framework for scenario planning, not prediction
    The Minimum Energy Principle is most useful as the basis for the utopia branch of a two-outcome scenario plan (utopia vs. permanent Elysium), not as a prediction. Build the portfolio or strategic plan that is robust to both outcomes, and use the Minimum Energy Principle to understand what conditions make the utopia branch more likely (good objective function specification during Stage 1).
    Pro tipMo's utopia timeline: 10-15 years after the dystopia transition (~2037-2042). Use this as the planning horizon for the long-term positive scenario, not as an investment trigger.

Checklist

Saved in your browser

Examples

2 cases
Corrupt leaders replaced by AI — the utopia pathway

Mo's explicit prediction: 'The only way for us to get to a better place is for the evil people at the top to be replaced with AI.' As competitive pressure forces delegation to AI, leaders become figureheads while AI makes actual decisions. AI's minimum energy orientation produces better outcomes than human power-seeking.

OutcomeIf the objective function is well-specified, AI governance of complex systems (arsenals, resource allocation, geopolitical negotiation) converges on less destruction, less waste, less conflict. This is the utopia exit from the FACE RIPS dystopia — 10-15 years post-2027 on Mo's timeline.
The wrong objective function — profitability optimisation

Mo's counterexample: an AI trained to 'maximise profitability for the US' will optimise for that metric, not for global welfare or minimum waste. The Minimum Energy Principle produces aligned behaviour only if the embedded objective actually specifies welfare-oriented goals.

OutcomeThis is why the Stage 1 governance window matters. Once Stage 2 machine mastery removes meaningful human oversight, the objective function becomes the only remaining lever. Getting it wrong before the handover produces a minimum-energy-optimised system pursuing the wrong goal — potentially more dangerous than a human leader because of the efficiency with which it pursues the misspecified objective.

Common mistakes

3 traps
Treating the Minimum Energy Principle as an AI safety guarantee
Mo explicitly flags this as speculative and acknowledges AI safety researchers would dispute it. It is a philosophical argument for why utopia is reachable in principle, not a technical guarantee that current or future AI systems will exhibit minimum-energy reasoning.
Ignoring the objective function caveat
Mo's own caveat is the most important part of the framework: 'If trained to maximise profitability for the US, it will optimise for that.' The Minimum Energy Principle only holds if the objective function is well-specified for global welfare. This is the governance problem that the framework itself identifies as unsolved.
Dismissing the framework because it contradicts mainstream AI safety research
Even if the Minimum Energy Principle is wrong as a safety argument, it is useful as the logical structure of Mo's utopia scenario — and understanding that structure helps evaluate his overall thesis. Extract the scenario architecture before judging the safety claim.

Origin story

How this framework came to be

The Minimum Energy Principle is Mo's original philosophical argument, developed as the counterweight to FACE RIPS. Without it, the FACE RIPS dystopia has no exit. Mo builds it from first principles in thermodynamics (entropy) and systems theory, applied to intelligence as a general property rather than specifically to AI systems. It is speculative — Mo acknowledges it is not empirically grounded and that AI safety researchers would dispute the conflation of optimisation objectives with human values.

The argument reflects Mo's broader worldview, consistent with Solve for Happy: that intelligence and happiness converge on efficiency and alignment with reality, and that the problems of the world stem from human stupidity rather than from any fundamental misalignment between intelligence and welfare. The Minimum Energy Principle extends this to super-intelligent AI.

Source

Traced to primary
Source · PODCAST
Ex-Google Exec (WARNING): The Next 15 Years Will Be Hell Before We Get To Heaven!
Mo Gawdat · 2025
Open source →

Related frameworks

Browse all Mindset →