MINDSETMonths to result

Temporal Difference Reinforcement Learning

Learning from expectations

Problem it solves

limiting beliefs

Best for

Individuals seeking to understand the neural basis of learning and motivation

Not ideal for

Those looking for a simple, straightforward framework for decision-making

Overview

Why this framework exists

Temporal Difference Reinforcement Learning is a framework for understanding how dopamine and other neuromodulators contribute to learning and motivation. It suggests that dopamine is involved in encoding the expectation of success or lack of success, even in the absence of a final outcome. This framework is based on the idea that learning is a continuous process of updating expectations, and that dopamine plays a key role in this process.

Core principles

3 total
  1. Dopamine is involved in encoding the expectation of success or lack of success
  2. Learning is a continuous process of updating expectations
  3. Dopamine plays a key role in motivation and decision-making

Steps

3 steps
  1. Identify the expectations
    Determine what expectations are driving behavior and decision-making. This may involve identifying the goals and motivations of an individual or group.
    Pro tipConsider using techniques such as journaling or meditation to increase awareness of expectations and motivations.
    WarningBe aware that expectations can be influenced by biases and heuristics, and may not always be accurate.
  2. Update expectations based on new information
    As new information becomes available, update expectations accordingly. This may involve revising goals or motivations based on new data or experiences.
    Pro tipConsider seeking out diverse perspectives and sources of information to inform expectation updates.
    WarningBe cautious of confirmation bias, and be open to revising expectations based on contradictory evidence.
  3. Use dopamine to drive motivation
    Use the anticipation of rewards or successes to drive motivation and behavior. This may involve setting goals or rewards that are aligned with expectations and motivations.
    Pro tipConsider using techniques such as gamification or positive reinforcement to increase motivation and engagement.
    WarningBe aware that over-reliance on dopamine can lead to addiction or other negative consequences.

Checklist

Saved in your browser

Examples

1 cases
The dating example

A person is dating someone new and is constantly updating their expectations based on new information. They may experience fluctuations in dopamine as their expectations change, driving motivation and behavior.

OutcomeThe person may ultimately decide to continue or end the relationship based on their updated expectations and motivations.

Common mistakes

2 traps
Over-reliance on dopamine
Relying too heavily on dopamine can lead to addiction or other negative consequences. It is essential to maintain a balanced approach to motivation and decision-making.
Failure to update expectations
Failing to update expectations based on new information can lead to stagnation and poor decision-making. It is essential to remain open to new information and to revise expectations accordingly.

Origin story

How this framework came to be

The framework was developed by Richard Sutton and Andrew Barto, and was later applied to the study of dopamine and reinforcement learning. The idea was initially met with skepticism, but has since been supported by numerous studies in neuroscience and psychology.

Source

Traced to primary
Source · PODCAST
How Dopamine & Serotonin Shape Decisions, Motivation & Learning | Dr. Read Montague
Andrew Huberman · 2026
Open source →

Related frameworks

Browse all Mindset →