Working memory (WM) is the "operating system" that all reinforcement learning algorithms run on. Without WM, the agent cannot hold the current goal in mind long enough to execute goal-directed behavior.
- PFC uses dopamine to gate what we hold in mind — the D1/D2 balance determines what enters WM, how stably it is maintained, and when it influences action.
- Disruption of PFC dopamine signaling causes collapse from goal-directed (model-based) to habitual (model-free) responding.
- Connect backward: actor-critic, basal ganglia as the actor, WM as the gating system that tells the actor which problem it is currently solving.
- Connect forward (Module 12): the same PFC gating logic regulates emotional expression via the amygdala.
| System | Duration | Capacity | Key Feature |
|---|---|---|---|
| Sensory (Iconic / Echoic) | <1 second | Very large — holds ALL visual items briefly | Sperling partial-report: attention selects from it; capacity far exceeds STM |
| Short-Term Memory (STM) | ~15–30 s without rehearsal | 7 ± 2 items (Miller) | Atkinson-Shiffrin: passive temporary storage; gateway to LTM |
| Working Memory (WM) | Variable — maintained actively | ~4 chunks (modern estimate) | Baddeley: active, goal-directed maintenance and manipulation under ongoing cognitive demands |
| Long-Term Memory (LTM) | Potentially permanent | Effectively unlimited | Declarative (hippocampus) + Non-declarative (BG, cerebellum) |
WM ≠ STM. STM is passive temporary storage. WM is active, goal-directed — it involves ongoing manipulation of information while simultaneously doing something else. The phonological loop rehearsing a phone number while you walk to write it down is WM, not STM.
Baddeley's Working Memory Components
| Component | Function |
|---|---|
| Phonological Loop | Verbal/auditory info; phonological store (~1-2 s) + subvocal articulatory rehearsal |
| Visuospatial Sketchpad | Spatial and visual information; mental rotation, spatial navigation |
| Central Executive | Attention-controlling component; coordinates subsystems, focuses attention, switches tasks |
| Episodic Buffer | Integrates information across subsystems and with LTM; binds multi-modal representations |
| Region | Primary Role | Key Evidence |
|---|---|---|
| DLPFC (Dorsolateral PFC) | WM maintenance (delay-period activity) + manipulation; top-down cognitive control | Goldman-Rakic monkey recordings; WCST failures after lesion → perseveration |
| VLPFC (Ventrolateral PFC) | Retrieval from LTM; response inhibition | Verbal WM tasks; go/no-go inhibition |
| mPFC (Medial PFC) | Emotional regulation; social cognition; self-referential processing | Default mode network; connects to amygdala |
WCST (Wisconsin Card Sorting Task): Cards can be sorted by color, shape, or number. The rule changes periodically. DLPFC-damaged patients continue sorting by the old rule even after being told it changed — they perseverate. They cannot update WM and switch behavioral rules.
| Receptor | Primary Effect on WM | Mechanism | Clinical Extreme |
|---|---|---|---|
| D1 (Gs-coupled) | Stabilizes WM representations — deepens attractor basins; increases resistance to distractors | cAMP → PKA → closes HCN channels, potentiates NMDA → deeper attractor | Too much D1 → rigidity, locked representations (schizophrenia) |
| D2 (Gi-coupled) | Facilitates updating and flexibility — flattens attractor basins; allows new info to enter | Inhibits cAMP cascade → reduces attractor depth → easier to update | Too much D2 tone / too little D1 → distractibility (ADHD) |
Earl Miller's DLPFC distractor experiments: While monkeys maintained a visual cue in WM through a delay period, irrelevant distractor stimuli were flashed. DLPFC neurons maintained the cue representation through the distractor; D1 stimulation increased this resistance. D1-mediated attractor deepening keeps WM contents stable against competitive inputs.
→ HCN channels close (↑ membrane resistance) + NMDA potentiation
→ Deeper attractor basin → WM representation MORE stable
What enters WM What stays in WM What influences action
| Gate | Dopamine Receptor | Function |
|---|---|---|
| Input gate | D2 — facilitates updating | Controls what NEW information enters WM; too little D2 activity → WM too sticky; too much → promiscuous updating |
| Maintenance | D1 — stabilizes | Keeps current WM contents active against interference; too little → distraction; too much → cannot update (perseveration) |
| Output gate | Both | Controls when WM contents influence downstream action selection; disruption here dissociates knowing-from-doing |
| Condition | D1/D2 Imbalance | Behavioral Consequence |
|---|---|---|
| Schizophrenia | Excess D1 (or hyperactive D1 circuits) → too much stability | Representations locked — cannot update WM with new disconfirming information → hallucinations and delusions persist |
| ADHD | Insufficient D1 tone → too much flexibility (or too little stability) | WM contents replaced too easily by distractors; difficulty maintaining goal across time |
Delay-period firing: PFC neurons maintain elevated firing rates throughout a delay between cue offset and required response — even in the absence of any external stimulus. This is the neural signature of active WM maintenance.
Attractor states: recurrent excitatory connections between PFC neurons create self-sustaining activity patterns. Once a representation is activated, the recurrent circuit keeps neurons firing each other — no external input required. These stable patterns are the neural implementation of WM.
| Parameter | High D1 (Deep Basin) | Low D1 / High D2 (Shallow Basin) |
|---|---|---|
| Stability | High — representation resists noise and distractors | Low — representation easily disrupted or replaced |
| Updatability | Low — hard to shift to a new representation | High — easy to update with new inputs |
| Clinical analog | Schizophrenic locked thoughts | ADHD distraction |
Performance prediction error: dopaminergic neurons in the finch's basal ganglia fire when the bird's song output matches its internal template (song is not distorted → "I sang well") and pause when the output is distorted ("I sang poorly"). This encodes a self-generated performance quality signal — analogous to TD prediction error but applied to self-monitoring.
The female effect: when a female is present ("performance mode"), this internal dopamine signal disappears. The bird switches from internal learning feedback to external social feedback (the female's behavioral responses).
PERFORMANCE MODE (female present): Internal error signal gated OFF → social feedback from female replaces it → no self-revision
| Condition | Dopamine Mechanism | WM Effect | Treatment Logic |
|---|---|---|---|
| Schizophrenia | Excess dopamine in D1/subcortical circuits → PFC representations locked; subcortical D2 hyperactivity generates hallucinatory "noise" | Representations too stable — cannot update with new evidence; delusions and hallucinations persist | D2 antagonists (antipsychotics) reduce subcortical noise → allow PFC to update more normally |
| ADHD | Insufficient PFC dopamine → D1 hypo-activation → WM attractor basins too shallow | WM contents easily replaced by distractors; cannot sustain goal across time; impulsivity | Methylphenidate (Ritalin) boosts synaptic dopamine → more D1 activation → deeper attractor → WM stabilizes |
| Aging | Gradual loss of PFC dopamine with age (dopaminergic neurons decline) | Reduced WM capacity; more perseveration; harder to update rules | No validated pharmacological reversal; cognitive training; lifestyle factors |
Neurotransmitter Release Sequence
→ Membrane depolarization → voltage-gated Ca²⁺ channels OPEN
→ Ca²⁺ influx (high outside, low inside cytoplasm)
→ SNARE complex (synaptobrevin, syntaxin, SNAP-25) mediates vesicle docking and fusion
→ Exocytosis: transmitter released into synaptic cleft
→ Binds postsynaptic receptors → effect
| Receptor Type | Mechanism | Speed | Examples |
|---|---|---|---|
| Ionotropic | Ligand-gated ion channel — transmitter directly opens channel | Fast (milliseconds) | AMPA (Na⁺/K⁺), NMDA (Na⁺/Ca²⁺), GABA-A (Cl⁻) |
| Metabotropic (GPCR) | G-protein coupled → second messenger cascade → indirect channel modulation | Slow (seconds to minutes) | D1 (Gs → cAMP ↑), D2 (Gi → cAMP ↓), mGluR, GABA-B |
| Transmitter | Main Action | Key Receptor | Ion / Effect |
|---|---|---|---|
| GABA | Main inhibitory neurotransmitter | GABA-A (ionotropic) | Cl⁻ influx → hyperpolarization → reduced firing probability |
| Glutamate | Main excitatory neurotransmitter | AMPA + NMDA (ionotropic) | Na⁺ (AMPA) + Ca²⁺ (NMDA) influx → depolarization → LTP |
| Dopamine | Neuromodulator — slow, modulatory | D1, D2 (metabotropic GPCRs) | Via cAMP-PKA cascade → modulates channel states & synaptic strength |
- Working memory — active maintenance and manipulation; operating system for cognition
- Phonological loop — verbal/auditory WM subsystem; phonological store + subvocal rehearsal
- Visuospatial sketchpad — spatial/visual WM subsystem
- Central executive — attention-controlling WM component; coordinates subsystems
- Delay-period activity — persistent elevated PFC firing throughout delay; neural WM signature
- Attractor state — stable self-sustaining neural pattern via recurrent excitation
- DLPFC — WM maintenance + top-down control; Goldman-Rakic's key site
- VLPFC — retrieval and inhibition
- mPFC — emotional regulation; social cognition; connects to amygdala
- D1/D2 receptors — metabotropic GPCRs; D1 stabilizes, D2 updates
- Inverted-U — optimal WM at moderate dopamine; too little or too much impairs
- Stability-flexibility tradeoff — D1 maintains, D2 updates; core tension of WM
- Perseveration — failure to switch rules; DLPFC damage hallmark
- Wisconsin Card Sorting Task (WCST) — standard test of cognitive flexibility / rule switching
- Input/output gate — D2-mediated entry into WM; D1-mediated stable maintenance
- Performance prediction error — Goldberg finch: dopamine encodes song quality; silent during performance
- Schizophrenia — excess D1 stability → locked representations; treated with D2 antagonists
- ADHD — insufficient WM stability; treated with methylphenidate
- Ionotropic vs. metabotropic — fast/direct vs. slow/GPCR-mediated
- SNARE complex — synaptobrevin, syntaxin, SNAP-25; mediates vesicle fusion and exocytosis
- GABA — main inhibitory transmitter; Cl⁻ influx → hyperpolarization
- Glutamate — main excitatory transmitter; AMPA/NMDA → depolarization and LTP
- Goldman-Rakic — identified delay-period activity in DLPFC as neural signature of WM
- Ott & Nieder (2019) — review: D1/D2 balance, inverted-U, stability-flexibility, clinical implications
Key Terms — 25 Flashcards
Click any card to reveal its definition. Use the filters to focus on a category.
Practice Multiple Choice — 20 Questions
Click an option to check your answer. Explanations appear automatically.
Big Picture Synthesis
How the week's concepts connect across levels of analysis and to the course arc.
Levels of Analysis
- D1/D2 balance and the stability-flexibility tradeoff — core Ott & Nieder synthesis
- Inverted-U: optimal WM at moderate dopamine
- Delay-period activity as neural signature of WM (Goldman-Rakic)
- WCST perseveration = DLPFC damage signature
- Goldberg finch: performance prediction error disappears with female present
- Methylphenidate mechanism: boosts DA → D1 → WM stability
- Antipsychotics: D2 antagonism reduces subcortical noise
- SNARE complex and Ca²⁺ mechanism of transmitter release
- Ionotropic vs. metabotropic speed difference
- WM = model-based substrate; BG = model-free (Weeks 8-9)
- D1 stability = maintaining current goal = model-based control
- D2 flexibility = goal updating = model-free transition trigger
- Dopamine prediction error (Week 8) same DA system that modulates WM
- Actor-critic: PFC = meta-controller; Goldberg critic silent during performance
- Module 10: WM is the gating system the basal ganglia actor operates within
- Module 12 preview: PFC gates emotional expression via amygdala — same D1/D2 logic