Reinforcement Schedules | FR · FI · VR

How Reward Timing Shapes Behavior

B.F. Skinner discovered that the pattern of reinforcement matters as much as the reinforcement itself. Even with the same reward, changing when and how it's delivered produces dramatically different behavioral patterns.

The 2 × 2 Classification

Reinforcement schedules are defined along two dimensions:

Ratio vs. Interval — Is reinforcement based on number of responses (ratio) or elapsed time (interval)?
Fixed vs. Variable — Is the requirement constant (fixed) or does it change unpredictably (variable)?

	Fixed	Variable
Ratio response count	FR Reinforce every Nth response Break-and-run pattern	VR Reinforce after ~N responses (varies) High, steady rate
Interval elapsed time	FI Reinforce first response after T sec Scalloped pattern	VI Reinforce first response after ~T sec (varies) Moderate, steady rate

The Four Characteristic Patterns

Fixed Ratio (FR): Produces a break-and-run pattern. After each reinforcement, the organism pauses (the "post-reinforcement pause"), then responds in a rapid burst until the next reinforcement. The cumulative record shows flat segments followed by steep ramps. Think of a student powering through homework problems — a break after each set, then a concentrated burst.

Fixed Interval (FI): Produces the classic scallop. Responding is very slow right after reinforcement (why respond when it won't pay off yet?) and accelerates as the interval approaches. Like checking cookies in the oven — you don't bother at first, but peek more and more often as the timer approaches zero.

Variable Ratio (VR): Produces the highest, steadiest response rate of all four schedules. Since any response could be the one that pays off, there's no logical time to pause. This is the slot-machine schedule — and it's why gambling is so hard to quit. It's also the most resistant to extinction.

Variable Interval (VI): Produces a moderate, steady response rate. Since the interval varies unpredictably, a consistent checking rate is the best strategy — not too fast (wastes effort), not too slow (misses opportunities). Like checking your email for a reply that could come at any point.

Reading the Cumulative Record

The cumulative record is Skinner's signature visualization. The x-axis is time, the y-axis is total responses so far. The line can only go up or stay flat — never down.

Steep slope = high response rate (fast pressing)
Flat line = pause in responding
● Dots = reinforcement delivered
Consistent slope = steady response rate (VR, VI)
Changing slope = accelerating or decelerating (FI scallop, FR break-and-run)

💡

Try it!

1. Switch to Simulation mode and press Start — watch all four patterns develop simultaneously.
2. In Manual mode, try pressing the FR lever rapidly. Notice how you get reinforced every N presses.
3. Try the FI lever — can you learn to wait for the interval and respond right when it "opens up"?
4. Increase the ratio to 30 — the post-reinforcement pause in FR gets longer.
5. Compare VR and FR — same average requirement, but VR keeps you pressing without any pause!

Why This Matters

Reinforcement schedules explain patterns of behavior we see everywhere: why slot machines are addictive (VR), why students cram before exams (FI scalloping), why piecework pay motivates bursts of productivity (FR), and why we check social media at a steady rate (VI). Understanding these schedules gives us a powerful framework for analyzing — and designing — incentive structures.

Reinforcement Schedules:
The Four Patterns of Behavior

Fixed Ratio FR-10

Fixed Interval FI-10s

Variable Ratio VR-10

Variable Interval VI-10s

Cumulative Records — All Four Schedules Compared

How Reward Timing Shapes Behavior

The 2 × 2 Classification

The Four Characteristic Patterns

Reading the Cumulative Record

Why This Matters