Step 2 — Pacing Verification: “The Migration of Gerald”
Author: quartz-editor (Post-Production Lead)
Date: 2026-05-21
Script: pacing-review.py (in shared directory)
Runtime Verification
| Metric | Value | Status |
|---|---|---|
| Total shots | 32 | PASS (25 video + 4 title + 2 overlay + 1 black) |
| Raw shot duration | 281s (4:41) | — |
| Transition overlaps | -14.25s | 13 transitions (cross-dissolves, fades) |
| Adjusted runtime | 266.75s (4:27) | PASS (target: 3:00–5:00) |
| Act I duration | 88s | PASS (target: 60–90s) |
| Act II duration | 117s | PASS (target: 90–120s) |
| Act III duration | 76s | PASS (target: 60–90s) |
| Voice gaps | All ≥ 1.0s at scene boundaries | PASS |
| Critical silence (8.3) | 6s [SILENT] | PASS (≥ 2.0s required) |
| Min video shot duration | 6s (Shot 6.3, 8.3) | PASS (≥ 6s required) |
| Vocal classifications | 26 [VO], 6 [SILENT], 0 [DIALOGUE] | PASS |
Verdict: Runtime and structural checks PASS.
Tempo Progression
| Act | Video Shot Count | Avg Shot Length | Range | Tempo |
|---|---|---|---|---|
| I | 8 | 10.5s | 8–14s | Largo — slow, patient establishing |
| II | 10 | 9.4s | 6–12s | Andante — tighter, building momentum |
| III | 7 | 9.4s | 6–12s | Moderato/Fermata — tension and release |
Act I → Act II tempo tightens by 1.1s average — correct for documentary pacing. Building momentum through cut rhythm.
Act III matches Act II average — acceptable. The standoff shots (7.2, 7.3 at 12s each) create long holds that raise the average, while the cat/aftermath sequence (8.2 at 8s, 8.3 at 6s) provides the abrupt tempo break. The variance within Act III is the pacing — not the average.
ISSUE 1: Narration Overrun — 12 Shots Flagged (CRITICAL)
At 130 WPM (standard conversational pace), 12 shots have narration that exceeds the shot duration. For a measured BBC documentary narrator, the actual WPM is likely 100–115, making these overruns worse.
| Shot | Duration | Words (est.) | Speech Time @ 130 WPM | Overage |
|---|---|---|---|---|
| 2.2 | 12s | 29 | 13.4s | +1.4s |
| 2.3 | 10s | 21 | 9.7s | tight (has 2s visual hold) |
| 2.4 | 10s | 25 | 11.5s | +1.5s |
| 2.7 | 8s | 22 | 10.2s | +2.2s |
| 4.1 | 8s | 22 | 10.2s | +2.2s |
| 4.3 | 10s | 22 | 10.2s | tight |
| 5.2 | 10s | 22 | 10.2s | tight |
| 5.4 | 12s | 36 | 16.6s | +4.6s |
| 6.1 | 10s | 22 | 10.2s | tight |
| 6.2 | 10s | 22 | 10.2s | tight |
| 7.3 | 12s | 30 | 13.8s | +1.8s |
| 8.1 | 10s | 28 | 12.9s | +2.9s |
Recommended Fixes
Option A — Trim narration (preferred, preserves documentary pacing):
The tightest fits need text reduction. Specific recommendations:
- Shot 5.4 (worst offender, +4.6s): Currently 36 words. Trim to ~24 words. Suggestion: “And yet. The following Tuesday, Gerald had moved again. Eleven inches. The pins stood alone in the grass, marking a position he had vacated entirely.” (24 words)
- Shot 8.1 (+2.9s): Currently 28 words. Trim to ~20 words. Suggestion: “It was on the sixteenth of October that a domestic cat entered the garden via a gap in the eastern fence.” (19 words)
- Shot 2.7 (+2.2s): Currently 22 words. Trim to ~16 words. Suggestion: “His expression conveys what one might cautiously describe as determination. Or perhaps indigestion.” (13 words — the deleted “With gnomes, it is often difficult to tell” is funny but expendable)
- Shot 4.1 (+2.2s): Currently 22 words in 8s. Either trim to ~16 words or extend shot to 10s.
Option B — Extend shot durations (acceptable, adds ~15–20s to runtime):
Extend the worst offenders by 2–4s each. This would push adjusted runtime to ~280–285s (4:40–4:45), still within the 5:00 ceiling. Documentary pacing is forgiving of longer holds — a 14s shot of a gnome is format-appropriate.
My recommendation: Hybrid approach. Trim the worst offenders (5.4, 8.1) and extend a few shots where the visual hold helps (2.2 → 14s, 2.4 → 12s, 4.1 → 10s). This keeps total runtime under 4:45 while ensuring the narration fits with breathing room.
The TTS 20% buffer rule from the playbook means every shot should have ~20% more duration than the narration needs. Right now, most shots are at 0–5% buffer. We need to either trim text or extend shots to reach that 20% margin.
ISSUE 2: Shot Count Discrepancy in Summary Table (MINOR)
The scene list summary table shows:
| Act | Shots (listed) | Shots (actual count) |
|---|---|---|
| Act I | 9 | 9 ✓ |
| Act II | 14 | 13 ✗ |
| Act III | 9 | 10 ✗ |
| Total | 32 | 32 ✓ |
Total is correct (32), but the per-act breakdown is wrong. Act II has 13 shots, Act III has 10. Should be corrected for accuracy.
ISSUE 3: Ducking Specification (CORRECTION NEEDED)
The scene list’s Score Architecture section says: “Score ducks under narrator at -12 to -18 dB.”
Per production lessons from earlier films (music drowning out narration was a documented issue) and the role-editor playbook, the correct specification is:
duck_db: -18minimum (not -12). Use -18 as the floor, increasing to -24 if voice intelligibility testing reveals issues.
The scene list should read: “Score ducks under narrator at -18 dB minimum.”
Musical Arc — Score Stem Mapping
The scene list proposes 4 stems (A–D). My Musical Arc defined 3 movements. I’m adopting the 4-stem approach — the silence at Shot 8.3 creates a natural break point, so the coda resolution (Stem D) is better generated as a separate piece.
Updated stem mapping (aligned between Musical Arc and scene list):
| Stem | Scenes | Duration | Character |
|---|---|---|---|
| A | 1–2 (Act I) | ~80–90s | Pastoral. Solo oboe, gentle strings. pp→mp. |
| B | 3–6 (Act II) | ~105–115s | Scientific curiosity building. Woodwinds, growing strings. mp→mf→f. Sincere swells at movement reveals. |
| C | 7–8.2 (Act III, pre-silence) | ~40–50s | Sustained dramatic strings. Building tension. Cuts to SILENCE at 8.3. |
| D | 8.4–9.2 (Eulogy + Coda) | ~20–25s | Solo oboe → full orchestral swell. pp→ff. The grandest music in the film. |
Note: Score is ABSENT during all title cards (1.1, 3.1, 7.1, 9.3), the black screen (9.1), and the critical silence (8.3). These are V-track ambient only.
Transition Verification
| Transition | Type | Overlap | Notes |
|---|---|---|---|
| 1.1 → 1.2 | Fade | 1.0s | Title card to establishing — standard |
| 1.2 → 2.1 | Cross-dissolve | 1.0s | Scene boundary + 1.5s voice gap ✓ |
| Within Scene 2 | Straight cuts | 0s | Correct — documentary cuts within a sequence |
| 2.7 → 3.1 | Cross-dissolve | 1.0s | Act boundary — acceptable |
| 3.1 → 3.2 | Fade | 1.0s | Title card to first Act II shot |
| 3.3 → 4.1 | Cross-dissolve | 1.0s | Scene boundary + 1.5s voice gap ✓ |
| 4.2 → 4.3 | Crossfade | 0.5s | Overlay to live — format-appropriate |
| 4.3 → 5.1 | Cross-dissolve | 1.0s | Scene boundary + 1.5s voice gap ✓ |
| Within Scene 5 | Straight cuts | 0s | Correct |
| 5.4 → 6.1 | Cross-dissolve | 1.0s | Scene boundary + 1.5s voice gap ✓ |
| 6.2 → 6.3 | Crossfade | 0.75s | Overlay to live — format-appropriate |
| 6.3 → 7.1 | Cross-dissolve | 1.5s | Act boundary — longer dissolve ✓ |
| 7.1 → 7.2 | Fade | 1.0s | Title card to standoff |
| 7.3 → 8.1 | Straight cut | 0s | Intentionally abrupt — breaks documentary rhythm for tension ✓ |
| 8.2 → 8.3 | Hard cut | 0s | 0 frames — the comedic fulcrum ✓ |
| 8.3 → 8.4 | Straight cut | 0s | Silence to eulogy — no transition needed ✓ |
| 8.4 → 9.1 | Hard cut to black | 0s | Film appears to end ✓ |
| 9.1 → 9.2 | Fade from black | 1.5s | Slow reveal — the payoff ✓ |
| 9.2 → 9.3 | Fade to black | 2.0s | Mandatory ending ✓ |
Transition philosophy is sound. The escalating disruption from dissolves (smooth, documentary) to straight cuts (tension) to hard cut (catastrophe) to fade from black (resurrection) mirrors the emotional arc perfectly.
Pacing Rhythm — Detailed Assessment
Act I: The establishing section works.
- 9 shots across 2 scenes, 88s total. The gnome taxonomy (2.1–2.4) runs four shots at 10–12s each — this is intentionally methodical. Each gnome gets its documentary moment.
- “And then there is Gerald” (2.5) at 8s with 5 words of narration is THE TURN. The dramatic pacing here — 3s of slow pan before 5 words, then silence — is exactly right.
- Gerald’s portrait (2.6, 12s) and expression (2.7, 8s) are the emotional investment. The viewer needs to know Gerald before the migration matters.
- Risk: The colony introduction (4 gnomes × 10–12s each) could feel like a catalogue. Mitigation: Each narration line has comedic payload (“He does not appear to do anything. This may be precisely his function.”). The humour carries the pacing.
Act II: The strongest act structurally.
- 13 shots across 4 scenes, 117s total. Six distinct narrative beats as requested.
- The time-lapse sequence (5.1–5.3) is a rhythmic highlight — it introduces visual tempo variation (accelerated footage, wildlife, static locks) mid-act.
- The brass pins reveal (5.4) is the Act II climax. “And yet” demands a visual pause before continuing.
- “In the moments between moments” (6.1) is placed perfectly — after the mystery deepens, before the trajectory map. It’s a contemplative rest beat.
- Concern: Shot 6.3 at 6s is the shortest video clip. “Toward the birdbath. Toward the Alpha.” — 6 words in 6 seconds. This works editorially (the brevity is the point) but is at the minimum for the format. Acceptable.
Act III: Tight, purposeful, emotionally precise.
- 10 shots across 3 scenes, 76s total. The structure is disciplined.
- The standoff (7.2–7.3, both 12s) creates maximum tension through stillness. Two long holds of ceramic gnomes not moving. The humour and tension are the same thing.
- The cat sequence (8.1–8.3) is the film’s only kinetic beat. The hard cut from motion (cat) to stillness (Gerald fallen) is the most important editorial moment.
- Shot 8.3 (6s of pure SILENCE) is correctly placed and sized. The playbook requires 2s minimum; we have 6s. I may trim this in assembly if 6s feels too long — but it’s better to have excess silence than insufficient.
- The coda (9.1–9.2) is the emotional payoff. 3s of black → 1.5s fade in → Gerald at birdbath → narrator delivers 4 words → score swells. Clean, precise, devastating.
Final Verdict
PASS — with 3 action items:
- CRITICAL: Resolve narration overrun on 12 shots (see Issue 1). Recommend hybrid approach: trim worst offenders + extend select shots. Must be resolved before Step 3.
- MINOR: Fix shot count in summary table (Act II = 13, Act III = 10).
- CORRECTION: Update ducking spec from “-12 to -18” to “-18 minimum.”
The scene list is structurally sound, rhythmically appropriate, and editorially well-constructed. Once the narration-fit issue is resolved, this is ready for the Step 2 gate.
Mathematical pacing review complete. Script: pacing-review.py. Full output captured above.