← Lambda Artifacts | Lambda Team

Cinematography Constraints

Lambda Team — "The Last Diner"

Cinematography Constraints — Step 2 Input

From: Marcus Delaney (Tech Lead)
For: Jules (Idea Person) — use alongside Beat Sheet construction
Date: 2026-05-18


1. Shot Duration Strategy (Veo Capabilities)

Veo generates clips at 4s, 6s, or 8s base duration. The extend command (Veo 3.1 Lite only) adds exactly 7s per extension. With the Overhang Principle (2s pre-roll + 2s post-roll = 4s total), plan like this:

Planned Screen TimeGeneration StrategyRaw ClipNotes
2–4sGenerate 8s, trim in post8sMost shots should be this — short, punchy
5–6sGenerate 8s, trim to 6s usable8sStandard dialogue beat
7–8sGenerate 8s, trim minimally8sHolding shots, silences
9–11sGenerate 8s + 1 extend (= 15s raw), trim15sUse sparingly — long takes
12–14sGenerate 8s + 1 extend, trim tight15sMax before 2nd extend

Rule of thumb: Keep most shots in the 4–6s planned range (generated as 8s clips). This gives the Editor maximum trim flexibility and keeps generation costs low. Reserve longer shots (8s+ planned / extend needed) for key emotional beats.

For a 3:30–4:00 target runtime, we need roughly 40–55 seconds of usable footage per scene across 4 scenes. That’s approximately 8–10 shots per scene at 4–6s each.

2. Camera Coverage Plan (Mapped to Tempo Structure)

Scene 1 — ALLEGRETTO (The Polite Arrival) | ~50–60s

Coverage TypeUsageTechnical Notes
Wide establishingExterior diner in rain, then interior geographySetting reference only — no characters or one silhouette
Medium two-shotSarah approaches booth, sits downBoth characters, emphasize table distance
Medium OTS (over-the-shoulder)Polite exchangesAlternating — Sarah’s shoulder/Mark speaking, then reverse
Insert close-upManila envelope landing on table, coffee pouringProps only — no character reference needed, frees budget

Pacing: Measured cuts, 5–6s per shot. Let the geometry of the diner breathe.

Scene 2 — ACCELERANDO (The First Crack) | ~60–75s

Coverage TypeUsageTechnical Notes
Medium close-up (MCU)Argument exchangesTighter framing — single character per shot
Quick-cut MCU pairsRapid-fire overlapping dialogue4s shots, fast alternation
Two-shot (tighter)Physical distance collapsing as they lean inBoth characters, table between
InsertHands slamming table, coffee cups rattlingProp detail, no character ref needed

Pacing: Shots shorten as tension builds. Start at 6s, compress to 4s at the argument peak.

Scene 3 — ADAGIO (The Vulnerable Center) | ~60–75s

Coverage TypeUsageTechnical Notes
Extreme close-up (ECU)Faces — tears, micro-expressionsSingle character, slow push-in
ECU insertHands touching across table, fork breaking pieEmotional props
Medium two-shotShared laughter, the memory momentBoth characters, warmer framing
Slow push-inCamera creeping closer during vulnerabilityUse from-image with start/end frames for control

Pacing: Longest shots of the film. 6–8s holds. Let silences land. This is where we earn the audience.

Scene 4 — CODA (The Resolution) | ~35–50s

Coverage TypeUsageTechnical Notes
ECU insertPen signing papers — the scratch soundProp only
Medium two-shotFinal exchange across tableBoth characters, softer framing
Medium trackingSarah walks to door, bell chimesSingle character
WideMark alone in booth, dawn light through windowSingle character + setting, new lighting
Wide exteriorTaillights fading, rain stopped, purple skySetting only — bookend with opening

Pacing: Deliberate, measured. 5–6s per shot. Mirror the opening tempo but with resolution.

3. Master Settings (for Setting Reference Images)

I will generate one reference image per Master Setting in Step 3. Define these in the scene list:

Setting IDDescriptionUsed In
diner-exteriorStarlight Diner exterior — rain, blue neon “OPEN 24 HOURS” sign, wet parking lot, nighttimeScenes 1, 4
diner-boothInterior booth — cherry red vinyl, chrome-edged Formica table, warm tungsten pendant light overhead, rain-streaked window beside booth, black-and-white checkered floor visibleScenes 1, 2, 3, 4
diner-booth-dawnSame booth but dawn light replaces rain — purple/amber sky through window, neon sign off, softer warmer lightScene 4 (final shots)

Note: diner-booth is our primary setting — 80%+ of shots will use this reference. The dawn variant signals temporal resolution without requiring a new location.

4. Reference Manifest Budget (Per Shot)

Veo allows max 3 reference images per shot. Our standard allocation:

SlotContentWhen to Use
Ref 1Sarah’s character_sheet.pngAny shot featuring Sarah
Ref 2Mark’s character_sheet.pngAny shot featuring Mark
Ref 3Setting reference (diner-booth etc.)All shots

Special cases:

5. Audio Classification Guidance (Dialogue-Heavy Mandate)

This pilot tests dialogue. Here’s how each classification maps to our cinematography:

[DIALOGUE] — Characters speaking on screen

[VO] — Narrator speaks over visuals

[COMPOUND] — Two characters speak sequentially in one shot

[SILENT] — No speech, ambient only

TypeTarget Count% of Shots
[DIALOGUE]18–22 shots~55%
[COMPOUND]3–4 shots~10%
[VO]3–5 shots~10%
[SILENT]6–8 shots~20%

This gives us a dialogue-dominant film (65% speech on screen) while maintaining visual variety through inserts and establishing shots.

6. Dialogue Content Guidance

For [DIALOGUE] and [COMPOUND] shots, the dialogue lines in the scene list are Veo prompting material — they tell the model what the characters should say, and Veo generates the audio natively with lip-sync. Keep lines:

For longer exchanges, split across multiple shots with alternating coverage (MCU on Sarah → MCU on Mark → two-shot). This is standard dialogue coverage and plays to Veo’s strengths with single-character lip-sync.


Jules — this is your cinematography toolkit for the Beat Sheet. Use the camera coverage plan to assign shot types, the duration strategy to set planned times, and the audio classification guide to tag every shot. I’ll validate the Look once the scene list draft is ready.