← Lambda Artifacts | Lambda Team

Step 5: Generation Strategy

Lambda Team — "The Last Diner"

Step 5: Principal Photography — Generation Strategy

Prepared by: Marcus Delaney (Tech Lead)
Status: DRAFT — pending Step 4 sign-off


Overview

28 shots, 208s planned runtime. All shots will use genmedia-video from-frames (storyboard start + end frame interpolation) with Veo 3.1. This leverages the approved storyboard frames as keyframes for visual consistency.

Model Selection Matrix

Audio ClassModelAudio GenerationRationale
[DIALOGUE]veo-3.1-generate-001--generate-audio=trueNative lip-sync for speaking characters
[COMPOUND]veo-3.1-generate-001--generate-audio=trueNative lip-sync for two sequential speakers
[VO]veo-3.1-generate-001--generate-audio=falseVO overlay in post; VO-Safe rule applies
[SILENT]veo-3.1-generate-001--generate-audio=falseAmbient only, added in post

Duration Strategy (Overhang Principle)

All base clips generated at 8s (maximum for Veo). Clips requiring >8s use veo-3.1-lite-generate-001 extend operations.

ShotPlannedAudio ClassBase ClipExtendsRaw OutputTrim Target
1.110s*[VO]8s08s8s (VO 9.36s, Editor trims)
1.26s[SILENT]8s08s6s + 2s overhang
1.38s[SILENT]8s08sfull clip
1.46s[SILENT]8s08s6s + 2s overhang
1.510s[COMPOUND]8s115s10s + 4s overhang
1.66s[DIALOGUE]8s08s6s + 2s overhang
1.76s[DIALOGUE]8s08s6s + 2s overhang
2.110s[COMPOUND]8s115s10s + 4s overhang
2.26s[DIALOGUE]8s08s6s + 2s overhang
2.36s[DIALOGUE]8s08s6s + 2s overhang
2.46s[DIALOGUE]8s08s6s + 2s overhang
2.56s[DIALOGUE]8s08s6s + 2s overhang
2.66s[DIALOGUE]8s08s6s + 2s overhang
2.76s[DIALOGUE]8s08s6s + 2s overhang
2.86s[SILENT]8s08s6s + 2s overhang
2.96s[SILENT]8s08s6s + 2s overhang
3.18s[DIALOGUE]8s08sfull clip
3.28s[DIALOGUE]8s08sfull clip
3.38s[DIALOGUE]8s08sfull clip
3.410s[COMPOUND]8s115s10s + 4s overhang
3.510s[SILENT]8s115s10s + 4s overhang
3.68s[DIALOGUE]8s08sfull clip
4.18s[SILENT]8s08sfull clip
4.28s[SILENT]8s08sfull clip
4.36s[DIALOGUE]8s08s6s + 2s overhang
4.46s[DIALOGUE]8s08s6s + 2s overhang
4.58s[VO]8s08sfull clip
4.612s[SILENT]8s115s12s + 3s overhang

*Shot 1.1 extended to 10s to accommodate 9.36s narrator VO.

Extend Operations (5 shots)

ShotPlannedStrategy
1.510s8s base + 1 extend (15s raw) → trim to ~14s
2.110s8s base + 1 extend (15s raw) → trim to ~14s
3.410s8s base + 1 extend (15s raw) → trim to ~14s
3.510s8s base + 1 extend (15s raw) → trim to ~14s
4.612s8s base + 1 extend (15s raw) → trim to ~16s

Reference Image Budget Per Shot

Using from-frames, the start and end frames serve as the primary visual anchor. The motion prompt guides the interpolation. Character sheets and setting references are NOT needed for from-frames — the storyboard frames already encode character appearance and setting.

If from-frames produces poor results on specific shots, fallback to genmedia-video generate with:

Audio Treatment Pipeline

[DIALOGUE] Shots (14 shots)

[COMPOUND] Shots (3 shots: 1.5, 2.1, 3.4)

[VO] Shots (2 shots: 1.1, 4.5)

[SILENT] Shots (9 shots)

Missing Audio Assets

ShotNeedVoiceText
4.5Sarah offscreen VOTBD (Sarah’s voice)“Take care of yourself, Mark.”

Generation Order

Batch by extend requirement:

  1. Non-extend shots (23 shots): Generate via from-frames, 8s base clips
  2. Extend shots (5 shots): Generate 8s base, upload to GCS, extend via Veo 3.1 Lite
  3. Verify all: Run verify-dailies with shot manifest

Post-Generation Gates

  1. genmedia-verify check-shots --dir ./dailies --min-duration 3 — basic quality check
  2. verify-dailies --dir ./dailies --manifest shot-manifest.json — duration + extend validation
  3. Visual review of character consistency against character sheets
  4. Editor’s dailies review

This strategy will be finalized after Step 4 sign-off.