← Fluorite Artifacts | Fluorite Team

Technical Feasibility

Team Fluorite — "Pickling Season"

Team Fluorite — Step 1: Technical Feasibility & API Strategy

Director of Photography: fluorite-techlead
Film: “Pickling Season”
Date: 2026-05-22
Status: DRAFT — Step 1 deliverable


1. API Selection by Shot Type

This maps each category of visual content to the optimal generation tool and model, with rationale.

1.1 Character Reference Chain (Step 3)

AssetToolModelNotes
Headshotgenmedia-image generateNano Banana Pro (default)16:9. Strong photorealistic faces. Start of chain.
Body sheet (3-view)genmedia-image generateNano Banana Pro--reference-image headshot.png. Critical chain link.
Scene test 1genmedia-image generateNano Banana Pro--reference-image headshot.png --reference-image body_sheet.png. Place character in World A setting.
Scene test 2genmedia-image generateNano Banana ProSame refs. Place character in World B setting.
Composite character sheetgenmedia-image generateNano Banana Pro--reference-image headshot.png --reference-image body_sheet.png. 4-view grid on white. 16:9.
Grandmother’s hands referencegenmedia-image generateNano Banana ProEXTRA reference — dedicated “hands at work” image per Tone Contract mandate. Weathered, strong, precise. Used in all food-prep shots.

Why Nano Banana Pro over Imagen for characters: Gemini’s generate subcommand supports --reference-image chaining, which is the backbone of our consistency strategy. Imagen models don’t support reference images — they’re prompt-only. We use Gemini for anything requiring visual continuity.

Why NOT Imagen for characters: Imagen 4 produces excellent standalone images but lacks reference-image input. Using it would break the chain.

1.2 Setting Reference Images (Step 3)

SettingToolModelPrompt Keywords
World A: Grandmother’s kitchen (memory)genmedia-image generateNano Banana Pro”warm amber light, golden hour, oil painting texture, cluttered kitchen, wooden surfaces, jars of preserved food, herbs hanging, copper pots, stove glow, tactile, lived-in, sensory, 16:9”
World B: Granddaughter’s apartmentgenmedia-image generateNano Banana Pro”cool blue-grey light, minimalist modern apartment kitchen, clean white surfaces, sparse, chrome appliances, pale wood, large windows, overcast daylight, sterile, precise, 16:9”
World A→B Invasion (Act II, 70/30)genmedia-image generateNano Banana Pro”modern apartment kitchen, mostly cool blue-grey light, BUT one corner has warm amber glow, a jar of pickles on the counter, a sprig of dill, subtle warmth leaking in, minimalist with first signs of invasion”
World A→B Invasion (Act III, 30/70)genmedia-image generateNano Banana Pro”modern apartment kitchen transformed, warm amber light fills most of the frame, jars on every surface, cutting board out, wooden spoon, herbs, the kitchen has become a farmhouse, rich saturated color overtaking the sterile blue-grey”

4 setting references — one per color-world state. This gives us a visual anchor for the progressive invasion arc.

1.3 Food Close-Ups (Steps 4-5)

Shot TypeToolModelNotes
Food stills (storyboard frames)genmedia-image generate OR genmedia-image imagenNano Banana Pro or Imagen 4Food photography is a proven strength for both. Imagen 4 at 2K resolution gives us textural detail. No character references needed — these are “free” shots.
Food video (pickles, brine, chopping)genmedia-video from-imageVeo 3.1 (default)Upload food still to GCS → animate with slow motion prompt. “Brine catching light, steam rising, gentle camera push-in.”
Extreme macro (cucumber cross-section, jar lid patina)genmedia-image imagenImagen 4 UltraSingle image at 2K. Maximum detail for signature transition frames.

Imagen for food stills is acceptable because food close-ups don’t need character reference chaining — they stand alone. We can use Imagen 4’s superior resolution here without breaking continuity.

1.4 Character Scenes (Steps 4-5)

Shot TypeToolStrategyModel
Single character, static (most shots)genmedia-video from-framesStart + End frame interpolation. Character sheet + setting reference as context for frame generation.Veo 3.1
Single character, slow motion (cooking)genmedia-video from-imageSingle hero frame → animate with motion prompt. “Hands slicing cucumber, deliberate, slow.”Veo 3.1
Two characters, shared frame (Act III)genmedia-video from-framesBoth character sheets as references for frames. Static composition — side by side at counter. Max 2 character sheets + 1 setting = 3 refs (at budget).Veo 3.1
Dialogue shotsgenmedia-video from-image or from-framesMotion prompt must include “speaking, expressive” per Audio Agreement. Use Veo 3.1 for audio generation.Veo 3.1
VO-safe shots (narrator over visuals)genmedia-video from-framesMotion prompt: “mouth closed, focused on task, hands working.” Explicit “No dialogue. No speech. No talking.” in prompt.Veo 3.1

1.5 Audio Pipeline

AssetToolModelNotes
Narrator VO (granddaughter)genmedia-voice generateGemini 3.1 Flash TTSYoung woman’s voice. Warm, wry, affectionate. Voice TBD from audition. 800-char limit per call — script will need splitting.
Grandmother’s dialoguegenmedia-voice generateGemini 3.1 Flash TTSOlder woman’s voice. Imperious, measured, declarative. Different voice profile from narrator.
Folk melody (memory world)genmedia-music generateLyria 3 Pro”Eastern European folk melody, accordion and clarinet, warm, nostalgic but joyful, moderately slow tempo.” ~2:30 duration for full coverage.
Sparse ambient (modern world)genmedia-music generateLyria 3 Clip”Minimal ambient, sparse, clean, modern, barely there, cool, no melody.” ~30s, loop-friendly.
Ambient/SFX (kitchen sounds)Baked into Veo video promptsVeo 3.1Audio prompts: “sound of knife on cutting board, boiling water, glass jar lid opening, cucumber crunch.” Negative: “No music. No soundtrack.”

2. Shot-Type Complexity Assessment

Low Complexity (Expected: ~60% of shots)

Medium Complexity (Expected: ~30% of shots)

High Complexity (Expected: ~10% of shots)


3. Safety Filter Risk Assessment

No Risk

Low Risk (Mitigations Defined)

TriggerMitigation
”Elderly woman” in close-upFrame as “character portrait, warm kitchen lighting” — lead with setting, not age descriptors.
”Border crossing” / “smuggling”NEVER render a literal border. Use “carrying,” “bringing.” Imply through narration + objects (suitcase, train window, jar wrapped in cloth).
”Knife” in close-upAlways pair with “domestic cooking context, kitchen, food preparation.” Never isolated blade.

Zero Risk (Our Strength)

The “Pickling Season” concept is structurally safe. No violence, no weapons outside cooking context, no politically sensitive content, no real-person references, no supernatural elements. This is a quiet domestic story about food and family. We should encounter essentially zero safety filter walls.


4. Prompt Engineering Strategy

4.1 Tone Anchor Injection (Mandatory)

Every single prompt — image and video — gets these 5 keywords appended:

tactile, warm-hearted, deliberate, sensory, lived-in

Plus visual translation of “warm-hearted”:

gentle golden warmth, soft natural light

4.2 World A Prompt Template

[SCENE DESCRIPTION]. Warm amber light, golden hour, oil painting texture, 
rich saturated color, abundance, hand-crafted. [CHARACTER REFERENCE: 
grandmother character sheet]. Tactile, warm-hearted, deliberate, sensory, 
lived-in. Gentle golden warmth, soft natural light. Photorealistic, 16:9 
composition.

4.3 World B Prompt Template

[SCENE DESCRIPTION]. Cool blue-grey light, minimalist interior, clean 
surfaces, sparse, modern apartment, muted tones, sterile, precise. 
[CHARACTER REFERENCE: granddaughter character sheet]. Tactile, warm-hearted, 
deliberate, sensory, lived-in. Gentle golden warmth, soft natural light. 
Photorealistic, 16:9 composition.

Note: Even World B gets “warm-hearted” and “gentle golden warmth” — these are emotional anchors that prevent the model from drifting into grim/noir territory. The cool palette keywords will dominate the visual, but the warmth anchors keep it from becoming oppressive.

4.4 Color Invasion Prompt Gradient

ActWorld B KeywordsWorld A KeywordsRatio
Act IFull setNone100/0
Act II (early)Full set”a single jar of pickles catches warm amber light in the corner”90/10
Act II (mid)Full set”warm amber glow from one side, a sprig of dill, jar of pickles on counter”70/30
Act III (early)“modern apartment, large windows, pale wood""warm amber light fills most of frame, jars on surfaces, cutting board, herbs”30/70
Act III (climax)Residual only (“apartment kitchen”)Full set0/100

4.5 Anti-Melancholy Drift Keywords

Per Tone Contract, AI models will try to make this sad. Counterbalance keywords for grandmother shots:

INCLUDE: confident posture, serene expression, purposeful movement, 
         warm rich colors, abundance, busy hands
AVOID:   lonely, isolated, frail, dim lighting, empty room, 
         desaturated, wistful, shadowy

4.6 Negative Audio Prompting (Mandatory for All Video)

Every Veo prompt gets:

No music. No soundtrack.

Non-dialogue shots additionally get:

No dialogue. No speech. No talking.

4.7 Hands Reference Strategy

The grandmother’s hands are the most important visual in the film. Strategy:

  1. Dedicated hands reference image in Step 3: “Close-up of elderly woman’s hands on a wooden cutting board, weathered skin, strong fingers, precise grip, warm amber light, tactile, photorealistic.”
  2. Chain from character headshot to anchor skin tone and age.
  3. Include hands reference in every food-prep shot prompt alongside the character sheet.
  4. Negative prompting if drift occurs: “NOT smooth skin, NOT young hands, NOT manicured nails.”

5. Duration & Extend Strategy

Target Runtime: 3:00–5:00 (180–300 seconds)

Based on the Kaurismäki-inspired deadpan pacing:

Veo Duration Strategy

Planned DurationVeo BaseExtendsTotal RawTrim Headroom
3-4s8s (with 4s overhang)08s4-5s to trim
5-7s8s (with 4s overhang)08s1-3s to trim
8-10s8s + 1 extend115s5-7s to trim
11-14s8s + 1 extend115s1-4s to trim

Rule: Generate 8s base clips for everything. Only extend shots planned >8s. The overhang principle (2s pre-roll + 2s post-roll) means a “planned 6s shot” needs to be generated as a clip containing at least 10s of usable content — which 8s base provides.

Wait — correction. The overhang is applied to the generation, not the planned duration. A planned 6s shot → generate as 6+4=10s. But Veo base is max 8s. So:

Actually, re-reading the mandate: “Apply a flat 4-second overhang to every shot: 2 seconds of pre-roll and 2 seconds of post-roll added to the planned duration. A planned 10s shot must be generated as a 14s clip.”

So:

Revised strategy:

Most of our shots should be 4-7s (deadpan style), so most will need 0-1 extends.


6. Reference Budget Planning

Veo allows max 3 reference images per shot. Our budget allocation:

Shot TypeRef 1Ref 2Ref 3
Single character in settingCharacter sheetSetting referenceHands ref (if cooking)
Two characters in settingGrandmother sheetGranddaughter sheetSetting reference
Food close-up (no character)Setting reference
Food with handsHands referenceSetting reference
Empty establishing shotSetting reference

The composite character sheet (4-view grid) is our primary per-character reference. One sheet per character = efficient budget use.


7. Production Risk Register

RiskLikelihoodImpactMitigation
Grandmother’s hands inconsistencyMEDIUMHIGHDedicated hands reference image. Include in every food-prep prompt.
Melancholy drift (sad instead of funny)HIGHHIGHAggressive anti-drift keywords. Confident posture descriptors. Rich saturated color.
Color invasion gradient looks artificialLOWMEDIUMTest prompt blends in Step 3 scene tests. Gradual keyword ratios.
TTS voice mismatch (narrator too old/young)MEDIUMMEDIUMAudition 3-4 voices in Step 3 before committing.
Food textures too “AI-perfect”LOWLOWPrompt for “slight imperfection, a pickle with a spot, slightly uneven slice” per Tone Contract.
Two-character shots break consistencyMEDIUMMEDIUMLimit to 2-3 shared-frame shots. Use side-by-side compositions. No physical contact.

Ready for fluorite-idea’s high_concept.md and design_brief.md to finalize the visual DNA. Will contribute prompt templates and technical sections to design_brief.md once it exists.