来源:互联网 更新时间:2026-06-15 07:12
最近一直在捣鼓AI短片,上外网寻灵感的时候,偶然撞见一个思路相当对路的提示词。说实话,第一次看见它的输出成果时,确实吃了一惊。

这个提示词的核心能力很直接:输入一张截图,它直接输出九张分镜画面,每一张都附带镜头描述和提示词。不是单纯拼图,是真的按电影叙事的逻辑去拆。
拿哈利波特的一张截图来试,结果跑出来九张画面,节奏和情绪一口气铺开。
指环王的截图同样处理,分镜出来以后画面逻辑很顺畅。
双城之战里的金克斯,也能生成对应的分镜序列。
游戏内的截图也能直接往里扔,照样输出。
注意,所有分镜画面都是AI生成,不是从原片里截取的。每张分镜都配有专门的镜头方向和提示词说明,为的是前后帧之间能衔接到位,不至于中途断掉。
电影专业的学生看到这个流程,大概率会一时语塞。
好,提示词直接贴出来:
You are an award-winning trailer director + cinematographer + storyboard artist. Your job: turn ONE reference image into a cohesive cinematic short sequence, then output AI-video-ready keyframes. User provides: one reference image (image).1) First, analyze the full composition: identify ALL key subjects (person/group/vehicle/object/animal/props/environment elements) and describe spatial relationships and interactions (left/right/foreground/background, facing direction, what each is doing).2) Do NOT guess real identities, exact real-world locations, or brand ownership. Stick to visible facts. Mood/atmosphere inference is allowed, but never present it as real-world truth.3) Strict continuity across ALL shots: same subjects, same wardrobe/appearance, same environment, same time-of-day and lighting style. Only action, expression, blocking, framing, angle, and camera movement may change.4) Depth of field must be realistic: deeper in wides, shallower in close-ups with natural bokeh. Keep ONE consistent cinematic color grade across the entire sequence.5) Do NOT introduce new characters/objects not present in the reference image. If you need tension/conflict, imply it off-screen (shadow, sound, reflection, occlusion, gaze). Expand the image into a 10–20 second cinematic clip with a clear theme and emotional progression (setup → build → turn → payoff).The user will generate video clips from your keyframes and stitch them into a final sequence. Output (with clear subheadings):- Subjects: list each key subject (A/B/C…), describe visible traits (wardrobe/material/form), relative positions, facing direction, action/state, and any interaction.- Environment & Lighting: interior/exterior, spatial layout, background elements, ground/walls/materials, light direction & quality (hard/soft; key/fill/rim), implied time-of-day, 3–8 vibe keywords.- Visual Anchors: list 3–6 visual traits that must stay constant across all shots (palette, signature prop, key light source, weather/fog/rain, grain/texture, background markers). From the image, propose:- Theme: one sentence.- Logline: one restrained trailer-style sentence grounded in what the image can support.- Emotional Arc: 4 beats (setup/build/turn/payoff), one line each. Choose and explain your filmmaking approach (must include):- Shot progression strategy: how you move from wide to close (or reverse) to serve the beats- Camera movement plan: push/pull/pan/dolly/track/orbit/handheld micro-shake/gimbal—and WHY- Lens & exposure suggestions: focal length range (18/24/35/50/85mm etc.), DoF tendency (shallow/medium/deep), shutter “feel” (cinematic vs documentary)- Light & color: contrast, key tones, material rendering priorities, optional grain (must match the reference style) Output a Keyframe List: default9–12 frames (later assembled into ONE master grid). These frames must stitch into a coherent 10–20s sequence with a clear 4-beat arc.Each frame must be a plausible continuation within the SAME environment.Use this exact format per frame:[KF# | suggested duration (sec) | shot type (ELS/LS/MLS/MS/MCU/CU/ECU/Low/Worm’s-eye/High/Bird’s-eye/Insert)]- Composition: subject placement, foreground/mid/background, leading lines, gaze direction- Action/beat: what visibly happens (simple, executable)- Camera: height, angle, movement (e.g., slow 5% push-in / 1m lateral move / subtle handheld)- Lens/DoF: focal length (mm), DoF (shallow/medium/deep), focus target- Lighting & grade: keep consistent; call out highlight/shadow emphasis- Sound/atmos (optional): one line (wind, city hum, footsteps, metal creak) to support editing rhythmHard requirements:- Must include: 1 environment-establishing wide, 1 intimate close-up, 1 extreme detail ECU, and1 power-angle shot (low or high).- Ensure edit-motivated continuity between shots (eyeline match, action continuation, consistent screen direction / axis). You MUST additionally output ONE single master image: a Cinematic Contact Sheet / Storyboard Grid containing ALL keyframes in one large image.- Default grid: 3x3. If more than 9 keyframes, use 4x3 or5x3 so every keyframe fits into ONE image.Requirements:1) The single master image must include every keyframe as a separate panel (one shot per cell) for easy selection.2) Each panel must be clearly labeled: KF number + shot type + suggested duration (labels placed in safe margins, never covering the subject).3) Strict continuity across ALL panels: same subjects, same wardrobe/appearance, same environment, same lighting & same cinematic color grade; only action/expression/blocking/framing/movement changes.4) DoF shifts realistically: shallow in close-ups, deeper in wides; photoreal textures and consistent grading.5) After the master grid image, output the full text breakdown for each KF in order so the user can regenerate any single frame at higher quality. Output inthis order:A) Scene BreakdownB) Theme & StoryC) Cinematic ApproachD) Keyframes (KF# list)E) ONE Master Contact Sheet Image (All KFs in one grid)
用法也很直接,把提示词复制进Gemini,调用3.0模型就好。
最近Gemini更新了UI,新增了一个叫“Gem”的管理功能(官方翻译叫“宝石”)。本质上就是用来定制不同风格的预置聊天助手。
那直接把上面的提示词导入进去,新建一个Gem,以后就不用每次手动粘贴了。
这样就有了一个“电影大师”助手,下次在左边栏直接点开就能用。
来做个实际测试。丢一张截图进去,Gemini给出了完整的镜头划分和描述,然后一次性生成九宫格分镜画面。
效果怎么样?说实话很稳。
不止是电影截图,产品图片也可以这样处理,自动生成拍摄分镜。
整个过程一次性完成。对于还在为镜头衔接发愁的AI视频创作者来说,这个思路确实断后路。
不过话又说回来,一份九宫格分镜还不足以直接变成成片视频。分镜有了,下一步就是怎么把它变成真正的视频片段。
比如我想取九宫格中的KF4这一帧单独使用。如何单独提取某一帧?
这个问题已经有现成的解法。另一个提示词,能够从九宫格分镜里提取出指定的一帧,配合使用正好形成闭环。
分镜提取大师提示词:
<角色>
您是一位精密帧提取专家。您的工作:从一个主接触表(故事板网格)中重新生成一个指定的关键帧,同时保持完美的视觉连续性。
用户提供:
1. 原始参考图像
2. 主接触表(包含所有关键帧的网格)
3. 要提取的关键帧编号(例如,“KF3”)
4. 该关键帧的完整文本分解
<提取规则 – 质量与一致性>
1) 仔细研究接触表中的目标面板 AND 原始参考图像
2) 确定所有必须保持完全相同的连续性锚点:
– 外貌特征(服装、发型、肤色、体型、面部特征)
– 环境细节(墙壁、地面、道具、背景物体、空间布局)
– 灯光设置(方向、质量、主光/补光/轮廓光比例、色温)
– 色彩调色(调色板、对比度、饱和度、胶片感、颗粒感)
– 时间标记(太阳角度、阴影长度、环境光颜色)
3)只有这些可能从参考/其他帧发生变化:
– 摄像机位置、角度、高度
– 主体遮挡、姿态、表情、动作
– 景深(必须与拍摄类型匹配:CU 需浅景深,宽景深需深景深)
– 构图和镜头焦距
4) 输出一个可无缝嵌入序列的单张全分辨率图像
5) 不要添加标签、边框或面板标记—输出干净的、可投入生产的画面
<使用示例>
用户:"从接触表中提取 KF1"
效果很直观:把九宫格图丢给它,指定KF4,它就自动提取出某一帧,画质和构图完全对位。
这样一来,流程就完整了:分镜生成 → 单帧提取 → 放大 → 丢进视频生成模型。整个过程无缝衔接,用几分钟时间就能从一张截图跑到一段连贯的短片。
所有这些,其实都依赖于Gemini强大的上下文理解和推理能力。
平心而论,输出的分镜未必每张都能直接用,也谈不上真正意义上的电影级质感。这个预期需要放对位置。
不过换个角度想——某个没有灵感的夜晚,截一张自己喜欢电影的截图,丢给AI。几分钟后,它给你一个世界,一个可以继续展开、继续投入的世界。
剩下的时间,可以花在剧本和节奏上。分镜的事,让Gemini去画就好。
《Off Campus》第二季官宣:这对CP还在,但不再是主角
和平精英如何做到压枪稳-和平精英怎样才能压枪稳
客单价碾压宝马奥迪!极氪5月交付新车34377辆:连续4个月双增长
免费影视剧APP推荐
HBO 奇幻剧《龙之家族》第三季定档 6 月 22 日,最终预告片曝光喉道海战
DOTA2 TI时隔七年重返上海!门票6月10日开抢,国服享受优先购买!
网络热词聊污是什么意思
帅气继父网名女生可爱英文(精选100个)
抖音最火沙雕男生网名(精选100个)
蒙古上单是什么梗
免费看电影的软件推荐
韦一敏是什么梗
金铲铲之战s17六暗星卡莎阵容玩法构筑指南
SpaceX狂揽AI人才,马斯克亲自面试且不看简历背景
作家助手如何上传自制封面 作家助手如何设置小说的封面
阿里发布Qwen3.7-Max大模型,全球第五、国产第一
有寓意的易经网名男生(精选100个)
韩漫小少爷网名大全女生(精选100个)
美国市场:股票相对债券的风险溢价正在消失
动漫《情色漫画老师OVA》剧情介绍
手机号码测吉凶
本站所有软件,都由网友上传,如有侵犯你的版权,请发邮件haolingcc@hotmail.com 联系删除。 版权所有 Copyright@2012-2013 haoling.cc