Create a Viral Sky Fly Video
Image Generation Overview
A total of 5 core images were generated using Nano Banana Pro. These images serve as visual foundations and reference frames for all scenes.
Upload your original photo and prompt copy paste.
Image 1 Prompt
Generate a person from photo 1 . The location is very high in the sky at a slightly pinkish sunset, a man in a
horizontal position in the effect of falling. The man's face is calm, he falls backwards. The frame is at the human
level, we see it in full growth. movie shot, slightly blurred background, beautiful color correction
The dress should be casual double shirt with black inner and red outer shirt, black jeans with red sneakers also
wearing highly realistic black and red watch on left hand, and on right hand holding key chain (on key chain "ArupFx Studio" should be written clearly)
Full body show but no extra space on left and right
Also the size should be 1080×1920
Image 2 Prompt
Take a picture from below as this man is very high in the sky, very small, slightly visible because he is very small,The camera is looking up, the sky is the same color, the angle is from bottom to top. The person has his back to us,we look up at the sky as in photo 2
person should be on top quarter area of the image.
Image 3 Prompt
Create an ultra-realistic, highly detailed close-up cinematic shot focusing only on the shoes from the provided
full-body reference image.
Change the camera angle to exactly match the reference shoe close-up: a low-angle perspective from beneath the
feet, with both shoe soles clearly visible and dominating the frame.
The shoes must remain exactly the same as in the original image — same color, texture, sole pattern, wear details,
and proportions.
Maintain the same sunset lighting conditions with warm golden-pink tones, natural shadows, and realistic light
falloff.
Use a shallow depth of field so the background is slightly blurred, while the shoes are extremely sharp, crisp, and in perfect focus.
The background should match the original environment (sky and clouds), but remain soft and unobtrusive.
Photorealistic camera quality, natural motion feel, no CGI, no illustration, no stylization, no distortion.
Output resolution: 1080×1920, vertical composition, professional cinematic photography.
Image 4 Prompt
Use Image 1 as the reference for the exact camera angle, body orientation, gravity direction, motion, lighting, and
environment.
Use Image 2 strictly for exact face identity matching only.
Create an ultra-realistic cinematic close-up by moving the camera closer to the subject, without changing the
original angle or orientation from Image 1.
The subject must appear naturally floating or falling, maintaining the same horizontal alignment as the original
full-body scene.
The face should be relaxed and calm, eyes gently closed or half-closed, with a peaceful drifting expression — no
posing, no tension.
Both hands should appear naturally floating in mid-air, slightly in front of the face, relaxed fingers, no framing or intentional gesture.
The watch must remain visible on the wrist, and the keychain must be naturally visible in the hand, exactly matching
the original accessories.
Maintain the same sunset lighting — soft golden-pink tones, natural highlights, smooth shadows, realistic skin
illumination.
Use shallow depth of field:
– Face in sharp focus
– Hands slightly soft
– Background softly blurred
Do NOT rotate the subject.
Do NOT change the angle.
Do NOT stylize or pose the hands.
Ultra-realistic photography, natural motion feel, no CGI, no illustration, no distortion.
Output: 1080×1920, vertical cinematic frame.
Image 5 Prompt
Create an ultra-realistic cinematic close-up of the watch by moving the camera closer to the subject from the original
full-body image.
Do not change the camera angle, body orientation, or gravity direction — the subject must remain in the same
floating/falling position as the original scene.
The shot should feel like a natural camera push-in, not a new pose or new angle.
The watch must be the main focus, extremely sharp and highly detailed — dial, hands, numbers, texture, reflections,
strap material, and edges must be clearly visible and realistic.
The hand wearing the watch should appear naturally floating, relaxed fingers, no posing.
Maintain the same sunset lighting — warm golden-pink tones, soft highlights, realistic shadows, and natural
reflections on the watch glass.
Use shallow depth of field:
– Watch in perfect sharp focus
– Hand slightly soft
– Background (sky and clouds) smoothly blurred
Keep the original environment and colors consistent with the full-body image.
Ultra-realistic photography, professional macro-cinematic look, no CGI, no illustration, no distortion.
Output resolution: 1080×1920, vertical cinematic frame.
Image 6 Prompt
Create an ultra-realistic cinematic close-up of the keychain by moving the camera closer from the original full-body
image.
Do not change the camera angle, body orientation, or gravity direction — the subject must remain in the same
floating/falling position as the original scene.
The shot should feel like a natural camera push-in, not a new pose or a new angle.
The keychain must be the main focus, extremely sharp and highly detailed — metal texture, edges, engraving
depth, reflections, and natural wear must be clearly visible.
The engraved text on the keychain must read exactly and clearly:
“ArupFx Studio”
(correct spelling, apostrophe, capitalization — no changes).
The hand holding the keychain should appear naturally floating, relaxed fingers, no posing or framing.
Maintain the same sunset lighting — warm golden-pink tones, soft highlights, realistic shadows, and natural
reflections on the metal surface.
Use shallow depth of field:
– Keychain in perfect sharp focus
– Hand slightly soft
– Background (sky and clouds) smoothly blurred
Keep the original environment, colors, and cinematic mood consistent with the full-body image.
Ultra-realistic photography, professional macro-cinematic look, no CGI, no illustration, no distortion.
Output resolution: 1080×1920, vertical cinematic frame.
Animation & Scene Creation (Flow Ai and Kling 2.6) Click Create a Video
Scene 1 – Start & End Frame Animation
Start Frame = Image 2
End Frame = Image 1
A smooth cinematic camera movement drifting downward through the sky during a soft pink sunset.
A man slowly descending in mid-air, body horizontal, as if gently floating through the atmosphere.
His motion feels weightless and controlled, like a dream sequence, not falling.
Calm and peaceful expression, relaxed body posture.
Camera follows him smoothly from a human eye-level perspective.
Soft cinematic depth of field with background slightly blurred.
Warm sunset light with pink and orange tones reflecting naturally on the subject.
Subtle motion in clothing and pendant caused by air movement.
Dramatic yet serene mood, realistic physics, ultra-cinematic feel.
Professional film color grading, high dynamic range.
Aspect ratio 9:16 (vertical), 4K quality
Scene 2 – Full Body Suspension
Ultra-slow and subtle movement of levitation suspended in the air as if time had completely stopped. 240 fps →
slow motion x0.125. Camera slowly pans. Hair floats very slowly. Keychain floats lightly in extreme slow motion.
Everything moves like a dream, suspended in time. Extremely minimal movement emphasizing total suspension.
Scene 3 – Sneaker Detail Shot
Ultra-subtle motion of sneakers. Red laces float minimally. Slow lateral pan around sneaker revealing texture and stitching. Luxury product-style shot. 240 fps → slow motion x0.125. Black pants blurred with bokeh. Hypnotic camera movement.
Scene 4 – Face Close-Up
Ultra-subtle facial movement. Serene expression with ultra-slow blink. Hair floats softly. Camera performs ultra-slow circular pan at 24 fps. Transparent glasses reflect light. Parallax background. Maintain exact face identity. 240 fps → slow motion x0.125.
Scene 5 – Watch Macro Shot
Extreme close-up of wrist watch. Slow lateral pan revealing dial, needles, strap texture. Luxury product feel. 240 fps → slow motion x0.125. Background blurred with cinematic bokeh.
Scene 6 – Keychain Macro Shot
Extreme close-up of keychain with text 'Amy's Visuals'. Subtle floating motion. Metal catches light with sparkles.
Slow circular camera pan. 240 fps → slow motion x0.125. Cinematic bokeh background.
🔚 Blog Post Ending
By following this guide, you can easily create ultra-realistic cinematic reels and videos using AI. From image generation to animation, camera movement, and detailed close-up shots — everything has been explained step by step in a clear and practical way.
If you are a content creator, video editor, brand promoter, or someone interested in AI visuals, this guide will be extremely helpful for you. With the right prompts, proper reference images, and patience, you can also produce professional-quality cinematic content.
If you found this post useful, don’t forget to share it, and stay connected with our website for more AI, video, and visual creation guides in the future.
Thank you ❤️






