How to Write Midjourney Prompts That Actually Look Cinematic

Six concrete levers — subject, lens, lighting, color, composition and rendering — that turn a generic Midjourney output into something worth printing.

May 22, 2026 4 min read by Admin
On this page 7

Most Midjourney prompts fail for the same reason: they describe a subject and forget that an image is also a medium. A good prompt directs a virtual film crew, not just the actor. Pull these six levers in order and you'll skip 80% of the trial-and-error.

1. Lock the subject before anything else

Start with one clear noun and one descriptor. "A fisherman" is weak. "An elderly Portuguese fisherman, weathered hands, salt-stained jacket" gives Midjourney something to commit to. Specificity collapses the search space.

2. Pick a lens

The lens word matters more than any "8k masterpiece" tag. A few that produce reliably different results:

  • 35mm — environmental, documentary feel.
  • 85mm — flattering portrait compression, shallow depth of field.
  • 24mm wide — dramatic, slightly distorted, great for landscapes and rooms.
  • macro — extreme close-up texture.

3. Name the lighting

Lighting is what separates a snapshot from a frame. Try "soft golden hour rim light", "hard north-facing window light", "single practical lamp, deep shadows". If you say nothing, Midjourney defaults to flat, even fill — the worst possible look.

4. Choose a color grade

Cinema lives on color grades. "Teal and orange" is the obvious one but try "muted earth tones", "warm sodium-vapor", "desaturated, high-contrast monochrome with a single red accent". The grade tells the renderer what mood you want.

5. Direct the composition

Add a composition cue: rule of thirds, centered symmetry, low-angle hero shot, over-the-shoulder, Dutch tilt. This is the single highest-leverage word group you can add.

6. Close with rendering style

Finish with the medium itself: shot on Arri Alexa, Kodak Portra 400, large-format film grain, ultra detailed photorealistic. Keep it to two or three tokens — more dilutes the signal.

A worked example

Bad: "a fisherman by the sea, beautiful, 8k"

Better: "cinematic portrait of an elderly Portuguese fisherman, 85mm lens, shallow depth of field, soft golden hour rim light, muted teal-and-orange grade, low-angle hero shot, shot on Arri Alexa, ultra detailed, photorealistic --ar 4:5 --style raw --v 6"

Same subject, completely different image. The trick isn't more words — it's the right six.

More to read

Keep reading