The Midjourney Prompt Formula That Actually Works Every Time (2026)
After generating somewhere north of 3,000 Midjourney images over the past year, I can tell you the single biggest mistake people make: they describe what they want to see instead of what they want to feel. A prompt like "a wolf in a forest" gets you a wolf in a forest. A prompt built around the right formula gets you something that stops people mid-scroll.
Here's the exact structure I use — the midjourney prompt formula that consistently produces professional results — broken down so you can apply it immediately.
The Core Formula
Five components, always in this order:
[Subject] + [Art Style] + [Lighting] + [Mood] + [Technical Params]
You don't need all five every time. But using at least three of them — and thinking about which three — is what separates a good image from a vague one. Let me walk through each piece.
Part 1: Subject — Be Specific Enough to Be Surprising
Your subject is the non-negotiable core. But "a woman" is not a subject — it's a placeholder. "A weather-beaten lighthouse keeper in her 60s, staring out a fogged porthole" is a subject.
What makes a subject specific enough?
The Three Subject Layers
Primary noun: The main thing (person, creature, object, scene). One noun, not three.
Defining detail: One or two adjectives that narrow it down — not "beautiful woman" (generic) but "auburn-haired woman in her 40s with paint-stained fingers" (specific and interesting).
Action or relationship: What is it doing? What is it near? Static subjects often produce flat images. "A crow landing on an abandoned piano" is more interesting than "a crow near a piano."
The moment you add the action layer, Midjourney has to make compositional decisions. Usually good ones.
Part 2: Art Style — Pick One, Mean It
This is where most prompts get vague. "Digital art style" tells Midjourney almost nothing — it's like telling a chef to cook "food style." The difference between styles is enormous in the final output.
Styles That Produce Reliable Results
- Cinematic photography: ARRI Alexa color science, anamorphic lens, shallow depth of field
- Editorial fashion: Vogue, high contrast, clean backgrounds, studio flash
- Oil painting: Impasto texture, visible brushstrokes, old master palette
- Anime/manga: Specify studio — "Studio Ghibli", "Makoto Shinkai style" — not just "anime"
- Concept art: Artstation, detailed environment design, matte painting
- Product photography: Dark gradient background, rim lighting, commercial
One style per prompt. Mixing two styles often produces a muddy compromise — unless you want the tension between them (e.g. "oil painting rendered with hyperrealistic photographic detail").
Part 3: Lighting — The Fastest Way to Change the Feeling
Lighting is where experienced photographers and cinematographers earn their keep, and Midjourney responds to lighting descriptors better than almost any other component. A portrait at noon in flat light and the same portrait at golden hour are completely different emotional experiences.
Lighting Descriptors Worth Learning
Golden hour: Warm orange-amber, long shadows, nostalgic, flattering on faces.
Rembrandt lighting: One key light at 45 degrees, triangle of light under eye, painterly and dramatic.
Rim lighting / backlit: Subject against bright background, glowing edges, cinematic silhouette.
Neon glow: Cyberpunk, colored shadows, reflections in rain-wet pavement.
Diffused overcast: Soft even shadows, muted colors, often melancholic or intimate.
Bioluminescence / underwater: Internal light source, teal and blue, otherworldly.
Pick lighting that matches your mood. "Golden hour" + "mysterious and unsettling" is a contradiction that weakens both descriptors. Let mood and lighting agree.
Part 4: Mood and Atmosphere — Say What You Want the Viewer to Feel
This one trips people up because it sounds abstract. But Midjourney has seen millions of images tagged with emotional descriptors. Words like "melancholic," "triumphant," "claustrophobic," "serene," and "foreboding" produce measurably different color palettes, compositions, and negative space choices.
Don't skip this component. Even one mood word acts as a global tonal instruction to everything else in the prompt.
Mood Words That Actually Work
- Melancholic, wistful, nostalgic
- Foreboding, ominous, eerie
- Triumphant, defiant, fierce
- Serene, meditative, still
- Chaotic, frenetic, overwhelming
- Intimate, quiet, tender
Part 5: Technical Parameters — The Final Dial
Midjourney v6 and later respond well to a short set of parameters appended at the end.
Parameters Worth Using
--ar [ratio]: Aspect ratio. --ar 16:9 for landscape, --ar 9:16 for portrait (Instagram/phone), --ar 1:1 for square. This one matters every time.
--stylize [0-1000]: How much Midjourney's aesthetic training kicks in. Low (0-200) = more literal interpretation of your prompt. High (700-1000) = more Midjourney's "taste" in the result. Default is 100. I usually run 250-400.
--chaos [0-100]: Variation across the 4-image grid. Low = consistent, safe. High = wild variation, good for exploration. I use --chaos 20 when I know what I want, --chaos 60 when I'm exploring.
--no [unwanted elements]: Negative prompting. --no text, watermark, blurry background. Be specific about what ruins your image.
Real Examples Side by Side
Before (no formula)
"A girl in a coffee shop"
Result: Generic, flat, stock-photo feel. Could be from any free image library in 2019.
After (formula applied)
"Young woman in her 20s reading a worn paperback at a corner table in a dimly lit Parisian café, espresso half-finished, steam rising, warm tungsten lamp light, bokeh background, soft and intimate atmosphere, analog film photography, grain, --ar 4:5 --stylize 300"
Result: Something you'd see in a magazine. Specific, emotional, technically directed.
Another Example
Before: "A dragon"
After: "Ancient dragon coiled around a crumbling cathedral tower, scales catching the last light of a blood-red sunset, moss and vines crawling up cracked stone, triumphant yet melancholic atmosphere, dark fantasy concept art, Artstation style, dramatic rim lighting, --ar 21:9 --stylize 450 --chaos 15"
The difference isn't prompt length — it's prompt specificity. Longer prompts that repeat vague terms are worse than shorter prompts with precise terms.
You can find hundreds of tested prompts built around this formula in the PromptSpace gallery — organized by style and use case so you can copy the structure and swap the subject.
What Actually Breaks Prompts
More useful than "what works" is understanding what makes prompts fail. After testing extensively, the failure modes are consistent.
Contradiction in Tone
Asking for "dark and moody cinematic lighting" combined with "bright, cheerful, vibrant colors" produces a confused image. The model tries to satisfy both and usually satisfies neither.
Too Many Subjects
Every subject after the first dilutes compositional focus. "A wolf, a raven, a sword, and a woman standing in a field" usually produces a cluttered mess. Pick one dominant subject. Use the others as supporting elements: "a woman holding a raven in a snowy field, a wolf watching from the tree line."
Prompt Inflation
Adding more words after your prompt is already working often makes it worse. If "portrait of an old lighthouse keeper, Rembrandt lighting, oil painting, melancholic --ar 4:5" is giving you great results, adding 30 more adjectives won't improve it. It dilutes the signal.
Missing Aspect Ratio
This is the most overlooked technical parameter. The default Midjourney output is square-ish (~1:1). If you're making a phone wallpaper, banner, or thumbnail, the composition will be wrong unless you set --ar. A horizontal composition cropped to 9:16 looks like a mistake, not a portrait.
What This Formula Won't Fix
The formula won't make a bad concept good. "A cat in a business suit attending a board meeting" will produce technically fine results with the formula, but whether that image is interesting is a creative judgment call, not a prompting problem.
It also won't help you if you're fighting Midjourney's content filters. Some highly specific real-person likenesses and certain graphic subject matter will get blocked regardless of prompt structure. The formula works within the model's actual capabilities.
Finally — prompt crafting takes iteration. Even with the perfect formula, you'll run the same prompt 3-4 times and pick the best grid output. That's normal. Professional Midjourney users don't post their first generation.
Frequently Asked Questions
Do I need to use all five formula components every time?
No — three is usually enough for a focused result. Subject + lighting + technical params, or subject + style + mood. The five-component formula is for when you need predictable, repeatable professional output. For exploration and experimentation, intentionally leaving components out can produce surprising results.
Should I use commas or write prompts as sentences?
Both work, but comma-separated descriptors tend to give each term equal weight, while full sentences sometimes have the model overweight the grammatically prominent parts. For the formula structure I've described, comma separation is cleaner and more reliable. "Subject, style, lighting, mood, --params" rather than a paragraph.
How do I stop Midjourney from adding text to images?
Add --no text, letters, words, watermark to your prompt. Midjourney v6 is much better about this than earlier versions, but certain styles (vintage posters, signage, book covers) will still drift toward adding text unless you explicitly exclude it.
What's the best --stylize value for photorealistic images?
100-250 for the most literal photorealistic interpretation of your prompt. Higher stylize values push the output toward Midjourney's aesthetic, which is beautiful but less strictly photographic. For product photography or portrait realism, stay under 300. For fantasy and painterly work, 400-700 often produces more interesting results.
Does prompt order matter?
Yes, somewhat. Midjourney gives more weight to terms earlier in the prompt. Put your most important descriptors (subject, primary style) first. Technical parameters like lighting and mood can come after. The --params always go at the end.
The formula works because it forces you to make deliberate choices about each dimension of the image instead of hoping the model guesses what you meant. Every component answers a specific question: what, how it looks, how it's lit, how it feels, how it's sized. Once you internalize these five questions, prompting becomes predictable — and predictable is exactly what you want when a client needs a specific shot by Friday.
Start with the getting started guide if you're new to AI image generation, or browse the prompt gallery to find tested formulas organized by style.











