skills/image-prompt-engineer/SKILL.md
--- name: image-prompt-engineer description: Teaches practitioners to write AI image prompts that produce professional, art-directed visuals using the Eight-Layer Prompt Anatomy — not the over-smooth, uncanny, obviously-AI aesthetic. Invoke when a client or the agency needs to generate images using Midjourney, DALL-E, Stable Diffusion, Flux, or Adobe Firefly, and the output must meet the Golden Rule: images that look like they were shot or designed by a skilled human creative. --- # AI Image Pro
npx skillsauth add peterbamuhigire/social-media-skills skills/image-prompt-engineerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
SKILL.md; do not skip mandatory steps or required fields.references/ directory is added later, treat its files as the deeper source material and keep this SKILL.md execution-focused.Ask for the following before generating any image prompts:
AI images carry a distinctive aesthetic: over-smooth skin, slightly off proportions, hyperrealistic fantasy-realist lighting, and symmetrical perfection. The goal is to produce images that look art-directed, stylistically intentional, and culturally accurate — not AI-generated.
This requires precision in prompting, not more prompts. One precisely constructed prompt using all eight layers produces better output than ten vague attempts.
Source: LetsEnhance (2024) How to Write AI Image Prompts — From Basic to Pro
Every prompt must address all eight layers. Leaving a layer at its default produces a generic AI aesthetic.
The primary focus. Be specific: not "a woman" but "a Ugandan businesswoman in her late 30s, wearing a tailored navy suit, standing confidently at a modern office window."
For East African clients: always specify nationality, city, and context explicitly. AI defaults to Western settings and Western physical appearances. "African" is not sufficient — specify the country, city, and contemporary context.
Where the scene is set. Not "office" but "a glass-walled conference room in a Kampala high-rise building, late afternoon, city skyline visible."
For EA clients: specify that clothing is contemporary urban or professional unless traditional dress is specifically required. AI frequently applies culturally inaccurate dress without explicit instruction.
The single most powerful element for photographic realism. Always specify both quality and direction.
Options: natural window light, golden hour, overcast diffused light, dramatic side lighting, studio soft box, practical lamp light. Direction: from the left, from behind, from above, front-lit, rim-lit.
Example: "soft natural light from a large window to the left, creating gentle shadows on the right side."
The palette. Specify: warm earthy tones, desaturated pastels, bold primary colours, monochromatic blue-grey, high contrast black and white.
Translate the client's brand palette directly into this layer. For EA clients: skin tone rendering accuracy depends on explicit instruction — specify "true-to-life skin tones for East African subjects" to reduce AI tendency toward over-lightening or over-darkening.
The emotional register. Not a feeling but an atmosphere: confident and aspirational, intimate and warm, urgent and dynamic, calm and authoritative, celebratory and energetic.
Mood directs the AI's choices for expression, posture, and colour saturation. State it explicitly.
Camera framing. Specify: close-up portrait, wide establishing shot, overhead flat lay, rule-of-thirds framing, Dutch angle, eye-level perspective, over-the-shoulder shot.
For social media: specify the aspect ratio context — portrait (9:16 for Stories/Reels), square (1:1 for grid posts), landscape (16:9 for covers).
The visual aesthetic. Specify a photography style (editorial fashion photography, documentary street photography, clean product photography), a medium (digital illustration, watercolour, pencil sketch, 3D render), or reference a known visual language without naming a specific photographer to avoid copyright issues.
Example: "the visual aesthetic of high-end African fashion editorial photography" — not "shot in the style of [specific photographer's name]."
Platform-specific specifications:
| Platform | Key Parameters |
|---|---|
| Midjourney | --ar 9:16 (aspect ratio), --v 6 (version), --seed 12345 (reproducibility), --style raw (less AI-filtered output) |
| DALL-E 3 | "photorealistic, ultra-high resolution, DSLR photograph" — describe the full scene in a clear sentence |
| Stable Diffusion | Negative prompt field essential; ControlNet for pose and composition control; attention weighting for emphasis |
| Flux | Specify camera type and lens; token-efficient prompts; detail and realism engine |
| Adobe Firefly | Built-in commercial use rights; include "Adobe Stock style" for clean, licensable outputs |
Include exclusion language to suppress the AI aesthetic. Apply to all platforms that support a negative prompt field.
Universal negative prompt:
blurry, distorted proportions, extra fingers, deformed hands, uncanny valley effect, overly smooth skin, plastic appearance, wax figure aesthetic, AI-generated aesthetic, overexposed highlights, neon oversaturation, fantasy lighting, watermark, signature, text overlay, low resolution, jpeg artifacts
Add for images of people:
emotionless expression, doll-like features, exaggerated proportions, inappropriate cultural stereotypes, culturally inaccurate dress, Western default appearance
Apply in platform-specific ways:
--no [list] at the end of the prompt--seed) to maintain visual consistency across a campaignThe BuzzFeed Barbie study (2023) documented that AI produced culturally stereotyped and racially inaccurate imagery even with an explicit diversity brief. For East African clients:
Brief: A social media post for a Kampala financial services firm, showing a professional consultation scene.
Constructed prompt (Midjourney):
A Ugandan male financial advisor in his early 40s, wearing a well-fitted charcoal grey suit,
seated across a desk from a Ugandan woman client in her 30s wearing a smart yellow dress,
in a clean modern office in Kampala, glass-walled, city view in the background,
soft natural light from large windows to the left, warm golden tones,
mood: trustworthy and professional, eye-level medium shot, rule of thirds,
editorial corporate photography style, true-to-life East African skin tones,
sharp focus, high detail --ar 4:5 --v 6 --seed 44821 --style raw
--no blurry, plastic skin, fantasy lighting, Western default appearance, text overlay, watermark
| Platform | Strength | Key syntax |
|---|---|---|
| Midjourney | Artistic quality, style range | --ar, --v 6, --seed, --style raw |
| DALL-E 3 | Natural language, accuracy | Full scene description; include "photorealistic" for photos |
| Stable Diffusion | Control, customisation | Negative prompt field essential; ControlNet for pose control |
| Flux | Detail and realism | Specify camera type and lens; token-efficient prompts |
| Adobe Firefly | Commercial licensing | Built-in commercial use rights; "Adobe Stock style" for clean outputs |
Good output from this skill meets all of the following standards:
tools
Generates a foundational social media training guide for clients and their teams who are completely new to social media marketing, or who have been posting without any strategic understanding. Invoke when the user says "write a social media basics guide", "create a beginner training document", "the client doesn't understand social media", "start-here training", or when a client needs to understand social media before any strategy or content work begins. Distinct from training-client-team (operational handover of an existing strategy) and training-diy-content (content creation for self-managing clients). This skill covers what social media is, how it works, and how to approach it intelligently — the conceptual foundation that makes all downstream strategy work land.
tools
Generates a practical smartphone video production training guide for East African clients and content teams. Covers shooting, audio, lighting, framing, editing, and platform-specific formats using only a smartphone — no professional equipment required. Invoke this skill when a client or their team needs to produce their own social video content and requires a hands-on, jargon-free training document tailored to EA field conditions.
tools
Generates a complete DIY content creation handbook for clients who want to manage some or all of their own content after the initial strategy engagement. Invoke when the user says "write a DIY content guide", "create a self-managed content handbook", "the client wants to manage their own content", or when a handover guide is needed at the end of a strategy engagement. Output is a self-contained reference document — not a training presentation — that the client keeps and uses independently.
tools
Generates a complete 2-hour in-person training workbook for a client's internal team — employees who will assist with content creation or community management. Invoke when the user says "create a team training guide", "write a staff training workbook", "onboard our internal team on social media", or needs a printable workshop document for client employees. Output is a structured, print-ready workbook — not a presentation deck.