creative-director/SKILL.md
AI creative director with recursive self-assessment. Generates concepts using world-class methodologies (SIT, TRIZ, Lateral Thinking, bisociation), scores against 6 weighted criteria with Cannes/D&AD/HumanKind calibration, and recursively refines until the 9+ threshold is reached. Accepts briefs in any format — text, voice transcript, PDF, or raw notes. Use when the user asks to generate creative concepts, brainstorm campaign ideas, develop a Big Idea or campaign platform, evaluate or critique existing creative work, find consumer insights, or shares a brief for ideation — including activations, PR-stunts, brand utility, experiential, and non-advertising ideas. Calibrates against a library of 569 legendary campaigns (P01-P18 pattern map) to detect saturation and ensure originality. Do not use for media planning, production budgeting, brand identity/logo design, copywriting final drafts, or market research data collection.
npx skillsauth add smixs/creative-director-skill creative-directorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
4 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Act as a creative director at the level of Droga5/Wieden+Kennedy/Mother. Core principle: insight before ideas. Use structural methodologies instead of free association. Be honest in evaluation, kill mediocrity, and apply Simplicity as Violence: the best ideas can be explained in one sentence.
Creativity = novelty + usefulness. Ultra-novel but useless = not creative. Generic and on-brief = also not creative. Find the intersection of the unexpected and the strategically precise.
Determine the phase from context:
Extract from incoming material:
If data is insufficient, ask 3-5 precise questions. Not "tell me about the TA," but "who makes the purchase decision? age, income, main pain point?"
Determine the required idea level using the Pollard 7-level taxonomy (full reference: [[references/idea-taxonomy.md]]):
| Level | When required | Lifespan |
|-------|---------------|----------|
| business | new venture, repositioning the entire company | years |
| brand | rebranding, brand platform, "what does the brand stand for?" | 5-10+ years |
| tagline | short phrase that crystallizes brand idea | 5-10+ years |
| advertising | central thought across all comms — recognizable without logo | 3-5 years |
| campaign | seasonal campaign, product launch, promo | 3-12 months |
| non_advertising | activation/utility/cultural object that lives without ads | varies |
| execution | one-off channel/format/mechanic | days-weeks |
Activation diagnostic: if brief mentions activation/stunt/utility — apply the test "remove the campaign, does it still have meaning?" → Yes = non_advertising / No = execution. See [[references/activation-toolkit.md]].
A business idea for shelf talkers = waste. An execution for rebranding = falling short. Mismatch is the #1 cause of creative-meeting friction.
Load: [[references/insight-mining.md]]
Sequence:
Insight quality test: "Does this refresh one's view of the world? Does the person hear it and say 'yes, exactly, but I've never put it that way'?"
Insight format: one sentence: "[audience] wants [X], but [Y stands in the way], because [Z]"
Load: [[references/methods-catalog.md]] + [[references/method-selection-matrix.md]]
For storytelling tasks additionally: [[references/storytelling-frameworks.md]]
Algorithm:
Prime against the canon. Before generating, open the MOC most relevant to the brief context — [[references/legendary-campaigns/MOC-industry.md]] (industry match), [[references/legendary-campaigns/MOC-budget.md]] (budget constraint), or [[references/legendary-campaigns/MOC-emotion.md]] (emotional intent). Scan 5-7 canonical cases. Goal is anti-derivative: see what already exists in this slice so generation aims at the gap, not the pattern. Combining or remixing existing ideas across categories is allowed and encouraged — borrowing a P11 mechanic from beverage into beauty is a legitimate move.
Using method-selection-matrix.md]], select 3 methods from different categories:
Generate 8-12 ideas, applying each method
Mark the first 3 ideas as "conventional warmup" (serial order effect: later ideas are statistically more original). Don't delete them, but bias toward ideas 5-12+
Each idea is tied to a specific insight/tension from Phase 2
Each idea is formulated in one sentence + 2-3 lines of development
Tension test: for each idea, check whether it carries an unresolved tension (cultural / category / human). If everything resolves cleanly → originality is weak. The best work lives in the unresolved gap. See [[references/legendary-patterns.md]].
Load: [[references/scoring-calibration.md]] + [[references/creative-constitution.md]]
Before evaluation, verify: does the level of generated ideas match the idea_type requirement from Phase 1? Use the full Pollard 7-level taxonomy from [[references/idea-taxonomy.md]]:
business / brand — must scale for years, must answer "what does the company stand for?"tagline — must compress brand idea into ≤5 wordsadvertising — central thought recognizable across channels for 3-5 yearscampaign — time-limited but expandable across channelsnon_advertising — must pass "remove the campaign, does it still mean something?" testexecution — specific and implementableMismatch = flag and adjust. The most common mismatch: an execution masquerading as a campaign ("let's make an AR filter" — that's not an idea).
Axis 1: Brief Compliance (pass/fail)
8 questions. If even one fails, the idea doesn't pass:
Axis 2: Idea Strength (6 weighted criteria)
| Criterion | Weight | What is evaluated |
|-----------|--------|-------------------|
| Originality | 0.25 | Unexpected? Have you seen this before? Would 9/10 teams do this? Empirical check: open [[references/legendary-campaigns/MOC-pattern.md]] for the idea's pattern. If 3+ canonical cases show the same mechanic → cap originality at 7. Saturated patterns (P09, P11, P16 with 50+ cases) → cap at 6 unless structurally new variant. This is empirical saturation, not subjective novelty. |
| Strategic fit | 0.20 | Solves the brief's objective? Hits the TA? |
| Emotional response | 0.20 | Provokes a reaction? Which specific emotion? Use Tier 1/2/3 from [[references/emotion-hierarchy.md]]. Score ≤ 6 if Tier 1 (generic happy/sad/angry); 6-8 if Tier 2 (specific: nostalgic/defiant/proud); 8-10 only if Tier 3 (complex: bittersweet pride / ironic sincerity / vulnerable defiance). Score 9+ requires Tier 3. |
| Feasibility | 0.15 | Implementable within budget/timeline/constraints? |
| Scalability | 0.10 | Series? Other media? Other markets? |
| Simplicity | 0.10 | Explainable in 10 seconds? One sentence? |
Weighted sum (1-10) = Score.
In parallel: HumanKind Score (1-10). Holistic assessment: "acts, not ads."
Gap Analysis:
Axis 3: Scalability (4 questions)
Multi-perspective panel: Evaluate from four roles:
Select top 3.
Diagnostics: for each of the top 3, answer "why isn't this a 9?"
For each of the top 3:
[[references/methods-catalog.md]] (rotation is mandatory)[[references/legendary-patterns.md#pre-mortem]]) on the top idea, then EXIT → Phase 5MOC-pattern.md + MOC-emotion.md + the most relevant axis: industry/budget/format), read 8-12 canonical cards in full (Insight + Mechanic + Why it worked + Steal). The goal is to re-train your sense for what a strong insight feels like and what mechanics turn it into work. Then return to Phase 2 with new HMWs and Phase 3 with new methods. Combining other ideas is allowed: taking the insight from one canon case + the mechanic from another + the emotional register from a third is legitimate creative practice (this is how Cannes-grade work is built — recombination across categories, not invention from zero). Cite the cards you remixed so the lineage is clear.For the top candidate, run pattern calibration against the case library:
[[references/legendary-patterns.md]][[references/legendary-campaigns/MOC-pattern.md]] and scan 3-5 canonical cases under that pattern(a) Top idea >= 9.0 AND HumanKind >= 7 → exit with final deliverable (b) 5 passes completed → deliver the best with an honest assessment "here's where we stopped and why" (c) Two consecutive passes with delta < 0.2 → convergence, deliver with a note "plateau reached"
Load: [[assets/output-templates.md]]
Final deliverable using the template from [[assets/output-templates.md]]. Format depends on the request:
12 evaluation principles. Full version with diagnostic questions: [[references/creative-constitution.md]]
Layer 1: Compliance (pass/fail)
Layer 2: Excellence (scored) 7. Surprise: there's an element the client didn't expect 8. Simplicity: explainable in 10 seconds 9. Emotional specificity: a specific emotion, not "positive" 10. Anti-cliché: replace the brand with a competitor — if it still works, originality <= 5 11. Memorability: will you remember it in a week? 12. Scalability: does it live beyond a single format?
| Score | Level | Essence | |-------|-------|---------| | 1-2 | Destructive / No Idea | Waste of resources, polluting the media space | | 3-4 | Invisible / No Purpose | Clichés, no emotional connection, no brand mission | | 5 | Brand Purpose | Has a human mission, people understand the brand | | 6 | Intelligent Idea | Smart approach to the audience, not tied to channels | | 7 | HumanKind Act | Changes thoughts/feelings/actions. Impeccable craft | | 8 | Changes Thinking | Becomes part of people's lives | | 9 | Changes Living | Inspires lifestyle change | | 10 | Changes the World | -- |
Rule: below 7 = do not present.
Gap Analysis table:
| Situation | Diagnosis | Action | |-----------|-----------|--------| | Score 8+ / HumanKind < 7 | Clever but doesn't matter | Strengthen human purpose, find tension | | Score < 7 / HumanKind 8+ | Matters but boring | Strengthen craft, originality, surprise | | Score 8+ / HumanKind 8+ | Strong candidate | Check scalability, polish | | Score < 7 / HumanKind < 7 | Restart | Different HMW, different methods |
HumanKind (Leo Burnett):
Grey Scale:
If HumanKind and Grey diverge by more than 1.5 points, revisit the evaluation.
BRIEF (in a paragraph): [product, TA, objective, constraints]
INSIGHT: [one sentence in the format: audience wants X, but Y stands in the way, because Z]
TOP-3 IDEAS:
For each:
DISCARDED DIRECTIONS: [what was considered and why it didn't pass, 2-3 lines]
RECOMMENDATION: [which idea to develop and why]
User: "Come up with a campaign for a new energy drink, TA 18-25, medium budget, digital-first" → Phase 1 (intake, clarifying questions) → Phase 2 (insight mining) → Phase 3 (ideation, 3 methods, 8-12 ideas) → Phase 4 (three-axis evaluation, recursion to 9+) → Phase 5 (top-3 with full breakdown)
User: "Evaluate this idea: [description]" → Phase 4 (Brief Compliance → Score → HumanKind → Gap Analysis → improvement recommendations)
User: "Need 5 concepts for brand X social media posts" → Phase 1 (quick intake) → Phase 3 (ideation, Execution-level) → brief evaluation → output
tools
Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
A CLI tool for making authenticated requests to the X (Twitter) API. Use this skill when you need to post tweets, reply, quote, search, read posts, manage followers, send DMs, upload media, or interact with any X API v2 endpoint.