skills/architect-ai-providers/SKILL.md
Integration architecture for AI model providers including OpenAI, Google Gemini, and Anthropic within Braze messaging workflows.
npx skillsauth add delta-and-beta/braze-agency architect-ai-providersInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
★ Insight ─────────────────────────────────────
The plugin-dev:skill-development skill emphasizes progressive disclosure — SKILL.md should be lean (1,500–2,000 words) with imperative writing, while heavy reference content lives in references/ files loaded only when needed. Since the user asked for just the markdown body (no frontmatter), I'll generate the body content that would go after frontmatter in a real SKILL.md.
─────────────────────────────────────────────────
This skill covers the architecture for integrating third-party AI model providers — OpenAI, Google Gemini, and Anthropic — into Braze messaging workflows. Use it when designing systems that leverage large language models (LLMs) for content generation, personalization, and intelligent automation within Braze campaigns, Canvases, and messaging pipelines.
The lens here is architectural: not just how individual API calls work, but how to structure reliable, observable, and maintainable AI-powered systems at the intersection of Braze and external AI providers.
Apply this skill when:
OpenAI's GPT model family is the most widely adopted LLM for Braze integrations. Key capabilities relevant to Braze workflows:
Primary integration patterns:
Gemini's multimodal capabilities extend AI use in Braze beyond text:
Primary integration patterns:
Claude's design emphasis on safety, instruction-following, and long-context fidelity makes it well-suited for regulated industries and high-stakes messaging:
Primary integration patterns:
Call an AI provider API directly from a Braze Connected Content block during message rendering.
Use when: Content must reflect the freshest possible user context and real-time data.
Trade-offs:
default fallbacks in Connected ContentKey design decisions:
max_tokens aggressively to bound latencyGenerate AI content in an upstream pipeline and store the output as Braze Custom Attributes or Catalog entries before the send is triggered.
Use when: Content generation requires complex reasoning, large context, or multi-step workflows that exceed Connected Content latency budgets.
Trade-offs:
Key design decisions:
generated_at timestamp attribute alongside itPre-generate parameterized content templates server-side, then inject dynamic user variables at send time via Connected Content or Liquid.
Use when: Content structure is stable but a small number of user-specific variables need real-time injection.
Example: Pre-generate a product recommendation paragraph with a {{product_name}} slot server-side, store it as a Custom Attribute, then use Liquid to inject the user's browsed product at send time without a live AI call.
| Dimension | OpenAI | Google Gemini | Anthropic (Claude) | |---|---|---|---| | Text generation quality | Excellent | Excellent | Excellent | | Instruction fidelity | High | High | Very High | | Multimodal | GPT-4o (text + image) | Native multimodal | Text-primary | | Long context | 128k tokens | 1M tokens | 200k tokens | | Structured output (JSON) | Native JSON mode | Function calling | Tool use | | Latency (p50) | Low–medium | Medium | Medium | | Regulated content use cases | Moderate | Moderate | Strong | | Fine-tuning | Yes | Yes | No (Claude) |
Design every AI integration with a deterministic fallback:
Treat prompts as versioned artifacts:
This skill synthesizes knowledge from the following topic references:
For detailed API call patterns and example Connected Content templates, consult:
references/openai-integration.md — GPT Connected Content templates, JSON mode patterns, embedding-based segmentationreferences/gemini-integration.md — Multimodal workflows, batch enrichment architecture, large-context prompt designreferences/anthropic-integration.md — Claude constraints, regulated content workflows, tool use patterns for Brazereferences/provider-comparison.md — Detailed capability matrix, latency benchmarks, cost modeling guidance★ Insight ─────────────────────────────────────
The skill body intentionally avoids deep API call specifics — those belong in references/*.md files per the progressive disclosure principle. The body stays focused on architectural decision-making (when to use each provider, which pattern to apply), which is the stable, always-useful layer. Provider API details change faster and are larger, making them ideal candidates for reference files loaded only when needed.
─────────────────────────────────────────────────
development
Cross-platform audience synchronization design across advertising platforms including Facebook, Google, TikTok, LinkedIn, and programmatic networks.
development
Defines cross-cutting API patterns for authentication, provisioning, preference management, and content delivery.
development
Covers API basics, authentication, rate limits, error codes, endpoint overview, data retention policies, and Postman collection usage.
development
Designs workspace structures, manages platform settings, rate limits, and API configurations.