plugins/salesforce-master/skills/data-cloud-2025/SKILL.md
Salesforce Data Cloud integration patterns and architecture (2025). PROACTIVELY activate for: (1) Data Cloud setup and ingestion, (2) Data Streams (cloud, mobile, web SDK, ingestion API), (3) data model objects (DMO) and source objects (DSO), (4) identity resolution and unified profiles, (5) calculated insights and segmentation, (6) activations to Marketing Cloud, advertising platforms, Salesforce CRM, (7) Bring Your Own Lake (BYOL) with Snowflake, BigQuery, Databricks, (8) zero-copy data sharing, (9) Data Cloud + Agentforce grounding, (10) consent management and compliance. Provides: data-stream selection matrix, identity resolution rules, segmentation patterns, BYOL configuration, and activation playbook.
npx skillsauth add JosiahSiegel/claude-plugin-marketplace data-cloud-2025Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
MANDATORY: Always Use Backslashes on Windows for File Paths
When using Edit or Write tools on Windows, you MUST use backslashes (\) in file paths, NOT forward slashes (/).
Examples:
D:/repos/project/file.tsxD:\repos\project\file.tsxThis applies to:
NEVER create new documentation files unless explicitly requested by the user.
Salesforce Data Cloud is a real-time customer data platform (CDP) that unifies data from any source to create a complete, actionable view of every customer. It powers AI, automation, and analytics across the entire Customer 360 platform.
Key Capabilities:
Detailed material lives in references/. Load only what the current task needs.
| Topic | File | When to load |
|-------|------|--------------|
| Data ingestion (CDC streaming, batch API, Snowflake/Databricks Zero Copy) | references/ingestion-patterns.md | Configuring data sources, importing CSV/SFTP/S3 data, setting up Zero Copy to a warehouse |
| Identity resolution & authentication | references/identity-resolution.md | Defining match rules, reconciliation, custom matching, JWT Bearer auth |
| Real-time activation (Flow, Agentforce, Reverse ETL, calculated insights, segmentation, Data Cloud SQL) | references/activation-patterns.md | Triggering downstream actions, segmentation, Agentforce grounding, SQL queries |
| Vector Database & semantic/hybrid search | references/vector-database.md | Unstructured data indexing, semantic search, Einstein Copilot Search, multi-language search |
┌──────────────────────────────────────────────────────────┐
│ Data Sources │
│ Salesforce CRM │ External Apps │ Data Warehouses │ APIs │
└────────┬─────────────────┬──────────────┬───────────┬────┘
│ │ │ │
┌────▼─────────────────▼──────────────▼───────────▼────┐
│ Data Cloud Connectors & Ingestion │
│ ├─ Real-time Streaming (Change Data Capture) │
│ ├─ Batch Import (scheduled/on-demand) │
│ └─ Zero Copy (Snowflake, Databricks, BigQuery) │
└────────────────────────┬─────────────────────────────┘
│
┌────────────────────────▼─────────────────────────────┐
│ Data Model & Harmonization │
│ ├─ Map to Common Data Model (DMO objects) │
│ ├─ Identity Resolution (match & merge) │
│ └─ Data Transformation (calculated insights) │
└────────────────────────┬─────────────────────────────┘
│
┌────────────────────────▼─────────────────────────────┐
│ Unified Customer Profile (360° View) │
│ ├─ Demographics, Transactions, Behavior, Events │
│ └─ Real-time Profile API for instant access │
└────────────────────────┬─────────────────────────────┘
│
┌────────────────────────▼─────────────────────────────┐
│ Activation & Actions │
│ ├─ Salesforce Flow (real-time automation) │
│ ├─ Marketing Cloud (segmentation/journeys) │
│ ├─ Agentforce (AI agents) │
│ ├─ Einstein AI (predictions/recommendations) │
│ └─ External Systems (reverse ETL) │
└──────────────────────────────────────────────────────┘
development
This skill should be used when the user asks to train, debug, scale, or improve ML models. PROACTIVELY activate for: (1) PyTorch, TensorFlow/Keras, JAX, Flax, Hugging Face Trainer/Accelerate training loops, (2) distributed training, DDP/FSDP/DeepSpeed, TPU/GPU setup, (3) mixed precision AMP/bf16, gradient accumulation, checkpointing, seeding, (4) overfitting, imbalance, loss functions, regularization, LR schedules, warmup, (5) memory optimization, gradient checkpointing, offloading, quantization-aware training. Provides: reproducible training best practices across deep learning and classical ML.
development
This skill should be used when the user asks to productionize, track, version, govern, monitor, or automate ML systems. PROACTIVELY activate for: (1) MLflow, Weights & Biases, Neptune, Comet, ClearML experiment tracking, (2) model registry, model versioning, artifact lineage, reproducibility, (3) Kubeflow, SageMaker Pipelines, Vertex AI Pipelines, Azure ML pipelines, Databricks workflows, (4) CI/CD, continuous training/evaluation, A/B tests, canary/shadow deployments, (5) drift detection, model monitoring, data validation, responsible AI governance. Provides: end-to-end MLOps architecture and operational safeguards.
development
This skill should be used when the user asks to optimize, export, serve, compress, or accelerate ML inference. PROACTIVELY activate for: (1) latency, throughput, p95/p99, batching, concurrency, KV cache, memory, or cost issues, (2) quantization INT8/INT4, GPTQ, AWQ, bitsandbytes, pruning, sparsity, distillation, (3) ONNX export, ONNX Runtime, TensorRT, TorchScript, torch.compile, XLA, OpenVINO, Core ML, TFLite, (4) Triton, TorchServe, TF Serving, BentoML, Seldon, KServe configuration, (5) edge deployment, CPU/GPU/TPU/Inferentia serving. Provides: hardware-aware inference optimization and safe benchmarking.
testing
This skill should be used when the user asks to tune hyperparameters, run sweeps, optimize search spaces, or use AutoML. PROACTIVELY activate for: (1) Optuna, Ray Tune, FLAML, AutoGluon, Hyperopt, Nevergrad, KerasTuner, W&B sweeps, (2) grid search, random search, Bayesian optimization, TPE, Gaussian processes, evolutionary search, (3) ASHA, Hyperband, successive halving, multi-fidelity optimization, population-based training, (4) learning-rate finder, batch-size search, early stopping, pruning, (5) reproducible sweep design and experiment analysis. Provides: budget-aware hyperparameter search strategy.