plugins/twilio-developer-kit/skills/twilio-customer-memory/SKILL.md
Store and retrieve customer context using Twilio Conversation Memory. Covers Memory Store provisioning, profile management, traits, observations, conversation summaries, and semantic Recall. Use this skill to give AI agents or human agents persistent memory of customer interactions across sessions and channels.
npx skillsauth add openai/plugins twilio-customer-memoryInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Conversation Memory gives your application persistent customer memory. Observations (what happened) and traits (who the customer is) are written automatically from conversations flowing through Conversation Orchestrator/Orchestrator — or posted directly if you run your own extraction. Retrieve relevant context via Recall before responding.
Conversation Orchestrator/Orchestrator conversation → auto-extracted observations & summaries → Memory Store
Your App → Recall → relevant context injected into LLM prompt
All Conversation Memory APIs are on memory.twilio.com. Observations, traits, profiles, summaries — everything is on the same host.
Auth: Basic Auth — TWILIO_ACCOUNT_SID and TWILIO_AUTH_TOKEN.
twilio-account-setupTWILIO_ACCOUNT_SID and TWILIO_AUTH_TOKEN — see twilio-iam-auth-setuptwilio-conversation-orchestratorDo this before setting up Conversation Orchestrator/Orchestrator. The Memory Store SID goes into your conversation service config.
Python
import os, requests
account_sid = os.environ["TWILIO_ACCOUNT_SID"]
auth_token = os.environ["TWILIO_AUTH_TOKEN"]
store = requests.post(
"https://memory.twilio.com/v1/Services",
auth=(account_sid, auth_token),
json={
"uniqueName": "my-app-memory",
"friendlyName": "My App Memory Store"
}
).json()
memory_store_sid = store["sid"]
print(memory_store_sid)
Node.js
const accountSid = process.env.TWILIO_ACCOUNT_SID;
const authToken = process.env.TWILIO_AUTH_TOKEN;
const store = await fetch("https://memory.twilio.com/v1/Services", {
method: "POST",
headers: {
"Authorization": "Basic " + btoa(`${accountSid}:${authToken}`),
"Content-Type": "application/json",
},
body: JSON.stringify({
uniqueName: "my-app-memory",
friendlyName: "My App Memory Store",
}),
}).then(r => r.json());
const memoryStoreSid = store.sid;
Use memory_store_sid when creating your Conversations Service in Conversation Orchestrator/Orchestrator. The two must be linked for automatic observation and summary extraction to work.
Profiles are created automatically when conversations flow through Conversation Orchestrator/Orchestrator — the conversation config determines how participants are resolved into profiles. You can also create or enrich profiles manually using traits.
Create a profile manually with traits:
Python
profile = requests.post(
f"https://memory.twilio.com/v1/Services/{memory_store_sid}/Profiles",
auth=(account_sid, auth_token),
json={
"traits": {
"Contact": {
"phone": "+15558675310",
"firstName": "Alyssa",
"lastName": "Mock",
"email": "[email protected]"
}
}
}
).json()
profile_id = profile["id"]
Node.js
const profile = await fetch(
`https://memory.twilio.com/v1/Services/${memoryStoreSid}/Profiles`,
{
method: "POST",
headers: {
"Authorization": "Basic " + btoa(`${accountSid}:${authToken}`),
"Content-Type": "application/json",
},
body: JSON.stringify({
traits: {
Contact: {
phone: "+15558675310",
firstName: "Alyssa",
lastName: "Mock",
email: "[email protected]",
}
}
}),
}
).then(r => r.json());
const profileId = profile.id;
Look up a profile by phone number (for inbound calls where you only have the caller's number):
Python
lookup = requests.post(
f"https://memory.twilio.com/v1/Services/{memory_store_sid}/Profiles/Lookup",
auth=(account_sid, auth_token),
json={"idType": "phone", "value": "+15558675310"}
).json()
profile_id = lookup["profiles"][0]["id"] if lookup.get("profiles") else None
Observations are extracted automatically from conversations when a conversation becomes inactive or is closed, based on your conversation config. You don't need to write them manually for Conversation Orchestrator-managed conversations.
If you run your own extraction (custom pipeline outside Conversation Orchestrator), post results directly:
Python
requests.post(
f"https://memory.twilio.com/v1/Services/{memory_store_sid}/Profiles/{profile_id}/Observations",
auth=(account_sid, auth_token),
json={
"observations": [
{
"content": "Customer asked about order #4521. Wants expedited shipping. Prefers SMS updates.",
"source": "custom_extraction",
"occurredAt": "2026-04-20T14:30:00Z",
"conversationIds": [conversation_sid]
}
]
}
)
Node.js
await fetch(
`https://memory.twilio.com/v1/Services/${memoryStoreSid}/Profiles/${profileId}/Observations`,
{
method: "POST",
headers: {
"Authorization": "Basic " + btoa(`${accountSid}:${authToken}`),
"Content-Type": "application/json",
},
body: JSON.stringify({
observations: [{
content: "Customer asked about order #4521. Wants expedited shipping. Prefers SMS updates.",
source: "custom_extraction",
occurredAt: new Date().toISOString(),
conversationIds: [conversationSid],
}]
}),
}
);
Batch up to 10 observations in one request.
Recall runs hybrid lexical + semantic search and returns the most relevant observations and summaries for an LLM prompt.
Recommended: pass a conversationId from Conversation Orchestrator/Orchestrator. Recall builds a contextually relevant query from the active conversation automatically — no need to craft one yourself.
Python
recall = requests.post(
f"https://memory.twilio.com/v1/Services/{memory_store_sid}/Profiles/{profile_id}/Recall",
auth=(account_sid, auth_token),
json={
"conversationId": orchestrator_conversation_sid,
"observationsLimit": 10,
"summariesLimit": 3,
}
).json()
observations = "\n".join(o["content"] for o in recall.get("observations", []))
summaries = "\n".join(s["content"] for s in recall.get("summaries", []))
system_prompt = f"""You are a helpful support agent.
Customer history:
{observations}
Recent summaries:
{summaries}"""
Node.js
const recall = await fetch(
`https://memory.twilio.com/v1/Services/${memoryStoreSid}/Profiles/${profileId}/Recall`,
{
method: "POST",
headers: {
"Authorization": "Basic " + btoa(`${accountSid}:${authToken}`),
"Content-Type": "application/json",
},
body: JSON.stringify({
conversationId: orchestratorConversationSid,
observationsLimit: 10,
summariesLimit: 3,
}),
}
).then(r => r.json());
const context = [
...recall.observations.map(o => o.content),
...recall.summaries.map(s => s.content),
].join("\n");
Other Recall modes:
| Mode | How | When to use |
|------|-----|-------------|
| Conversation ID (recommended) | "conversationId": orchestrator_sid | Active Conversation Orchestrator/Orchestrator conversation — query is generated from conversation context |
| Custom query | "query": "your question" | Custom pipelines outside Conversation Orchestrator, or when you need precise control over relevance |
| No query | Omit both query and conversationId | Returns most recent observations in chronological order — useful for loading history at session start |
Traits are organized into named groups. The Contact group is the standard identity anchor — its fields are promoted to profile identifiers for lookup.
| Group | Fields | Use |
|-------|--------|-----|
| Contact | phone, email, firstName, lastName | Identity anchor — always include |
| Account | accountNumber, tier, region | Business account data |
| Support | disposition, caseId, lastIssueType | Support history |
Define your own groups for domain-specific data.
Summaries are written automatically at conversation close or when a conversation goes inactive, based on your conversation config — the same trigger as observations. You can also write them manually:
Python
requests.post(
f"https://memory.twilio.com/v1/Services/{memory_store_sid}/Profiles/{profile_id}/ConversationSummaries",
auth=(account_sid, auth_token),
json={
"conversationId": conversation_sid,
"content": "Customer called about order #4521. Resolved: approved expedited upgrade.",
"source": "manual"
}
)
Summaries are returned in the summaries array of Recall results.
Retrieve memory at call start, store observations at call end. For voice AI agents on ConversationRelay.
Python (WebSocket handler)
async def handle_call(websocket):
setup = json.loads(await websocket.recv())
caller = setup.get("from", "unknown")
# Look up profile by caller phone
lookup = requests.post(
f"https://memory.twilio.com/v1/Services/{MEMORY_STORE_SID}/Profiles/Lookup",
auth=(ACCOUNT_SID, AUTH_TOKEN),
json={"idType": "phone", "value": caller}
).json()
profiles = lookup.get("profiles", [])
profile_id = profiles[0]["id"] if profiles else None
context = ""
if profile_id:
recall = requests.post(
f"https://memory.twilio.com/v1/Services/{MEMORY_STORE_SID}/Profiles/{profile_id}/Recall",
auth=(ACCOUNT_SID, AUTH_TOKEN),
json={"observationsLimit": 5, "summariesLimit": 2}
).json()
context = "\n".join(o["content"] for o in recall.get("observations", []))
system_prompt = f"You are a helpful agent.\n\nCustomer history:\n{context}" if context else "You are a helpful agent."
# ... handle conversation ...
# Store observation at end if running custom extraction
if profile_id:
requests.post(
f"https://memory.twilio.com/v1/Services/{MEMORY_STORE_SID}/Profiles/{profile_id}/Observations",
auth=(ACCOUNT_SID, AUTH_TOKEN),
json={"observations": [{"content": call_summary, "source": "voice_agent", "conversationIds": [orchestrator_conversation_sid]}]}
)
Use one Memory Store per client. The uniqueName doubles as a namespace.
# At client onboarding
store = requests.post(
"https://memory.twilio.com/v1/Services",
auth=(account_sid, auth_token),
json={"uniqueName": f"client-{client_id}", "friendlyName": client_name}
).json()
# Store store["sid"] in your tenant config — pass it to Conversation Orchestrator conversation service setup
twilio-debugging-observability.observationsLimit max 20, default 5. summariesLimit and communicationsLimit similar.twilio-conversation-orchestratortwilio-conversation-intelligencetwilio-enterprise-knowledgetwilio-voice-conversation-relaytwilio-debugging-observabilitytools
Top-level workflow skill for USD performance diagnosis and optimization. Use for slow loading, high memory, low FPS, or 'optimize my scene' requests; delegates auth/runtime setup to Phase 0 owners.
data-ai
Use when the user mentions MagicPath, designs, UI components, themes, canvas selections, or repo-to-canvas UI work; run magicpath-ai to search, inspect, install, or author components.
documentation
Use as the top-level router for Omniverse Realtime Viewer USD app requests and focused viewer reference documents.
tools
Turn Notion specs into implementation plans, tasks, and progress tracking; use when implementing PRDs/feature specs and creating Notion plans + tasks from them.