LangGraph Patterns

Patterns for building reliable, maintainable AI agent workflows with LangGraph. Graphs should have typed state, focused nodes, explicit routing, and proper error handling.

Typed State

Every graph MUST define its state as a TypedDict. The state is the single source of truth flowing through the graph.

from typing import TypedDict, Annotated
from langgraph.graph import add_messages

class AgentState(TypedDict):
    messages: Annotated[list, add_messages]  # Chat history with reducer
    context: str                              # Retrieved context
    plan: list[str]                           # Action plan steps
    current_step: int                         # Progress tracker
    error: str | None                         # Error state
    final_answer: str | None                  # Output

State Rules

Use TypedDict — never raw dicts
Use Annotated with reducers for append-only fields (like messages)
Include an error field for error propagation
Keep state flat — avoid deeply nested structures
Every field should have a clear purpose documented

Node Design

Each node is a function that takes state, performs ONE operation, and returns a state update.

async def retrieve_context(state: AgentState) -> dict:
    """Retrieve relevant context from the knowledge base."""
    query = state["messages"][-1].content

    try:
        docs = await retriever.ainvoke(query)
        context = "\n\n".join(doc.page_content for doc in docs)
        return {"context": context}
    except RetrieverError as e:
        return {"error": f"Context retrieval failed: {e}"}


async def generate_response(state: AgentState) -> dict:
    """Generate a response using the LLM with retrieved context."""
    if state.get("error"):
        return {}  # Skip if previous node errored

    prompt = RESPONSE_PROMPT.format(
        context=state["context"],
        question=state["messages"][-1].content,
    )

    response = await llm.ainvoke([
        SystemMessage(content=prompt),
        *state["messages"],
    ])

    return {"messages": [response], "final_answer": response.content}

Node Rules

Each node does ONE thing (retrieve, generate, validate, etc.)
Nodes return partial state updates (only the fields they modify)
Nodes handle their own errors and set the error field
Nodes check for previous errors and skip gracefully
Nodes are async when they do I/O
Nodes are independently testable

Graph Construction

Build graphs with explicit edges and clear flow:

from langgraph.graph import StateGraph, END

def build_rag_graph() -> StateGraph:
    """Build a RAG pipeline graph."""
    graph = StateGraph(AgentState)

    # Add nodes
    graph.add_node("retrieve", retrieve_context)
    graph.add_node("generate", generate_response)
    graph.add_node("validate", validate_response)
    graph.add_node("handle_error", handle_error)

    # Set entry point
    graph.set_entry_point("retrieve")

    # Add edges
    graph.add_edge("retrieve", "check_retrieval")
    graph.add_conditional_edges(
        "check_retrieval",
        route_after_retrieval,
        {
            "success": "generate",
            "error": "handle_error",
        },
    )
    graph.add_edge("generate", "validate")
    graph.add_conditional_edges(
        "validate",
        route_after_validation,
        {
            "valid": END,
            "invalid": "generate",  # Retry
            "error": "handle_error",
        },
    )
    graph.add_edge("handle_error", END)

    return graph.compile()

Conditional Routing

Use routing functions to direct flow based on state:

def route_after_retrieval(state: AgentState) -> str:
    """Route based on retrieval result."""
    if state.get("error"):
        return "error"
    if not state.get("context"):
        return "error"
    return "success"


def route_after_validation(state: AgentState) -> str:
    """Route based on validation result."""
    if state.get("error"):
        return "error"
    if state.get("validation_passed"):
        return "valid"
    if state.get("current_step", 0) >= MAX_RETRIES:
        return "error"
    return "invalid"

Routing Rules

Routing functions are pure — they only read state, never modify it
Return string keys that match the conditional edge mapping
Always include an error route
Add retry limits to prevent infinite loops
Log routing decisions for debugging

Tool Calling

Integrate tools through LangGraph's tool node pattern:

from langchain_core.tools import tool
from langgraph.prebuilt import ToolNode

@tool
def search_database(query: str, limit: int = 10) -> str:
    """Search the invoice database for matching records.

    Args:
        query: Search query string
        limit: Maximum number of results to return
    """
    results = db.search(query, limit=limit)
    return json.dumps([r.to_dict() for r in results])


@tool
def calculate_total(invoice_id: str) -> str:
    """Calculate the total for an invoice including tax.

    Args:
        invoice_id: The UUID of the invoice
    """
    invoice = db.get_invoice(invoice_id)
    total = sum(li.quantity * li.unit_price for li in invoice.line_items)
    tax = total * Decimal("0.08")
    return json.dumps({"subtotal": str(total), "tax": str(tax), "total": str(total + tax)})


# Create tool node
tools = [search_database, calculate_total]
tool_node = ToolNode(tools)

# Bind tools to LLM
llm_with_tools = llm.bind_tools(tools)

Tool Rules

Tools have clear, descriptive docstrings (the LLM reads them)
Tools return strings (serialized results)
Tools handle their own errors and return error messages
Tools are typed and validated
Keep tools focused — one action per tool

Checkpointing

Use checkpointing for long-running graphs and human-in-the-loop patterns:

from langgraph.checkpoint.memory import MemorySaver
from langgraph.checkpoint.postgres import PostgresSaver

# Development: in-memory checkpointing
memory = MemorySaver()
graph = build_graph().compile(checkpointer=memory)

# Production: persistent checkpointing
checkpointer = PostgresSaver.from_conn_string(DATABASE_URL)
graph = build_graph().compile(checkpointer=checkpointer)

# Invoke with thread_id for session persistence
config = {"configurable": {"thread_id": "user-session-123"}}
result = await graph.ainvoke(initial_state, config=config)

# Resume from checkpoint
result = await graph.ainvoke(None, config=config)  # Continues from last state

Checkpointing Rules

Always use checkpointing in production
Use thread_id to scope state per conversation/user
Use PostgresSaver or equivalent for persistence across restarts
Clean up old checkpoints periodically

Error Handling in Graphs

Design explicit error paths, not try/except around the whole graph:

async def handle_error(state: AgentState) -> dict:
    """Handle errors gracefully and produce a user-friendly response."""
    error_msg = state.get("error", "An unknown error occurred")

    return {
        "messages": [AIMessage(content=f"I encountered an issue: {error_msg}. Please try again.")],
        "final_answer": None,
    }

Error Handling Rules

Every graph has a dedicated error-handling node
Nodes set state["error"] instead of raising exceptions
Routing functions check for errors and redirect to error handler
Error handler produces a user-friendly message
Errors are logged with context (which node, what input)

Context Window Management

Long-running agents and multi-turn conversations will exceed the context window. Plan for this from the start — retroffitting compaction is expensive.

Message Compaction Integration

Combine LangGraph's trim_messages with Anthropic's Compaction API for layered context management:

from langchain_core.messages import trim_messages, RemoveMessage

class AgentState(TypedDict):
    messages: Annotated[list, add_messages]
    compaction_count: int          # Track compaction cycles for budget enforcement
    cumulative_tokens: int         # Estimated total tokens consumed across compactions

async def call_model(state: AgentState) -> dict:
    """LLM call with message trimming — first line of defense."""
    trimmed = trim_messages(
        state["messages"],
        max_tokens=100_000,        # Soft limit — well below context window
        strategy="last",
        token_counter=llm,
        include_system=True,
    )
    response = await llm.ainvoke(trimmed)
    return {"messages": [response]}

Compaction + Checkpointing Interplay

Checkpoints store the full state including messages. When compaction summarizes messages, the checkpoint captures the compacted version. Key rules:

Checkpoint BEFORE compaction — preserve the full state for replay/debugging
Checkpoint AFTER compaction — the compacted state becomes the new baseline
Thread-level compaction tracking — store compaction_count in state so it survives checkpoints
Budget enforcement across compactions — a 200K-token context compacted 10 times = 2M tokens billed

TRIGGER = 100_000
BUDGET = 3_000_000

async def check_budget(state: AgentState) -> dict:
    """Gate node — enforce total token budget across compaction cycles."""
    total = state.get("compaction_count", 0) * TRIGGER
    if total >= BUDGET:
        return {
            "messages": [AIMessage(content="Wrapping up — token budget reached.")],
            "error": "budget_exceeded",
        }
    return {}

Message Cleanup for Stale Tool Results

Long-running agents accumulate tool call/result pairs that are no longer relevant. Remove them to keep context lean:

async def cleanup_stale_messages(state: AgentState) -> dict:
    """Remove tool messages older than the last N turns."""
    messages = state["messages"]
    keep_after = max(0, len(messages) - 20)  # Keep last 20 messages

    removals = []
    for i, msg in enumerate(messages[:keep_after]):
        if msg.type in ("tool", "tool_call"):
            removals.append(RemoveMessage(id=msg.id))
    return {"messages": removals}

Context Window Management Rules

Any agent that can run more than 10 tool iterations MUST have a compaction strategy
Track compaction_count in state for budget enforcement
Use trim_messages before every LLM call as first defense
Clean up stale tool call/result pairs periodically
Checkpoint both before and after compaction events
System prompt MUST have its own cache_control breakpoint so it survives compaction

Prompt Management

Never hardcode prompts in node functions. Use templates with clear variables:

RETRIEVAL_PROMPT = """You are a helpful assistant answering questions about invoices.

Context from the knowledge base:
{context}

User question: {question}

Instructions:
- Answer based ONLY on the provided context
- If the context doesn't contain the answer, say so
- Cite specific parts of the context in your answer
"""

# In the node:
prompt = RETRIEVAL_PROMPT.format(
    context=state["context"],
    question=state["messages"][-1].content,
)

Prompt Rules

Prompts are module-level constants or loaded from files
Variables use {name} format for .format() substitution
Prompts include clear instructions and constraints
System prompts are separate from user prompts
Prompts are versioned alongside code

LangGraph Patterns

Patterns for building reliable, maintainable AI agent workflows with LangGraph. Graphs should have typed state, focused nodes, explicit routing, and proper error handling.

Typed State

Every graph MUST define its state as a TypedDict. The state is the single source of truth flowing through the graph.

from typing import TypedDict, Annotated
from langgraph.graph import add_messages

class AgentState(TypedDict):
    messages: Annotated[list, add_messages]  # Chat history with reducer
    context: str                              # Retrieved context
    plan: list[str]                           # Action plan steps
    current_step: int                         # Progress tracker
    error: str | None                         # Error state
    final_answer: str | None                  # Output

State Rules

Use TypedDict — never raw dicts
Use Annotated with reducers for append-only fields (like messages)
Include an error field for error propagation
Keep state flat — avoid deeply nested structures
Every field should have a clear purpose documented

Node Design

Each node is a function that takes state, performs ONE operation, and returns a state update.

async def retrieve_context(state: AgentState) -> dict:
    """Retrieve relevant context from the knowledge base."""
    query = state["messages"][-1].content

    try:
        docs = await retriever.ainvoke(query)
        context = "\n\n".join(doc.page_content for doc in docs)
        return {"context": context}
    except RetrieverError as e:
        return {"error": f"Context retrieval failed: {e}"}


async def generate_response(state: AgentState) -> dict:
    """Generate a response using the LLM with retrieved context."""
    if state.get("error"):
        return {}  # Skip if previous node errored

    prompt = RESPONSE_PROMPT.format(
        context=state["context"],
        question=state["messages"][-1].content,
    )

    response = await llm.ainvoke([
        SystemMessage(content=prompt),
        *state["messages"],
    ])

    return {"messages": [response], "final_answer": response.content}

Node Rules

Each node does ONE thing (retrieve, generate, validate, etc.)
Nodes return partial state updates (only the fields they modify)
Nodes handle their own errors and set the error field
Nodes check for previous errors and skip gracefully
Nodes are async when they do I/O
Nodes are independently testable

Graph Construction

Build graphs with explicit edges and clear flow:

from langgraph.graph import StateGraph, END

def build_rag_graph() -> StateGraph:
    """Build a RAG pipeline graph."""
    graph = StateGraph(AgentState)

    # Add nodes
    graph.add_node("retrieve", retrieve_context)
    graph.add_node("generate", generate_response)
    graph.add_node("validate", validate_response)
    graph.add_node("handle_error", handle_error)

    # Set entry point
    graph.set_entry_point("retrieve")

    # Add edges
    graph.add_edge("retrieve", "check_retrieval")
    graph.add_conditional_edges(
        "check_retrieval",
        route_after_retrieval,
        {
            "success": "generate",
            "error": "handle_error",
        },
    )
    graph.add_edge("generate", "validate")
    graph.add_conditional_edges(
        "validate",
        route_after_validation,
        {
            "valid": END,
            "invalid": "generate",  # Retry
            "error": "handle_error",
        },
    )
    graph.add_edge("handle_error", END)

    return graph.compile()

Conditional Routing

Use routing functions to direct flow based on state:

def route_after_retrieval(state: AgentState) -> str:
    """Route based on retrieval result."""
    if state.get("error"):
        return "error"
    if not state.get("context"):
        return "error"
    return "success"


def route_after_validation(state: AgentState) -> str:
    """Route based on validation result."""
    if state.get("error"):
        return "error"
    if state.get("validation_passed"):
        return "valid"
    if state.get("current_step", 0) >= MAX_RETRIES:
        return "error"
    return "invalid"

Routing Rules

Routing functions are pure — they only read state, never modify it
Return string keys that match the conditional edge mapping
Always include an error route
Add retry limits to prevent infinite loops
Log routing decisions for debugging

Tool Calling

Integrate tools through LangGraph's tool node pattern:

from langchain_core.tools import tool
from langgraph.prebuilt import ToolNode

@tool
def search_database(query: str, limit: int = 10) -> str:
    """Search the invoice database for matching records.

    Args:
        query: Search query string
        limit: Maximum number of results to return
    """
    results = db.search(query, limit=limit)
    return json.dumps([r.to_dict() for r in results])


@tool
def calculate_total(invoice_id: str) -> str:
    """Calculate the total for an invoice including tax.

    Args:
        invoice_id: The UUID of the invoice
    """
    invoice = db.get_invoice(invoice_id)
    total = sum(li.quantity * li.unit_price for li in invoice.line_items)
    tax = total * Decimal("0.08")
    return json.dumps({"subtotal": str(total), "tax": str(tax), "total": str(total + tax)})


# Create tool node
tools = [search_database, calculate_total]
tool_node = ToolNode(tools)

# Bind tools to LLM
llm_with_tools = llm.bind_tools(tools)

Tool Rules

Tools have clear, descriptive docstrings (the LLM reads them)
Tools return strings (serialized results)
Tools handle their own errors and return error messages
Tools are typed and validated
Keep tools focused — one action per tool

Checkpointing

Use checkpointing for long-running graphs and human-in-the-loop patterns:

from langgraph.checkpoint.memory import MemorySaver
from langgraph.checkpoint.postgres import PostgresSaver

# Development: in-memory checkpointing
memory = MemorySaver()
graph = build_graph().compile(checkpointer=memory)

# Production: persistent checkpointing
checkpointer = PostgresSaver.from_conn_string(DATABASE_URL)
graph = build_graph().compile(checkpointer=checkpointer)

# Invoke with thread_id for session persistence
config = {"configurable": {"thread_id": "user-session-123"}}
result = await graph.ainvoke(initial_state, config=config)

# Resume from checkpoint
result = await graph.ainvoke(None, config=config)  # Continues from last state

Checkpointing Rules

Always use checkpointing in production
Use thread_id to scope state per conversation/user
Use PostgresSaver or equivalent for persistence across restarts
Clean up old checkpoints periodically

Error Handling in Graphs

Design explicit error paths, not try/except around the whole graph:

async def handle_error(state: AgentState) -> dict:
    """Handle errors gracefully and produce a user-friendly response."""
    error_msg = state.get("error", "An unknown error occurred")

    return {
        "messages": [AIMessage(content=f"I encountered an issue: {error_msg}. Please try again.")],
        "final_answer": None,
    }

Error Handling Rules

Every graph has a dedicated error-handling node
Nodes set state["error"] instead of raising exceptions
Routing functions check for errors and redirect to error handler
Error handler produces a user-friendly message
Errors are logged with context (which node, what input)

Context Window Management

Long-running agents and multi-turn conversations will exceed the context window. Plan for this from the start — retroffitting compaction is expensive.

Message Compaction Integration

Combine LangGraph's trim_messages with Anthropic's Compaction API for layered context management:

from langchain_core.messages import trim_messages, RemoveMessage

class AgentState(TypedDict):
    messages: Annotated[list, add_messages]
    compaction_count: int          # Track compaction cycles for budget enforcement
    cumulative_tokens: int         # Estimated total tokens consumed across compactions

async def call_model(state: AgentState) -> dict:
    """LLM call with message trimming — first line of defense."""
    trimmed = trim_messages(
        state["messages"],
        max_tokens=100_000,        # Soft limit — well below context window
        strategy="last",
        token_counter=llm,
        include_system=True,
    )
    response = await llm.ainvoke(trimmed)
    return {"messages": [response]}

Compaction + Checkpointing Interplay

Checkpoints store the full state including messages. When compaction summarizes messages, the checkpoint captures the compacted version. Key rules:

Checkpoint BEFORE compaction — preserve the full state for replay/debugging
Checkpoint AFTER compaction — the compacted state becomes the new baseline
Thread-level compaction tracking — store compaction_count in state so it survives checkpoints
Budget enforcement across compactions — a 200K-token context compacted 10 times = 2M tokens billed

TRIGGER = 100_000
BUDGET = 3_000_000

async def check_budget(state: AgentState) -> dict:
    """Gate node — enforce total token budget across compaction cycles."""
    total = state.get("compaction_count", 0) * TRIGGER
    if total >= BUDGET:
        return {
            "messages": [AIMessage(content="Wrapping up — token budget reached.")],
            "error": "budget_exceeded",
        }
    return {}

Message Cleanup for Stale Tool Results

Long-running agents accumulate tool call/result pairs that are no longer relevant. Remove them to keep context lean:

async def cleanup_stale_messages(state: AgentState) -> dict:
    """Remove tool messages older than the last N turns."""
    messages = state["messages"]
    keep_after = max(0, len(messages) - 20)  # Keep last 20 messages

    removals = []
    for i, msg in enumerate(messages[:keep_after]):
        if msg.type in ("tool", "tool_call"):
            removals.append(RemoveMessage(id=msg.id))
    return {"messages": removals}

Context Window Management Rules

Any agent that can run more than 10 tool iterations MUST have a compaction strategy
Track compaction_count in state for budget enforcement
Use trim_messages before every LLM call as first defense
Clean up stale tool call/result pairs periodically
Checkpoint both before and after compaction events
System prompt MUST have its own cache_control breakpoint so it survives compaction

Prompt Management

Never hardcode prompts in node functions. Use templates with clear variables:

RETRIEVAL_PROMPT = """You are a helpful assistant answering questions about invoices.

Context from the knowledge base:
{context}

User question: {question}

Instructions:
- Answer based ONLY on the provided context
- If the context doesn't contain the answer, say so
- Cite specific parts of the context in your answer
"""

# In the node:
prompt = RETRIEVAL_PROMPT.format(
    context=state["context"],
    question=state["messages"][-1].content,
)

Prompt Rules

Prompts are module-level constants or loaded from files
Variables use {name} format for .format() substitution
Prompts include clear instructions and constraints
System prompts are separate from user prompts
Prompts are versioned alongside code

Adoption

33prime/skills/stack/langgraph-patterns

$ install --global

Security Scan Results

SKILL.md

LangGraph Patterns

Typed State

State Rules

Node Design

Node Rules

Graph Construction

Conditional Routing

Routing Rules

Tool Calling

Tool Rules

Checkpointing

Checkpointing Rules

Error Handling in Graphs

Error Handling Rules

Context Window Management

Message Compaction Integration

Compaction + Checkpointing Interplay

Message Cleanup for Stale Tool Results

Context Window Management Rules

Prompt Management

Prompt Rules

Related Skills

33prime/skills/workflows/parallel-execution

33prime/skills/workflows/module-extraction

33prime/skills/workflows/forge-orchestrate

33prime/skills/workflows/code-review

33prime/skills/stack/langgraph-patterns

$ install --global

Security Scan Results

SKILL.md

LangGraph Patterns

Typed State

State Rules

Node Design

Node Rules

Graph Construction

Conditional Routing

Routing Rules

Tool Calling

Tool Rules

Checkpointing

Checkpointing Rules

Error Handling in Graphs

Error Handling Rules

Context Window Management

Message Compaction Integration

Compaction + Checkpointing Interplay

Message Cleanup for Stale Tool Results

Context Window Management Rules

Prompt Management

Prompt Rules

Related Skills

33prime/skills/workflows/parallel-execution

33prime/skills/workflows/module-extraction

33prime/skills/workflows/forge-orchestrate

33prime/skills/workflows/code-review