skills/litellm/SKILL.md
Unified LLM gateway and SDK for calling 100+ LLM providers (OpenAI, Anthropic, Bedrock, Vertex, Azure, Ollama, etc.) with a single OpenAI-compatible interface. MANDATORY TRIGGERS: litellm, litellm proxy, litellm router, litellm.completion, llm gateway, multi-provider llm, openai-compatible proxy, model fallback, llm cost tracking, llm load balancing. Also trigger when user wants to switch LLM providers without rewriting code, route requests across multiple models, add fallbacks/retries to LLM calls, track token costs, run a self-hosted OpenAI-compatible proxy, or unify observability across providers. When in doubt about whether to use this skill for any multi-provider LLM task, use it.
npx skillsauth add abhisheksharma-17/skills-graph litellmInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Call 100+ LLMs (OpenAI, Anthropic, Azure, Bedrock, Vertex, Ollama, etc.) with the OpenAI input/output format. SDK + self-hostable proxy gateway with routing, fallbacks, caching, observability, and cost tracking.
Source: docs.litellm.ai | Python SDK: v1.52.x | License: MIT
| Reference | File | Read When |
|-----------|------|-----------|
| Overview & Quickstart | references/00-overview.md | Getting started, install, core concepts, when to use SDK vs Proxy |
| Completion API | references/01-completion-api.md | litellm.completion(), params, messages, response shape |
| Providers & Models | references/02-providers.md | Provider prefixes, OpenAI, Anthropic, Bedrock, Vertex, Azure, Ollama |
| Streaming | references/03-streaming.md | Sync/async streaming, chunk parsing, stream wrappers |
| Async & Concurrency | references/04-async.md | acompletion, async batching, throughput patterns |
| Router (SDK) | references/05-router.md | Multi-deployment load balancing, routing strategies, model groups |
| Proxy Server | references/06-proxy-server.md | LiteLLM Proxy: config.yaml, virtual keys, /chat/completions endpoint |
| Fallbacks & Retries | references/07-fallbacks-retries.md | num_retries, fallback chains, context window fallbacks, timeout policy |
| Caching | references/08-caching.md | In-memory, Redis, S3 caching, semantic caching, cache controls |
| Observability | references/09-observability.md | Callbacks, Langfuse, Langsmith, OTEL, custom loggers |
| Cost Tracking | references/10-cost-tracking.md | Token counting, cost calculation, budgets, spend tracking |
| Structured Outputs & Tools | references/11-structured-outputs.md | JSON mode, response_format, function calling, tool_choice |
| Embeddings & Other APIs | references/12-embeddings.md | embedding(), image_generation(), transcription, moderation |
# SDK only
pip install litellm
# Proxy server with extras
pip install 'litellm[proxy]'
# Run proxy
litellm --config config.yaml --port 4000
tools
Open-source AI-native vector database for semantic search, hybrid search, RAG, and agent-driven workflows with multi-tenancy and model provider integrations. MANDATORY TRIGGERS: weaviate, Weaviate, weaviate-client, near_text, near_vector, hybrid search vector database, WeaviateClient, weaviate.connect, collections.create, query.near_text, query.hybrid, query.bm25. Also trigger when user wants to build semantic search, store vector embeddings, implement RAG pipelines, combine keyword and vector search, set up multi-tenant vector storage, use named vectors, or integrate vector DB with LLM providers. When in doubt about whether to use this skill for vector database or semantic search tasks, use it.
tools
End-to-end testing and browser automation framework by Microsoft — locators, assertions, fixtures, network mocking, visual testing, tracing, codegen, API testing, and CI/CD integration. MANDATORY TRIGGERS: playwright, Playwright, @playwright/test, page.goto, page.click, page.locator, getByRole, getByText, getByTestId, expect(page), toHaveScreenshot, playwright.config, npx playwright. Also trigger when user wants to write E2E tests, automate browser interactions, test across Chromium/Firefox/WebKit, mock network requests in tests, do visual regression testing, generate tests with codegen, or set up CI test pipelines. When in doubt about whether to use this skill for browser testing or E2E automation tasks, use it.
tools
OpenAI Agents SDK for building multi-agent workflows with tools, handoffs, guardrails, streaming, MCP, sessions, and tracing. MANDATORY TRIGGERS: openai-agents, openai agents sdk, openai-agents-python, openai agents, Runner.run, function_tool, handoff, guardrail, MCPServerStdio, MCPServerStreamableHttp, HostedMCPTool, RunContextWrapper, AgentHooks. Also trigger when user wants to build multi-agent systems with OpenAI models, create agent orchestration with handoffs, add guardrails to LLM applications, integrate MCP servers with agents, implement streaming agent responses, or use OpenAI's official agent framework. When in doubt about whether to use this skill for OpenAI agent tasks, use it.
tools
AI orchestration framework for building production-ready RAG applications, autonomous agents, and multimodal search systems. MANDATORY TRIGGERS: haystack, deepset, haystack-ai, haystack pipeline, haystack agent. Also trigger when the user wants to build RAG pipelines with modular components, create tool-calling agents with Haystack, orchestrate retrieval-augmented generation, build semantic search systems, or evaluate LLM pipelines. When in doubt about whether to use this skill for RAG orchestration or AI pipeline tasks, use it.