.claude/skills/ts-chromadb/SKILL.md
Assists with storing, searching, and managing vector embeddings using ChromaDB. Use when building RAG pipelines, semantic search engines, or recommendation systems. Trigger words: chromadb, chroma, vector database, embeddings, semantic search, similarity search, vector store, rag.
npx skillsauth add eliferjunior/Claude chromadbInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
ChromaDB is an open-source vector database for storing, searching, and managing embeddings. It provides a simple API for document ingestion, semantic similarity search, and metadata filtering, supporting both Python and JavaScript/TypeScript clients with embedded, server, and cloud deployment options.
get_or_create_collection for idempotent collection setup, choose PersistentClient for development and HttpClient for production server connections.add() calls in chunks of 5,000 documents, always store source metadata (filename, URL, page number) for RAG citations, and use upsert() for incremental updates to avoid duplicates.collection.query(query_texts=..., n_results=...) for text-based search, combine metadata where filters to narrow results before semantic search, and set n_results based on the LLM's context window (5-10 for most RAG pipelines).$eq, $gt, $in with $and/$or logical operators, and combine with where_document for content-based filtering alongside semantic similarity.PersistentClient for single-node applications, Docker for server mode, or Chroma Cloud for managed hosting with multi-tenancy support.hnsw:M, hnsw:construction_ef, hnsw:search_ef) for the quality-speed tradeoff and choose cosine distance for normalized embeddings (OpenAI, Cohere).User request: "Set up a RAG pipeline with ChromaDB for answering questions about our docs"
Actions:
upsert() for idempotent ingestioncollection.query() and pass retrieved chunks as context to the LLMOutput: A semantic search pipeline that retrieves relevant document chunks for LLM-powered Q&A.
User request: "Implement product search that combines text similarity with category filters"
Actions:
query_texts with where={"category": "electronics"}$gte and $lte operatorsOutput: A filtered semantic search that narrows by metadata before ranking by text similarity.
get_or_create_collection for idempotent collection initialization; it is safe for restarts.add() calls in chunks of 5,000 documents to manage memory usage.upsert() for incremental updates to avoid duplicate documents when re-ingesting.n_results based on the LLM's context window: 5-10 results for most RAG pipelines.cosine distance for normalized embeddings (OpenAI, Cohere) and l2 for unnormalized.development
Expert guidance for Fireworks AI, the platform for running open-source LLMs (Llama, Mixtral, Qwen, etc.) with enterprise-grade speed and reliability. Helps developers integrate Fireworks' inference API, fine-tune models, and deploy custom model endpoints with function calling and structured output support.
development
Convert any website into clean, structured data with Firecrawl — API-first web scraping service. Use when someone asks to "turn a website into markdown", "scrape website for LLM", "Firecrawl", "extract website content as clean text", "crawl and convert to structured data", or "scrape website for RAG". Covers single-page scraping, full-site crawling, structured extraction, and LLM-ready output.
tools
Expert guidance for Firebase, Google's platform for building and scaling web and mobile applications. Helps developers set up authentication, Firestore/Realtime Database, Cloud Functions, hosting, storage, and analytics using Firebase's SDK and CLI.
development
When the user needs to build file upload functionality for a web application. Use when the user mentions "file upload," "image upload," "upload endpoint," "multipart upload," "presigned URL," "S3 upload," "file validation," "upload to cloud storage," or "accept user files." Handles upload endpoints, file validation (type, size, magic bytes), cloud storage integration, and upload status tracking. For image/video processing after upload, see media-transcoder.