skills/qdrant/SKILL.md
High-performance vector search engine built in Rust for AI applications. MANDATORY TRIGGERS: qdrant, qdrant-client, vector search, vector database, similarity search, nearest neighbor search, ANN search, embedding search. Also trigger when user wants to store and query embeddings, build RAG retrieval pipelines, implement hybrid sparse+dense search, set up multitenancy for vector data, configure HNSW indexing or quantization, or manage vector collections. When in doubt about whether to use this skill for vector search tasks, use it.
npx skillsauth add abhisheksharma-17/skills-graph qdrantInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
AI-native vector similarity search engine written in Rust with rich filtering, hybrid queries, and quantization.
Source: qdrant.tech v1.17.1 | Package: qdrant-client (Python) | License: Apache 2.0
| Reference | File | Read When |
|-----------|------|-----------|
| Overview & Setup | references/00-overview.md | Getting started, installation, architecture, client initialization, quickstart |
| Collections | references/01-collections.md | Creating/managing collections, vector config, named vectors, aliases, distance metrics |
| Points | references/02-points.md | Upserting vectors, payloads, batch operations, scroll, delete, update payload |
| Search & Query API | references/03-search-query.md | Similarity search, query_points, search params, grouping, scoring |
| Filtering | references/04-filtering.md | Payload filters, match, range, geo, nested, boolean clauses, special conditions |
| Indexing | references/05-indexing.md | Payload indexes, HNSW config, full-text search, tenant/principal indexes |
| Quantization | references/06-quantization.md | Scalar, binary, product quantization, memory optimization, search tuning |
| Hybrid Search | references/07-hybrid-search.md | Sparse+dense fusion, prefetch, RRF, DBSF, multi-stage retrieval |
| Recommendation & Discovery | references/08-recommendation.md | Recommend API, positive/negative examples, discovery search, context pairs |
| Optimizer & Performance | references/09-optimizer.md | Optimizer config, bulk upload, memmap, segment management, monitoring |
| Snapshots & Backups | references/10-snapshots.md | Create/restore snapshots, S3 storage, full storage snapshots, recovery |
| Multitenancy | references/11-multitenancy.md | Tenant isolation, tenant indexes, tiered multitenancy, best practices |
| Deployment | references/12-deployment.md | Docker, Kubernetes, distributed mode, Qdrant Cloud, configuration |
# Python client
pip install qdrant-client
pip install 'qdrant-client[fastembed]' # with local embeddings
# Docker (primary development method)
docker pull qdrant/qdrant
docker run -p 6333:6333 -p 6334:6334 \
-v $(pwd)/qdrant_data:/qdrant/storage qdrant/qdrant
development
High-throughput LLM inference and serving engine with PagedAttention, continuous batching, and OpenAI-compatible API. MANDATORY TRIGGERS: vLLM, vllm, LLM serving, LLM inference engine, PagedAttention. Also trigger when the user wants to serve LLMs in production, deploy models with tensor parallelism, use speculative decoding, quantize models for inference, build OpenAI-compatible API servers, or optimize LLM throughput and latency. When in doubt about whether to use this skill for LLM serving tasks, use it.
tools
Type-safe Python agent framework for building production-grade GenAI applications with Pydantic validation, structured outputs, and dependency injection. MANDATORY TRIGGERS: pydantic-ai, pydantic_ai, PydanticAI, pydantic ai agent. Also trigger when the user wants to build type-safe AI agents in Python, create structured LLM outputs with Pydantic models, implement dependency injection for agents, use tools/capabilities with LLMs, or build multi-agent systems with Python type safety. When in doubt about whether to use this skill for Python AI agent tasks, use it.
development
Durable execution platform for building fault-tolerant workflows, long-running processes, and resilient distributed applications. MANDATORY TRIGGERS: temporal, temporal.io, temporalio, durable execution, workflow orchestration engine. Also trigger when the user wants to build fault-tolerant workflows, implement saga patterns, create long-running distributed processes, orchestrate microservices with retries and timeouts, or build durable AI agent pipelines. When in doubt about whether to use this skill for workflow orchestration or durable execution tasks, use it.
tools
AI framework for building RAG pipelines, agents, workflows, and data-augmented LLM applications with 300+ integrations. MANDATORY TRIGGERS: llamaindex, llama-index, llama_index, LlamaIndex, VectorStoreIndex, SimpleDirectoryReader, LlamaHub, LlamaParse. Also trigger when the user wants to build RAG applications with LlamaIndex, create document indexing pipelines, build agentic workflows with tool calling, implement structured data extraction from documents, or connect LLMs to custom data sources. When in doubt about whether to use this skill for RAG, document indexing, or LLM data augmentation tasks, use it.