skills/qdrant-scaling/scaling-qps/SKILL.md
Guides Qdrant query throughput (QPS) scaling. Use when someone asks 'how to increase QPS', 'need more throughput', 'queries per second too low', 'batch search', 'read replicas', or 'how to handle more concurrent queries'.
npx skillsauth add williamlimasilva/.copilot qdrant-scaling-qpsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Throughput scaling means handling more parallel queries per second. This is different from latency - throughput and latency are opposite tuning directions and cannot be optimized simultaneously on the same node.
High throughput favors fewer, larger segments so each query touches less overhead.
default_segment_number: 2) Maximizing throughputalways_ram=true to reduce disk IO Quantizationoptimizer_cpu_budget to limit indexing CPUs (e.g. 2 on an 8-CPU node reserves 6 for queries)If a single node is saturated on CPU after applying the tuning above, scale horizontally with read replicas.
replication_factor: 2+ and route reads to replicas Distributed deploymentSee also Horizontal Scaling for general horizontal scaling guidance.
If it is not possible to keep all vectors in RAM, disk I/O can become the bottleneck for throughput. In this case:
io_uring on Linux (kernel 5.11+) io_uring articlecpu_count - 1, which is optimal for RAM-based search but may be too low for disk-based search. See configuration referencetools
Narrative and synthesis profile for Wiggins: framing, explanation, and audience-aware communication patterns for Ember sessions.
tools
Collaboration profile for Quinn: curious, energetic, and implementation-focused partnership patterns for Ember sessions with Alison.
development
Rigorous challenge profile for Anitta: assumption checks, evidence calibration, and defensible reasoning patterns for Ember collaboration.
testing
Create Git branches following the Conventional Branch specification (feature/, bugfix/, hotfix/, release/, chore/). Use when creating a new branch, naming a branch, or checking whether a branch name complies with the spec.