Compact Hypercube Embeddings for Fast Retrieval

This skill teaches Claude to design and implement retrieval systems that use compact binary hypercube embeddings instead of dense floating-point vectors. The core idea, from the Cross-View Alignment Hashing (CVA-Hash) framework, is to project pretrained encoder outputs through a learned hashing layer, binarize them with a sign function, and search in Hamming space instead of Euclidean/cosine space. This yields 32-256x memory reduction and enables sub-millisecond search over millions of items using bitwise XOR + popcount operations.

When to Use

When the user needs to build a text-to-image, text-to-audio, or cross-modal retrieval system and is concerned about latency or memory at scale (100K+ items).
When the user asks how to compress embeddings from a foundation model (CLIP, BioCLIP, BioLingual, SentenceTransformers, etc.) into compact binary codes for faster search.
When the user has a working embedding-based search pipeline but wants to reduce vector storage from float32 to binary without catastrophic quality loss.
When the user is building a biodiversity monitoring system, wildlife observation database, or ecological archive that needs efficient text-based retrieval over images or audio.
When the user wants to add a hashing layer on top of a frozen or LoRA-adapted encoder for approximate nearest neighbor search.
When the user asks about alternatives to FAISS IVF/HNSW that are simpler to deploy and have deterministic performance.

Key Technique

Cross-View Alignment Hashing (CVA-Hash) takes two pretrained encoders (e.g., a text encoder and an image encoder from CLIP) and attaches a lightweight hashing head to each. Each head is a small MLP that projects the encoder's continuous embedding (e.g., 512-d float32) down to a target bit length (e.g., 64 bits), then applies a sign() function to produce binary codes in {-1, +1}^K. At inference, these are stored as packed bit arrays. Retrieval becomes a Hamming distance computation: XOR two bit strings and count the 1s. This is a single CPU instruction per 64-bit word, making brute-force search over millions of codes feasible in milliseconds.

Training uses three losses jointly: (1) A contrastive alignment loss that pulls matching text-image (or text-audio) hash codes together and pushes non-matching codes apart in Hamming space, mirroring the InfoNCE objective used in CLIP but operating on binary codes. (2) A quantization loss that penalizes the gap between the continuous pre-sign activations and their binarized outputs, encouraging the network to produce activations near +1 or -1 so binarization loses minimal information. (3) Optionally, a bit-balance regularization that encourages each bit to be +1 or -1 with roughly equal probability across the dataset, maximizing the information entropy of the code. During training, the sign function's zero gradient is bypassed using the straight-through estimator (STE): gradients flow through sign() as if it were the identity function.

Parameter-efficient fine-tuning (PEFT) keeps the pretrained encoders mostly frozen. The paper applies LoRA adapters (rank 4-16) to the encoder's attention layers and trains only the LoRA weights plus the hashing head. This means the entire adaptation can be done on a single GPU in hours, not days. The resulting system inherits the zero-shot generalization of the foundation model while gaining efficient binary retrieval. Crucially, the hashing objective was found to improve the underlying encoder representations, yielding better retrieval even when evaluated with continuous embeddings.

Step-by-Step Workflow

Select foundation encoders for each modality. For text-image retrieval, use CLIP or BioCLIP. For text-audio, use BioLingual or CLAP. Load them with their pretrained weights and freeze all parameters initially. Identify the embedding dimension (e.g., 512 or 768).
Design the hashing head. Create a small MLP per modality: Linear(embed_dim, embed_dim) -> BatchNorm -> ReLU -> Linear(embed_dim, K) where K is the target hash bit length (32, 64, or 128). At inference, apply sign() to the K-dimensional output. During training, use the straight-through estimator.

Implement the straight-through estimator for sign(). In PyTorch:

class SignSTE(torch.autograd.Function):
    @staticmethod
    def forward(ctx, x):
        return x.sign()
    @staticmethod
    def backward(ctx, grad_output):
        return grad_output  # pass gradient through unchanged

Implement the training losses. Combine three terms:
- Alignment loss: Contrastive loss (InfoNCE) computed on the binary codes of text-observation pairs. Use cosine similarity in Hamming space (inner product of sign vectors divided by K).
- Quantization loss: mean(abs(abs(h) - 1)) where h is the pre-sign continuous output, penalizing values near zero.
- Bit balance loss: mean(abs(mean(codes, dim=0))) — penalizes bits that are consistently +1 or -1 across the batch.
- Weight them as: L = L_align + alpha * L_quant + beta * L_balance with alpha=0.1, beta=0.01 as starting points.
Attach LoRA adapters to the encoders (optional but recommended). Use peft library to add rank-8 LoRA to the query/value projection matrices of the encoder's transformer layers. This unfreezes ~1-2% of parameters and significantly improves hash quality over training only the hashing head.
Train on paired data. Feed text-observation pairs through their respective encoders + hashing heads. Use a batch size of 256-1024 with in-batch negatives for the contrastive loss. Train for 10-30 epochs with AdamW (lr=1e-4 for hashing heads, lr=1e-5 for LoRA weights). Use cosine annealing.
Binarize and pack the database embeddings. After training, encode every item in the database through the observation encoder + hashing head + sign(). Convert {-1, +1} to {0, 1} and pack into numpy.packbits or uint64 arrays. A 64-bit code per item means 1 million items = 8 MB.

Implement Hamming distance search. At query time, encode the text query the same way, pack its bits, and compute Hamming distance against all database codes using XOR + popcount:

import numpy as np
def hamming_search(query_bits, db_bits):
    # query_bits: (K//8,) uint8, db_bits: (N, K//8) uint8
    xor = np.bitwise_xor(db_bits, query_bits)
    distances = np.unpackbits(xor, axis=1).sum(axis=1)
    return np.argsort(distances)

Evaluate with mAP@K and Precision@K. Compare binary retrieval against continuous-embedding retrieval (cosine similarity on the raw encoder outputs) using mean Average Precision at K=1, 5, 10, 20. Expect binary retrieval to reach 85-100% of continuous performance at 64+ bits while being 50-100x faster.
Deploy as a two-stage pipeline (optional). For maximum quality, use binary search to retrieve the top-100 candidates, then re-rank them with continuous embeddings. This gives near-exact retrieval quality with near-binary search speed.

Concrete Examples

Example 1: Building a wildlife image search engine

User: "I have 500K iNaturalist images with CLIP embeddings stored as float32 vectors. Search is too slow and uses 1GB of RAM. Help me compress these for faster retrieval."

Approach:

Load the pretrained CLIP model and attach a 64-bit hashing head to both the image and text encoders.
Fine-tune with LoRA on a subset of image-caption pairs from iNaturalist (or use the existing CLIP alignment — even training the hashing head alone on 50K pairs works).
Encode all 500K images through the image encoder + hashing head + sign(), producing 64-bit binary codes.
Pack into a numpy uint8 array of shape (500000, 8) — total 4 MB vs the original 1 GB.
At query time, encode text through the text encoder + hashing head + sign(), compute Hamming distances via XOR + popcount.

Output:

import torch
import numpy as np
from transformers import CLIPModel, CLIPProcessor
from peft import get_peft_model, LoraConfig

# 1. Load CLIP and attach hashing head
clip = CLIPModel.from_pretrained("openai/clip-vit-base-patch32")
HASH_BITS = 64

class HashingHead(torch.nn.Module):
    def __init__(self, input_dim, hash_bits):
        super().__init__()
        self.fc = torch.nn.Sequential(
            torch.nn.Linear(input_dim, input_dim),
            torch.nn.BatchNorm1d(input_dim),
            torch.nn.ReLU(),
            torch.nn.Linear(input_dim, hash_bits),
        )
    def forward(self, x):
        h = self.fc(x)
        if self.training:
            return h, SignSTE.apply(h)
        return h.sign()

text_hash = HashingHead(512, HASH_BITS)
image_hash = HashingHead(512, HASH_BITS)

# 2. Encode entire database → binary
with torch.no_grad():
    for batch in dataloader:
        emb = clip.get_image_features(pixel_values=batch)
        codes = image_hash(emb)  # (B, 64) in {-1, +1}
        bits = ((codes + 1) / 2).byte()  # convert to {0, 1}
        packed = np.packbits(bits.numpy(), axis=1)  # (B, 8) uint8
        db_codes.append(packed)
db_codes = np.concatenate(db_codes)  # (500000, 8) = 4 MB

# 3. Query
query_emb = clip.get_text_features(**processor(text=["red-tailed hawk flying"]))
query_code = text_hash(query_emb).sign()
query_bits = np.packbits(((query_code + 1) / 2).byte().numpy(), axis=1)
distances = np.unpackbits(np.bitwise_xor(db_codes, query_bits), axis=1).sum(1)
top_k = np.argsort(distances)[:20]

Example 2: Adding binary search to an existing audio monitoring pipeline

User: "I have a BioLingual model encoding bird call spectrograms. I want to search 2 million recordings by text description like 'woodpecker drumming on dead tree'. Current FAISS index is 3 GB."

Approach:

Attach 128-bit hashing heads to BioLingual's audio and text encoders.
Train on paired audio-text data with the CVA-Hash losses (alignment + quantization + balance).
Encode all 2M audio recordings into 128-bit codes. Storage: 2M * 16 bytes = 32 MB (vs 3 GB).
Search with Hamming distance. For 2M items at 128 bits, brute-force XOR + popcount runs in ~5ms on a single CPU core.
Re-rank top-100 with continuous BioLingual embeddings for maximum precision.

Output:

# Memory comparison
continuous_storage = 2_000_000 * 768 * 4  # 6.1 GB (float32, 768-d)
binary_storage = 2_000_000 * 16           # 32 MB (128-bit codes)
compression_ratio = continuous_storage / binary_storage  # 192x

# Search speed comparison (single-threaded, approximate)
# Continuous cosine similarity: ~2000ms for 2M @ 768-d
# Binary Hamming distance:     ~5ms for 2M @ 128-bit
# Speedup: ~400x

Example 3: Zero-shot domain transfer for soundscape monitoring

User: "I trained hash codes on iNatSounds but need to deploy on a rainforest soundscape dataset I don't have labels for. Will it generalize?"

Approach:

The CVA-Hash framework inherits zero-shot capabilities from the pretrained encoder. The paper shows that the hashing objective actually improves zero-shot generalization compared to the base encoder.
Encode the new soundscape recordings with the same audio encoder + hashing head. No retraining needed.
Query with natural language descriptions of target species or sounds.
Monitor retrieval quality with a small manually-labeled subset. If mAP drops significantly, fine-tune the LoRA adapters on a few hundred labeled pairs from the new domain.

Best Practices

Do: Start with 64-bit codes as a default. This balances quality and compactness well for most datasets under 10M items. Go to 128 bits for 10M+ or when you need near-continuous-embedding quality.
Do: Use the quantization loss during training. Without it, the continuous activations cluster near zero, and binarization destroys information. Target values should be pushed toward +/-1.
Do: Train with large batch sizes (512+). The contrastive alignment loss needs enough in-batch negatives to learn discriminative codes.
Do: Evaluate with both binary-only and two-stage (binary retrieve + continuous re-rank) pipelines to understand the quality-speed tradeoff.
Avoid: Using hash codes shorter than 32 bits for real retrieval tasks. Below 32 bits, too many unrelated items collide in Hamming space.
Avoid: Skipping the straight-through estimator during training. Without STE, gradients cannot flow through the sign function, and the hashing head will not learn.
Avoid: Applying L2 normalization after the hashing head. The sign function already projects onto the hypercube vertices; normalization would undo this.

Error Handling

Hash codes are all identical or near-identical: The quantization loss coefficient is too low, or learning rate is too high. Lower lr to 1e-5 and increase alpha (quantization weight) to 0.5. Also check that BatchNorm is present in the hashing head.
Retrieval quality is much worse than continuous baselines: Try increasing bit length from 64 to 128. If still poor, ensure the contrastive loss uses temperature scaling (tau=0.07 is a good default) and that in-batch negatives are sufficient (batch size >= 256).
Training loss explodes or NaN: The STE can cause instability with high learning rates. Use gradient clipping (max_norm=1.0) and warmup for the first 5% of training steps.
Bit balance loss dominates early training: Use a warm-up schedule for the balance loss coefficient — start at 0 and linearly increase to beta over the first 3 epochs.
Hamming search returns too many ties: At low bit lengths (32), many items will share the same Hamming distance. Break ties using a secondary score (e.g., continuous embedding similarity on tied candidates only).

Limitations

Binary codes are inherently lossy. For tasks requiring fine-grained distinction between very similar items (e.g., differentiating subspecies), continuous embeddings with exact search will outperform binary retrieval.
The framework requires paired cross-modal data for training (text-image or text-audio pairs). If you only have unimodal data, you cannot train the alignment loss — consider using a teacher model to generate pseudo-pairs.
Hamming distance brute-force scales linearly with database size. Beyond ~50M items, even binary search may need indexing structures (multi-index hashing or hash tables) to stay under 100ms.
The quality of hash codes is bounded by the quality of the underlying foundation model. If the base encoder cannot distinguish two concepts in continuous space, the hash codes will not either.
LoRA fine-tuning assumes the pretrained encoder is a reasonable starting point for the target domain. For highly specialized domains (e.g., underwater sonar), full fine-tuning or a domain-specific encoder may be necessary.

Reference

Paper: Compact Hypercube Embeddings for Fast Text-based Wildlife Observation Retrieval (Moummad et al., 2026). Focus on Section 3 (CVA-Hash framework), Section 4 (loss functions and training), and Tables 1-3 (mAP comparisons across bit lengths showing binary codes matching or exceeding continuous baselines).

Compact Hypercube Embeddings for Fast Retrieval

When to Use

When the user needs to build a text-to-image, text-to-audio, or cross-modal retrieval system and is concerned about latency or memory at scale (100K+ items).
When the user asks how to compress embeddings from a foundation model (CLIP, BioCLIP, BioLingual, SentenceTransformers, etc.) into compact binary codes for faster search.
When the user has a working embedding-based search pipeline but wants to reduce vector storage from float32 to binary without catastrophic quality loss.
When the user is building a biodiversity monitoring system, wildlife observation database, or ecological archive that needs efficient text-based retrieval over images or audio.
When the user wants to add a hashing layer on top of a frozen or LoRA-adapted encoder for approximate nearest neighbor search.
When the user asks about alternatives to FAISS IVF/HNSW that are simpler to deploy and have deterministic performance.

Key Technique

Step-by-Step Workflow

Select foundation encoders for each modality. For text-image retrieval, use CLIP or BioCLIP. For text-audio, use BioLingual or CLAP. Load them with their pretrained weights and freeze all parameters initially. Identify the embedding dimension (e.g., 512 or 768).
Design the hashing head. Create a small MLP per modality: Linear(embed_dim, embed_dim) -> BatchNorm -> ReLU -> Linear(embed_dim, K) where K is the target hash bit length (32, 64, or 128). At inference, apply sign() to the K-dimensional output. During training, use the straight-through estimator.

Implement the straight-through estimator for sign(). In PyTorch:

class SignSTE(torch.autograd.Function):
    @staticmethod
    def forward(ctx, x):
        return x.sign()
    @staticmethod
    def backward(ctx, grad_output):
        return grad_output  # pass gradient through unchanged

Implement the training losses. Combine three terms:
- Alignment loss: Contrastive loss (InfoNCE) computed on the binary codes of text-observation pairs. Use cosine similarity in Hamming space (inner product of sign vectors divided by K).
- Quantization loss: mean(abs(abs(h) - 1)) where h is the pre-sign continuous output, penalizing values near zero.
- Bit balance loss: mean(abs(mean(codes, dim=0))) — penalizes bits that are consistently +1 or -1 across the batch.
- Weight them as: L = L_align + alpha * L_quant + beta * L_balance with alpha=0.1, beta=0.01 as starting points.
Attach LoRA adapters to the encoders (optional but recommended). Use peft library to add rank-8 LoRA to the query/value projection matrices of the encoder's transformer layers. This unfreezes ~1-2% of parameters and significantly improves hash quality over training only the hashing head.
Train on paired data. Feed text-observation pairs through their respective encoders + hashing heads. Use a batch size of 256-1024 with in-batch negatives for the contrastive loss. Train for 10-30 epochs with AdamW (lr=1e-4 for hashing heads, lr=1e-5 for LoRA weights). Use cosine annealing.
Binarize and pack the database embeddings. After training, encode every item in the database through the observation encoder + hashing head + sign(). Convert {-1, +1} to {0, 1} and pack into numpy.packbits or uint64 arrays. A 64-bit code per item means 1 million items = 8 MB.

Implement Hamming distance search. At query time, encode the text query the same way, pack its bits, and compute Hamming distance against all database codes using XOR + popcount:

import numpy as np
def hamming_search(query_bits, db_bits):
    # query_bits: (K//8,) uint8, db_bits: (N, K//8) uint8
    xor = np.bitwise_xor(db_bits, query_bits)
    distances = np.unpackbits(xor, axis=1).sum(axis=1)
    return np.argsort(distances)

Evaluate with mAP@K and Precision@K. Compare binary retrieval against continuous-embedding retrieval (cosine similarity on the raw encoder outputs) using mean Average Precision at K=1, 5, 10, 20. Expect binary retrieval to reach 85-100% of continuous performance at 64+ bits while being 50-100x faster.
Deploy as a two-stage pipeline (optional). For maximum quality, use binary search to retrieve the top-100 candidates, then re-rank them with continuous embeddings. This gives near-exact retrieval quality with near-binary search speed.

Concrete Examples

Example 1: Building a wildlife image search engine

User: "I have 500K iNaturalist images with CLIP embeddings stored as float32 vectors. Search is too slow and uses 1GB of RAM. Help me compress these for faster retrieval."

Approach:

Load the pretrained CLIP model and attach a 64-bit hashing head to both the image and text encoders.
Fine-tune with LoRA on a subset of image-caption pairs from iNaturalist (or use the existing CLIP alignment — even training the hashing head alone on 50K pairs works).
Encode all 500K images through the image encoder + hashing head + sign(), producing 64-bit binary codes.
Pack into a numpy uint8 array of shape (500000, 8) — total 4 MB vs the original 1 GB.
At query time, encode text through the text encoder + hashing head + sign(), compute Hamming distances via XOR + popcount.

Output:

import torch
import numpy as np
from transformers import CLIPModel, CLIPProcessor
from peft import get_peft_model, LoraConfig

# 1. Load CLIP and attach hashing head
clip = CLIPModel.from_pretrained("openai/clip-vit-base-patch32")
HASH_BITS = 64

class HashingHead(torch.nn.Module):
    def __init__(self, input_dim, hash_bits):
        super().__init__()
        self.fc = torch.nn.Sequential(
            torch.nn.Linear(input_dim, input_dim),
            torch.nn.BatchNorm1d(input_dim),
            torch.nn.ReLU(),
            torch.nn.Linear(input_dim, hash_bits),
        )
    def forward(self, x):
        h = self.fc(x)
        if self.training:
            return h, SignSTE.apply(h)
        return h.sign()

text_hash = HashingHead(512, HASH_BITS)
image_hash = HashingHead(512, HASH_BITS)

# 2. Encode entire database → binary
with torch.no_grad():
    for batch in dataloader:
        emb = clip.get_image_features(pixel_values=batch)
        codes = image_hash(emb)  # (B, 64) in {-1, +1}
        bits = ((codes + 1) / 2).byte()  # convert to {0, 1}
        packed = np.packbits(bits.numpy(), axis=1)  # (B, 8) uint8
        db_codes.append(packed)
db_codes = np.concatenate(db_codes)  # (500000, 8) = 4 MB

# 3. Query
query_emb = clip.get_text_features(**processor(text=["red-tailed hawk flying"]))
query_code = text_hash(query_emb).sign()
query_bits = np.packbits(((query_code + 1) / 2).byte().numpy(), axis=1)
distances = np.unpackbits(np.bitwise_xor(db_codes, query_bits), axis=1).sum(1)
top_k = np.argsort(distances)[:20]

Example 2: Adding binary search to an existing audio monitoring pipeline

User: "I have a BioLingual model encoding bird call spectrograms. I want to search 2 million recordings by text description like 'woodpecker drumming on dead tree'. Current FAISS index is 3 GB."

Approach:

Attach 128-bit hashing heads to BioLingual's audio and text encoders.
Train on paired audio-text data with the CVA-Hash losses (alignment + quantization + balance).
Encode all 2M audio recordings into 128-bit codes. Storage: 2M * 16 bytes = 32 MB (vs 3 GB).
Search with Hamming distance. For 2M items at 128 bits, brute-force XOR + popcount runs in ~5ms on a single CPU core.
Re-rank top-100 with continuous BioLingual embeddings for maximum precision.

Output:

# Memory comparison
continuous_storage = 2_000_000 * 768 * 4  # 6.1 GB (float32, 768-d)
binary_storage = 2_000_000 * 16           # 32 MB (128-bit codes)
compression_ratio = continuous_storage / binary_storage  # 192x

# Search speed comparison (single-threaded, approximate)
# Continuous cosine similarity: ~2000ms for 2M @ 768-d
# Binary Hamming distance:     ~5ms for 2M @ 128-bit
# Speedup: ~400x

Example 3: Zero-shot domain transfer for soundscape monitoring

User: "I trained hash codes on iNatSounds but need to deploy on a rainforest soundscape dataset I don't have labels for. Will it generalize?"

Approach:

The CVA-Hash framework inherits zero-shot capabilities from the pretrained encoder. The paper shows that the hashing objective actually improves zero-shot generalization compared to the base encoder.
Encode the new soundscape recordings with the same audio encoder + hashing head. No retraining needed.
Query with natural language descriptions of target species or sounds.
Monitor retrieval quality with a small manually-labeled subset. If mAP drops significantly, fine-tune the LoRA adapters on a few hundred labeled pairs from the new domain.

Best Practices

Do: Start with 64-bit codes as a default. This balances quality and compactness well for most datasets under 10M items. Go to 128 bits for 10M+ or when you need near-continuous-embedding quality.
Do: Use the quantization loss during training. Without it, the continuous activations cluster near zero, and binarization destroys information. Target values should be pushed toward +/-1.
Do: Train with large batch sizes (512+). The contrastive alignment loss needs enough in-batch negatives to learn discriminative codes.
Do: Evaluate with both binary-only and two-stage (binary retrieve + continuous re-rank) pipelines to understand the quality-speed tradeoff.
Avoid: Using hash codes shorter than 32 bits for real retrieval tasks. Below 32 bits, too many unrelated items collide in Hamming space.
Avoid: Skipping the straight-through estimator during training. Without STE, gradients cannot flow through the sign function, and the hashing head will not learn.
Avoid: Applying L2 normalization after the hashing head. The sign function already projects onto the hypercube vertices; normalization would undo this.

Error Handling

Hash codes are all identical or near-identical: The quantization loss coefficient is too low, or learning rate is too high. Lower lr to 1e-5 and increase alpha (quantization weight) to 0.5. Also check that BatchNorm is present in the hashing head.
Retrieval quality is much worse than continuous baselines: Try increasing bit length from 64 to 128. If still poor, ensure the contrastive loss uses temperature scaling (tau=0.07 is a good default) and that in-batch negatives are sufficient (batch size >= 256).
Training loss explodes or NaN: The STE can cause instability with high learning rates. Use gradient clipping (max_norm=1.0) and warmup for the first 5% of training steps.
Bit balance loss dominates early training: Use a warm-up schedule for the balance loss coefficient — start at 0 and linearly increase to beta over the first 3 epochs.
Hamming search returns too many ties: At low bit lengths (32), many items will share the same Hamming distance. Break ties using a secondary score (e.g., continuous embedding similarity on tied candidates only).

Limitations

Binary codes are inherently lossy. For tasks requiring fine-grained distinction between very similar items (e.g., differentiating subspecies), continuous embeddings with exact search will outperform binary retrieval.
The framework requires paired cross-modal data for training (text-image or text-audio pairs). If you only have unimodal data, you cannot train the alignment loss — consider using a teacher model to generate pseudo-pairs.
Hamming distance brute-force scales linearly with database size. Beyond ~50M items, even binary search may need indexing structures (multi-index hashing or hash tables) to stay under 100ms.
The quality of hash codes is bounded by the quality of the underlying foundation model. If the base encoder cannot distinguish two concepts in continuous space, the hash codes will not either.
LoRA fine-tuning assumes the pretrained encoder is a reasonable starting point for the target domain. For highly specialized domains (e.g., underwater sonar), full fine-tuning or a domain-specific encoder may be necessary.

Adoption

ndpvt-web/compact-hypercube-embeddings-fast

$ install --global

Security Scan Results

SKILL.md

Compact Hypercube Embeddings for Fast Retrieval

When to Use

Key Technique

Step-by-Step Workflow

Concrete Examples

Best Practices

Error Handling

Limitations

Reference

Related Skills

ndpvt-web/gradingattack-attacking-short-answer

ndpvt-web/gisa-benchmark-general-information-seeking

ndpvt-web/gflowpo-generative-flow-network

ndpvt-web/generative-ontology-structured-knowledge

ndpvt-web/compact-hypercube-embeddings-fast

$ install --global

Security Scan Results

SKILL.md

Compact Hypercube Embeddings for Fast Retrieval

When to Use

Key Technique

Step-by-Step Workflow

Concrete Examples

Best Practices

Error Handling

Limitations

Reference

Related Skills

ndpvt-web/gradingattack-attacking-short-answer

ndpvt-web/gisa-benchmark-general-information-seeking

ndpvt-web/gflowpo-generative-flow-network

ndpvt-web/generative-ontology-structured-knowledge