Chunking strategies, metadata filters, and hybrid search (BM25 + vectors)
How to chunk your data, filter it smartly, and combine BM25 with vectors for reliable retrieval.
6 posts
How to chunk your data, filter it smartly, and combine BM25 with vectors for reliable retrieval.
Practical RAG patterns that work in production and how to evaluate them with recall, groundedness, and latency.
A practical primer on cosine similarity and common ANN indexes: HNSW, IVF, and PQ — with small, concrete examples.
Getting Started with Vector Search
Build a streaming chat endpoint with Server-Sent Events (SSE) in ASP.NET Core and wire it to Semantic Kernel for token-by-token output.
What Semantic Kernel is, how it works, which languages it supports, and where I'm using it.