9 June 2026 · 38 min read
How I built a local semantic code indexer for AI editors — covering AST chunking, hybrid vector+BM25 search, symbol graphs, and why naive approaches break on real codebases.
code-search
semantic-search
ast-parsing
embeddings
developer-tools
mcp
Read →
24 May 2026 · 40 min read
The token arithmetic behind system prompts, tool schemas, RAG retrieval, and conversation history — why the effective limit is below the advertised one, and four production strategies to manage it.
LLMs
Context Windows
Production AI
Token Budget
Deep Dive
Read →
24 May 2026 · 35 min read
A deep look at why attention dilution and positional bias make context stuffing counterproductive, and how AST chunking, hybrid BM25+dense retrieval, and RRF fusion fix it.
RAG
Code Assistants
LLMs
Production AI
Deep Dive
Read →
23 May 2026 · 90 min read
From tokenization and transformers to cosine similarity, HNSW graph search, RAG pipelines, and LLM training costs. The deepest guide on the web — with 7 interactive demos.
Embeddings
LLMs
RAG
Vector Search
Deep Dive
Read →
22 May 2026 · 30 min read
How AI and LLMs automate DPDP Act compliance — classifying data principal requests, drafting multilingual replies in 5 Indian languages, tracking 30-day SLAs, and generating audit-ready evidence trails.
DPDP
AI / LLM
India Privacy
Compliance
Open Source
Read →