14 June 2026 · 36 min read
How I benchmarked an AI code editor tool without fooling myself — the research vs implementation distinction that makes a +19% headline misleading, the 5 of 6 CPython tasks where re-discovery dropped, and the limitations that decide whether any of it applies to you.
benchmarking
ai-code-editors
developer-tools
llm-evaluation
mcp
cpython
Read →
11 June 2026 · 42 min read
Why /compact kills precision, what the KV cache actually is, and how I built a working memory layer that survives session boundaries — with the bugs that shaped the final design.
working-memory
llm-context
kv-cache
mcp
developer-tools
Read →
9 June 2026 · 38 min read
How I built a local semantic code indexer for AI editors — covering AST chunking, hybrid vector+BM25 search, symbol graphs, and why naive approaches break on real codebases.
code-search
semantic-search
ast-parsing
embeddings
developer-tools
mcp
Read →
24 May 2026 · 40 min read
The token arithmetic behind system prompts, tool schemas, RAG retrieval, and conversation history — why the effective limit is below the advertised one, and four production strategies to manage it.
LLMs
Context Windows
Production AI
Token Budget
Deep Dive
Read →
24 May 2026 · 35 min read
A deep look at why attention dilution and positional bias make context stuffing counterproductive, and how AST chunking, hybrid BM25+dense retrieval, and RRF fusion fix it.
RAG
Code Assistants
LLMs
Production AI
Deep Dive
Read →
23 May 2026 · 90 min read
From tokenization and transformers to cosine similarity, HNSW graph search, RAG pipelines, and LLM training costs. The deepest guide on the web — with 7 interactive demos.
Embeddings
LLMs
RAG
Vector Search
Deep Dive
Read →
22 May 2026 · 30 min read
How AI and LLMs automate DPDP Act compliance — classifying data principal requests, drafting multilingual replies in 5 Indian languages, tracking 30-day SLAs, and generating audit-ready evidence trails.
DPDP
AI / LLM
India Privacy
Compliance
Open Source
Read →