Blogs

A collection of my thoughts on AI, engineering, and more.

2026
🔤 Wordmaze: a one-shot, verifier-graded puzzle for RL
🤖 Claude Managed Agents: sharp edges and gotchas
✈️ IndiGo: India's Affordable Growth Carrier, by the Numbers
💸 A Practical Cost Checklist for Agent and Harness Engineering
🔍 Agents Can Reason. They Still Can't Really Search.
2025
📏 Bits per byte (BPB) for LLMs: tokenizer-agnostic loss
🎨 Creativity Is a Luxury
🔀 GPT-5 router: why chat apps will route across models
🧠 Instruction Aware Embeddings
🔍 How to improve RAG retrieval: recall, precision, and NDCG
2024
☁️ AWS Bedrock Converse API: one endpoint for Claude, Llama, and Mistral
💾 Database audit fields: created_at, updated_at, created_by, updated_by
2023
KV cache in GPT: how it speeds up transformer inference
2021
👨‍💻 LC contest problems summary
📝 Hugo commands