Technical Articles

The essays lay down the principles. These articles put them into practice.

Each guide explores a technical topic in depth — RAG architecture, prompt engineering, evaluations, fine-tuning — with code examples, decision trees, and lessons drawn from production systems like WHOOP Coach and Cursor.

Article 12 min min read May 8, 2026

What's Really Happening When You Talk to an AI

Tokens, transformers, context window, system prompt, tools: the conceptual foundations for really understanding how ChatGPT, Claude, or Gemini work. No equations.
- fundamentals
- primer
Article 24 min min read May 8, 2026

Context Is Finite. Program Accordingly.

An inventory of the techniques that fill the window, the phenomena that degrade it, the heuristics to master it. And along the way, the most expensive anti-pattern in production agents.
Article 11 min min read Apr 9, 2026

RAG: What It Is and How to Actually Use It

The full RAG pipeline — chunking, embedding, retrieval, reranking — and the production concerns that separate prototypes from systems that work.
- rag
- architecture
Article 10 min min read Apr 9, 2026

Multi-Agent Architectures: When One Agent Isn't Enough

A practical guide to multi-agent patterns — orchestrator-workers, pipelines, ensembles, and swarms — and where they break.
- architecture
- multi-agent
Article 9 min min read Apr 9, 2026

Fine-Tuning vs. RAG: When to Teach the Model and When to Show It the Answer

Fine-tuning changes how the model thinks. RAG changes what it sees. A practical decision framework for when to use each — and when to use both.
- fine-tuning
- rag
Article 8 min min read Apr 9, 2026

Evals: How to Test Systems That Don't Have Right Answers

A practical guide to LLM evaluation — code-based checks, LLM-as-a-judge, human review, and how to build an eval suite that catches regressions before they ship.
- evals
- testing
Article 10 min min read Apr 9, 2026

Advanced Prompt Engineering: Beyond the Basics

Patterns that separate prompts that work in demos from prompts that work in production — context management, structured outputs, few-shot engineering, and version control.
- prompting
- architecture