Article List
Explore latest news, discover interesting content, and dive deep into topics that interest you
Product Evals in Three Simple Steps
Label some data, align LLM-evaluators, and run the eval harness with each change....
Advice for New Principal Tech ICs (i.e., Notes to Myself)
Based on what I've learned from role models and mentors in Amazon...
Training an LLM-RecSys Hybrid for Steerable Recs with Seman…
An LLM that can converse in English & item IDs, and make recommendations w/o retrieval or tools....
Evaluating Long-Context Question & Answer Systems
Evaluation metrics, how to build eval datasets, eval methodology, and a review of several benchmarks....
AI Engineer 2025 - Improving RecSys & Search with LLM techn…
Recsys & search are converging with LLMs via semantic IDs, data augmentation, and unified foundation models....
Exceptional Leadership: Some Qualities, Behaviors, and Styl…
What makes a good leader? What do good leaders do? And commando, soldier, and police leadership....
Building News Agents for Daily News Recaps with MCP, Q, and…
Learning to automate simple agentic workflows with Amazon Q CLI, Anthropic MCP, and tmux....
An LLM-as-Judge Won't Save The Product—Fixing Your Process …
Applying the scientific method, building via eval-driven development, and monitoring AI output....
Frequently Asked Questions about My Writing Process
How I started, why I write, who I write for, how I write, and more....
NVIDIA GTC 2025 - Building LLM-Powered Applications
Chip Huyen and I share what we've learned, best practices, and insights at NVIDIA GTC 2025....
Improving Recommendation Systems & Search in the Age of LLMs
Model architectures, data generation, training paradigms, and unified frameworks inspired by LLMs....
Building AI Reading Club: Features & Behind the Scenes
Exploring how an AI-powered reading experience could look like....
2024 Year in Review
A peaceful year of steady progress on my craft and health....
A Spark of the Anti-AI Butlerian Jihad (on Bluesky)
How the sharing of 1M Bluesky posts uncovered the strong anti-AI sentiment on Bluesky....
Seemingly Paradoxical Rules of Writing
With regard to writing, there are many rules and also no rules at all....
How to Run a Weekly Paper Club (and Build a Learning Commun…
Benefits of running a weekly paper club, how to start one, and how to read and facilitate papers....
My Minimal MacBook Pro Setup Guide
Setting up my new MacBook Pro from scratch...
39 Lessons on Building ML Systems, Scaling, Execution, and …
ML systems, production & scaling, execution & collaboration, building for users, conference etiquette....