Article List

Explore latest news, discover interesting content, and dive deep into topics that interest you

Clear Filters
Beyond Standard LLMs Machine Learning Research

Beyond Standard LLMs

Linear Attention Hybrids, Text Diffusion, Code World Models, and Small Recursive Transformers...

2 months, 1 week ago Ahead of AI
94253 words 314 min
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch) Machine Learning Research

Understanding the 4 Main Approaches to LLM Evaluation (From…

Multiple-Choice Benchmarks, Verifiers, Leaderboards, and LLM Judges with Code Examples...

3 months, 1 week ago Ahead of AI
70353 words 234 min
Understanding and Implementing Qwen3 From Scratch Machine Learning Research

Understanding and Implementing Qwen3 From Scratch

A Detailed Look at One of the Leading Open-Source LLMs...

4 months, 1 week ago Ahead of AI
4570 words 15 min
From GPT-2 to gpt-oss: Analyzing the Architectural Advances Machine Learning Research

From GPT-2 to gpt-oss: Analyzing the Architectural Advances

And How They Stack Up Against Qwen3...

5 months ago Ahead of AI
79049 words 263 min
The Big LLM Architecture Comparison Machine Learning Research

The Big LLM Architecture Comparison

From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design...

5 months, 3 weeks ago Ahead of AI
147672 words 492 min
LLM Research Papers: The 2025 List (January to June) Machine Learning Research

LLM Research Papers: The 2025 List (January to June)

A topic-organized collection of 200+ LLM research papers from 2025...

6 months, 1 week ago Ahead of AI
15618 words 52 min
Understanding and Coding the KV Cache in LLMs from Scratch Machine Learning Research

Understanding and Coding the KV Cache in LLMs from Scratch

KV caches are one of the most critical techniques for efficient inference in LLMs in production....

6 months, 3 weeks ago Ahead of AI
40977 words 136 min
Coding LLMs from the Ground Up: A Complete Course Machine Learning Research

Coding LLMs from the Ground Up: A Complete Course

Why build LLMs from scratch? It's probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they had a l...

8 months ago Ahead of AI
6195 words 20 min
The State of Reinforcement Learning for LLM Reasoning Machine Learning Research

The State of Reinforcement Learning for LLM Reasoning

Understanding GRPO and New Insights from Reasoning Model Papers...

8 months, 3 weeks ago Ahead of AI
106954 words 356 min
First Look at Reasoning From Scratch: Chapter 1 Machine Learning Research

First Look at Reasoning From Scratch: Chapter 1

Welcome to the next stage of large language models (LLMs): reasoning. LLMs have transformed how we process and generate text, but their success has be...

9 months, 2 weeks ago Ahead of AI
1841 words 6 min
The State of LLM Reasoning Model Inference Machine Learning Research

The State of LLM Reasoning Model Inference

Inference-Time Compute Scaling Methods to Improve Reasoning Models...

10 months ago Ahead of AI
77792 words 259 min
Understanding Reasoning LLMs Machine Learning Research

Understanding Reasoning LLMs

Methods and Strategies for Building and Refining Reasoning Models...

11 months, 1 week ago Ahead of AI
59222 words 197 min
Noteworthy AI Research Papers of 2024 (Part Two) Machine Learning Research

Noteworthy AI Research Papers of 2024 (Part Two)

Six influential AI papers from July to December...

11 months, 4 weeks ago Ahead of AI
74700 words 249 min
Noteworthy AI Research Papers of 2024 (Part One) Machine Learning Research

Noteworthy AI Research Papers of 2024 (Part One)

Six influential AI papers from January to June...

1 year ago Ahead of AI
41183 words 137 min
LLM Research Papers: The 2024 List Machine Learning Research

LLM Research Papers: The 2024 List

A curated list of interesting LLM-related research papers from 2024, shared for those looking for something to read over the holidays....

1 year, 1 month ago Ahead of AI
94753 words 315 min
Understanding Multimodal LLMs Machine Learning Research

Understanding Multimodal LLMs

An introduction to the main techniques and latest models...

1 year, 2 months ago Ahead of AI
88361 words 294 min
Building A GPT-Style LLM Classifier From Scratch Machine Learning Research

Building A GPT-Style LLM Classifier From Scratch

Finetuning a GPT Model for Spam Classification...

1 year, 3 months ago Ahead of AI
9948 words 33 min
Building LLMs from the Ground Up: A 3-hour Coding Workshop Machine Learning Research

Building LLMs from the Ground Up: A 3-hour Coding Workshop

If your weekend plans include catching up on AI developments and understanding Large Language Models (LLMs), I've prepared a 1-hour presentation on th...

1 year, 4 months ago Ahead of AI
4807 words 16 min
1 / 2