The archive

All articles

100 stories from the Inference Daily desk.

AI Agents·Jun 23, 2026

Agents Are Eating the Backend, One long-context workflows at a Time

The interesting story is not the demo. It is the second month, when long-context workflows either earns its keep or gets quietly rolled back.

Yuki Tanabe3 min read

LLMs·Jun 22, 2026

The Quiet Architecture Shift Behind GPT-5.1

The interesting story is not the demo. It is the second month, when computer-use agents either earns its keep or gets quietly rolled back.

Priya Raman3 min read

AI in Business·Jun 20, 2026

The Honest ROI Story Behind Notion's AI Bet

Beyond the launch posts, Claude 4.5 Sonnet is reshaping how product managers approach incident response. We talked to the people actually using it in production.

Elena Brost3 min read

AI News·Jun 20, 2026

The Week in AI: Gemini 3 Pro, memory, and a New small-model orchestration

Duolingo is the latest in a string of teams treating small-model orchestration as the default, not the experiment. Here is what they got right — and what they are still figuring out.

Mira Castellanos3 min read

AI Search·Jun 18, 2026

Retrieval Is Eating Search: A Look Inside code execution

Beyond the launch posts, Phi-4 is reshaping how founders approach lead qualification. We talked to the people actually using it in production.

Mira Castellanos3 min read

AI Agents·Jun 16, 2026

Agents Are Eating the Backend, One evals-first development at a Time

Intercom is the latest in a string of teams treating small-model orchestration as the default, not the experiment. Here is what they got right — and what they are still figuring out.

Mira Castellanos3 min read

Automation·Jun 14, 2026

When to Replace expense reporting With an Agent — and When Not To

Ramp is the latest in a string of teams treating RAG-as-a-service as the default, not the experiment. Here is what they got right — and what they are still figuring out.

Priya Raman3 min read

Automation·Jun 14, 2026

When to Replace data cleanup With an Agent — and When Not To

Beyond the launch posts, GPT-5.1 is reshaping how sales teams approach compliance review. We talked to the people actually using it in production.

Priya Raman3 min read

AI QA·Jun 11, 2026

The Quiet Discipline Behind Reliable Grok 4 Apps

The interesting story is not the demo. It is the second month, when evals-first development either earns its keep or gets quietly rolled back.

Yuki Tanabe3 min read