Agents Are Eating the Backend, One long-context workflows at a Time
The interesting story is not the demo. It is the second month, when long-context workflows either earns its keep or gets quietly rolled back.
The archive
100 stories from the Inference Daily desk.
The interesting story is not the demo. It is the second month, when long-context workflows either earns its keep or gets quietly rolled back.
The interesting story is not the demo. It is the second month, when computer-use agents either earns its keep or gets quietly rolled back.
Beyond the launch posts, Claude 4.5 Sonnet is reshaping how product managers approach incident response. We talked to the people actually using it in production.
Duolingo is the latest in a string of teams treating small-model orchestration as the default, not the experiment. Here is what they got right — and what they are still figuring out.
Beyond the launch posts, Phi-4 is reshaping how founders approach lead qualification. We talked to the people actually using it in production.
Intercom is the latest in a string of teams treating small-model orchestration as the default, not the experiment. Here is what they got right — and what they are still figuring out.
Ramp is the latest in a string of teams treating RAG-as-a-service as the default, not the experiment. Here is what they got right — and what they are still figuring out.
Beyond the launch posts, GPT-5.1 is reshaping how sales teams approach compliance review. We talked to the people actually using it in production.
The interesting story is not the demo. It is the second month, when evals-first development either earns its keep or gets quietly rolled back.