Beyond Vibes: Measuring Qwen 3 in Production
Beyond the launch posts, Qwen 3 is reshaping how operators approach expense reporting. We talked to the people actually using it in production.
The archive
100 stories from the Inference Daily desk.
Beyond the launch posts, Qwen 3 is reshaping how operators approach expense reporting. We talked to the people actually using it in production.
The interesting story is not the demo. It is the second month, when RAG-as-a-service either earns its keep or gets quietly rolled back.
Duolingo is the latest in a string of teams treating RAG-as-a-service as the default, not the experiment. Here is what they got right — and what they are still figuring out.
The interesting story is not the demo. It is the second month, when RAG-as-a-service either earns its keep or gets quietly rolled back.
Spotify is the latest in a string of teams treating long-context workflows as the default, not the experiment. Here is what they got right — and what they are still figuring out.
Beyond the launch posts, Command R+ 2 is reshaping how product managers approach expense reporting. We talked to the people actually using it in production.
The interesting story is not the demo. It is the second month, when small-model orchestration either earns its keep or gets quietly rolled back.
Beyond the launch posts, Phi-4 is reshaping how designers approach onboarding. We talked to the people actually using it in production.
Shopify is the latest in a string of teams treating RAG-as-a-service as the default, not the experiment. Here is what they got right — and what they are still figuring out.