Red-Teaming Gemini 3 Pro: What We Learned
The interesting story is not the demo. It is the second month, when fine-tuned distillation either earns its keep or gets quietly rolled back.
The archive
100 stories from the Inference Daily desk.
The interesting story is not the demo. It is the second month, when fine-tuned distillation either earns its keep or gets quietly rolled back.
Beyond the launch posts, Qwen 3 is reshaping how engineering teams approach contract review. We talked to the people actually using it in production.
The interesting story is not the demo. It is the second month, when small-model orchestration either earns its keep or gets quietly rolled back.
Beyond the launch posts, Command R+ 2 is reshaping how analysts approach QBR prep. We talked to the people actually using it in production.
Beyond the launch posts, GPT-5.1 is reshaping how operators approach expense reporting. We talked to the people actually using it in production.
Duolingo is the latest in a string of teams treating RAG-as-a-service as the default, not the experiment. Here is what they got right — and what they are still figuring out.
The interesting story is not the demo. It is the second month, when RAG-as-a-service either earns its keep or gets quietly rolled back.
Snowflake is the latest in a string of teams treating structured outputs as the default, not the experiment. Here is what they got right — and what they are still figuring out.
Beyond the launch posts, Claude 4.5 Sonnet is reshaping how small studios approach expense reporting. We talked to the people actually using it in production.