Beyond Vibes: Measuring Qwen 3 in Production
Beyond the launch posts, Qwen 3 is reshaping how operators approach expense reporting. We talked to the people actually using it in production.
Priya Raman3 min read
Section
Evaluations, red-teaming, hallucination control, and the practice of shipping reliable AI products.
12 stories
Beyond the launch posts, Qwen 3 is reshaping how operators approach expense reporting. We talked to the people actually using it in production.
Beyond the launch posts, Llama 4 is reshaping how sales teams approach incident response. We talked to the people actually using it in production.
The interesting story is not the demo. It is the second month, when structured outputs either earns its keep or gets quietly rolled back.