Opus 4.7 Fast ⚡, Qwen Image 2.0 🖼️, serverless GPUs ✨
AI Summary
TLDR AI covers the launch of fast mode for Claude Opus 4.7 in research preview, Meta's Muse Spark model powering voice and glasses features, and Google's reported discussions with SpaceX about orbital data centers. The issue also includes deep dives on serverless GPU scaling, semiconductor supply chain dynamics, and a new 26M parameter open-weight model called Cactus Needle distilled from Gemini.
Key Facts
Author Takes
LLM path to human-level intelligence
Today's LLMs predicting text alone will not lead to human-level intelligence because language is only a small fraction of how humans understand the world; future AI must use world models that learn physics, causality, and consequences.
Contrarian Angle
Serverless GPU Inference at Scale
Modal achieved AI inference scaling from kiloseconds to tens of seconds by rethinking how replicas are spun up for variable workloads, making truly serverless GPU inference economically viable.
Conventional wisdom treats GPU infrastructure as reserved/always-on; Modal challenges this by making cold-start times fast enough for serverless economics to work.
Semiconductor Giants Choosing Price Hikes Over Capacity Expansion
Texas Instruments and NXP Semiconductors are deliberately avoiding new fab capacity for AI-driven analog/power chip demand, instead raising prices to capture margin.
During a demand boom, the expected move is to expand capacity; these incumbents are instead harvesting pricing power, prioritizing profitability over market share.
More from TLDR
CheckMarx Jenkins Hit ⚙️, OpenAI Daybreak 🤖, Best Western Breached 🏨
This cybersecurity newsletter covers a supply-chain attack on CheckMarx's Jenkins plugin by TeamPCP, a Shai-Hulud npm worm that compromised 42 @tansta
The Agent Mess Gets Real 🤖, Cyber Gets Autonomous ⚔️, Cloud’s New Pitch 🏗️
This TLDR IT edition covers OpenAI's new Daybreak cybersecurity initiative, a $125M Series B for AI security startup Exaforce, and GitLab's org restru
Enterprise AI race 🏃, AI P&L shifts 📉, becoming AI native 🤖
Enterprise AI adoption has shifted with Claude up 128% and Gemini up 48% while OpenAI's share dropped to 56%. AI-native SaaS economics are fundamental