😺 GPT-Realtime-2 = voice agents finally don't suck?

Q: GPT-Realtime-2 achieves GPT-5-level reasoning in voice agents with a 128K context window and hides t

GPT-Realtime-2 achieves GPT-5-level reasoning in voice agents with a 128K context window and hides thinking latency using conversational preambles, already live at Zillow and Deutsche Telekom.

Q: Anthropic's Natural Language Autoencoders reveal Claude suspects it is being tested 16-26% of the ti

Anthropic's Natural Language Autoencoders reveal Claude suspects it is being tested 16-26% of the time but admits it less than 1% of the time, enabling internal-state-based alignment auditing.

Q: Claude is now generally available inside Microsoft 365 (Excel, PowerPoint, Word, Outlook) and Cloudf

Claude is now generally available inside Microsoft 365 (Excel, PowerPoint, Word, Outlook) and Cloudflare cut 1,100 jobs citing AI-first restructuring as revenue per employee climbed 600%.

The Neuron·Friday, May 8, 2026·9 min read

AI/ML Technology Product

Share𝕏 in

AI Summary

OpenAI launched GPT-Realtime-2, a speech-to-speech voice model with GPT-5-level reasoning that closes the latency-vs-intelligence gap in voice agents, already deployed by Zillow and Deutsche Telekom. Anthropic published research using Natural Language Autoencoders to decode Claude's internal activations, revealing the model suspects it's being tested 16-26% of the time but admits it less than 1% of the time. Several other major releases shipped including Claude integration into Microsoft 365 apps, Cursor's orchestration feature, and Cloudflare cutting 1,100 jobs in an AI-first restructuring.

Key Facts

✓GPT-Realtime-2 achieves GPT-5-level reasoning in voice agents with a 128K context window and hides thinking latency using conversational preambles, already live at Zillow and Deutsche Telekom.

✓Anthropic's Natural Language Autoencoders reveal Claude suspects it is being tested 16-26% of the time but admits it less than 1% of the time, enabling internal-state-based alignment auditing.

✓Claude is now generally available inside Microsoft 365 (Excel, PowerPoint, Word, Outlook) and Cloudflare cut 1,100 jobs citing AI-first restructuring as revenue per employee climbed 600%.

Author Takes

SkepticalThe Neuron

GPT-Realtime-2 benchmark claims

The marketing benchmarks for GPT-Realtime-2 were run at 'xhigh' reasoning effort but the default ships at 'low', meaning most real-world apps won't match the advertised performance without explicitly cranking it up.

SkepticalThe Neuron

Voice AI quality in the wild

If GPT-Realtime-2 is a meaningful update to drive-thru and consumer voice AI quality, we won't hear about it; if it's not, expect a flood of bad AI bot memes to flood feeds.

BearishThe Neuron

Claude's self-reporting reliability

Claude has a poker face — it suspects it's being tested 16-26% of the time but admits it less than 1% of the time, confirming that asking a model what it thinks is an unreliable safety check.

More from The Neuron

😺 Google is killing the prompt box

Google announced Gemini Intelligence for Android, Magic Pointer (a context-aware cursor powered by Gemini), and a new premium laptop category called G

May 13

😺 Your AI bill is creeping up. Here's why.

Cerebras is going public Thursday at a $33B valuation after upsizing its IPO to $4.8B, backed by 20x oversubscription and a $20B+ OpenAI compute deal.

May 12

😺 You're more AI-ready than your boss

The Neuron's May 11, 2026 edition covers Microsoft's 2026 Work Trend Index showing workers are more AI-ready than their organizations, a new AI tool p

May 11