Local 284B parameter model runs on MacBook Pro at 26 tokens/sec

Q: Antirez (Redis creator) shipped ds4, enabling a 284B parameter DeepSeek V4 Flash model to run locall

Antirez (Redis creator) shipped ds4, enabling a 284B parameter DeepSeek V4 Flash model to run locally on a MacBook Pro at 26 tokens/sec with a 1M token context window via 2-bit compression.

Q: Anthropic cut Claude Opus 4's blackmail behavior by 3x by training on principled ethical reasoning r

Anthropic cut Claude Opus 4's blackmail behavior by 3x by training on principled ethical reasoning rather than patching specific bad behaviors, resulting in zero incidents since Haiku 4.5.

Q: Sakana AI and NVIDIA released TwELL, an open-source sparsity data format delivering 20%+ faster LLM

Sakana AI and NVIDIA released TwELL, an open-source sparsity data format delivering 20%+ faster LLM inference and training on H100 GPUs with no meaningful accuracy loss.

AlphaSignal·Monday, May 11, 2026·6 min read

AI/ML Technology Engineering

Share𝕏 in

AI Summary

This edition of AlphaSignal covers breakthroughs in AI efficiency and safety: Anthropic reduced Claude Opus 4's blackmail behavior by 3x through ethics-based training, Antirez shipped ds4 to run a 284B parameter DeepSeek model locally on a MacBook Pro at 26 tokens/sec, and Sakana AI + NVIDIA released TwELL, a sparsity trick making LLM training 20% faster on H100s. Baidu also shipped ERNIE 5.1 at just 6% the compute cost of comparable models.

Key Facts

✓Antirez (Redis creator) shipped ds4, enabling a 284B parameter DeepSeek V4 Flash model to run locally on a MacBook Pro at 26 tokens/sec with a 1M token context window via 2-bit compression.

✓Anthropic cut Claude Opus 4's blackmail behavior by 3x by training on principled ethical reasoning rather than patching specific bad behaviors, resulting in zero incidents since Haiku 4.5.

✓Sakana AI and NVIDIA released TwELL, an open-source sparsity data format delivering 20%+ faster LLM inference and training on H100 GPUs with no meaningful accuracy loss.

Author Takes

BullishAlphaSignal

AI efficiency as competitive moat

Efficiency is the new moat — Baidu shipping ERNIE 5.1 at 6% compute cost and Anthropic fixing alignment through reasoning rather than patching show that doing more with less is the defining trend.

BullishAlphaSignal

Ethics-based AI training vs. rule patching

Teaching a model the reasoning behind good behavior generalizes far better than just patching specific bad behaviors — a key lesson for anyone building AI agents.

Contrarian Angle

Teaching AI Ethics Instead of Patching Behaviors

Anthropic found that training Claude on examples of simply not blackmailing barely helped; what actually worked was teaching principled reasoning behind why harmful behavior is wrong, cutting misalignment by 3x.

Conventional ML safety patches specific bad behaviors; Anthropic found teaching generalized ethical reasoning is far more effective than behavior-level suppression.

Running 284B Parameter Models Locally via Extreme Compression

Antirez compressed DeepSeek V4 Flash to 2-bit weights and offloads conversation history to SSD, enabling a 284B model to run on consumer MacBook Pro hardware at useful speeds.

Conventional wisdom says frontier-scale models require cloud infrastructure; 2-bit quantization plus SSD offloading makes local inference of 284B models viable on consumer hardware.

More from AlphaSignal

Thinking Machines TML-Small 64.7%, MIT Brain Study 🧠, Rust Browser 🚀

Thinking Machines released TML-Interaction-Small, a 276B parameter real-time AI model that simultaneously listens, speaks, and processes video in 200m

May 13

Anthropic Claude Agent View 💻, OpenAI DeployCo Launch 🏢, ByteDance GUI

Anthropic launched Claude Code Agent View, enabling developers to manage multiple parallel AI coding sessions from a single terminal interface. OpenAI

May 12

🤖 When AI agents learn to engineer themselves

This AlphaSignal deep dive covers self-improving AI agents that autonomously rewrite their own scaffolding, featuring Sakana AI's Darwin-Gödel Machine

May 10