The signal, extracted.

AI moves too fast and lies too often. We track frontier labs, analysts who actually know, and the open-source firehose — then weight what survives. No newsletter pad, no affiliate slop.

analysis Analysis

Divide-and-conquer value learning reduces Bellman recursions logarithmically for long-horizon RL

A Berkeley researcher proposes a third RL paradigm beyond TD and Monte Carlo that exploits triangle inequality in goal-conditioned settings, avoiding the error accumulation that blocks Q-learning from scaling to long-horizon tasks.

The signal, extracted.

Divide-and-conquer value learning reduces Bellman recursions logarithmically for long-horizon RL

Recent

Mutual information as an imaging metric predicts decoder performance across four domains without task-specific training

SPEX and ProxySPEX identify influential feature interactions in LLMs with exponentially fewer ablations

GRASP makes long-horizon planning with learned world models practical through three targeted fixes

Qwen-MT covers 92 languages at $0.5 per million tokens using a lightweight MoE architecture

Archive

GSPO replaces token-level RL clipping with sequence-level optimization, fixing MoE training collapse

Qwen-Image is a 20B image model that renders Chinese calligraphy and multi-column English text accurately

Qwen-Image-Edit: a 20B model that separates semantic and appearance editing with bilingual text support

Qwen3Guard brings streaming safety detection to open-source guardrail models

Top logits can leak task-irrelevant image information as readily as full residual stream projections

Apple researchers build a context understanding benchmark and find quantized models degrade unevenly

Apple at ICLR 2026: RNN parallelization, tool-augmented SSMs, unified image models, and more

ParaRNN: Apple researchers train a 7B-parameter nonlinear RNN, competitive with transformers

Apple research generates realistic long-term motion with a 64x temporally compressed embedding

End-to-end FP8 in RL training: NeMo RL achieves 48% speedup over BF16 baseline

NVIDIA adds Muon optimizer support to Megatron Core, closes gap with AdamW at scale

Three LLM agents wrote 600,000 lines of code and ran 850 experiments to win a Kaggle competition

NVIDIA FLARE reduces federated learning migration to ~5 lines of code and an environment swap

NVIDIA Blackwell delivers 150+ tokens/sec/user on DeepSeek-V4-Pro out of the box

Open AI systems have a structural edge in cybersecurity defense — here is why

QIMMA validates Arabic benchmarks before running models on them — and finds systematic problems in established datasets

Gemma 4 runs as a vision-language-action agent on an 8 GB Jetson Orin Nano Super

Running Transformers.js in a Chrome extension: the Manifest V3 architecture that actually works

DeepSeek-V4 cuts KV cache to 2% of standard cost to make million-token agent context practical

ADeLe scores models and tasks on the same 18-ability scale to predict performance before deployment

Microsoft researchers: we are benchmarking AI against the past when we should be asking what comes next

Microsoft's New Future of Work report: AI is speeding up work changes but distributing benefits unevenly

Microsoft researchers on AI and climate: separate the data from the hype before drawing conclusions

Microsoft's AutoAdapt turns LLM domain adaptation from guesswork into a constraint-aware pipeline

Google's Vantage uses AI avatars to assess skills like critical thinking in adaptive conversations

Google's MoGen generates synthetic neuron shapes that cut brain-mapping errors by 4.4%

Simula treats synthetic data generation as mechanism design, not sample-by-sample prompting

ReasoningBank gives agents a memory that learns from failures, not just successes

Google Photos can now reframe your shots from a new camera angle after the fact

Gemma 4 releases four model sizes under Apache 2.0, with the 31B ranked third among all open models

Gemini Robotics-ER 1.6 adds instrument reading and multi-view reasoning, developed with Boston Dynamics

Gemini 3.1 Flash TTS launches with audio tags, 70+ languages, and SynthID watermarking

Google DeepMind partners with five global consultancies to deploy frontier AI at enterprise scale

Google DeepMind's Decoupled DiLoCo trains LLMs across data centers on standard internet bandwidth

How we decide what's news

Cluster

Weight

Signal

Cite