Live · weighted across 376 sources

The signal, extracted.

AI moves too fast and lies too often. We track frontier labs, analysts who actually know, and the open-source firehose — then weight what survives. No newsletter pad, no affiliate slop.

opensource Official

DeepSeek-V4 cuts KV cache to 2% of standard cost to make million-token agent context practical

DeepSeek-V4 combines two new attention mechanisms with agent-specific post-training to reduce KV cache memory to roughly 2% of a standard grouped-query-attention architecture, targeting long-horizon agentic workloads over chat.

1 source · primary

Recent

frontier Official

ADeLe scores models and tasks on the same 18-ability scale to predict performance before deployment

Microsoft Research and collaborators introduce ADeLe, a framework that characterizes both benchmarks and LLMs using shared capability scores, achieving ~88% prediction accuracy on unseen tasks for models like GPT-4o and LLaMA-3.1-405B.

1 source · primary

analysis Analysis

Microsoft researchers: we are benchmarking AI against the past when we should be asking what comes next

Researchers behind the New Future of Work Report 2025 discuss the intentionality required to build a future where people flourish, the gap between efficiency and the work future worth wanting, and why AI's role as tool versus collaborator is not a semantic question.

1 source · primary

analysis Official

Microsoft's New Future of Work report: AI is speeding up work changes but distributing benefits unevenly

The fifth annual New Future of Work report from Microsoft Research documents generative AI's entry into workplaces — faster than previous technologies, with measurable productivity gains for some, declining entry-level hiring, and significant gaps by gender, income level, and language.

1 source · primary

analysis Analysis

Microsoft researchers on AI and climate: separate the data from the hype before drawing conclusions

A Microsoft Research podcast episode brings together a sustainability scientist and an optimization researcher to examine AI's actual climate footprint, the local infrastructure concerns from datacenter expansion, and where AI optimization tools can genuinely help.

1 source · primary

Archive

frontier Official

Microsoft's AutoAdapt turns LLM domain adaptation from guesswork into a constraint-aware pipeline

An open-source framework from Microsoft Research automates the selection between RAG and fine-tuning approaches, plans adaptation strategies against real deployment constraints, and replaces manual hyperparameter search with a budgeted refinement loop.

1 source · primary

frontier Official

Google's Vantage uses AI avatars to assess skills like critical thinking in adaptive conversations

A research experiment built with NYU uses an Executive LLM to steer multi-party AI conversations toward targeted skill assessment, with AI scoring accuracy matching human expert agreement rates in a 188-person study.

1 source · primary

science Official

Google's MoGen generates synthetic neuron shapes that cut brain-mapping errors by 4.4%

A new open-source model from Google Research generates realistic 3D neuron geometries from point cloud flow matching, improving connectomic reconstruction accuracy in a way that would save 157 person-years of manual work at mouse-brain scale.

1 source · primary

frontier Official

Simula treats synthetic data generation as mechanism design, not sample-by-sample prompting

Google's Simula framework decomposes synthetic dataset creation into independently controllable axes — diversity, complexity, and quality — and has already been deployed in Gemma safety models, Android scam detection, and spam filtering.

1 source · primary

frontier Official

ReasoningBank gives agents a memory that learns from failures, not just successes

Google Cloud's ICLR paper introduces a structured memory framework that distills both failed and successful agent trajectories into transferable reasoning strategies, beating memory-free baselines by up to 8.3% on web benchmarks.

1 source · primary

frontier Official

Google Photos can now reframe your shots from a new camera angle after the fact

Google's Auto frame feature uses 3D scene reconstruction and generative inpainting to re-render photos from a different viewpoint, fixing parallax and perspective distortion without a reshoot.

1 source · primary

opensource Official

Gemma 4 releases four model sizes under Apache 2.0, with the 31B ranked third among all open models

Google DeepMind's Gemma 4 family spans a 2B mobile model to a 31B dense model, supports 140+ languages, adds 256K context for larger variants, and ships under a fully permissive open-source license.

1 source · primary

robotics Official

Gemini Robotics-ER 1.6 adds instrument reading and multi-view reasoning, developed with Boston Dynamics

Google DeepMind's upgraded embodied reasoning model improves spatial reasoning, success detection, and multi-camera understanding — and gains a new industrial inspection capability co-developed with Boston Dynamics.

1 source · primary

frontier Official

Gemini 3.1 Flash TTS launches with audio tags, 70+ languages, and SynthID watermarking

Google DeepMind's new text-to-speech model scores 1,211 on the Artificial Analysis TTS Elo leaderboard, introduces natural-language audio tags for vocal direction, and watermarks all output with SynthID.

1 source · primary

frontier Official

Google DeepMind partners with five global consultancies to deploy frontier AI at enterprise scale

Accenture, Bain, BCG, Deloitte, and McKinsey gain early access to Gemini models and direct DeepMind technical engagement in a push to close a significant gap between enterprise AI ambition and production deployment.

1 source · primary

frontier Official

Google DeepMind's Decoupled DiLoCo trains LLMs across data centers on standard internet bandwidth

A new distributed training architecture from Google DeepMind uses asynchronous compute islands to train large models across distant locations — more than 20x faster than conventional synchronization, with self-healing fault tolerance.

1 source · primary

The 4o4 method

How we decide what's news

01

Cluster

Same story across 30 outlets is one story. We embed, dedupe, and surface the canonical thread — not the press-release telephone game.

02

Weight

OpenAI's own blog outweighs 50 reblogs. SemiAnalysis on chips outweighs general trade press. Tier 0 to Tier 5. Honest scoring.

03

Signal

We tag every story: scoop, official, analysis, underrated. You see the shape of coverage at a glance.

04

Cite

Every claim links back to its source. Every source has a tier. You can audit our reasoning end to end.

Read the methodology →