Top AI News Weekly

Week 23 · Jun 1 – Jun 7, 2026

Microsoft AI Build Week + NVIDIA Physical AI Wave. Microsoft AI ships its full in-house stack at Build 2026: MAI-Thinking-1 (1T MoE, 97% AIME 2025, matches Opus 4.6 on SWE-Bench Pro), MAI-Voice-2 (15-lang expressive TTS), MAI-Image-2.5 (#2 on image edit leaderboards), MAI-Transcribe-1.5 (SOTA in 18 of 43 languages), and Microsoft Discovery GA — the agentic R&D platform that drove the Majorana 2 quantum chip (~1000× reliability, 20s qubit lifetime). NVIDIA launches Cosmos 3 (open Physical AI omni-model with Reasoner + Generator MoT, tops PAI-Bench), Nemotron 3 Ultra (550B MoE, 55B active, 5.9× throughput vs GLM-5.1, NVFP4), OmniDreams (real-time AV world model), RTX Spark superchip (Blackwell + Grace, slim laptops), open Unitree H2 Plus humanoid reference design (75 DoF total). MiniMax M3 hits 59% SWE-Bench Pro with 1M context + native multimodality + Sparse Attention (15.6× decode). Alibaba Qwen3.7-Plus (1M context multimodal agent, 79 ScreenSpot Pro). Google Gemma 4 12B (encoder-free native audio-in), Magenta RealTime 2 (2.4B live music, ~200ms latency, MLX support). OpenAI ships ChatGPT Dreaming (background memory synthesis) + GPT-5.5 Instant June refresh. Ideogram 4.0 (9.3B open-weight, structured prompts, 2K native). Reve 2.0 (Large Layout Model, #2 T2I Arena). Boson AI ships Higgs Audio v3 TTS (4B, 102 languages, zero-shot) + Higgs Avatar v1 (real-time talking head). Baidu NAVA (native audio-visual alignment), Bernini video gen+edit, StreamChar real-time character AV. Research wave: Déjà View looping Transformers for 3D, PaGeR panoramic geometry, MAMMA multi-person mocap, Stable-Layers Flow-GRPO, WavTTS waveform diffusion. Tooling: EveryInc compound-engineering-plugin (37 skills/51 agents), Baidu LoongForge (5.04× over Megatron), Kasetto (declarative Rust agent manager), learn-claude-code (20-lesson harness curriculum), Odysseus self-hosted, earendil-works/pi toolkit, MisoTTS 8B, synthteam Slack persona plugin, QuantDinger AI quant trading. Industry: GitHub Copilot moves to usage-based AI Credits ($0.01/credit), Perplexity Personal Computer for Windows (19-model orchestrator), Vals AI Finance Agent v2 benchmark (GPT-5.5 leads at 51.76%), FlashDreams inference library, NVIDIA Cosmos Coalition (Black Forest Labs, Runway, LTX), Suno iOS Notes/Voice Memos integration.

44 launches and research drops that matter for enterprise AI builders—curated, tagged, and ready for your next roadmap sync.

New drops

Unique sources

Key themes

Immersive · Frontier · Agents

Jump to week

frontier

Frontier Models & Research

New reasoning systems, world models, and alignment papers.

Memory SystemOpenAI

ChatGPT Dreaming (V3 Memory)

New ChatGPT memory architecture that replaces the manually curated saved-memories list with a background synthesis process revising memories over time (e.g. trip "will go" → "went"); enabled by ~5× compute-cost reduction, doubles memory capacity for Plus/Pro.

Week 23 · Jun 1 – Jun 7, 2026

Frontier Models & Research

ChatGPT Dreaming (V3 Memory)

Gemma 4 12B

Qwen3.7-Plus

MiniMax M3

Majorana 2

NVIDIA Nemotron 3 Ultra

MAI-Thinking-1

GPT-5.5 Instant — June refresh

Immersive Media & Simulation

MisoTTS 8B

Bernini

Déjà View

PaGeR

Magenta RealTime 2

MAMMA

Reve 2.0

Ideogram 4.0

NVIDIA Cosmos 3

Stable-Layers

WavTTS

StreamChar

NVIDIA OmniDreams

Higgs Audio v3 TTS

NAVA

MAI-Voice-2

MAI-Image-2.5

Higgs Avatar v1

Unitree H2 Plus + Isaac GR00T Ref Design

MAI-Transcribe-1.5

NVIDIA Cosmos Coalition

Suno iOS — Notes + Voice Memos integration

Agents & Embodied Intelligence

Compound Engineering Plugin

learn-claude-code

Pi Agent Toolkit

synthteam

QuantDinger

Microsoft Discovery (GA)

Perplexity Personal Computer for Windows

Developer Tooling & Infra

LoongForge

Kasetto

Odysseus

Finance Agent v2 Benchmark

NVIDIA RTX Spark

GitHub Copilot AI Credits

FlashDreams