-
Anthropic Launches Claude Managed Agents: Dreams, Outcomes, Multiagent Orchestration
Anthropic
tools
-
Google Announces Gemini Intelligence for Android with Cross-App Automation
Google
tools
-
Claude Code v2.1.139–v2.1.140: Agent View Research Preview and /goal Command
Anthropic
tools
-
Google DeepMind Co-Scientist: Multi-Agent Research System Published in Nature, Tool Now in Labs
Google DeepMind
research
-
Code as Agent Harness: Survey Positions Code as the Substrate for Executable Agent Systems (159 HF upvotes)
Multi-institution (42 authors)
research
-
SkillsVote: Lifecycle Governance of Agent Skills — Collection, Recommendation, Evolution (219 HF upvotes)
Memtensor Research Group / IAAR-Shanghai
research
-
Windsurf Rebrands as Devin Desktop and Launches Open Agent Client Protocol (ACP)
Cognition
tools
-
GitHub Copilot Standalone Desktop App Launches in Technical Preview at Microsoft Build 2026
GitHub
tools
-
Microsoft Launches Scout: Always-On Autopilot AI Agent for Microsoft 365
Microsoft
tools
-
Sakana AI Releases Fugu: Multi-LLM Orchestrator Achieving SoTA on SWE-Bench Pro
Sakana AI
research
-
MLEvolve: Self-Evolving Multi-Agent LLM Framework for Automated ML Algorithm Discovery
research
-
Ctx2Skill: Self-Improving Framework for Autonomous Context-Skill Discovery in LLMs
research
-
ARIS: Autonomous ML Research via Adversarial Multi-Agent Collaboration
Shanghai Jiao Tong University
research
-
VS Code 1.120: Agents Window Ships to Stable with Terminal Risk Assessment
Microsoft
tools
-
Crafter: Multi-Agent Harness for Editable Scientific Figure Generation Scores +16pt Over Baselines (103 HF Upvotes)
Tsinghua University
research
-
Moonshot AI Opens Kimi Work Desktop Agent with 300-Sub-Agent Swarm and WebBridge
Moonshot AI
tools
-
EvoArena: LLM Agents Score Only 39.6% on Dynamic Evolving Environments Benchmark
MIT
research
-
GitHub Copilot CLI v1.0.45: /autopilot Toggle and /fork Session Branching
GitHub
tools
-
Cursor Launches Microsoft Teams Integration for Cloud Agent Delegation
Cursor
tools
-
NanoResearch: Co-Evolving Skills, Memory, and Policy for Personalized AI Research Automation
Shanghai AI Lab
research
-
TMAS: Scaling Test-Time Compute via Multi-Agent Synergy with Hierarchical Memory
research
-
OpenClaw v2026.5.16-beta.5/6: Grok OAuth, Mac Settings Redesign, Python Debugging
OpenClaw
tools
-
OpenAI Codex CLI v0.137.0: Multi-Agent v2, Enterprise Config Bundles, TUI Keybindings
OpenAI
tools
-
SearchSwarm: Delegation Intelligence for LLM Agents in Long-Horizon Deep Research
research
-
Google DeepMind and Partners Launch $10M Multi-Agent AI Safety Research Fund
Google DeepMind
industry
-
Google DeepMind Takes Minority Stake in CCP Games for Multi-Agent Research in EVE Online
Google DeepMind
industry