#swe-bench
- Microsoft Build 2026: MAI Model Family Launched to Power GitHub Copilot Without OpenAI Dependency Microsoft models-llm
- Mistral releases Medium 3.5 — 128B dense, 256k context, open weights Mistral models-llm
- Mistral Launches Medium 3.5 Open-Weight Flagship and Remote Coding Agents in Vibe Mistral AI models-llm
- Poolside Open-Sources Laguna XS.2 and M.1: First Open-Weight Agentic Coding Models from a US Startup Poolside models-llm
- DeepReinforce Releases Ornith-1.0: Open-Source Coding Models That Learn Their Own RL Scaffolds DeepReinforce tools
- SHERLOC: Structured Diagnostic Localization Cuts Code Repair Token Usage by 36.7% research