Frontier lab releases, open-source checkpoints, multimodal systems, inference stacks, and model capability shifts.
Anthropic’s Claude Opus 4.7 and Claude Mythos Preview sharpen agentic and software-engineering use cases
OpenAnthropic’s frontier lineup now centers on **Claude Opus 4.7** as its flagship generally available model, adding stronger software engineering and vision capabilities on top of 1M context and agent-team orchestration foundations introduced in 4.6.[3] In parallel, **Claude Mythos Preview** is positioned as a frontier-tier experimental model available to restricted partners via Project Glasswing, signaling Anthropic’s next generation beyond generally available Opus.[3]
OpenAI GPT‑5.5 pushes agentic desktop benchmarks with 1M-token context
OpenOpenAI’s **GPT‑5.5** is described as its flagship agentic model, offering a 1M-token context window—4× larger than GPT‑5.4—and becoming the first model to exceed human baseline on the OSWorld‑V agentic desktop benchmark at 75%.[3] The model is optimized for long-horizon, tool-rich workflows such as multi-step research, code refactoring across large codebases, and complex UI automation.[3]
Google DeepMind’s Gemma 4 and Mistral’s Medium 3.5 strengthen open-weight multimodal options
Open**Gemma 4** is Google DeepMind’s most capable open-weight family so far, released under Apache 2.0 with on-device variants and a 26B MoE option that natively supports vision and audio, 256K context, and 140+ languages.[3] **Mistral Medium 3.5** ships as a frontier-class multimodal model with open weights under a Modified MIT license, and Mistral also released **Voxtral TTS**, its first open text-to-speech model, expanding its open audio ecosystem.[3]