Frontier lab releases, open-source checkpoints, multimodal systems, inference stacks, and model capability shifts.
TeamDay.ai: GPT‑5.3 Codex hits OpenAI’s “high” cyber-preparedness bar
OpenTeamDay.ai’s February–March 2026 frontier roundup reports that OpenAI’s **GPT‑5.3 Codex** is the first OpenAI model rated “high” on its internal cybersecurity preparedness framework, with capabilities that could “meaningfully enable real-world cyber harm” if automated or used at scale.[1] The model is positioned as OpenAI’s flagship agentic coding system, optimized for long‑running development and operations tasks.[1]
Anthropic’s Claude Sonnet 4.6 pushes “mid‑tier” toward flagship performance
OpenThe same frontier roundup highlights **Claude Sonnet 4.6** (released Feb 17, 2026) as a full upgrade that delivers near‑Opus performance in coding, computer use, long‑context reasoning, agent planning, and design while targeting lower cost.[1] Sonnet 4.6 ships with a 1M‑token context window in beta, aligning with other frontier systems now offering million‑token contexts.[1]
Open‑weight frontier models: GLM‑5 and Kimi K2.5 expand self‑hosted options
OpenTeamDay.ai notes that **GLM‑5** (Zhipu AI) and **Kimi K2.5** (Moonshot AI) are large mixture‑of‑experts frontier models released as open‑weight systems suitable for self‑hosting.[1] GLM‑5 is a 745B‑parameter MoE with strong coding, multi‑step reasoning, and long‑context support, while Kimi K2.5 is a 1T‑parameter multimodal MoE with an “Agent Swarm” capability trained via Parallel Agent Reinforcement Learning (PARL), and the model weights are available via Hugging Face.[1]