Frontier lab releases, open-source checkpoints, multimodal systems, inference stacks, and model capability shifts.
TeamDay: GPT-5.3 Codex hits OpenAI’s “high” cyber preparedness bar
OpenTeamDay’s February–March 2026 frontier-model roundup reports that OpenAI’s **GPT-5.3 Codex** is the first OpenAI model rated “high” on its internal cybersecurity preparedness framework, reflecting concern that its coding and reasoning skills could meaningfully enable cyber harm if automated at scale.[1] The model is positioned as OpenAI’s flagship agentic coding system, optimized for long-running development and operations tasks.[1]
Anthropic’s Claude Sonnet 4.6 pushes mid-tier toward flagship performance
OpenTeamDay highlights **Claude Sonnet 4.6** as a full upgrade across coding, computer use, long‑context reasoning, agent planning, and design, with a 1M‑token context window in beta.[1] The model delivers near‑Opus performance at roughly one-fifth the price, targeting high‑volume production workloads.[1]
Open‑weight surge: GLM‑5, Kimi K2.5, and DeepSeek V3.2/V4 for self‑hosting
OpenTeamDay notes several new open‑weight frontier-class options: **GLM‑5** (745B‑parameter MoE with 44B active parameters), **Moonshot Kimi K2.5** (1T‑parameter MoE with Agent Swarm and PARL for parallelized tasks), and **DeepSeek V3.2/V4**, with V3.2 priced at roughly $0.27 per million tokens and V4 offering 1M+ context windows.[1] These models are designed for creative work, code generation, multi‑step reasoning, and agentic intelligence, and GLM‑5 and Kimi K2.5 are available for self‑hosting if