Frontier lab releases, open-source checkpoints, multimodal systems, inference stacks, and model capability shifts.
TeamDay.ai maps February–March 2026 frontier model wave: GPT‑5.3 Codex, Claude 4.6, Mistral Large 3, DeepSeek V4, GLM‑5, Kimi K2.5
OpenTeamDay.ai catalogs eleven major frontier releases, highlighting OpenAI’s GPT‑5.3 Codex, Anthropic’s Claude Sonnet 4.6, Mistral Large 3, DeepSeek V3.2/V4, Moonshot’s Kimi K2.5 and Kimi Claw, Zhipu’s GLM‑5, MiniMax M2.5, and ByteDance Seed models.[1] Several of these (GLM‑5, Kimi K2.5, DeepSeek V4, Mistral Large 3, Seed‑OSS‑36B) ship as open weights and push 1M+ token context, strong coding, and agentic capabilities toward or near Claude Opus–level performance.[1]
GPT‑5.3 Codex flagged as “high” cyber‑preparedness risk by OpenAI
OpenTeamDay.ai notes that GPT‑5.3 Codex is the first OpenAI model rated “high” on the company’s cybersecurity preparedness framework, reflecting its capability to “meaningfully enable real‑world cyber harm, especially if automated or used at scale.”[1] The model targets long‑running agentic coding tasks and shows substantial gains in code generation and reasoning performance over prior Codex‑class models.[1]
Open‑weight GLM‑5 and Kimi K2.5 reach near‑frontier benchmarks at scale
Zhipu’s GLM‑5 (745B MoE, 44B active) and Moonshot’s Kimi K2.5 (1T MoE, 32B active) are reported as open‑weight models that match or surpass Claude Opus 4.5 on several benchmarks, including SWE‑bench and Humanity’s Last Exam, while supporting 1M+ token contexts.[1] Kimi K2.5 also introduces an Agent Swarm capability via Parallel Agent Reinforcement Learning (PARL) to decompose and parallelize complex tasks.[1]