Frontier lab releases, open-source checkpoints, multimodal systems, inference stacks, and model capability shifts.
Third Way maps current frontier model set (GPT‑5.5, Gemini 3.1 Pro, Muse Spark, Mistral Large 3, DeepSeek V4)
OpenThird Way’s policy memo defines **frontier AI models** as the most advanced systems by capability and training compute, and lists a current frontier set including OpenAI’s ChatGPT‑5.5, Google’s Gemini 3.1 Pro, Meta’s Muse Spark, Mistral Large 3, and DeepSeek V4.[4] It notes that major regulations (EU AI Act, New York RAISE Act, California SB 53) are now anchoring obligations to compute thresholds around 10^25–10^26 FLOP used in training.[4]
Understanding AI survey: five US labs have all shipped major models in the last two months
OpenUnderstanding AI reports that OpenAI, Anthropic, Google, Meta, and xAI have each released significant new models in roughly the last two months, and offers a comparative primer on their strengths and weaknesses.[1] The piece emphasizes that these frontier models now differ more in alignment, latency, and tool integration than in basic multimodal capability, which has become table stakes.[1][6]
TeamAI comparison: 22 frontier models show multimodal as a floor, not a differentiator
OpenTeamAI’s 2026 comparison charts 22 frontier models (GPT, Claude, Gemini, DeepSeek, Qwen, Kimi, and others) and notes that *every* major model now handles text, images, and documents; multimodality is described as a baseline capability rather than a differentiator.[6] The article instead highlights divergence in context length, pricing, and agentic support as the primary axes for model selection.[6]