Frontier lab releases, open-source checkpoints, multimodal systems, inference stacks, and model capability shifts.
OpenAI ships GPT-5.4 Thinking / Pro for professional and agentic workflows
OpenMapify reports OpenAI has released **GPT-5.4 Thinking / GPT-5.4 Pro**, described as its newest frontier model targeting professional work, with stronger reasoning, coding, and agent workflows than prior 5.x iterations.[5] GPT-5.4 Thinking is rolling out across ChatGPT and the API, while GPT-5.4 Pro is positioned for maximum performance on complex tasks and deep automation.[5]
Google DeepMind advances Gemini 3.x line with fast frontier ‘Flash’ variants
OpenMapify highlights **Gemini 3 Flash** as a fast, cost-effective frontier-intelligence model optimized for speed while preserving strong reasoning, complementing heavier Gemini 3 Pro-class models.[5] The piece also notes Google’s broader Gemini 3 family is aimed at multimodal understanding and code generation, integrated into Google’s ecosystem.[5]
Meta’s Llama 4 and other open models push multimodal, MoE, and Apache-2.0 stacks
OpenA ranking and roadmap overview notes **Llama 4** has moved to a mixture-of-experts, natively multimodal design, with the Scout variant fitting on a single H100 and the Maverick variant reportedly outperforming GPT‑4o on many benchmarks.[7] The same source highlights **Small 4**, a 119B MoE model under Apache 2.0 that unifies reasoning, multimodal, and coding while running about 40% faster than its Small 3 predecessor.[7]