Frontier lab releases, open-source checkpoints, multimodal systems, inference stacks, and model capability shifts.
Irregular on Gemini 3 Pro, GPT‑5.1‑Codex‑Max, and Claude Opus 4.5 frontier jump
OpenIrregular’s "A Frontier Fortnight" outlines how Google DeepMind’s **Gemini 3 Pro**, OpenAI’s **GPT‑5.1‑Codex‑Max**, and Anthropic’s **Claude Opus 4.5** collectively raise the bar on benchmarks and software engineering workflows.[1] The piece emphasizes improved cybersecurity-relevant capabilities, with Anthropic using the SOLVE framework to score Opus 4.5 on vulnerability discovery and exploit development tasks.[1]
John C. Derrick on Gemini 3.5 Flash, Gemini Omni, and Llama 4 MoE
OpenJohn C. Derrick’s 2026 model ranking notes Google I/O’s release of **Gemini 3.5 Flash** (general availability) and the rollout of **Gemini Omni** and an upgraded Antigravity agent platform, with 3.5 Flash beating Gemini 3.1 Pro on several coding and agentic benchmarks.[6] The same piece highlights **Llama 4**’s mixture‑of‑experts, natively multimodal design, with the Scout variant fitting on a single H100 and Maverick outperforming GPT‑4o on many benchmarks.[6]
Microsoft Azure Foundry expands multi‑vendor foundation model catalog
OpenMicrosoft’s Foundry Models hub aggregates flagship foundation models from OpenAI, Anthropic Claude, Meta, Mistral AI, DeepSeek, xAI, Cohere, Hugging Face, NVIDIA, Stability and others in a single catalog optimized for out‑of‑the‑box implementation in Foundry.[7] The catalog spans language and vision models and is positioned as a way to quickly discover and deploy popular frontier and open‑source systems.[7]