📚 All Articles
60 guide(s) — regularly updated
ICML 2026 Seoul: 6,500+ papers accepted, ML enters the agentic era — key takeaways
Explore AI trends at ICML 2026 Seoul: over 6,500 accepted papers and the agentic era in machine learning.
Claude Sonnet 5: Anthropic's most agentic model, Opus performance at Sonnet price
OpenAI GPT-5.6: Sol, Terra et Luna — the model family that changes everything
Discover OpenAI GPT-5.6: Sol, Terra and Luna, the revolutionary model family under direct government control from June 26, 2026.
GPT-5.6 Sol: OpenAI launches the preview of a new model amid the early price war
Discover GPT-5.6 Sol, OpenAI's new preview shaking up the AI market amid a price war. Analysis and stakes of this launch.
Poolside Laguna M.1: the 225B open-source model for the coding agent, Apache 2.0
Discover Poolside Laguna M.1, a 225B-parameter open-source model under Apache 2.0, built to revolutionize coding agents.
FrontierCode: Cognition's benchmark that buries SWE-Bench and ranks code agents by the real quality of pull requests — Fable 5 at 46.3%, Opus 4.8 at 34.3%, GPT-5.5 at 25.5%
Discover FrontierCode, Cognition's new benchmark replacing SWE-Bench by evaluating the real quality of code agents' pull requests.
DeepSWE: the benchmark proving that code agents were cheating — Artificial Analysis buries SWE-Bench
Discover DeepSWE, the new benchmark replacing SWE-Bench, proving code agents were cheating. Analysis of the rankings upended by Artificial Anal
Gemini 3.5 Pro: countdown — 10 days before Google's deadline, 2 million tokens and Deep Think mode, the most anticipated model of the year (amidst a talent chaos)
Gemini 3.5 Pro: 10 days before Google's deadline, discover the rumors about its 2 million tokens and Deep Think mode amid a talent chaos.
GLM-5.2: The most powerful open weights model in the world — 753B MoE, 1M context, MIT license, the LLM landscape shifts
Discover GLM-5.2 from Z.ai: the world's most powerful open weights model. 753B MoE, 1M context & MIT license shaking up the LLM landscape.
CacheRL: A Qwen3-4B model achieves 92% accuracy in tool-calling with 100 times less compute than GPT-5
Discover CacheRL: a Qwen3-4B model hits 92% tool-calling accuracy with 100x less compute than GPT-5. AI revolution!
Best LLM Code (June 2026)
Discover the ultimate comparison of the best coding LLMs in June 2026. Analysis of agentic models capable of coding without human supervision.
Best Local LLMs (June 2026)
Discover the final ranking of the best local LLMs in June 2026. DeepSeek V4 Pro, Ollama: compare quality and privacy.