📚 All Articles
28 guide(s) — regularly updated
Gemini 3.5 Flash : the fast model that beats Opus 4.7 and GPT-5.5 on agent benchmarks — 289 tokens/second
Discover Gemini 3.5 Flash: the ultra-fast model at 289 tokens/sec beating Claude Opus 4.7 and GPT-5.5 on agent benchmarks.
General Preference RL: this paper unifies reinforcement learning and preference optimization for LLMs
Discover the General Preference RL paper unifying reinforcement learning and preference optimization to solve LLM post-training.
OpenAI Parameter Golf: The challenge that proves small models are the future of AI
Discover the OpenAI Parameter Golf challenge: why compressing an LLM into 16 MB proves small models are the future of AI.
Meta Muse Spark: why Meta betrayed open-source — the first closed model from the Superintelligence Lab
Discover why Meta Muse Spark is a turning point: the first closed model from the Superintelligence Lab that betrays Meta's open-source promise.
MeMo : Memory as a Model — memory as an autonomous model for updating LLMs without retraining
Discover MeMo (Memory as a Model): the innovative solution to update LLMs without retraining and defeat knowledge obsolescence.
SDAR: how to train AI agents with reinforcement learning without breaking them — self-distillation agentic
Discover SDAR (Self-Distillation Agentic Reinforcement): the method to train your AI agents with reinforcement learning without breaking them.
OpenDeepThink : Bradley-Terry comparison-based parallel reasoning changes the game for LLM inference
Discover OpenDeepThink: how Bradley-Terry comparison parallel reasoning revolutionizes LLM inference and outperforms sequential chain-of-thought
Negation Neglect : when fine-tuning makes LLMs blind to the false
Discover the Negation Neglect phenomenon: how fine-tuning LLMs against fake news ends up making them blind to falsehoods.
KV-Fold : The training-free trick that revolutionizes long-context inference in LLMs
Discover KV-Fold, the training-free trick revolutionizing LLM long-context inference and solving the token management nightmare.
Attractor Models: the new architecture that beats Transformers at reasoning
Discover Attractor Models, the new AI architecture that outperforms Transformers on reasoning at equivalent parameters.
Translate this title to English: UniPool : the newcomer in MoE architectures decouples network depth from expert growth
Discover UniPool, the innovation revolutionizing MoE architectures by decoupling network depth from expert growth.
Best Free Llms (May 2026)
Discover the best free LLMs of May 2026. Our comparison decides to find the ideal open source or freemium AI without paying.