📚 All Articles
48 guide(s) — regularly updated
Qwen-AgentWorld : when an LLM simulates the world to train autonomous agents — the new frontier of language world modeling
Discover Alibaba's Qwen-AgentWorld: a revolutionary LLM that simulates the world to train autonomous agents. The new frontier of language world mo
Agentic Resource Discovery: the open standard that will unify AI agents
Discover Agentic Resource Discovery, the new open standard from Google and Microsoft designed to unify AI agents and automate their tool discove
Google launches the Interactions API in general availability: the new default interface for building Gemini agents (and generateContent retires)
Google launches Interactions API to GA. Discover the new default interface for your Gemini agents and the end of generateContent.
Vercel eve: the open source framework that wants to do for AI agents what Next.js did for the web
Discover Vercel eve: the revolutionary open source framework for building production AI agents, just as Next.js transformed the web.
OpenAI Codex Record & Replay : show a task once, the agent repeats it endlessly — the end of manual scripting
Discover OpenAI Codex Record & Replay: show a task once, and the agent repeats it infinitely. The end of manual scripting is here.
Claude Code switches to monthly credits: what changes for devs and autonomous agents
Anthropic ends Claude Code's free tier. Discover how the new monthly credit billing changes things for developers and autonomous agents.
MiroFish: an undergrad builds 700,000 AI agents in 10 days — this open source project predicts the future and explodes on GitHub
MiroFish: a student creates 700K AI agents in 10 days. Discover this open source project exploding on GitHub and predicting the future.
EEVEE : the first test-time prompt learning framework for self-improving AI agents
Discover EEVEE, the first test-time prompt learning framework designed to create self-improving and adaptable AI agents in real time.
Life-Harness : boosting LLM agents by 88.5% without retraining, the open source runtime revolution
Discover Life-Harness: the open source runtime revolution boosting LLM agents by 88.5% without retraining. No more brute force!
OmniGameArena: The UE5 benchmark that measures the learning dynamics of VLM agents in games
Discover OmniGameArena, the UE5 benchmark to evaluate VLM agents' learning dynamics in video games, beyond mere scores.
Life-Harness : boosting LLM agents by 88.5% without touching the model — the runtime revolution
Discover Life-Harness: the runtime method boosting LLM agents by 88.5% without fine-tuning. No more production failures!
OmniGameArena : the UE5 benchmark revolutionizing the evaluation of VLM agents in games
Discover OmniGameArena, the revolutionary UE5 benchmark for evaluating VLM agents in games. Forget simple scores and measure real progression.