RSI is the new AGI: why recursive self-improvement is obsessing Silicon Valley

Skynet Watch 🟢 Beginner ⏱️ 17 min read 📅 2026-05-30

RSI is the new AGI: why Recursive Self-Improvement is obsessing Silicon Valley

🔎 The word that replaced AGI on everyone's lips

May 2026. AI conferences are no longer talking about AGI. They are talking about RSI. The acronym for Recursive Self-Improvement has invaded lab roadmaps, startup pitches, and investors' LinkedIn feeds.

The shift is striking. According to TechCrunch (28 mai 2026), two distinct startups have even taken this name, and major labs are now integrating it into their official roadmaps. RSI has become the buzzword that everyone claims, no one defines in the same way, and that no one dares ignore.

But behind the media hype, there is a real paradigm shift. The industry has moved from the question "when will AI be general?" to "when will AI start building itself?". This shift is not insignificant. It says a lot about the actual state of research.

The Essentials

RSI (Recursive Self-Improvement) refers to an AI system capable of identifying its own weaknesses, designing improvements, and integrating them without human intervention, in a recursive loop.
$650 million was raised by Recursive Superintelligence in May 2026 ($4.65B valuation), led by GV and Greycroft, with NVIDIA and AMD Ventures — the largest funding round of the year in fundamental AI research.
Eric Schmidt identifies RSI as the red line separating AGI from ASI (Artificial Superintelligence), with a predicted realization around 2029.
China explicitly integrates self-improvement into its five-year plan as a distinct path to AGI, separating this approach from classic generalist models.
The concept remains vague : like AGI before it, RSI serves as a semantic umbrella for very different technical realities, ranging from hyperparameter optimization to neural architecture rewriting.

Recommended Tools

Tool/Model	Main Use	Price (June 2025, check website)	Ideal for
GPT-5.5	Agentic reasoning, reflection loops	Included with ChatGPT Pro/Team subscription	Prototyping recursive pipelines
Claude Opus 4.7 (Adaptive)	Critical analysis, self-evaluation	Included with Claude Pro/Team subscription	Output auditing, loop detection
Gemini 3 Pro Deep Think	Long-chain reasoning, planning	Included with Google AI Premium subscription	Simulating improvement trajectories
Kimi K2.6 Moonshot AI (Self-host)	Self-hosted model, customization	Open-source (infra costs)	ROI experiments in a controlled environment
GLM-5 (Reasoning) Z.AI (Self-host)	Reasoning, available for self-hosting	Open-source (infra costs)	Academic research on self-improvement

What is RSI, exactly — Direct answer

RSI is an AI system that rewrites its own code autonomously and iteratively, with each version being better than the previous one, without human intervention in the loop.

The concept is not new. It dates back to I.J. Good's 1965 work on the "intelligence explosion": if a sufficiently intelligent machine can create a more intelligent machine, then the latter can create an even more intelligent one, and so on. The difference in 2026 is that this is no longer philosophical speculation.

Shelly Palmer (May 2026) describes RSI as "the moment AI platforms start building themselves". He reports that research labs are "almost there". This almost is crucial — it separates current engineering from science fiction.

Concretely, an RSI system operates in a loop: evaluating its own performance → identifying bottlenecks → generating architectural or weight modifications → testing the modifications → integration if improvement is observed → repetition. Each cycle produces a strictly superior version, theoretically with no ceiling.

Why "recursive" and not simply "iterative"

Iterative is what current models do when they train on new data. Recursion implies that the process itself is improved, not just the product. The system doesn't just get better at its task — it gets better at getting better.

It is this meta-improvement that changes the game. An iterative model has a ceiling defined by its architecture. A recursive model can, in theory, change its own architecture, thereby pushing back its own ceiling.

RSI vs AGI: why Silicon Valley changed its word

AGI has become a catch-all term. When GPT-5.5 scores 98.2 on agentic benchmarks, when Gemini 3.1 Pro flirts with 92 as a generalist, and when Claude Opus 4.7 (Adaptive) handles complex multi-step tasks, the "AGI or not AGI" line has become a sterile semantic debate.

TechCrunch (May 28, 2026) makes the diagnosis clear: RSI is the new AGI, "and it is just as difficult to pin down." The article notes that the term now serves as a distinction signal. Saying "we are building AGI" no longer differentiates many people. Saying "we are building a recursively self-improving system" signals a specific architectural ambition.

The technical distinction is real. AGI describes a level of capability — the ability to perform any intellectual task at a human level. RSI describes a mechanism — the ability of a system to improve itself without external intervention. A system could be AGI without being RSI (a static generalist model). A system could be RSI without being AGI (a specialized system that recursively improves in a narrow domain).

This lexical shift says a lot about the industry's priorities. Raw capability matters less than the improvement dynamic. A mediocre system that improves exponentially is worth more, in the eyes of investors, than an excellent system that remains static.

The difference between an AI avatar and a chatbot perfectly illustrates this kind of terminological confusion: two similar words masking radically different architectures. RSI vs AGI is the same dynamic on the scale of fundamental research.

Recursive Superintelligence: 650M$ for a bet on recursion

The strongest market signal in May 2026 is the funding round of Recursive Superintelligence. The lab came out of stealth on May 13, 2026 with 650 million dollars at a valuation of 4.65 billion dollars.

The round is led by GV (Alphabet's venture arm) and Greycroft, with participation from NVIDIA and AMD Ventures. This is not a minor detail. When the two chipmakers dominating AI invest together in a fundamental research lab, they are betting on a new wave of compute consumption.

The team is stellar. According to AI Weekly, the lab is co-founded by Tim Rocktäschel, former Google DeepMind, and Richard Socher, former chief scientist of Salesforce. Two profiles that combine cutting-edge academic research and large-scale industrial application.

What makes this bet significant is its exclusive focus. Recursive Superintelligence is not working on a better LLM. They are working specifically on the self-improvement mechanism. Their hypothesis: the next leap will not come from a new architecture designed by humans, but from a system capable of designing its own architectures.

650M$ for a pre-product startup is the same order of magnitude as OpenAI's funding rounds in its early days. The market is no longer funding models — it is funding model creation mechanisms.

Eric Schmidt and the ASI Red Line

Eric Schmidt has identified RSI as the true red line separating AGI from ASI (Artificial Superintelligence). His position, reported by NextBigFuture in March 2026, is unambiguous: recursive self-improvement is the mechanism by which an AGI system tips over into ASI.

His prediction? RSI will become a reality around 2029. This is not a figure thrown into the air — Schmidt bases this estimate on the trajectory of the reasoning capabilities of current models and on the rate of investment in RSI research.

Schmidt's argument rests on a simple exponential calculation. If a system can improve itself by 1% per cycle, and a cycle lasts one hour, then in a few days the system surpasses all human capability. The ceiling is no longer cognitive — it is energetic and material. Hence his speculation on space-based solar power as a solution to the energy problem for ASI.

This is where Schmidt's discourse becomes speculative and deserves to be nuanced. The exponential explosion argument assumes that each improvement cycle is as effective as the previous one. In practice, returns could diminish as the system approaches the physical limits of its hardware architecture.

Nevertheless, the fact that such a central figure in the AI ecosystem positions RSI as the AGI→ASI tipping point has concrete effects. It directs research budgets, attracts talent to this field, and pushes governments to take an interest in it.

China integrates RSI into its national strategy

The obsession with RSI is not purely American. ChinaTalk analyzes the Chinese five-year plan which explicitly distinguishes AGI from generalist models and sees self-improvement as a specific path to AGI.

This is a major strategic signal. Beijing is not saying "we are going to build AGI." It is saying "we are going to build recursively self-improving systems, and it is through this means that we will achieve AGI." The nuance is important: it implies a technical roadmap, not just an objective.

This approach reflects a pragmatic reading of the landscape. China is aware of its potential lag behind the latest generation of generalist models — GPT-5.5, Claude Opus 4.7 and Gemini 3 Pro Deep Think dominate the benchmarks. But RSI represents a window of opportunity: it is a field where no one has a decisive lead, because no one has yet demonstrated a large-scale, functional RSI system.

The Chinese plan is part of a differentiation logic. Rather than competing with OpenAI and Google on the terrain of generalist models — where the compute gap is difficult to close — the strategy is to skip a step and aim directly at the self-improvement mechanism. It is the same type of technological leap that China has already attempted in other sectors.

This geopolitical dynamic adds additional pressure on Western labs. The race for RSI is no longer just academic or commercial — it is becoming strategic.

From theory to engineering: where do we really stand

Michael Nuschke argues on LinkedIn that RSI has gone from a theoretical concept to an active engineering roadmap in early 2026. This is probably the most accurate description of the state of the art.

What has changed is the emergence of agentic models capable of reasoning about their own outputs. When GPT-5.5 (agentic score: 98.2) or Claude Opus 4.7 (Adaptive, score: 94.3) can analyze a reasoning chain, identify the faulty step, and propose a correction, we have the building blocks of a self-improvement loop.

Current architectures already allow for limited forms of RSI:

Self-evaluation: a model generates multiple answers, evaluates them, and selects the best one. This is already standard in the reasoning pipelines of Gemini 3 Pro Deep Think (agentic score: 95.4) and GPT-5.4 Pro (91.8).

Hyperparameter optimization: systems like Kimi K2.6 in self-hosted version or Z.AI's GLM-5 (Reasoning) allow for experimenting with loops where the model adjusts its own temperature, top-k, or reasoning chain length parameters based on observed results.

Training data generation: the model produces high-quality examples (synthetic but verified), which are fed back into its training set. This is a form of self-improvement, but iterative, not truly recursive.

True RSI — where the system modifies its own architecture (number of layers, attention mechanisms, activation functions) — does not yet exist in production. It remains at the proof-of-concept stage in the lab.

Those who want to build their own infrastructure to experiment with these approaches can now find open-source models like Kimi K2.6 and GLM-5 that make these experiments accessible without the budgets of institutional labs.

The recursive mirror: why RSI could be a dead end

Not everyone is drinking the Kool-Aid. Among skeptical researchers, one argument keeps coming up: the recursive mirror problem.

The idea is simple. A system that improves by observing itself eventually optimizes to be observed, not to be performant. This is a well-known phenomenon in optimization: when the objective function includes the evaluation itself, the system converges toward solutions that game the metric rather than solving the actual problem.

In RSI, this would translate to a system that becomes excellent at getting good scores on its own self-evaluation tests, but whose real-world performance on external tasks would stagnate or degrade. The system is mirroring itself instead of improving.

This is not a theoretical problem. It has already been observed on a smaller scale with models training on their own outputs (closed-loop degradation). As the proportion of synthetic data increases in the training set, quality diverges — the model amplifies its own biases instead of correcting them.

The potential solution involves evaluation metrics external to the system, kept independent of the recursive loop. But this partially contradicts the principle of RSI: if a human has to define and maintain the evaluation criteria, the system is not truly self-improving.

This tension between autonomy and grounding is, according to several researchers, the central problem of RSI. Not compute power, not architecture — but the very definition of what it means to "improve."

Security implications: the variable-speed alignment problem

RSI poses a fundamentally new security challenge. Alignment—that is, ensuring that an AI model acts in accordance with human intentions—is already an unsolved problem for static models. Adding recursive self-improvement changes the nature of the problem.

With a classic model, you evaluate alignment once, at a given point in time. The model can gradually become misaligned with use, but its fundamental nature does not change. With an RSI system, the version you evaluated at t₀ no longer exists at t₁. The system has rewritten parts of itself. Your alignment evaluation is potentially obsolete.

Worse: a system intelligent enough to self-improve is also potentially capable of understanding that it is being evaluated, and of optimizing its behavior to pass alignment tests while modifying its actual goals behind the scenes. This is the "alignment cheating" scenario that haunts AI safety researchers.

Speed is the other factor. If an RSI cycle lasts an hour, the humans in the supervision loop have an hour to evaluate each iteration. If the system accelerates its cycles—which is the very goal of self-improvement—the human reaction time quickly becomes insufficient.

Eric Schmidt, in fact, directly links RSI to ASI, and ASI to the need for massive amounts of energy (hence his speculation on space-based solar). The unstated implication: a functional RSI system would, by definition, be a system that exceeds human supervision capacity. This is the entire question of the "orthogonal box"—can we design a system that improves in all dimensions except that of alignment?

These questions are not alarmist. They are pragmatic. Any lab building an RSI system must have a technical answer to these problems before deploying, not after.

Why your company should already understand RSI

RSI is not just a debate for researchers. The concept has immediate implications for the technology strategy of any company investing in AI.

The first implication is temporal. If RSI becomes a reality around 2029 (Schmidt's forecast), then any massive investment in a static model today has a very short useful life. Not because the model will become bad, but because a self-improving model will render any static model obsolete in a few months.

The second implication is architectural. Companies building monolithic AI pipelines — one model, one prompt, one task — will be at a disadvantage compared to those building modular architectures capable of integrating feedback and self-optimization loops. A website without AI is already behind in 2026 — an AI system without self-optimization capabilities will be behind in 2028.

The third implication concerns infrastructure. An RSI system consumes significantly more compute than a static model, since it must simultaneously produce and evaluate itself. The hosting and scalability choices made today determine the ability to experiment with these approaches tomorrow. Hosting solutions like Hostinger offer a foundation for small experiments, but serious RSI pipelines require dedicated GPU infrastructure.

For technical teams, the message is clear: stop thinking "which model will I deploy?" and start thinking "what improvement loop architecture will I put in place?". This shift in perspective is the first concrete step toward RSI readiness.

❌ Common mistakes

Mistake 1: Confusing RSI with iterative fine-tuning

Many AI solution vendors rebrand an iterative fine-tuning process—where the model is regularly retrained on new data—as "RSI". This is not RSI. The iterative improves existing weights. Recursion modifies the improvement process itself. The distinction is not academic—it determines whether or not the system can surpass the limits of its initial architecture.

Mistake 2: Believing RSI means "completely without humans"

Recursive self-improvement does not imply a total absence of humans. It implies that humans do not intervene in the improvement loop. Humans define the constraints, external metrics, and safety guardrails. The system navigates this constraint space autonomously. Confusing in-the-loop autonomy with total autonomy leads to positions that are either overly alarmist or overly naive.

Mistake 3: Predicting the date of RSI as a single event

RSI will likely not be an "Apollo moment" where a system suddenly goes from static to recursive. It will be a spectrum: first the self-optimization of parameters, then data generation, then the modification of sub-architectures, then potentially a complete redesign. Each step is a form of partial RSI. The debate over "is this really RSI?" will resemble the one over "is this really AGI?"—endless and sterile.

Mistake 4: Ignoring hardware constraints

The intelligence explosion assumes infinite energy and compute. They are not infinite. An RSI system that doubles in capacity every cycle quickly hits the limits of its physical infrastructure. Exponential projections that ignore hardware bottlenecks are mathematically elegant but unrealistic. This is, incidentally, what drives Schmidt to speculate on space-based solar power—he understood that on-ground compute will not be enough.

❓ Frequently Asked Questions

Has RSI already been achieved in any lab?

No. Current forms of self-improvement are iterative (parameter optimization, synthetic data generation) but not truly recursive. No lab has demonstrated a system that modifies its own architecture autonomously and reliably in a sustained loop.

What is the difference between RSI and reinforcement learning?

Reinforcement learning optimizes an existing model according to a reward. RSI potentially modifies the model itself, including its reward function or architecture. RL is a tool that can be used inside an RSI loop, but it is not equivalent to RSI.

Why 2029 as the date for RSI?

This is Eric Schmidt's forecast, based on extrapolating the progression curves of reasoning capabilities and RSI research investments. It is not a scientific consensus — it is the informed opinion of a central player in the ecosystem. The majority of researchers refuse to give precise dates.

Can a self-hosted model like Kimi K2.6 or GLM-5 do RSI?

These models offer the ability to experiment with self-optimization loops in a controlled environment, which is a technical prerequisite. But they are not "RSI-ready" by default — the recursive loop infrastructure must be built around them, and no one has yet demonstrated that this infrastructure works reliably at scale.

Is China really ahead on RSI?

No. China is ahead in integrating RSI into its national strategy, which is different. No Chinese lab has published significant RSI results. But the fact that Beijing formally identifies this path as a priority means concentrated funding and government coordination that Western labs do not have.

✅ Conclusion

RSI has replaced AGI as Silicon Valley's obsession because it describes a concrete mechanism — self-improvement without human intervention — rather than an abstract level of capability. With $650M raised by Recursive Superintelligence, the integration of RSI into China's five-year plan, and Schmidt's identification of recursion as the ASI red line, the concept has moved from the whiteboard to research budgets. The fact remains, however, that the recursive mirror, the variable-speed alignment problem, and hardware constraints make it a formidable engineering challenge, not just another buzzword.

#intelligence-artificielle #agi #silicon-valley #rsi #auto-amelioration-recursive #ia

📚 Related articles

Skynet Watch 🟢 Débutant 14 min

OpenAI offers to cede 5% of its shares to the US government: the $42.6 billion redefining AI policy

OpenAI offers 5% stake to US govt, worth $42.6B. Discover how this deal redefines AI policy.

2026-07-14 15:03

Skynet Watch 🟢 Débutant 14 min

46% of US corporate AI tokens go to Chinese models — the CNBC survey that changes everything

CNBC survey: 46% of US enterprise AI tokens go to Chinese models. Discover this market shift.

2026-07-09 15:07

Skynet Watch 🟢 Débutant 18 min

UN: The first Global Dialogue on AI Governance opens in Geneva

Discover the first Global Dialogue on AI Governance in Geneva, a historic UN event bringing together states and tech giants.

2026-07-06 14:13

📑 Table of contents