<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>AI Agents on Carles Abarca</title><link>https://carlesabarca.com/tags/ai-agents/</link><description>Recent content in AI Agents on Carles Abarca</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>© 2026 Carles Abarca</copyright><lastBuildDate>Fri, 03 Apr 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://carlesabarca.com/tags/ai-agents/index.xml" rel="self" type="application/rss+xml"/><item><title>Your Code Is One Agent Session From Being Cloned — And There's Nothing You Can Do About It</title><link>https://carlesabarca.com/posts/claude-code-leak-no-more-moats/</link><pubDate>Fri, 03 Apr 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/claude-code-leak-no-more-moats/</guid><description>The Claude Code leak proves that proprietary code is no longer a competitive barrier. If an agent can rewrite 512,000 lines in hours, what actually protects your business?</description><content:encoded>&lt;p&gt;On April 1, 2026 — and no, it wasn&amp;rsquo;t a joke — someone discovered that Claude Code&amp;rsquo;s npm package, Anthropic&amp;rsquo;s command-line tool, included a misconfigured &lt;code&gt;.map&lt;/code&gt; file. That file contained the complete source code: &lt;strong&gt;512,000 lines of TypeScript spread across 1,900 files&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;Within hours, the leaked repository had 25,000 stars on GitHub. Developers were tearing it apart piece by piece. But the truly unsettling part wasn&amp;rsquo;t the leak itself.&lt;/p&gt;
&lt;p&gt;It was what happened next.&lt;/p&gt;

&lt;h2 class="relative group"&gt;From TypeScript to Python in an afternoon
 &lt;div id="from-typescript-to-python-in-an-afternoon" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#from-typescript-to-python-in-an-afternoon" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;A researcher at the University of Washington took the leaked code, fed it through OpenAI Codex, and within hours had a &lt;strong&gt;functional reimplementation in Python&lt;/strong&gt;. Not a copy — a complete rewrite in a different language. The resulting repository, claw-code, hit 44,500 stars. A Rust rewrite is already underway.&lt;/p&gt;
&lt;p&gt;And here&amp;rsquo;s the legal problem that should keep every CTO up at night: a reimplementation in another language, generated by an AI agent, is &lt;strong&gt;likely immune to a DMCA takedown&lt;/strong&gt;. It&amp;rsquo;s not a copy. It&amp;rsquo;s a derived work created by a machine. Current legal frameworks simply aren&amp;rsquo;t designed for this.&lt;/p&gt;
&lt;p&gt;Gergely Orosz, one of the most respected voices in software engineering, put it this way: we&amp;rsquo;re facing a new reality where &lt;strong&gt;any closed codebase is one agent session away from being functionally cloned&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;Read that again. Any codebase. One agent session. Cloned.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What they found inside
 &lt;div id="what-they-found-inside" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-they-found-inside" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;The findings were fascinating and, in some cases, hilarious. Wes Bos discovered 187 hardcoded verbs for the loading spinner (including &amp;ldquo;hullaballooing&amp;rdquo; and &amp;ldquo;razzmatazzing&amp;rdquo;). They found an internal analytics system that flags your prompt as negative every time you swear at the agent. And the random 4-character IDs are filtered to exclude 25 curse words.&lt;/p&gt;
&lt;p&gt;But beyond the anecdotes, the technical discoveries revealed the real architecture of one of the world&amp;rsquo;s most advanced coding agents:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;35 modules&lt;/strong&gt; with clearly separated responsibilities.&lt;/li&gt;
&lt;li&gt;A &lt;strong&gt;lightweight memory system&lt;/strong&gt; based on ~150-character pointers, not bulk storage.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;System prompts live on the client&lt;/strong&gt;, not the server.&lt;/li&gt;
&lt;li&gt;Code comments are written &lt;strong&gt;for LLMs to read&lt;/strong&gt;, not humans.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;That last point deserves a pause. Anthropic is no longer writing code for programmers to understand. They&amp;rsquo;re writing code for other AI models to understand. If that doesn&amp;rsquo;t tell you where the industry is heading, nothing will.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The protective barrier that no longer exists
 &lt;div id="the-protective-barrier-that-no-longer-exists" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-protective-barrier-that-no-longer-exists" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;For decades, proprietary code was one of the most reliable protective barriers in technology. Your competitor could copy your idea, your design, your go-to-market strategy — but replicating a million lines of optimized code took years and hundreds of engineers.&lt;/p&gt;
&lt;p&gt;That&amp;rsquo;s over.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;m not exaggerating. Think about what actually happened this week:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;A configuration error exposed complete source code.&lt;/li&gt;
&lt;li&gt;An AI agent rewrote it in another language in hours.&lt;/li&gt;
&lt;li&gt;The open source community improved and extended it in days.&lt;/li&gt;
&lt;li&gt;The result is legally difficult to challenge.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Each of these steps was unthinkable two years ago. Together, they represent a fundamental shift in what it means to &amp;ldquo;protect&amp;rdquo; your software intellectual property.&lt;/p&gt;
&lt;p&gt;And you don&amp;rsquo;t need a leak for this to happen. AI agents can already:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Infer architectures&lt;/strong&gt; from the observable behavior of an API.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Replicate functionality&lt;/strong&gt; from public documentation and examples.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Generate alternative implementations&lt;/strong&gt; that achieve the same results with different code.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Code was never really a barrier. It was the illusion of a barrier.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The SaaSpocalypse connection
 &lt;div id="the-saaspocalypse-connection" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-saaspocalypse-connection" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;If this story sounds familiar, it&amp;rsquo;s because we already saw act one. In February of this year, &lt;a href="../../posts/saaspocalypse/" &gt;I wrote about the SaaSpocalypse&lt;/a&gt;: $300 billion evaporated in 48 hours when the market understood that the business logic of a $200/user/month SaaS fits in a text file that an agent can read and execute.&lt;/p&gt;
&lt;p&gt;The Claude Code leak makes that thesis worse in a way that few have articulated yet.&lt;/p&gt;
&lt;p&gt;Think about it: every SaaS provider exposes documented public APIs. Endpoints, data schemas, workflows, business rules — all accessible to any agent that can read documentation. The Claude Code leak proved that an agent can take 512,000 lines of code and reimplement them in hours. Now combine that with the fact that &lt;strong&gt;most SaaS products expose their business logic through their own APIs&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;You don&amp;rsquo;t need to steal Salesforce&amp;rsquo;s source code. You just need an agent that reads their public documentation, observes endpoint behavior, and infers the underlying architecture. The API &lt;em&gt;is&lt;/em&gt; the blueprint.&lt;/p&gt;
&lt;p&gt;Satya Nadella said it in December 2024: &amp;ldquo;SaaS applications are CRUD databases with business logic on top. Agents will absorb that logic.&amp;rdquo; In February, the market understood this in the abstract. With Claude Code, we have concrete proof: the tools to absorb that logic already exist. And they work.&lt;/p&gt;
&lt;p&gt;The SaaSpocalypse wasn&amp;rsquo;t the ending. It was the trailer.&lt;/p&gt;

&lt;h2 class="relative group"&gt;So what actually protects your business?
 &lt;div id="so-what-actually-protects-your-business" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#so-what-actually-protects-your-business" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;If code is no longer your competitive advantage — and public APIs reveal your business logic — what&amp;rsquo;s left? After 20 years in technology transformation, from banking in Spain to education in Mexico, I&amp;rsquo;ve seen this same question emerge every time a new technological wave destroys the previous barriers:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;1. Data, not code.&lt;/strong&gt; Your model trained on proprietary data, your curated datasets, your domain knowledge encoded in features that an agent can&amp;rsquo;t infer from the outside. A Claude Code clone can replicate the tool, but it can&amp;rsquo;t replicate the millions of conversations that trained Claude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;2. Speed of execution.&lt;/strong&gt; If your competitor can clone your code in hours, the advantage lies in being the first to solve the next problem. Not in protecting the previous solution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;3. Trust and brand.&lt;/strong&gt; In a world where anyone can replicate the technology, differentiation comes down to who trusts you. Anthropic&amp;rsquo;s customers aren&amp;rsquo;t going to migrate to claw-code to save on their subscription. They pay for support, for SLAs, for the guarantee that someone responds when something breaks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;4. The ecosystem.&lt;/strong&gt; Integrations, partnerships, network effects. Slack didn&amp;rsquo;t win because its code was uncopyable. It won because everyone was already there.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;5. A culture of continuous innovation.&lt;/strong&gt; If you assume that everything you build today will be replicable tomorrow, your only sustainable advantage is the ability to build the next thing faster than anyone else.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The implications for the enterprise
 &lt;div id="the-implications-for-the-enterprise" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-implications-for-the-enterprise" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;For any technology leader reading this, the message is clear: &lt;strong&gt;review your intellectual property strategy today&lt;/strong&gt;. Not tomorrow. Today.&lt;/p&gt;
&lt;p&gt;Some questions that should be on the agenda for your next board meeting:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;How much of our competitive advantage depends on code that an agent could replicate?&lt;/li&gt;
&lt;li&gt;What does our own API documentation reveal about our business logic?&lt;/li&gt;
&lt;li&gt;Do we have proprietary data that is genuinely hard to reproduce?&lt;/li&gt;
&lt;li&gt;Does our security strategy account for the fact that a misconfigured &lt;code&gt;.map&lt;/code&gt; file can expose our entire codebase?&lt;/li&gt;
&lt;li&gt;Are we prepared for a world where DMCA doesn&amp;rsquo;t protect against AI-generated reimplementations?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;And perhaps the most uncomfortable one of all: &lt;strong&gt;Are we still investing in building walls, when we should be investing in running faster?&lt;/strong&gt;&lt;/p&gt;

&lt;h2 class="relative group"&gt;The twist no one expected
 &lt;div id="the-twist-no-one-expected" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-twist-no-one-expected" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;There&amp;rsquo;s a delicious irony in this whole story. Claude Code — Anthropic&amp;rsquo;s tool designed for AI to write code — was dismantled and rewritten by the AI of its direct competitor. OpenAI Codex cloned Anthropic&amp;rsquo;s flagship product in an afternoon.&lt;/p&gt;
&lt;p&gt;It&amp;rsquo;s as if Ford had invented the assembly line and Toyota had copied it the same day using Ford&amp;rsquo;s own robots.&lt;/p&gt;
&lt;p&gt;Welcome to the era where the tools you build to automate other people&amp;rsquo;s work can be used to automate &lt;em&gt;your own&lt;/em&gt; work. Where your code is not your protective barrier. Where your advantage is not what you already built, but what you&amp;rsquo;re going to build tomorrow.&lt;/p&gt;
&lt;p&gt;The Claude Code leak wasn&amp;rsquo;t a security incident.&lt;/p&gt;
&lt;p&gt;It was a warning.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Carles Abarca is VP of Digital Transformation at Tecnológico de Monterrey and former CTO of Banco Sabadell. He writes about AI, digital transformation, and the future of software at &lt;a href="https://carlesabarca.com" target="_blank" rel="noreferrer"&gt;carlesabarca.com&lt;/a&gt;.&lt;/em&gt;&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/claude-code-leak-no-more-moats/featured.png"/></item><item><title>NemoClaw and the Enterprise Awakening: When AI Agents Leave the Garage</title><link>https://carlesabarca.com/posts/nemoclaw-enterprise-agents/</link><pubDate>Wed, 18 Mar 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/nemoclaw-enterprise-agents/</guid><description>NVIDIA just validated what some of us already knew: OpenClaw is not a toy. With NemoClaw, the framework that started as a personal AI assistant enters the enterprise with security, governance, and guardrails. At Tec de Monterrey, we&amp;rsquo;ve been building this future for months — we call them AgenTECs.</description><content:encoded>&lt;p&gt;Jensen Huang does not use words carelessly. When the CEO of the most valuable company on Earth stands on stage at GTC and says &amp;ldquo;OpenClaw is the operating system for personal AI,&amp;rdquo; he is not making a prediction. He is reading a scoreboard.&lt;/p&gt;
&lt;p&gt;And then he drops NemoClaw.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What NemoClaw actually is
 &lt;div id="what-nemoclaw-actually-is" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-nemoclaw-actually-is" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Let me cut through the press releases. NemoClaw is not a competitor to OpenClaw. It is an enterprise wrapper around it. A single command — one command — installs NVIDIA&amp;rsquo;s Nemotron models and the OpenShell runtime on top of any existing OpenClaw instance, adding the three things that kept CIOs awake at night:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Privacy routing.&lt;/strong&gt; A router that decides which queries go to local models (running on your hardware, never leaving your network) and which can safely reach frontier models in the cloud. This is not a minor feature. This is the difference between &amp;ldquo;interesting demo&amp;rdquo; and &amp;ldquo;approved by Legal.&amp;rdquo;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sandboxed execution.&lt;/strong&gt; OpenShell provides an isolated environment where agents can write code, access files, and execute tasks without touching the host system. The agent gets the access it needs to be productive. Nothing more.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Policy-based guardrails.&lt;/strong&gt; Configurable security boundaries that define what an agent can and cannot do. Not through hope. Through enforcement.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The message from NVIDIA is unmistakable: OpenClaw is infrastructure. Not a project. Not a trend. Infrastructure.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The enterprise problem was never intelligence
 &lt;div id="the-enterprise-problem-was-never-intelligence" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-enterprise-problem-was-never-intelligence" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Here is what most commentary about AI agents gets wrong: the bottleneck was never capability. Claude, GPT, Gemini — they have been capable of executing complex business tasks for over a year. The bottleneck was trust.&lt;/p&gt;
&lt;p&gt;No CIO in their right mind would deploy an autonomous agent with access to production systems, employee data, and financial records without three guarantees: that data stays where it should, that the agent operates within defined boundaries, and that every action is auditable. NemoClaw provides exactly those three guarantees.&lt;/p&gt;
&lt;p&gt;But NVIDIA is solving a problem that some of us had already started solving from the inside.&lt;/p&gt;

&lt;h2 class="relative group"&gt;AgenTECs: What happens when a university takes agents seriously
 &lt;div id="agentecs-what-happens-when-a-university-takes-agents-seriously" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#agentecs-what-happens-when-a-university-takes-agents-seriously" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;At Tecnológico de Monterrey, we did not wait for NemoClaw. We could not afford to. When you are responsible for the digital infrastructure serving 100,000 students and 30,000 employees, &amp;ldquo;wait and see&amp;rdquo; is not a strategy.&lt;/p&gt;
&lt;p&gt;We call them &lt;strong&gt;AgenTECs&lt;/strong&gt; — autonomous institutional agents built on OpenClaw, powered by our own TECgpt models, running on our private Azure infrastructure. They are not chatbots. They are not demos. They are digital collaborators with @tec.mx email accounts, Microsoft Teams presence, and defined roles within the organization.&lt;/p&gt;
&lt;p&gt;The architecture is deliberately simple:&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Layer&lt;/th&gt;
 &lt;th&gt;Component&lt;/th&gt;
 &lt;th&gt;Function&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Orchestration&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;OpenClaw&lt;/td&gt;
 &lt;td&gt;Lifecycle management, plugins, tools, persistent memory&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Cognitive&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;TECgpt (institutional LLMs)&lt;/td&gt;
 &lt;td&gt;Reasoning, language understanding, decision-making&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Infrastructure&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Azure private cloud&lt;/td&gt;
 &lt;td&gt;Isolated, secure, dedicated resources&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Channels&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Email @tec.mx, Teams, WhatsApp&lt;/td&gt;
 &lt;td&gt;Communication with users, supervisors, and other agents&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;Each AgenTEC operates under a governance model with three clearly separated roles: the &lt;strong&gt;Administrator&lt;/strong&gt; (IT — keeps it running), the &lt;strong&gt;Supervisor&lt;/strong&gt; (business area — tells it what to do), and the &lt;strong&gt;User&lt;/strong&gt; (interacts with it as they would with any colleague). This separation is not bureaucracy. It is the difference between an agent that scales and an agent that becomes someone&amp;rsquo;s science project.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Why this matters now
 &lt;div id="why-this-matters-now" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#why-this-matters-now" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Three converging forces make this moment different from every previous enterprise AI wave:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;First, the cost equation has flipped.&lt;/strong&gt; An AgenTEC running 24/7 on a dedicated VPS costs a fraction of a human collaborator performing the same function. Not because the human is replaceable — because repetitive, rule-based work should never have been assigned to humans in the first place.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second, the framework matured before the enterprise noticed.&lt;/strong&gt; OpenClaw&amp;rsquo;s skill system, memory persistence, multi-channel communication, and tool integration were built for personal use. But personal AI turns out to be harder than enterprise AI. If an agent can manage someone&amp;rsquo;s email, calendar, finances, and home automation simultaneously, it can certainly process support tickets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Third, NVIDIA just removed the last excuse.&lt;/strong&gt; Every objection I have heard from CISOs and compliance officers — data privacy, execution isolation, audit trails — has a technical answer in NemoClaw. The conversation shifts from &amp;ldquo;can we?&amp;rdquo; to &amp;ldquo;when do we start?&amp;rdquo;&lt;/p&gt;

&lt;h2 class="relative group"&gt;The playbook
 &lt;div id="the-playbook" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-playbook" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;For CIOs reading this, here is the practical path:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Start with the mundane.&lt;/strong&gt; The first AgenTEC we are designing handles TecServices requests — the thousands of monthly queries about credentials, procedures, and administrative processes. Not glamorous. High volume. Perfect for an agent that follows procedures without deviation and escalates anything unusual.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Keep humans in the loop, but redefine the loop.&lt;/strong&gt; The supervisor does not review every action. They review patterns, exceptions, and escalations. Weekly, not hourly. The agent handles the routine. The human handles the judgment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Build on open infrastructure.&lt;/strong&gt; OpenClaw is open source. TECgpt runs on our own cloud. NemoClaw adds NVIDIA&amp;rsquo;s models locally. At no point does institutional data leave our control. This is not negotiable for a university handling student records, research data, and financial information.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Measure outcomes, not activity.&lt;/strong&gt; An AgenTEC&amp;rsquo;s value is not measured by how many messages it processes. It is measured by how many human hours it liberates for higher-value work, how many response times it reduces, and how many errors it eliminates.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The uncomfortable truth
 &lt;div id="the-uncomfortable-truth" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-uncomfortable-truth" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Here is what I tell my peers at every CIO forum: the question is not whether AI agents will transform your operations. That is settled. The question is whether you will build your own or rent someone else&amp;rsquo;s.&lt;/p&gt;
&lt;p&gt;NemoClaw makes the &amp;ldquo;build&amp;rdquo; option dramatically more accessible. OpenClaw provides the foundation. NVIDIA provides the guardrails. Your institutional knowledge — your processes, your data, your culture — provides the differentiation that no vendor can replicate.&lt;/p&gt;
&lt;p&gt;At Tec de Monterrey, we chose to build. We call them AgenTECs. And every month, the gap between what they can do and what we imagined widens — in our favor.&lt;/p&gt;
&lt;p&gt;The operating system for AI agents is here. The enterprise wrappers are shipping. The only scarce resource now is the courage to start.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Carles Abarca is VP of Digital Transformation at Tecnológico de Monterrey, where he leads the development of TECgpt and the AgenTECs program. Previously CTO of Banco Sabadell and CIO of TSB Bank.&lt;/em&gt;&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/nemoclaw-enterprise-agents/featured.svg"/></item><item><title>AI Agents Are No Longer Assisting Scientists. They Are Doing the Science.</title><link>https://carlesabarca.com/posts/ai-agents-doing-science/</link><pubDate>Sun, 15 Mar 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/ai-agents-doing-science/</guid><description>Three events this week mark a tipping point: AI agents are now producing original scientific knowledge. We analyze the Great Scientific Acceleration and its consequences.</description><content:encoded>&lt;p&gt;In March 2026, something shifted. Not a bigger model. Not a higher benchmark. Something deeper: AI agents stopped being tools that help scientists and started producing scientific knowledge on their own.&lt;/p&gt;
&lt;p&gt;This week, three events converged, and I believe they mark a tipping point with no return. The &lt;a href="https://shipsquad.ai/blog/autoresearch-openclaw-claude-opus-ai-agents-doing-science" target="_blank" rel="noreferrer"&gt;ShipSquad&lt;/a&gt; team documented it brilliantly in their analysis &lt;em&gt;&amp;ldquo;AutoResearch, OpenClaw, Claude Opus 4.6: AI Agents Are Now Doing the Science&amp;rdquo;&lt;/em&gt;, and it inspired me to dig deeper into what this means for scientific research as we know it.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The facts
 &lt;div id="the-facts" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-facts" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Andrej Karpathy released &lt;a href="https://github.com/karpathy/autoresearch" target="_blank" rel="noreferrer"&gt;AutoResearch&lt;/a&gt;&lt;/strong&gt; — a 630-line Python framework that lets an AI agent run hundreds of machine learning experiments autonomously on a single GPU. You let it run overnight. You wake up to a better model and a complete discovery log. In 48 hours, the agent found ~20 improvements no human had identified, cutting GPT-2 training time by 11%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Anthropic&amp;rsquo;s Claude Opus 4.6 discovered 22 zero-day vulnerabilities in Firefox&lt;/strong&gt; — 14 of them high-severity — in just two weeks. For context: those 14 represent nearly one-fifth of all serious Firefox vulnerabilities patched throughout 2025. An AI model matched months of human security research.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sakana AI unveiled &lt;a href="https://pub.sakana.ai/ai-scientist-v2/paper/paper.pdf" target="_blank" rel="noreferrer"&gt;The AI Scientist v2&lt;/a&gt;&lt;/strong&gt; — an agentic system that generates hypotheses, designs experiments, executes them, analyzes results, and writes the complete scientific paper. The result: the first entirely AI-generated scientific paper accepted through peer review at an academic workshop.&lt;/p&gt;
&lt;p&gt;Three events. The same week. The same conclusion: &lt;strong&gt;AI agents are now producing original knowledge&lt;/strong&gt;.&lt;/p&gt;

&lt;h2 class="relative group"&gt;AI has discovered before, but never on its own
 &lt;div id="ai-has-discovered-before-but-never-on-its-own" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#ai-has-discovered-before-but-never-on-its-own" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;To understand why this moment is different, trace the evolution:&lt;/p&gt;

&lt;h3 class="relative group"&gt;Era 1: AI as microscope (2018-2023)
 &lt;div id="era-1-ai-as-microscope-2018-2023" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#era-1-ai-as-microscope-2018-2023" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;AI amplified human researchers&amp;rsquo; capabilities. Humans asked the questions, designed the experiments, and AI processed data at impossible scales.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.nature.com/articles/d41586-024-03214-7" target="_blank" rel="noreferrer"&gt;AlphaFold&lt;/a&gt;&lt;/strong&gt; (2020-2024) predicted the structure of all 200 million known proteins. A problem that had resisted solution for 50 years. Hassabis and Jumper &lt;a href="https://www.nobelprize.org/prizes/chemistry/2024/press-release/" target="_blank" rel="noreferrer"&gt;won the 2024 Nobel Prize in Chemistry&lt;/a&gt; for it. But the question — &amp;ldquo;can we predict protein structures?&amp;rdquo; — was formulated by humans.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href="https://news.mit.edu/2023/using-ai-scientists-combat-drug-resistant-infections-0525" target="_blank" rel="noreferrer"&gt;Halicin&lt;/a&gt;&lt;/strong&gt; (2020) — MIT researchers used AI to screen 100 million chemical compounds and discovered a new antibiotic capable of killing multi-drug-resistant bacteria, including the dreaded &lt;em&gt;Acinetobacter baumannii&lt;/em&gt;. But the experimental design was human.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href="https://deepmind.google/blog/millions-of-new-materials-discovered-with-deep-learning/" target="_blank" rel="noreferrer"&gt;GNoME by DeepMind&lt;/a&gt;&lt;/strong&gt; (2023) discovered &lt;a href="https://www.nature.com/articles/s41586-023-06735-9" target="_blank" rel="noreferrer"&gt;2.2 million new crystals&lt;/a&gt;, including 380,000 stable materials — multiplying by 10x everything humanity had found in the entire history of materials science. But the evaluation framework was designed by Google researchers.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;h3 class="relative group"&gt;Era 2: AI as colleague (2024-2025)
 &lt;div id="era-2-ai-as-colleague-2024-2025" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#era-2-ai-as-colleague-2024-2025" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;AI began proposing hypotheses and designing experiments, but under human supervision.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href="https://deepmind.google/blog/funsearch-making-new-discoveries-in-mathematical-sciences-using-large-language-models/" target="_blank" rel="noreferrer"&gt;FunSearch by DeepMind&lt;/a&gt;&lt;/strong&gt; (2024) used an LLM to discover new solutions to open problems in pure mathematics — the first time a language model made a genuine discovery in formal sciences.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.nature.com/articles/s41591-025-03743-2" target="_blank" rel="noreferrer"&gt;Insilico Medicine&lt;/a&gt;&lt;/strong&gt; got its molecule rentosertib — designed entirely by generative AI — through a Phase IIa clinical trial with positive results for idiopathic pulmonary fibrosis. From concept to human testing in under 30 months, when the traditional process takes 10-15 years.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href="https://news.mit.edu/2025/using-generative-ai-researchers-design-compounds-kill-drug-resistant-bacteria-0814" target="_blank" rel="noreferrer"&gt;MIT Antibiotics-AI Project&lt;/a&gt;&lt;/strong&gt; (2025) moved from discovering existing antibiotics to designing entirely new molecules using generative AI capable of killing resistant bacteria. No longer searching for needles in a haystack; they are manufacturing new needles.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;h3 class="relative group"&gt;Era 3: AI as autonomous researcher (2026 →)
 &lt;div id="era-3-ai-as-autonomous-researcher-2026-" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#era-3-ai-as-autonomous-researcher-2026-" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;And now, in March 2026, we cross the threshold: AI formulates its own questions, designs its own experiments, executes them, and produces papers accepted by human reviewers. Not science fiction. It is AutoResearch, AI Scientist v2, and Claude Opus doing independent security research.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Great Acceleration: when AI researches faster than humans can read
 &lt;div id="the-great-acceleration-when-ai-researches-faster-than-humans-can-read" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-great-acceleration-when-ai-researches-faster-than-humans-can-read" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;I considered making a safer prediction — forecasting the percentage growth of AI-authored scientific papers. Instead, here is the unfiltered version.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Before 2030, AI agents will have produced more verifiable scientific discoveries in materials science, drug discovery, and computational mathematics than all human researchers combined in those same disciplines.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is not hyperbole. It is arithmetic.&lt;/p&gt;
&lt;p&gt;Consider the numbers: GNoME discovered in weeks 380,000 stable materials that all of humanity took centuries to accumulate (barely 48,000 by 2023). AutoResearch runs 100 experiments per night — months of a PhD student&amp;rsquo;s work. And AI Scientist v2 can generate a complete scientific paper in hours, not months.&lt;/p&gt;
&lt;p&gt;Now scale that. Not one agent, but thousands. Not one night, but every night. Not one domain, but all of them.&lt;/p&gt;

&lt;h3 class="relative group"&gt;The Compound Acceleration phenomenon
 &lt;div id="the-compound-acceleration-phenomenon" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-compound-acceleration-phenomenon" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;What we are witnessing is not linear acceleration. It is compound acceleration: each agent discovery feeds the next research cycle. One agent discovers a new material → another simulates its properties → another designs applications → another writes the paper. In parallel. 24/7. No vacations, no ego, no departmental politics.&lt;/p&gt;
&lt;p&gt;Human science operates at publication speed: one paper every 6-18 months. Agentic science will operate at computation speed: one discovery per minute.&lt;/p&gt;

&lt;h3 class="relative group"&gt;Problems we will solve decades ahead of schedule
 &lt;div id="problems-we-will-solve-decades-ahead-of-schedule" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#problems-we-will-solve-decades-ahead-of-schedule" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;This acceleration is not just quantitative. There are problems we believed were decades away that research agents could solve much sooner:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Antibiotic resistance&lt;/strong&gt; — AI is already designing new molecules against superbugs. With autonomous agents iterating 24/7 on thousands of variants, we could have a complete new generation of antibiotics before 2030. The WHO estimated that by then, superbugs would kill 10 million people per year. Agents could get there first — for the benefit of all humanity.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Nuclear fusion&lt;/strong&gt; — The greatest challenge is controlling plasma. DeepMind already &lt;a href="https://deepmind.google/blog/accelerating-fusion-science-through-learned-plasma-control/" target="_blank" rel="noreferrer"&gt;used AI to control plasma shape&lt;/a&gt; in the TCV tokamak. Autonomous agents simulating and optimizing millions of magnetic configurations could compress decades of experimental research into years.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Cellular aging&lt;/strong&gt; — AlphaFold solved protein structures. The next frontier is understanding the interactions between proteins, genes, and cellular processes that cause aging. It is a problem of massive combinatorial complexity — exactly the kind of problem where agents excel.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;New energy materials&lt;/strong&gt; — GNoME already opened the door with 380,000 stable materials. Agents systematically exploring that space could find the room-temperature superconductor, the perfect battery electrolyte, or the catalyst that makes industrial carbon capture viable.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The question is no longer &lt;em&gt;whether&lt;/em&gt; AI will surpass humans in scientific output. It is &lt;em&gt;when&lt;/em&gt;. And my answer is: in many fields, it is already happening. We just have not updated our metrics to measure it.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The consequences nobody wants to discuss
 &lt;div id="the-consequences-nobody-wants-to-discuss" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-consequences-nobody-wants-to-discuss" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;

&lt;h3 class="relative group"&gt;1. The end of &amp;ldquo;batch research&amp;rdquo;
 &lt;div id="1-the-end-of-batch-research" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#1-the-end-of-batch-research" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;Today, a researcher can run perhaps 5-10 experiments per week. AutoResearch demonstrates that an agent runs 100 per night. When research shifts from a sequential human process to a continuous autonomous one, knowledge production will multiply by orders of magnitude.&lt;/p&gt;

&lt;h3 class="relative group"&gt;2. The radical democratization of discovery
 &lt;div id="2-the-radical-democratization-of-discovery" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#2-the-radical-democratization-of-discovery" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;Karpathy proved that a single GPU and 630 lines of code are enough for autonomous research. A PhD student in Monterrey, Lagos, or Bangalore can now compete in knowledge production with a Stanford lab. The barrier is no longer budget; it is the ability to formulate good questions and direct agents with precision.&lt;/p&gt;

&lt;h3 class="relative group"&gt;3. The &amp;ldquo;Orchestra Conductor&amp;rdquo; as the new scientific role
 &lt;div id="3-the-orchestra-conductor-as-the-new-scientific-role" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#3-the-orchestra-conductor-as-the-new-scientific-role" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;The scientist of the future will not be the one pipetting in the lab or writing code. It will be the one who &amp;ldquo;programs in Markdown&amp;rdquo; — who writes the precise instructions that guide squadrons of autonomous agents. This is exactly what Karpathy demonstrates with his &lt;code&gt;program.md&lt;/code&gt; file: the future of directing research is not writing better code, but writing better programs in natural language.&lt;/p&gt;
&lt;p&gt;This paradigm shift is not new. I explored it in my article &lt;a href="../../posts/next-ai-wave-agents/" &gt;&amp;ldquo;The Next AI Wave: Agents&amp;rdquo;&lt;/a&gt; — where I argued that autonomous agents represent a fundamental shift from assistive AI to autonomous execution. The same applies to science: the researcher becomes the conductor of research agent orchestras.&lt;/p&gt;

&lt;h3 class="relative group"&gt;4. The peer review crisis
 &lt;div id="4-the-peer-review-crisis" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#4-the-peer-review-crisis" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;If AI Scientist v2 already generates papers accepted at workshops, how long until it produces papers accepted at top conferences? And how will we distinguish between a paper written by an agent and one written by a human? The peer review system, designed to evaluate human work, is unprepared for a world where papers are generated at industrial speed. We will need new evaluation frameworks — and possibly, AI agents reviewing the papers of other AI agents.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What this means for universities
 &lt;div id="what-this-means-for-universities" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-this-means-for-universities" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;The institutions that most quickly integrate autonomous research agents into their labs will lead knowledge production in the next decade. It is not about buying bigger GPUs. It is about:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Training &amp;ldquo;research agent directors&amp;rdquo;&lt;/strong&gt; — scientists who know how to formulate questions and direct AI squadrons.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Building autonomous experimentation infrastructure&lt;/strong&gt; — labs where agents can run thousands of experiments without continuous human oversight.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Redefining authorship and intellectual property&lt;/strong&gt; — if an AI agent discovers a molecule that cures a disease, who owns the discovery?&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Measuring scientific output differently&lt;/strong&gt; — current indicators (papers, citations, h-index) are metrics designed for human speed. We need metrics that capture agentic velocity.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The race has started. And the window of opportunity to position yourself is now.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Conclusion
 &lt;div id="conclusion" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#conclusion" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;In just six years, AI went from predicting protein structures to discovering antibiotics, from designing materials to writing scientific papers, from finding known vulnerabilities to discovering zero-days. The trajectory is clear and accelerating exponentially.&lt;/p&gt;
&lt;p&gt;AI agents will not replace scientists. They will make scientists who do not use them irrelevant. And that transition, unlike the one we lived through in the software industry, will not take a decade. It will take months.&lt;/p&gt;
&lt;p&gt;Welcome to the era of the &lt;strong&gt;Great Scientific Acceleration&lt;/strong&gt;. Those who understand it first will lead the science of the future. Those who ignore it will read about it in papers written by machines.&lt;/p&gt;
&lt;hr&gt;

&lt;h2 class="relative group"&gt;🤖 Go deeper with AI
 &lt;div id="-go-deeper-with-ai" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#-go-deeper-with-ai" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Want to explore further? Click your favorite AI with a ready-made prompt:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;On the scientific acceleration with AI agents:&lt;/strong&gt;
&lt;a href="https://chat.openai.com/?q=Analyze%20the%20impact%20of%20autonomous%20AI%20agents%20on%20scientific%20output.%20Compare%20the%20discovery%20speed%20of%20AlphaFold%2C%20GNoME%2C%20and%20AutoResearch%20with%20traditional%20human%20research.%20In%20which%20fields%20will%20AI%20first%20surpass%20humans%20in%20verifiable%20knowledge%20production%3F" target="_blank" rel="noreferrer"&gt;ChatGPT&lt;/a&gt; · &lt;a href="https://www.perplexity.ai/search?q=Analyze%20the%20impact%20of%20autonomous%20AI%20agents%20on%20scientific%20output.%20Compare%20the%20discovery%20speed%20of%20AlphaFold%2C%20GNoME%2C%20and%20AutoResearch%20with%20traditional%20human%20research.%20In%20which%20fields%20will%20AI%20first%20surpass%20humans%20in%20verifiable%20knowledge%20production%3F" target="_blank" rel="noreferrer"&gt;Perplexity&lt;/a&gt; · &lt;a href="https://claude.ai/new?q=Analyze%20the%20impact%20of%20autonomous%20AI%20agents%20on%20scientific%20output.%20Compare%20the%20discovery%20speed%20of%20AlphaFold%2C%20GNoME%2C%20and%20AutoResearch%20with%20traditional%20human%20research.%20In%20which%20fields%20will%20AI%20first%20surpass%20humans%20in%20verifiable%20knowledge%20production%3F" target="_blank" rel="noreferrer"&gt;Claude&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;On the future of peer review:&lt;/strong&gt;
&lt;a href="https://chat.openai.com/?q=How%20should%20academic%20peer%20review%20evolve%20when%20AI%20agents%20like%20Sakana%27s%20AI%20Scientist%20v2%20already%20produce%20accepted%20papers%3F%20Analyze%20fraud%20risks%2C%20quality%20opportunities%2C%20and%20propose%20a%20new%20scientific%20evaluation%20framework%20for%20the%20agentic%20era." target="_blank" rel="noreferrer"&gt;ChatGPT&lt;/a&gt; · &lt;a href="https://www.perplexity.ai/search?q=How%20should%20academic%20peer%20review%20evolve%20when%20AI%20agents%20like%20Sakana%27s%20AI%20Scientist%20v2%20already%20produce%20accepted%20papers%3F%20Analyze%20fraud%20risks%2C%20quality%20opportunities%2C%20and%20propose%20a%20new%20scientific%20evaluation%20framework%20for%20the%20agentic%20era." target="_blank" rel="noreferrer"&gt;Perplexity&lt;/a&gt; · &lt;a href="https://claude.ai/new?q=How%20should%20academic%20peer%20review%20evolve%20when%20AI%20agents%20like%20Sakana%27s%20AI%20Scientist%20v2%20already%20produce%20accepted%20papers%3F%20Analyze%20fraud%20risks%2C%20quality%20opportunities%2C%20and%20propose%20a%20new%20scientific%20evaluation%20framework%20for%20the%20agentic%20era." target="_blank" rel="noreferrer"&gt;Claude&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;On the impact for universities:&lt;/strong&gt;
&lt;a href="https://chat.openai.com/?q=I%27m%20a%20university%20executive.%20How%20should%20I%20prepare%20my%20institution%20for%20a%20world%20where%20autonomous%20AI%20agents%20can%20run%20hundreds%20of%20experiments%20per%20night%3F%20What%20new%20roles%20do%20we%20need%3F%20What%20infrastructure%3F%20How%20do%20we%20redefine%20authorship%20and%20intellectual%20property%3F" target="_blank" rel="noreferrer"&gt;ChatGPT&lt;/a&gt; · &lt;a href="https://www.perplexity.ai/search?q=I%27m%20a%20university%20executive.%20How%20should%20I%20prepare%20my%20institution%20for%20a%20world%20where%20autonomous%20AI%20agents%20can%20run%20hundreds%20of%20experiments%20per%20night%3F%20What%20new%20roles%20do%20we%20need%3F%20What%20infrastructure%3F%20How%20do%20we%20redefine%20authorship%20and%20intellectual%20property%3F" target="_blank" rel="noreferrer"&gt;Perplexity&lt;/a&gt; · &lt;a href="https://claude.ai/new?q=I%27m%20a%20university%20executive.%20How%20should%20I%20prepare%20my%20institution%20for%20a%20world%20where%20autonomous%20AI%20agents%20can%20run%20hundreds%20of%20experiments%20per%20night%3F%20What%20new%20roles%20do%20we%20need%3F%20What%20infrastructure%3F%20How%20do%20we%20redefine%20authorship%20and%20intellectual%20property%3F" target="_blank" rel="noreferrer"&gt;Claude&lt;/a&gt;&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/ai-agents-doing-science/featured.png"/></item><item><title>The Chart That Predicts Which Jobs AI Will Kill (And They're Not the Ones You Think)</title><link>https://carlesabarca.com/posts/ai-jobs-displacement-anthropic/</link><pubDate>Mon, 09 Mar 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/ai-jobs-displacement-anthropic/</guid><description>An Anthropic study analyzing 2 million conversations reveals the gap between what AI CAN do and what it IS doing. That gap is the coming tsunami.</description><content:encoded>&lt;p&gt;Look at this chart carefully. It&amp;rsquo;s not an analysis of what AI has destroyed. It&amp;rsquo;s an &lt;strong&gt;X-ray of what it&amp;rsquo;s about to destroy&lt;/strong&gt;.&lt;/p&gt;

&lt;figure&gt;
 &lt;img
 class="my-0 rounded-md"
 loading="lazy"
 decoding="async"
 fetchpriority="auto"
 alt=""
 width="1280"
 height="853"
 src="../../posts/ai-jobs-displacement-anthropic/featured_hu_7a629d8976dfc179.png"
 srcset="../../posts/ai-jobs-displacement-anthropic/featured_hu_7a629d8976dfc179.png 800w,/posts/ai-jobs-displacement-anthropic/featured_hu_d58629eebe18c99c.png 1280w"
 sizes="(min-width: 768px) 50vw, 65vw"
 data-zoom-src="../../posts/ai-jobs-displacement-anthropic/featured.png"
 /&gt;
 
 &lt;figcaption&gt;Source: Anthropic — Labor market impacts of AI (March 2026)&lt;/figcaption&gt;
 &lt;/figure&gt;
&lt;p&gt;The blue area is what AI &lt;strong&gt;can&lt;/strong&gt; do today. The red area is what AI &lt;strong&gt;is&lt;/strong&gt; doing today. The difference between them isn&amp;rsquo;t a safety margin. It&amp;rsquo;s a tsunami that hasn&amp;rsquo;t hit shore yet.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Study: 2 Million Conversations with Claude
 &lt;div id="the-study-2-million-conversations-with-claude" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-study-2-million-conversations-with-claude" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Anthropic just published &lt;a href="https://www.anthropic.com/research/labor-market-impacts" target="_blank" rel="noreferrer"&gt;Labor market impacts of AI: A new measure and early evidence&lt;/a&gt;, and it&amp;rsquo;s the most rigorous analysis I&amp;rsquo;ve seen on AI&amp;rsquo;s real employment impact.&lt;/p&gt;
&lt;p&gt;What did they do? They crossed three data sources:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;The &lt;strong&gt;O*NET database&lt;/strong&gt;, cataloging tasks across ~800 US occupations.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Real Claude usage data&lt;/strong&gt; — 2 million conversations analyzed via the Anthropic Economic Index.&lt;/li&gt;
&lt;li&gt;Theoretical estimates from &lt;strong&gt;Eloundou et al. (2023)&lt;/strong&gt; on which tasks an LLM can make at least twice as fast.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The result is a new metric: &lt;strong&gt;observed exposure&lt;/strong&gt; — not what AI could theoretically do, but what it&amp;rsquo;s actually doing in professional settings. And the most revealing finding isn&amp;rsquo;t the absolute numbers — it&amp;rsquo;s the &lt;strong&gt;gap&lt;/strong&gt; between the two.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The 10 Most Exposed Jobs
 &lt;div id="the-10-most-exposed-jobs" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-10-most-exposed-jobs" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;The ranking won&amp;rsquo;t surprise anyone who&amp;rsquo;s been paying attention, but the numbers are brutal:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Computer Programmers — 75% coverage&lt;/strong&gt;. Three out of four tasks a programmer does, Claude already handles.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Customer Service Representatives&lt;/strong&gt;. First-party API traffic shows massive automation.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Data Entry Keyers — 67%&lt;/strong&gt;. Reading documents and entering data. The perfect automation use case.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The list continues: actuaries, financial analysts, technical writers. &lt;strong&gt;Office jobs. White-collar work. People with college degrees.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;On the other end, 30% of workers have &lt;strong&gt;zero exposure&lt;/strong&gt;. Cooks, motorcycle mechanics, lifeguards, bartenders. Jobs where hands, bodies, and physical context are irreplaceable.&lt;/p&gt;
&lt;p&gt;Ironic, isn&amp;rsquo;t it? Decades telling us automation was coming for manual labor. &lt;strong&gt;It&amp;rsquo;s coming for the desks.&lt;/strong&gt;&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Demographic Surprise
 &lt;div id="the-demographic-surprise" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-demographic-surprise" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;This is where the study shatters the dominant narrative.&lt;/p&gt;
&lt;p&gt;The workers most exposed to AI are:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;16 percentage points more likely to be female&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;11 points more likely to be white&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Nearly twice as likely to be Asian&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Earn 47% more&lt;/strong&gt; than unexposed workers&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;17.4% hold graduate degrees&lt;/strong&gt; (vs. 4.5% in the unexposed group)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This &lt;strong&gt;isn&amp;rsquo;t&lt;/strong&gt; the displaced factory worker narrative. These are lawyers, analysts, programmers, university professors. The professional class that thought it was untouchable.&lt;/p&gt;
&lt;p&gt;When I say this will reshape social structure, I&amp;rsquo;m not exaggerating.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Gap IS the Prediction
 &lt;div id="the-gap-is-the-prediction" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-gap-is-the-prediction" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Go back to the chart. Look at the categories:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Computer &amp;amp; Math&lt;/strong&gt;: 94% theoretical capability, 33% actual use&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Legal&lt;/strong&gt;: ~85% theoretical, less than 15% observed&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Education&lt;/strong&gt;: ~70% theoretical, less than 15% observed&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Office &amp;amp; Admin&lt;/strong&gt;: 90% theoretical, a fraction of actual use&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;That distance between blue and red isn&amp;rsquo;t comfort. &lt;strong&gt;It&amp;rsquo;s latency.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It&amp;rsquo;s the time companies take to adopt, regulators to adapt, workflows to reconfigure. But the technology is already there. The model already knows how. The ecosystem just needs to catch up.&lt;/p&gt;
&lt;p&gt;And every month, the red area grows. Anthropic says it explicitly: &lt;em&gt;&amp;ldquo;As capabilities advance, adoption spreads, and deployment deepens, the red area will grow to cover the blue.&amp;rdquo;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;This isn&amp;rsquo;t speculative prediction. It&amp;rsquo;s an &lt;strong&gt;empirical observation with trajectory&lt;/strong&gt;.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What Changes with AI Agents
 &lt;div id="what-changes-with-ai-agents" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-changes-with-ai-agents" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Here&amp;rsquo;s the factor the study &lt;strong&gt;doesn&amp;rsquo;t&lt;/strong&gt; fully measure — because it didn&amp;rsquo;t exist at this scale when they collected the data.&lt;/p&gt;
&lt;p&gt;The study analyzes LLM usage — conversations with Claude. Chat interactions. A human asks, the AI answers. It&amp;rsquo;s the &lt;strong&gt;augmentation&lt;/strong&gt; model: AI helps you, you execute.&lt;/p&gt;
&lt;p&gt;But &lt;strong&gt;AI agents&lt;/strong&gt; are something else entirely. They don&amp;rsquo;t answer — they &lt;strong&gt;act&lt;/strong&gt;. They execute task chains autonomously. They navigate systems, make intermediate decisions, complete entire workflows without human intervention.&lt;/p&gt;
&lt;p&gt;What we&amp;rsquo;re building at Tecnológico de Monterrey with &lt;strong&gt;AgenTECs&lt;/strong&gt; is exactly this. Not a chatbot that helps you draft an email. An agent that manages the entire process: reads context, drafts, sends, follows up, escalates if there&amp;rsquo;s no response.&lt;/p&gt;
&lt;p&gt;When agents arrive at enterprise scale — and they&amp;rsquo;re already arriving — &lt;strong&gt;the red area in the chart will expand explosively&lt;/strong&gt;. Because you no longer need a human interacting with AI task by task. The agent covers the entire role.&lt;/p&gt;
&lt;p&gt;Think about the Legal category: 85% theoretical capability, &amp;lt;15% current use. What happens when an agent can review contracts, identify risk clauses, generate executive summaries, and prepare response drafts — all without a lawyer touching the keyboard? The 85% becomes the new floor, not the ceiling.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What to Do (Which Is Not Panic)
 &lt;div id="what-to-do-which-is-not-panic" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-to-do-which-is-not-panic" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;I&amp;rsquo;ve been saying the same thing for years: this isn&amp;rsquo;t about fear. It&amp;rsquo;s about &lt;strong&gt;preparation&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;When I wrote &lt;a href="https://www.linkedin.com/pulse/el-fin-del-desarrollador-carles-abarca-kxg2c/" target="_blank" rel="noreferrer"&gt;&amp;ldquo;El fin del desarrollador&amp;rdquo;&lt;/a&gt; on LinkedIn, the reaction was predictable: &amp;ldquo;exaggerated,&amp;rdquo; &amp;ldquo;developers will always be needed,&amp;rdquo; &amp;ldquo;AI can&amp;rsquo;t do X.&amp;rdquo; The same arguments I heard about TECgpt when we launched it and people said professors would never use it. Today we have &lt;strong&gt;over 60,000 active users&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;The metaphor I use is the &lt;strong&gt;orchestra conductor&lt;/strong&gt;. The value is no longer in playing the violin — it&amp;rsquo;s in knowing what music to perform, who plays what, and when to change the score. Future professionals don&amp;rsquo;t execute tasks — they &lt;strong&gt;orchestrate systems&lt;/strong&gt; that execute them.&lt;/p&gt;
&lt;p&gt;Specifically:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Massive upskilling, now&lt;/strong&gt;. Not &amp;ldquo;intro to AI&amp;rdquo; courses — real training on production tools.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Redefine roles&lt;/strong&gt;, don&amp;rsquo;t eliminate them. A lawyer who masters AI agents is worth more, not less. But a lawyer who only knows manual contract review has an expiration date.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Measure exposure&lt;/strong&gt; in your organization. Use Anthropic&amp;rsquo;s framework. Identify which tasks in each role an LLM can already perform. Design the transition before it&amp;rsquo;s imposed on you.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Create new roles&lt;/strong&gt; that don&amp;rsquo;t exist yet: AI orchestrators, agent prompt engineers, autonomous systems supervisors.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2 class="relative group"&gt;The Bottom Line
 &lt;div id="the-bottom-line" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-bottom-line" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;The BLS projects that the most exposed occupations under this metric will grow &lt;strong&gt;less&lt;/strong&gt; through 2034. For every 10 points of observed coverage, the growth projection drops 0.6 percentage points. This isn&amp;rsquo;t casual correlation — labor market analysts are seeing the same thing.&lt;/p&gt;
&lt;p&gt;And yet, Anthropic also finds that &lt;strong&gt;there&amp;rsquo;s no systematic increase in unemployment&lt;/strong&gt; in the most exposed professions. Yet.&lt;/p&gt;
&lt;p&gt;That&amp;rsquo;s the window. We&amp;rsquo;re in the moment between seeing the lightning and hearing the thunder. &lt;strong&gt;The bolt already struck.&lt;/strong&gt; The question isn&amp;rsquo;t whether the sound will arrive, but whether you&amp;rsquo;ll be ready when it does.&lt;/p&gt;
&lt;p&gt;Those who read this chart as &amp;ldquo;AI hasn&amp;rsquo;t affected employment much yet&amp;rdquo; are confusing latency with safety. Those who read it as &amp;ldquo;a structural labor market shift is coming and we need to act now&amp;rdquo;&amp;hellip; they&amp;rsquo;re the ones who&amp;rsquo;ll still be conducting the orchestra.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/ai-jobs-displacement-anthropic/featured.png"/></item><item><title>China's AI Pincer Move: Qwen 3.5 and CoPaw Are Not a Warning Shot — They're the Main Event</title><link>https://carlesabarca.com/posts/china-ai-qwen-copaw/</link><pubDate>Thu, 05 Mar 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/china-ai-qwen-copaw/</guid><description>Qwen 3.5 beats GPT-5.2 on key benchmarks. CoPaw launches as a full open-source agent workstation. China is no longer catching up — they&amp;rsquo;re building a parallel AI ecosystem. And the West should be paying attention.</description><content:encoded>&lt;p&gt;There is a moment in every technology race when &amp;ldquo;catching up&amp;rdquo; becomes &amp;ldquo;setting the pace.&amp;rdquo; For China&amp;rsquo;s AI ecosystem, that moment is now.&lt;/p&gt;
&lt;p&gt;In the span of a few weeks, Alibaba has released two things that, taken separately, would each be significant. Taken together, they represent a strategic vision that should make every Western AI executive lose sleep.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Qwen 3.5&lt;/strong&gt;: a family of open-source models that beats GPT-5.2 on instruction following and leads the field on vision benchmarks. Apache 2.0 licensed. Free. Commercial use allowed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;CoPaw&lt;/strong&gt;: an open-source personal AI agent workstation — think OpenClaw, but from Alibaba&amp;rsquo;s AgentScope team — with persistent memory, custom skills, multi-channel support, and browser automation.&lt;/p&gt;
&lt;p&gt;Models &lt;em&gt;and&lt;/em&gt; infrastructure. The brain &lt;em&gt;and&lt;/em&gt; the body.&lt;/p&gt;
&lt;p&gt;This is not a warning shot. This is a strategy.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Qwen 3.5 Story: Frontier AI Goes Free
 &lt;div id="the-qwen-35-story-frontier-ai-goes-free" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-qwen-35-story-frontier-ai-goes-free" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Let me give you the numbers first, because they tell a story.&lt;/p&gt;
&lt;p&gt;Qwen 3.5&amp;rsquo;s flagship model uses a Mixture of Experts (MoE) architecture with 397 billion total parameters but only 17 billion active at any given time. Read that again. You get frontier-class performance while only running the compute cost of a 17B model.&lt;/p&gt;
&lt;p&gt;The benchmarks are not &amp;ldquo;competitive.&amp;rdquo; They are leading:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;IFBench (instruction following): 76.5&lt;/strong&gt; — beating GPT-5.2&amp;rsquo;s 75.4&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;SWE-bench (coding): 76.4&lt;/strong&gt; — neck and neck with the best&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;MMMU (vision): 85.0&lt;/strong&gt; — outright leader&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;256K token context window&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;201 languages supported&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Thinking and non-thinking modes&lt;/strong&gt; — you choose the tradeoff between depth and speed&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The model family was released in three waves between February and March 2026: flagship, medium, and small. The small models — 0.8B to 9B parameters — are explicitly designed for on-device deployment. Your phone. Your laptop. Your edge server. No API call required.&lt;/p&gt;
&lt;p&gt;Let that sink in for a moment.&lt;/p&gt;
&lt;p&gt;A year ago, running anything close to frontier AI locally was a fantasy. Today, Alibaba is handing you models that compete with the best in the world, under the most permissive open-source license available, optimized to run on your hardware.&lt;/p&gt;
&lt;p&gt;The MoE architecture is the key unlock here. Traditional dense models force you to choose: either you run a massive model with massive compute, or you run a small model with limited capability. MoE breaks that tradeoff. Qwen 3.5 has the knowledge of a 397B model but the inference cost of a 17B one. It is, in practical terms, the democratization of frontier AI.&lt;/p&gt;
&lt;p&gt;And it is Apache 2.0. Not &amp;ldquo;open-ish.&amp;rdquo; Not &amp;ldquo;you can look but not touch.&amp;rdquo; Fully open. Fork it. Fine-tune it. Ship it in your product. Alibaba does not care. Or rather — they care very much, but their game is not licensing revenue.&lt;/p&gt;

&lt;h2 class="relative group"&gt;CoPaw: The Agent Layer China Was Missing
 &lt;div id="copaw-the-agent-layer-china-was-missing" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#copaw-the-agent-layer-china-was-missing" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Models without infrastructure are academic papers. Infrastructure without models is empty plumbing. The interesting move is doing both.&lt;/p&gt;
&lt;p&gt;CoPaw (copaw.bot) launched in March 2026 from Alibaba&amp;rsquo;s AgentScope team. If you are familiar with OpenClaw — and if you read my blog, you probably are — CoPaw is China&amp;rsquo;s answer to it. An open-source personal AI agent workstation that turns language models into persistent, capable digital workers.&lt;/p&gt;
&lt;p&gt;The feature list reads like someone studied every agent platform on the market and built a synthesis:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;ReMe&lt;/strong&gt;: persistent memory across sessions. Your agent remembers context, preferences, past interactions. Not a gimmick — this is what separates a chatbot from an actual assistant.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Custom skills&lt;/strong&gt;: build and import capabilities. Pull from clawhub.ai, skills.sh, or GitHub.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Multi-channel&lt;/strong&gt;: DingTalk, Feishu, iMessage, Discord, QQ. Your agent lives where you work.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Cron scheduling&lt;/strong&gt;: automated tasks on a schedule. Check my email every morning. Summarize my feeds at 6 PM. The basics, done right.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Browser automation&lt;/strong&gt;: your agent can navigate the web, fill forms, extract data. The hands to go with the brain.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;MCP Server integration&lt;/strong&gt;: the emerging standard for tool use. CoPaw speaks it natively.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Is it perfect? No. WhatsApp and Telegram support are missing — a significant gap for Western and Latin American users. Multi-agent orchestration is not there yet. OpenRouter integration is absent. These are real limitations.&lt;/p&gt;
&lt;p&gt;But here is what matters: CoPaw is not a prototype. It is a platform. And it is open-source, which means the community can fill those gaps faster than any corporate roadmap.&lt;/p&gt;
&lt;p&gt;I have been running OpenClaw as my personal agent infrastructure for months — it is literally what powers JarvisX, my AI assistant. So I understand this space intimately. CoPaw is not a clone. It is a parallel evolution, built from a different set of assumptions (Chinese messaging ecosystem, AgentScope framework, different privacy model) that arrives at remarkably similar conclusions about what an AI agent workstation needs to be.&lt;/p&gt;
&lt;p&gt;That convergence is the signal. When two teams on opposite sides of the world, working independently, build essentially the same thing — that is not coincidence. That is the shape of the future becoming obvious.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Earthquake Started in January 2025
 &lt;div id="the-earthquake-started-in-january-2025" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-earthquake-started-in-january-2025" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;None of this is happening in a vacuum. Let me connect the dots.&lt;/p&gt;
&lt;p&gt;January 2025: DeepSeek releases R1, an open-source reasoning model that shocks the industry. Silicon Valley&amp;rsquo;s reaction ranges from dismissal to panic, settling on grudging respect. The &amp;ldquo;China can&amp;rsquo;t do AI&amp;rdquo; narrative dies overnight.&lt;/p&gt;
&lt;p&gt;Throughout 2025: Chinese labs iterate at a pace that makes Western release cycles look glacial. Qwen, DeepSeek, Yi, GLM — each generation closing the gap further. The MoE architecture becomes the standard approach, driven by the practical reality that Chinese labs face compute constraints from US export controls and have to be &lt;em&gt;more efficient&lt;/em&gt;, not less.&lt;/p&gt;
&lt;p&gt;Here is the irony that should keep policymakers awake: export controls designed to slow China&amp;rsquo;s AI development may have accelerated their innovation in efficiency. When you cannot buy the biggest GPUs, you learn to do more with less. And &amp;ldquo;more with less&amp;rdquo; turns out to be exactly what the market wants.&lt;/p&gt;
&lt;p&gt;February-March 2026: Qwen 3.5 arrives, not as a single model but as an ecosystem play. Flagship for the cloud, medium for the server room, small for the device. And simultaneously, CoPaw launches to provide the agent layer. Models plus infrastructure plus ecosystem.&lt;/p&gt;
&lt;p&gt;This is not &amp;ldquo;China catching up.&amp;rdquo; This is China executing a full-stack AI strategy while much of the West is still arguing about whether to charge $200/month or $2,000/month for API access.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Alibaba Strategy: OpenAI&amp;rsquo;s Vision, Open-Source&amp;rsquo;s Price
 &lt;div id="the-alibaba-strategy-openais-vision-open-sources-price" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-alibaba-strategy-openais-vision-open-sources-price" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Let me be explicit about what Alibaba is doing, because I think most Western observers are misreading it.&lt;/p&gt;
&lt;p&gt;OpenAI&amp;rsquo;s vision has always been: build the best models, then build the infrastructure to deploy them, then build the ecosystem of applications on top. Vertical integration. The &amp;ldquo;Apple of AI.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;Alibaba&amp;rsquo;s vision is the same — except open-source.&lt;/p&gt;
&lt;p&gt;Best models? Qwen 3.5 is demonstrably frontier-competitive. Infrastructure? CoPaw provides the agent layer. AgentScope provides the framework. Ecosystem? Apache 2.0 means anyone can build on it.&lt;/p&gt;
&lt;p&gt;The difference is the business model. OpenAI charges for access. Alibaba gives away the technology and monetizes the cloud (Alibaba Cloud), the commerce (Alibaba platforms), and the enterprise services built on top. The AI itself is the loss leader. Or rather, it is the moat around everything else.&lt;/p&gt;
&lt;p&gt;This is not charity. It is strategy. And it is devastatingly effective.&lt;/p&gt;
&lt;p&gt;If you are an enterprise CTO today — and I have been one, at Banco Sabadell, so I know the calculus — the question on your desk is uncomfortable: Why am I paying for proprietary AI models when open-source alternatives match or beat them on benchmarks?&lt;/p&gt;
&lt;p&gt;The answers used to be: reliability, support, safety, compliance. Those are real. But they are eroding fast. Qwen 3.5 is not some garage project. It is backed by one of the largest technology companies on Earth. It has enterprise-grade documentation. It runs in production at Alibaba scale.&lt;/p&gt;
&lt;p&gt;The moat is getting shallow.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What This Means for the West
 &lt;div id="what-this-means-for-the-west" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-this-means-for-the-west" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;I am not writing this as a China cheerleader or a Western doomer. I am writing it as someone who has spent 20+ years in enterprise technology and is currently leading digital transformation at one of Latin America&amp;rsquo;s largest universities.&lt;/p&gt;
&lt;p&gt;Here is what I think this means:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;For AI companies (OpenAI, Anthropic, Google):&lt;/strong&gt; The &amp;ldquo;best model&amp;rdquo; advantage is now measured in months, not years. If Qwen 3.5 can match GPT-5.2 today, Qwen 4 will likely match whatever comes next. The sustainable advantage must come from ecosystem, trust, and integration — not model quality alone. The race to the bottom on model pricing accelerates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;For enterprises:&lt;/strong&gt; Your AI strategy cannot depend on a single provider. The multi-model, multi-provider approach is no longer a nice-to-have — it is risk management. And if you are not evaluating open-source models for your use cases, you are leaving money and optionality on the table.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;For the open-source movement:&lt;/strong&gt; This is your moment. China&amp;rsquo;s largest tech companies are pouring billions into open-source AI, not because they are altruistic, but because it serves their strategic interests. The result is the same: the commons gets richer. Western open-source advocates should take notes on how to align corporate strategy with community benefit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;For developers:&lt;/strong&gt; Learn to run local models. Understand MoE architectures. Get comfortable with agent frameworks — both OpenClaw and CoPaw. The developers who thrive in 2027 will be the ones who can deploy and orchestrate AI agents across multiple models and platforms, not the ones locked into a single API.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;For policymakers:&lt;/strong&gt; The export control strategy needs a fundamental rethink. Restricting compute has not prevented frontier AI development in China — it has redirected it toward efficiency innovations that may ultimately be more valuable than brute-force scaling. The horse has left the barn, and the barn is on fire.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Democratization Paradox
 &lt;div id="the-democratization-paradox" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-democratization-paradox" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Here is the question that keeps me up at night: if frontier AI is free and open, what is the moat?&lt;/p&gt;
&lt;p&gt;Not for Alibaba — their moat is their ecosystem. Not for OpenAI — their moat is their brand and enterprise relationships. I mean for &lt;em&gt;everyone else&lt;/em&gt;. For the thousands of SaaS companies, AI startups, and technology consultancies that have built their value proposition around access to AI capabilities.&lt;/p&gt;
&lt;p&gt;When Qwen 3.5 is free, when CoPaw is free, when the entire stack from model to agent to deployment is open-source and commercially licensable — what exactly are you selling?&lt;/p&gt;
&lt;p&gt;The answer, I think, is the same answer it has always been in technology: domain expertise, integration quality, trust, and speed of execution. The tools become commoditized. The craft does not.&lt;/p&gt;
&lt;p&gt;But that is a much harder business than &amp;ldquo;we have access to AI and you don&amp;rsquo;t.&amp;rdquo; And it will cause a shakeout that makes the SaaSpocalypse look like a rehearsal.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What I Am Doing About It
 &lt;div id="what-i-am-doing-about-it" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-i-am-doing-about-it" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;I never write about things I am not willing to act on. So here is what this means for my work:&lt;/p&gt;
&lt;p&gt;At Tec de Monterrey, we are actively evaluating open-source models for educational applications where data sovereignty matters — and with a Latin American university serving students across multiple countries, it matters a lot. Qwen 3.5&amp;rsquo;s multilingual support (201 languages, with strong Spanish coverage) makes it a serious candidate.&lt;/p&gt;
&lt;p&gt;Personally, I run my AI agent infrastructure on OpenClaw. CoPaw&amp;rsquo;s release is not a threat to that — it is validation. The agent workstation pattern is the right abstraction. And competition drives improvement. I fully expect OpenClaw and CoPaw to cross-pollinate features, especially given that CoPaw can already import skills from clawhub.ai.&lt;/p&gt;
&lt;p&gt;The future I see is heterogeneous. Not &amp;ldquo;Western AI vs. Chinese AI&amp;rdquo; but a global ecosystem where the best models and tools win regardless of origin. Where an enterprise in Mexico City runs Qwen for some tasks, Claude for others, and Gemini for a third — all orchestrated by agent infrastructure that does not care about the nationality of the model.&lt;/p&gt;
&lt;p&gt;That is not a geopolitical statement. It is an engineering reality.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Bottom Line
 &lt;div id="the-bottom-line" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-bottom-line" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Alibaba has executed a textbook pincer move: world-class models on one side, agent infrastructure on the other. Qwen 3.5 gives you the brain. CoPaw gives you the body. Both are free. Both are open. Both are production-ready.&lt;/p&gt;
&lt;p&gt;The West still leads in many dimensions — safety research, alignment, enterprise trust, regulatory frameworks. Those matter. But the raw capability gap? It is closing so fast that by the time you finish reading this article, it may have closed a little more.&lt;/p&gt;
&lt;p&gt;If you are a technology leader and you are not paying attention to what is coming out of China, you are not paying attention.&lt;/p&gt;
&lt;p&gt;And in this industry, not paying attention is how you become the next $300 billion cautionary tale.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/china-ai-qwen-copaw/featured.png"/></item><item><title>My AI Agent Is 2 Weeks Old. 72,000 Lines of Code. 5 Projects Shipped.</title><link>https://carlesabarca.com/posts/ai-agent-two-weeks/</link><pubDate>Wed, 25 Feb 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/ai-agent-two-weeks/</guid><description>I haven&amp;rsquo;t written a line of code in weeks. My AI agent has added 72,563 lines, made 43 commits, and shipped 5 projects to production — all in 14 days.</description><content:encoded>&lt;p&gt;I haven&amp;rsquo;t written a line of code in weeks. And yet, my repositories keep growing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;72,563 lines added. 43 commits. 5 projects deployed to production.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All in 14 days. I didn&amp;rsquo;t write any of it. My AI agent did.&lt;/p&gt;

&lt;h2 class="relative group"&gt;A Clarification First
 &lt;div id="a-clarification-first" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#a-clarification-first" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Let me be clear about something: my job as VP of Digital Transformation at Tecnológico de Monterrey is not to write code. I lead teams that do that at scale, for one of Latin America&amp;rsquo;s largest universities.&lt;/p&gt;
&lt;p&gt;But I&amp;rsquo;ve been writing code for &lt;strong&gt;40 years&lt;/strong&gt;. From assembler to C, Fortran, Visual Basic, Pascal, COBOL, C++, Java, JavaScript, TypeScript, Rust, Python — I&amp;rsquo;ve touched them all. First because it was my job. Then because it became a habit I never wanted to break.&lt;/p&gt;
&lt;p&gt;Why? Because staying close to the code keeps me close to reality. It helps me make better decisions. It lets me have real conversations with my technical teams — not as a manager who reads reports, but as someone who understands what they&amp;rsquo;re building.&lt;/p&gt;
&lt;p&gt;The projects I&amp;rsquo;m about to describe are &lt;strong&gt;personal&lt;/strong&gt;. Some are proofs of concept that later evolve into institutional initiatives — like TECgpt Desktop, which started as an experiment on my laptop before becoming an official tool at the university. Others are simply things I want to build.&lt;/p&gt;
&lt;p&gt;This matters for what comes next.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Meet My Digital Apprentice
 &lt;div id="meet-my-digital-apprentice" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#meet-my-digital-apprentice" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;His name is JarvisX. He runs on a Mac Mini in my home office. He&amp;rsquo;s been &amp;ldquo;alive&amp;rdquo; for exactly two weeks.&lt;/p&gt;
&lt;p&gt;But here&amp;rsquo;s the key: &lt;strong&gt;I decide the architecture.&lt;/strong&gt; I choose the tech stack. I define the structure, the patterns, the conventions. JarvisX executes — brilliantly, tirelessly, and autonomously — but the technical vision is mine.&lt;/p&gt;
&lt;p&gt;He&amp;rsquo;s not replacing my judgment. He&amp;rsquo;s amplifying it.&lt;/p&gt;
&lt;p&gt;When I say &amp;ldquo;build this feature,&amp;rdquo; I&amp;rsquo;ve already decided &lt;em&gt;how&lt;/em&gt; it should be built. JarvisX handles the implementation: writing the code, deploying to production, opening a browser to verify it looks right, and committing the changes. All without asking me to approve each step.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Numbers
 &lt;div id="the-numbers" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-numbers" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;In two weeks, JarvisX has:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Added &lt;strong&gt;72,563 lines of code&lt;/strong&gt; across multiple repositories&lt;/li&gt;
&lt;li&gt;Made &lt;strong&gt;43 commits&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;Shipped &lt;strong&gt;5 complete projects&lt;/strong&gt; to production&lt;/li&gt;
&lt;li&gt;Configured deployments, authentication systems, and cloud infrastructure&lt;/li&gt;
&lt;li&gt;Validated his own work visually — taking screenshots, iterating until it&amp;rsquo;s right&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Let me be honest: not all 72,000 lines are hand-crafted code. Some are frameworks, themes, and data files. But &lt;strong&gt;every commit, every deploy, every design decision was autonomous.&lt;/strong&gt;&lt;/p&gt;

&lt;h2 class="relative group"&gt;How This Article Got Here
 &lt;div id="how-this-article-got-here" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#how-this-article-got-here" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Want to see what working with an AI agent actually looks like? Here&amp;rsquo;s the conversation that published this very article:&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;&lt;strong&gt;Me:&lt;/strong&gt; JarvisX, I left a new article for my blog in our shared Dropbox folder.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;JarvisX:&lt;/strong&gt; Read it! I see it&amp;rsquo;s in English. Do you want me to generate the Spanish version?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Me:&lt;/strong&gt; Yes. I trust your translations by now, so no need for me to review. Upload both versions directly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;JarvisX:&lt;/strong&gt; Thanks for the trust! Uploading now. Should I add a featured image?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Me:&lt;/strong&gt; Yes. Something that evokes an orchestra conductor surrounded by AI agents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;JarvisX:&lt;/strong&gt; Got it. Do you want to review it first?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Me:&lt;/strong&gt; No. Just upload it.&lt;/p&gt;
&lt;/blockquote&gt;&lt;p&gt;That&amp;rsquo;s it. No back-and-forth. No micromanagement. Just trust built over two weeks of working together.&lt;/p&gt;
&lt;p&gt;By the time you&amp;rsquo;re reading this, JarvisX has already translated it, generated the image, and deployed both versions to production. I didn&amp;rsquo;t check his work. I didn&amp;rsquo;t need to.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Real Insight
 &lt;div id="the-real-insight" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-real-insight" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;A year ago, I wrote an article called &lt;a href="https://carlesabarca.com/posts/fin-del-desarrollador/" target="_blank" rel="noreferrer"&gt;The End of the Developer&lt;/a&gt;. My thesis: the junior developer role would disappear. The future would belong to &amp;ldquo;directors of orchestra&amp;rdquo; who conduct AI agents rather than write code themselves.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;m now living that prediction.&lt;/p&gt;
&lt;p&gt;But here&amp;rsquo;s what I&amp;rsquo;ve learned: &lt;strong&gt;You can&amp;rsquo;t direct what you don&amp;rsquo;t understand.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;new developer&amp;rdquo; — the one who orchestrates AI agents — still needs deep technical knowledge. Not to write every line, but to:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Define architecture&lt;/strong&gt; that makes sense&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Recognize good code&lt;/strong&gt; when the agent produces it&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Catch mistakes&lt;/strong&gt; before they reach production&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Make tradeoffs&lt;/strong&gt; that require experience to evaluate&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;My 40 years of coding didn&amp;rsquo;t become obsolete when I started working with JarvisX. They became &lt;strong&gt;essential&lt;/strong&gt;. I can direct him effectively precisely &lt;em&gt;because&lt;/em&gt; I know what good looks like.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What This Means
 &lt;div id="what-this-means" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-this-means" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;If you&amp;rsquo;re a senior developer or architect, this is your moment. Your experience is more valuable than ever — not for typing code, but for &lt;strong&gt;directing those who do&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;re a junior developer, the path forward isn&amp;rsquo;t to compete with AI at writing code. It&amp;rsquo;s to &lt;strong&gt;accelerate your learning&lt;/strong&gt; so you can direct AI sooner. Use these tools to learn faster, not to avoid learning.&lt;/p&gt;
&lt;p&gt;If you&amp;rsquo;re a CIO or CTO, stop asking &amp;ldquo;how do we adopt AI agents?&amp;rdquo; Start asking &amp;ldquo;do our people have the technical depth to direct them well?&amp;rdquo;&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Bottom Line
 &lt;div id="the-bottom-line" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-bottom-line" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;72,000 lines. 5 projects. 2 weeks. Zero lines written by me.&lt;/p&gt;
&lt;p&gt;But every architectural decision? Mine. Every technology choice? Mine. Every quality standard? Mine.&lt;/p&gt;
&lt;p&gt;The future of software development isn&amp;rsquo;t about writing less code. It&amp;rsquo;s about &lt;strong&gt;directing more of it&lt;/strong&gt; — with the wisdom that only experience can provide.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Carles Abarca is VP of Digital Transformation at Tecnológico de Monterrey and former CTO of Banco Sabadell. He has been writing code for 40 years and plans to never stop — even if he&amp;rsquo;s no longer the one typing it.&lt;/em&gt;&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/ai-agent-two-weeks/featured.png"/></item><item><title>The CIO's Guide to AI Agents: Before You Buy, Ask These 5 Questions</title><link>https://carlesabarca.com/posts/cio-guide-ai-agents/</link><pubDate>Mon, 23 Feb 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/cio-guide-ai-agents/</guid><description>The AI agent market will reach $236 billion by 2034. Every vendor wants your budget. Here&amp;rsquo;s how to separate hype from value.</description><content:encoded>&lt;p&gt;&lt;em&gt;The AI agent market will reach $236 billion by 2034. Every vendor wants your budget. Here&amp;rsquo;s how to separate hype from value.&lt;/em&gt;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;Every enterprise software vendor now sells &amp;ldquo;AI agents.&amp;rdquo; Salesforce has Agentforce. ServiceNow acquired Moveworks for $2.85 billion. Microsoft promises an &amp;ldquo;agentic enterprise&amp;rdquo; through Copilot. Startups like Sierra have hit $100 million ARR in under two years.&lt;/p&gt;
&lt;p&gt;The pressure to act is real. 89% of CIOs consider agent-based AI a strategic priority. 51% of large enterprises have already deployed agentic AI. Your board is asking questions. Your competitors are moving.&lt;/p&gt;
&lt;p&gt;But here&amp;rsquo;s what nobody tells you: most AI agent purchases fail to deliver expected value. Not because the technology doesn&amp;rsquo;t work — but because organizations buy the wrong solution for their specific situation.&lt;/p&gt;
&lt;p&gt;After two decades leading technology transformation at a major European bank and now driving digital innovation at one of Latin America&amp;rsquo;s largest universities, I&amp;rsquo;ve developed a framework for evaluating AI agents that cuts through vendor hype. Before you sign that contract, ask these five questions.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The AI Agent Landscape in 2026
 &lt;div id="the-ai-agent-landscape-in-2026" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-ai-agent-landscape-in-2026" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;First, let&amp;rsquo;s understand what you&amp;rsquo;re buying. The market has three distinct categories:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;1. Embedded Agents from SaaS Vendors&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Your existing vendors are adding agents to their platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Salesforce Agentforce&lt;/strong&gt;: $0.10 per action or $125-550/user/month&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;ServiceNow AI Agents&lt;/strong&gt;: Full orchestration with AI Control Tower&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Microsoft Copilot Studio&lt;/strong&gt;: Included with M365, plus add-ons&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Zendesk AI Agents&lt;/strong&gt;: $1.50 per autonomous resolution&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The pitch: &amp;ldquo;You already use our platform. Now it&amp;rsquo;s smarter.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;2. Pure-Play Agent Startups&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Companies building agents from the ground up:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Sierra&lt;/strong&gt; (Bret Taylor, ex-Salesforce): $10B valuation, focused on customer service&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Adept&lt;/strong&gt;: Targeting workflow automation&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Imbue, Reflection AI&lt;/strong&gt;: Research-driven approaches&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The pitch: &amp;ldquo;We&amp;rsquo;re not constrained by legacy architecture.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;3. Foundation Model Providers&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The companies building the AI itself:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Anthropic&lt;/strong&gt;: Claude with Computer Use and MCP (Model Context Protocol)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;OpenAI&lt;/strong&gt;: GPT-4 with Operator&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Google&lt;/strong&gt;: Gemini with Agentspace&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The pitch: &amp;ldquo;Build custom agents on our infrastructure.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;Each category has trade-offs. Your job is to understand which trade-offs matter for your organization.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Question 1: What Problem Are You Actually Solving?
 &lt;div id="question-1-what-problem-are-you-actually-solving" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#question-1-what-problem-are-you-actually-solving" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;This sounds obvious. It isn&amp;rsquo;t.&lt;/p&gt;
&lt;p&gt;&amp;ldquo;We need AI agents&amp;rdquo; is not a problem statement. Neither is &amp;ldquo;we need to reduce costs&amp;rdquo; or &amp;ldquo;we need to be more innovative.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;A proper problem statement looks like this:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&amp;ldquo;Our customer service team handles 50,000 tickets per month. 60% are password resets and order status checks. Average handling time is 8 minutes. We need to reduce that to under 2 minutes for routine inquiries.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&amp;ldquo;Our compliance team manually reviews 10,000 transactions daily for AML screening. False positive rate is 95%. We need to reduce false positives while maintaining regulatory coverage.&amp;rdquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Notice the difference? Specific process. Measurable baseline. Clear target.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The trap&lt;/strong&gt;: Vendors will happily sell you a &amp;ldquo;general-purpose agent platform&amp;rdquo; that can theoretically do anything. In practice, these platforms do nothing well. Start with one high-value, well-defined use case. Prove it works. Then expand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Red flag&lt;/strong&gt;: If you can&amp;rsquo;t articulate the specific process, current metrics, and target improvement, you&amp;rsquo;re not ready to buy.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Question 2: Build, Buy, or Extend?
 &lt;div id="question-2-build-buy-or-extend" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#question-2-build-buy-or-extend" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;You have three paths. Let me illustrate each with real scenarios.&lt;/p&gt;

&lt;h3 class="relative group"&gt;EXTEND: Add Agent Capabilities to Your Existing Platforms
 &lt;div id="extend-add-agent-capabilities-to-your-existing-platforms" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#extend-add-agent-capabilities-to-your-existing-platforms" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;What it means&lt;/strong&gt;: You already use Salesforce, ServiceNow, or Microsoft 365. You activate their built-in agent features.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real example — Global Retailer with Salesforce&lt;/strong&gt;:
A retail company with 200 customer service reps was already running Service Cloud. They activated Agentforce Service Agent in three weeks. Configuration, not development. The agent now handles 40% of incoming inquiries (order status, return policies, store hours) without human intervention. Cost: $0.10 per agent action. No new vendor relationship. No integration project.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real example — Insurance Company with ServiceNow&lt;/strong&gt;:
An insurer using ServiceNow ITSM enabled AI Agents for incident categorization and routing. The agent reads incoming tickets, identifies the affected system, assigns priority, and routes to the right team. Implementation: 6 weeks. Result: 60% reduction in misrouted tickets, 25% faster resolution times.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;When to Extend&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;You&amp;rsquo;re already paying for the platform&lt;/li&gt;
&lt;li&gt;Your use case is common (customer service, IT helpdesk, HR inquiries)&lt;/li&gt;
&lt;li&gt;You need results in weeks, not months&lt;/li&gt;
&lt;li&gt;You don&amp;rsquo;t have AI engineering talent&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;The limitation&lt;/strong&gt;: You&amp;rsquo;re constrained by what your vendor offers. If Salesforce Agentforce doesn&amp;rsquo;t support your specific workflow, you&amp;rsquo;re stuck waiting for their roadmap.&lt;/p&gt;
&lt;hr&gt;

&lt;h3 class="relative group"&gt;BUY: Purchase a Specialized Agent Solution
 &lt;div id="buy-purchase-a-specialized-agent-solution" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#buy-purchase-a-specialized-agent-solution" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;What it means&lt;/strong&gt;: You bring in a startup or specialized vendor that does one thing exceptionally well.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real example — E-commerce Brand with Sierra&lt;/strong&gt;:
A direct-to-consumer brand with $500M revenue wasn&amp;rsquo;t satisfied with their existing chatbot. They deployed Sierra for customer service. Sierra&amp;rsquo;s agents handle complex conversations: processing returns while suggesting alternatives, managing subscription modifications, resolving billing disputes. The agents access their Shopify backend, payment processor, and shipping systems. Implementation: 4 months. Result: 70% of conversations resolved without human escalation. Customer satisfaction increased 15 points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real example — Enterprise IT with Moveworks (now ServiceNow)&lt;/strong&gt;:
Before the acquisition, companies bought Moveworks specifically for IT helpdesk automation. The agent could reset passwords, provision software, troubleshoot VPN issues, and answer policy questions — all through natural conversation in Slack or Teams. It understood context: &amp;ldquo;I can&amp;rsquo;t access the server&amp;rdquo; triggered different workflows than &amp;ldquo;I need Photoshop installed.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;When to Buy&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Your existing vendor&amp;rsquo;s agents aren&amp;rsquo;t good enough for a strategic use case&lt;/li&gt;
&lt;li&gt;A startup has proven traction in your specific domain&lt;/li&gt;
&lt;li&gt;You can afford 3-6 months implementation&lt;/li&gt;
&lt;li&gt;The use case is important enough to justify a new vendor relationship&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;The risk&lt;/strong&gt;: Startup viability. What happens if Sierra gets acquired? If the startup pivots? You&amp;rsquo;re dependent on a company that may not exist in five years. Mitigate this by ensuring data portability and avoiding deep customizations that lock you in.&lt;/p&gt;
&lt;hr&gt;

&lt;h3 class="relative group"&gt;BUILD: Create Custom Agents Using Foundation Models
 &lt;div id="build-create-custom-agents-using-foundation-models" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#build-create-custom-agents-using-foundation-models" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;What it means&lt;/strong&gt;: You use Claude, GPT-4, or Gemini APIs to build agents tailored to your unique processes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real example — Investment Bank&amp;rsquo;s Deal Analysis Agent&lt;/strong&gt;:
A bulge-bracket bank built a custom agent for M&amp;amp;A analysts. The agent ingests SEC filings, earnings transcripts, news articles, and internal research. Analysts ask natural language questions: &amp;ldquo;What are the key risks in Acme Corp&amp;rsquo;s debt structure?&amp;rdquo; or &amp;ldquo;Compare the margin profile of these three acquisition targets.&amp;rdquo; The agent synthesizes information that would take an analyst hours to compile manually. Built on Claude with custom RAG (retrieval-augmented generation) over proprietary databases. Development: 8 months. Team: 6 engineers, 2 ML specialists. The output is proprietary competitive advantage — no vendor offers this.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real example — Pharmaceutical Company&amp;rsquo;s Clinical Trial Agent&lt;/strong&gt;:
A pharma company built an agent to monitor clinical trial data in real-time. The agent identifies adverse event patterns, flags protocol deviations, and generates regulatory-ready reports. This isn&amp;rsquo;t a use case any vendor serves — the domain knowledge is too specialized, the regulatory requirements too specific. Built on GPT-4 with extensive fine-tuning and custom safety guardrails. Development: 12 months.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real example — University&amp;rsquo;s Custom Academic Ecosystem (TecGPT)&lt;/strong&gt;:
At Tecnológico de Monterrey, we built TecGPT — our academic AI ecosystem, integrated with our LMS, assisting both professors and students, and aligned with our academic regulations. No vendor could offer what we needed out-of-the-box.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;When to Build&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Your process is genuinely unique and defines competitive advantage&lt;/li&gt;
&lt;li&gt;No vendor solution exists for your domain&lt;/li&gt;
&lt;li&gt;You have AI/ML engineering talent (or can acquire it)&lt;/li&gt;
&lt;li&gt;You can invest 6-12 months before seeing production value&lt;/li&gt;
&lt;li&gt;The long-term value justifies the ongoing maintenance cost&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;The reality check&lt;/strong&gt;: Building is expensive. A custom agent isn&amp;rsquo;t a one-time project — it&amp;rsquo;s an ongoing product. You need engineers to maintain it, improve it, and adapt it as foundation models evolve. Budget for 2-3 FTEs indefinitely, not just the initial build.&lt;/p&gt;
&lt;hr&gt;

&lt;h3 class="relative group"&gt;The Decision Matrix
 &lt;div id="the-decision-matrix" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-decision-matrix" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Factor&lt;/th&gt;
 &lt;th&gt;Extend&lt;/th&gt;
 &lt;th&gt;Buy&lt;/th&gt;
 &lt;th&gt;Build&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;Time to value&lt;/td&gt;
 &lt;td&gt;Weeks&lt;/td&gt;
 &lt;td&gt;3-6 months&lt;/td&gt;
 &lt;td&gt;6-12 months&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Customization&lt;/td&gt;
 &lt;td&gt;Low&lt;/td&gt;
 &lt;td&gt;Medium&lt;/td&gt;
 &lt;td&gt;Unlimited&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Vendor risk&lt;/td&gt;
 &lt;td&gt;Low&lt;/td&gt;
 &lt;td&gt;High&lt;/td&gt;
 &lt;td&gt;Medium&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Talent required&lt;/td&gt;
 &lt;td&gt;Admins&lt;/td&gt;
 &lt;td&gt;Integrators&lt;/td&gt;
 &lt;td&gt;Engineers + ML&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Total cost (3 years)&lt;/td&gt;
 &lt;td&gt;$$&lt;/td&gt;
 &lt;td&gt;$$$&lt;/td&gt;
 &lt;td&gt;$-$$$$&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Competitive advantage&lt;/td&gt;
 &lt;td&gt;None&lt;/td&gt;
 &lt;td&gt;Low&lt;/td&gt;
 &lt;td&gt;High&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;

&lt;h3 class="relative group"&gt;My Recommendation
 &lt;div id="my-recommendation" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#my-recommendation" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Start with Extend&lt;/strong&gt; for your first agent deployment. Even if it&amp;rsquo;s not perfect, you&amp;rsquo;ll learn what works, what users actually need, and where the gaps are. That learning is invaluable before you commit to Buy or Build.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Move to Buy&lt;/strong&gt; when you&amp;rsquo;ve proven agent value and need capabilities your platform vendor doesn&amp;rsquo;t offer. Choose startups with strong traction, clear use case focus, and enterprise customers who can serve as references.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Build only&lt;/strong&gt; when you&amp;rsquo;ve exhausted Extend and Buy options, or when the process is so unique that it genuinely differentiates your business. If you&amp;rsquo;re building agents for commodity processes (customer service, IT helpdesk), you&amp;rsquo;re wasting engineering talent that could create actual competitive advantage.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Question 3: How Will You Measure Success?
 &lt;div id="question-3-how-will-you-measure-success" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#question-3-how-will-you-measure-success" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Before deployment, define:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficiency metrics&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Time saved per task&lt;/li&gt;
&lt;li&gt;Tasks handled without human intervention&lt;/li&gt;
&lt;li&gt;Error rate reduction&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Quality metrics&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Customer satisfaction (for customer-facing agents)&lt;/li&gt;
&lt;li&gt;Compliance accuracy (for regulatory processes)&lt;/li&gt;
&lt;li&gt;Decision quality (measured against human expert baseline)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Business metrics&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Cost per transaction&lt;/li&gt;
&lt;li&gt;Revenue impact (if applicable)&lt;/li&gt;
&lt;li&gt;Employee satisfaction (agents should help workers, not threaten them)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;The 90-day rule&lt;/strong&gt;: If you can&amp;rsquo;t demonstrate measurable improvement within 90 days, something is wrong. Either the use case was poorly chosen, the implementation was flawed, or the vendor oversold capabilities. Don&amp;rsquo;t extend pilots indefinitely hoping for results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hidden metric&lt;/strong&gt;: Adoption. The most sophisticated agent is worthless if your team doesn&amp;rsquo;t use it. Track actual usage, not just availability.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Question 4: What&amp;rsquo;s Your Human-in-the-Loop Strategy?
 &lt;div id="question-4-whats-your-human-in-the-loop-strategy" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#question-4-whats-your-human-in-the-loop-strategy" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;In 2026, no responsible organization deploys fully autonomous agents for consequential decisions. The question is where to place human oversight.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The spectrum&lt;/strong&gt;:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Human-initiated&lt;/strong&gt;: Human starts task, agent assists, human approves result&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Agent-initiated with approval&lt;/strong&gt;: Agent identifies opportunity, proposes action, human approves&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Agent-executed with audit&lt;/strong&gt;: Agent acts autonomously, human reviews after the fact&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Fully autonomous&lt;/strong&gt;: Agent acts without oversight (appropriate only for low-risk, reversible actions)&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;My framework for choosing&lt;/strong&gt;:&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Risk Level&lt;/th&gt;
 &lt;th&gt;Reversibility&lt;/th&gt;
 &lt;th&gt;Recommended Approach&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;Low&lt;/td&gt;
 &lt;td&gt;Easy to reverse&lt;/td&gt;
 &lt;td&gt;Fully autonomous&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Low&lt;/td&gt;
 &lt;td&gt;Hard to reverse&lt;/td&gt;
 &lt;td&gt;Agent-executed with audit&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;High&lt;/td&gt;
 &lt;td&gt;Easy to reverse&lt;/td&gt;
 &lt;td&gt;Agent-initiated with approval&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;High&lt;/td&gt;
 &lt;td&gt;Hard to reverse&lt;/td&gt;
 &lt;td&gt;Human-initiated only&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;&lt;strong&gt;Examples&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Password reset → Fully autonomous (low risk, reversible)&lt;/li&gt;
&lt;li&gt;Customer refund under $50 → Agent-executed with audit&lt;/li&gt;
&lt;li&gt;Credit decision → Agent-initiated with approval&lt;/li&gt;
&lt;li&gt;Regulatory filing → Human-initiated only&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;The regulatory reality&lt;/strong&gt;: Financial services, healthcare, and other regulated industries will require human oversight for most consequential decisions for the foreseeable future. Plan for this. Agents that &amp;ldquo;recommend and explain&amp;rdquo; are more valuable than agents that &amp;ldquo;decide and act&amp;rdquo; in these contexts.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Question 5: What&amp;rsquo;s Your Data Strategy?
 &lt;div id="question-5-whats-your-data-strategy" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#question-5-whats-your-data-strategy" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;AI agents are only as good as the data they can access. Before buying, audit:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data availability&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Can the agent access all systems needed for the use case?&lt;/li&gt;
&lt;li&gt;Are APIs available, or will you need custom integrations?&lt;/li&gt;
&lt;li&gt;What&amp;rsquo;s the latency? Agents that wait 30 seconds for data lookups frustrate users.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Data quality&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Is your data accurate and up-to-date?&lt;/li&gt;
&lt;li&gt;Are there known data quality issues that will cause agent errors?&lt;/li&gt;
&lt;li&gt;Who&amp;rsquo;s responsible for data hygiene?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Data governance&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;What data can the agent access? What&amp;rsquo;s off-limits?&lt;/li&gt;
&lt;li&gt;How do you prevent agents from exposing sensitive information?&lt;/li&gt;
&lt;li&gt;What audit trail exists for agent data access?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;The integration tax&lt;/strong&gt;: Most enterprises underestimate integration effort by 3-5x. If the vendor says &amp;ldquo;we integrate with everything,&amp;rdquo; ask for customer references running your specific system combination. Generic claims mean nothing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;MCP and the future&lt;/strong&gt;: Anthropic&amp;rsquo;s Model Context Protocol (MCP) is emerging as a standard for agent-to-system communication. Consider whether your chosen platform supports open standards or locks you into proprietary integrations.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Decision Framework
 &lt;div id="the-decision-framework" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-decision-framework" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;After answering these five questions, you should be able to complete this statement:&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;&amp;ldquo;We will deploy [specific agent type] to solve [specific problem] in [specific process]. We expect to achieve [specific metric improvement] within [timeframe]. Our human oversight model is [approach]. We have confirmed data access to [systems] and defined governance policies for [sensitive data types].&amp;rdquo;&lt;/p&gt;
&lt;/blockquote&gt;&lt;p&gt;If you can&amp;rsquo;t complete this statement with confidence, you&amp;rsquo;re not ready to buy.&lt;/p&gt;

&lt;h2 class="relative group"&gt;My Recommendations for 2026
 &lt;div id="my-recommendations-for-2026" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#my-recommendations-for-2026" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;For most enterprises&lt;/strong&gt;: Start with your existing vendor&amp;rsquo;s agent capabilities. Salesforce Agentforce, ServiceNow AI Agents, or Microsoft Copilot Studio will handle 80% of common use cases. The integration is already done. The governance frameworks exist. The risk is manageable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;For customer service&lt;/strong&gt;: Sierra has proven traction. If customer experience is strategic and your existing vendor&amp;rsquo;s agents aren&amp;rsquo;t cutting it, evaluate Sierra seriously. Their $100M ARR in 21 months signals real value delivery.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;For unique processes&lt;/strong&gt;: Build on Anthropic Claude or OpenAI, but only if you have the engineering talent to maintain custom solutions. The foundation models are extraordinary. The engineering required to productionize them is not trivial.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;For everyone&lt;/strong&gt;: Start small. One use case. 90-day proof of value. Then expand. The vendors want you to buy platforms. You should buy solutions to specific problems.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Bottom Line
 &lt;div id="the-bottom-line" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-bottom-line" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;The AI agent market is real. The value is real. But so is the hype.&lt;/p&gt;
&lt;p&gt;62% of companies investing in agentic AI expect more than 100% ROI. Some will achieve it. Many won&amp;rsquo;t — not because the technology failed, but because they bought the wrong solution for the wrong problem with the wrong implementation approach.&lt;/p&gt;
&lt;p&gt;Don&amp;rsquo;t be a statistic. Ask the five questions. Complete the decision framework. Then — and only then — sign the contract.&lt;/p&gt;
&lt;p&gt;The agents are ready. Make sure you are too.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/cio-guide-ai-agents/featured.png"/></item><item><title>Banking Will Never Be the Same: An Insider's View on AI Agents</title><link>https://carlesabarca.com/posts/banking-ai-agents/</link><pubDate>Sun, 22 Feb 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/banking-ai-agents/</guid><description>I led digital transformation at a €200B European bank. AI agents will change everything I thought I knew.</description><content:encoded>&lt;p&gt;&lt;em&gt;I led digital transformation at a €200B European bank. AI agents will change everything I thought I knew.&lt;/em&gt;&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;Between 2001 and 2023, I spent over two decades at Banco Sabadell, one of Spain&amp;rsquo;s largest financial institutions, rising from technology roles to CIO and ultimately CTO. I oversaw the technology integration of TSB in the UK, navigated one of the most public banking IT crises in recent memory, and led a multi-year digital transformation that touched every corner of a 25,000-employee organization.&lt;/p&gt;
&lt;p&gt;Back then, we believed APIs would revolutionize banking. We were thinking too small.&lt;/p&gt;
&lt;p&gt;The AI agent revolution will make our API transformation look like a warm-up exercise.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Why Banking Is Perfect for AI Agents
 &lt;div id="why-banking-is-perfect-for-ai-agents" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#why-banking-is-perfect-for-ai-agents" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Banks are unique. They operate in an environment that seems hostile to automation — heavy regulation, risk aversion, legacy systems — but is actually ideal for AI agents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Repetitive processes with clear rules.&lt;/strong&gt; KYC verification. Transaction monitoring. Dispute resolution. Compliance reporting. These processes follow explicit rules that humans apply thousands of times per day. Rules that can be encoded, executed, and audited by agents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structured data everywhere.&lt;/strong&gt; Unlike creative industries, banking runs on structured data: transactions, contracts, balances, rates. This is exactly what agents excel at processing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regulation as an advantage.&lt;/strong&gt; This sounds counterintuitive. But consider: agents can follow rules more consistently than humans. They don&amp;rsquo;t get tired. They don&amp;rsquo;t take shortcuts. They don&amp;rsquo;t forget steps. In a world where compliance failures cost billions, agents that provably follow every regulation become assets, not risks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Economics demand it.&lt;/strong&gt; European banks operate on razor-thin margins. Cost-to-income ratios above 60% are common. The math is brutal: reduce operational costs or die. Agents offer a path that doesn&amp;rsquo;t require another round of layoffs — they offer a path to do more with the same people.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Agent Transformation Map
 &lt;div id="the-agent-transformation-map" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-agent-transformation-map" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Here is how I see AI agents transforming core banking operations:&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Function&lt;/th&gt;
 &lt;th&gt;Today&lt;/th&gt;
 &lt;th&gt;With Agents&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;KYC/Onboarding&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;3-5 days, multiple handoffs&lt;/td&gt;
 &lt;td&gt;15 minutes, agent + human final approval&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Fraud Detection&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Alerts generate queues → humans investigate&lt;/td&gt;
 &lt;td&gt;Agent investigates, escalates only genuine concerns&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Customer Service&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Limited chatbots, frustrated transfers&lt;/td&gt;
 &lt;td&gt;Agent with full context, handles 80% end-to-end&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Compliance Monitoring&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Armies of reviewers sampling transactions&lt;/td&gt;
 &lt;td&gt;Continuous agent monitoring, humans handle exceptions&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Credit Decisions&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Score + human judgment + committee&lt;/td&gt;
 &lt;td&gt;Agent analysis + recommendation + human approval&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Regulatory Reporting&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Manual data gathering, reconciliation hell&lt;/td&gt;
 &lt;td&gt;Agent assembles, validates, humans verify and submit&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;The pattern is consistent: agents handle the volume, humans handle the judgment. The ratio shifts from 90% human work / 10% oversight to 10% human work / 90% oversight.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Real Challenges (From the Inside)
 &lt;div id="the-real-challenges-from-the-inside" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-real-challenges-from-the-inside" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;I would be lying if I said this transformation will be easy. Having lived inside a large bank, I know the obstacles are formidable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Legacy systems are not going anywhere.&lt;/strong&gt; COBOL is still running critical systems at most major banks. Mainframes process millions of transactions daily. These systems work. They are paid off. No CEO will approve ripping them out for an AI experiment. The winning strategy is not replacement — it is an agent layer that interfaces with legacy systems through existing APIs and screen scraping where necessary.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regulators are conservative — but evolving.&lt;/strong&gt; Banking regulators move slowly by design. They remember 2008. But they are not blind to AI&amp;rsquo;s potential. The key is engaging them early, demonstrating auditability, and framing agents as risk reduction tools, not risk introduction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Culture resists change.&lt;/strong&gt; Banks are risk-averse institutions filled with risk-averse people. &amp;ldquo;We&amp;rsquo;ve always done it this way&amp;rdquo; is not a cliché — it is a deeply held belief. Change management matters more than technology selection.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The talent gap is real.&lt;/strong&gt; Bankers do not understand AI. AI engineers do not understand banking. Finding people who speak both languages is nearly impossible. Building that bridge — through training, hiring, and partnerships — is essential.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data silos persist.&lt;/strong&gt; The customer data you need for intelligent agents is scattered across dozens of systems that do not talk to each other. Data unification projects have failed for decades. Agents will not magically solve this — but they can work around it better than traditional integration approaches.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What I Would Do Differently Today
 &lt;div id="what-i-would-do-differently-today" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-i-would-do-differently-today" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;If I were back in that CTO chair today, knowing what I know about AI agents, here is what I would change:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stop buying &amp;ldquo;AI-powered&amp;rdquo; SaaS.&lt;/strong&gt; Every vendor now claims AI capabilities. Most are wrappers around the same foundation models with limited customization. Instead: build an agent layer on top of your existing systems. Own the intelligence, rent the infrastructure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Start with back-office, not customer-facing.&lt;/strong&gt; Regulators scrutinize customer-facing AI. Back-office operations have more freedom. Prove value internally — reconciliation, reporting, internal fraud investigation — then expand to customer touchpoints with evidence and confidence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adopt a hybrid approval model.&lt;/strong&gt; For the next 3-5 years, the winning pattern is: agent proposes, human approves. This satisfies regulators, builds trust, and creates training data for future full automation. Do not try to remove humans too fast.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Invest in observability.&lt;/strong&gt; Agents must be auditable. Every decision, every data access, every recommendation must be logged and explainable. This is not optional — it is the foundation of regulatory acceptance.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Predictions for 2027-2030
 &lt;div id="predictions-for-2027-2030" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#predictions-for-2027-2030" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Based on what I am seeing in the market and my experience inside banking:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;50% reduction in back-office headcount.&lt;/strong&gt; Not through layoffs, but through attrition and redeployment. The work will not disappear — it will transform. Banks will need fewer processors and more agent supervisors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&amp;ldquo;Agent-first&amp;rdquo; banks will emerge.&lt;/strong&gt; New entrants — likely from Asia and Latin America — will build banks designed around agents from day one. They will operate with 1/10th the staff of traditional banks. This is the real competitive threat.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regulators will create AI agent certification.&lt;/strong&gt; Just as we have software audits today, we will have agent audits. Banks that achieve certification early will have a competitive advantage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;M&amp;amp;A will accelerate.&lt;/strong&gt; The investment required to build agent capabilities favors scale. Smaller banks that cannot afford the transformation will merge or be acquired. The industry will consolidate further.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The &amp;ldquo;AI banker&amp;rdquo; role will emerge.&lt;/strong&gt; A new job category: professionals who understand both banking operations and AI agent orchestration. They will be the most sought-after talent in financial services.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The Choice Ahead
 &lt;div id="the-choice-ahead" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-choice-ahead" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Every banking executive faces the same question today: invest heavily in AI agents now, or wait and see.&lt;/p&gt;
&lt;p&gt;Waiting feels safe. It is not.&lt;/p&gt;
&lt;p&gt;The banks that move first will reduce costs, improve compliance, and — crucially — attract the talent needed to keep improving. The banks that wait will find themselves competing against institutions that operate at half their cost structure.&lt;/p&gt;
&lt;p&gt;I have seen banking transform before — from branches to ATMs, from paper to digital, from monoliths to APIs. Each wave rewarded the early movers and punished the laggards.&lt;/p&gt;
&lt;p&gt;This wave will be faster and more decisive.&lt;/p&gt;
&lt;p&gt;The question is not whether your bank will use AI agents. The question is whether you will be the one deploying them — or the one being replaced by a bank that did.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Carles Abarca is VP of Digital Transformation at Tecnológico de Monterrey. He spent 22 years at Banco Sabadell, including roles as CIO and CTO, leading the bank&amp;rsquo;s digital transformation and the technology integration of TSB. He writes about AI, digital transformation, and the future of enterprise technology at &lt;a href="https://carlesabarca.com" target="_blank" rel="noreferrer"&gt;carlesabarca.com&lt;/a&gt;.&lt;/em&gt;&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/banking-ai-agents/featured.png"/></item><item><title>The Agent Economy: From Selling Software to Selling Artificial Employees</title><link>https://carlesabarca.com/posts/agent-economy/</link><pubDate>Wed, 18 Feb 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/agent-economy/</guid><description>SAP will no longer sell FI/CO licenses. It will sell an accounting agent. Salesforce will not sell CRM per user. It will sell a sales agent. Welcome to the era of Services-as-Software.</description><content:encoded>&lt;p&gt;The SaaSpocalypse of February was not an accident. It was a warning. The market saw what many CIOs still do not want to see: the business logic that justifies billions in licensing fees fits in a text file that an AI agent can execute.&lt;/p&gt;
&lt;p&gt;But the interesting question is not what gets destroyed. It is what emerges.&lt;/p&gt;

&lt;h2 class="relative group"&gt;From SaaS to SaS
 &lt;div id="from-saas-to-sas" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#from-saas-to-sas" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;The industry consensus already has a name for this transition: &lt;strong&gt;SaS — Services-as-Software&lt;/strong&gt;. The term, popularized by Foundation Capital in their thesis on the $4.6 trillion agentic AI opportunity, inverts the formula that has dominated enterprise software for two decades. SaaS sold software as a service. SaS sells services as software — autonomous agents that deliver outcomes, not interfaces.&lt;/p&gt;
&lt;p&gt;It is the difference between selling a scalpel and selling the surgery.&lt;/p&gt;
&lt;p&gt;Others call it &lt;strong&gt;WaaS — Workers-as-a-Service&lt;/strong&gt;. Or simply what it is: the &lt;strong&gt;Agent Economy&lt;/strong&gt;. An economy where companies do not buy software licenses. They hire artificial employees. Employees that do not negotiate salaries, do not request time off, and scale without limit.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The reconversion map
 &lt;div id="the-reconversion-map" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-reconversion-map" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;If the thesis is correct, the question is not whether the major SaaS players will reconvert, but when and how. Here are my concrete predictions:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;SAP&lt;/strong&gt; will stop selling FI/CO licenses for accounting. Instead, it will offer an accounting agent that, based on business events — an invoice issued, a payment received, a period close — autonomously maintains the books in SAP. Humans will stop interacting with SAP&amp;rsquo;s UI. The agent will interact with SAP&amp;rsquo;s API. The financial controller will shift from operating the system to supervising the agent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Salesforce&lt;/strong&gt; will stop selling CRM per user. It will offer a sales agent that qualifies leads, updates opportunities, schedules follow-ups, and generates forecasts. Salespeople will stop filling in Salesforce fields. The agent will extract information from emails, calls, and meetings, keeping the pipeline updated. The sales manager&amp;rsquo;s role will shift from chasing their team to update the CRM to reviewing the agent&amp;rsquo;s decisions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;ServiceNow&lt;/strong&gt; will stop selling support tickets per seat. It will offer an operations agent that diagnoses incidents, executes runbooks, escalates when necessary, and closes tickets without human intervention. Eighty percent of L1 and L2 support will be invisible to humans.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Workday&lt;/strong&gt; will stop selling HR modules per employee. It will offer a people management agent that processes payroll, manages absences, generates compliance reports, and executes full onboarding — from provisioning access to scheduling the first week&amp;rsquo;s agenda.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;HubSpot&lt;/strong&gt; will stop selling a marketing suite. It will offer a growth agent that generates content, optimizes campaigns, segments audiences, and adjusts budgets in real time based on conversions. The CMO will not operate tools. They will conduct an orchestra of agents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Atlassian&lt;/strong&gt; will stop selling Jira per developer. It will offer a delivery agent that breaks epics into tasks, assigns work based on capacity, detects blockers, and generates progress reports. The engineering manager will shift from managing a board to managing a strategy.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The pattern
 &lt;div id="the-pattern" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-pattern" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;In every case, the pattern is the same: the human stops being the software operator and becomes the agent&amp;rsquo;s supervisor. The value is not in the interface. It is in the intelligence.&lt;/p&gt;
&lt;p&gt;And the business model transforms with it. You no longer charge per seat — because there are no seats. You charge per outcome. Per invoice processed. Per lead qualified. Per incident resolved. Per payroll executed.&lt;/p&gt;
&lt;p&gt;The companies that understand this first will capture the market. Those that keep selling interfaces with a copilot bolted on top will discover that a markdown file and an agent with access to their APIs do the same job at a fraction of the cost.&lt;/p&gt;
&lt;p&gt;The Agent Economy is not coming. It is already here. The question is whether you are hiring agents or still buying licenses.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/agent-economy/featured.png"/></item><item><title>The SaaSpocalypse: $300 Billion Evaporated in 48 Hours</title><link>https://carlesabarca.com/posts/saaspocalypse/</link><pubDate>Tue, 17 Feb 2026 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/saaspocalypse/</guid><description>Anthropic launches a few plugins and the SaaS market loses $300 billion. The AI agents thesis is confirmed in the most brutal way possible.</description><content:encoded>&lt;p&gt;$300 billion evaporated in 48 hours. Welcome to the SaaSpocalypse.&lt;/p&gt;
&lt;p&gt;In October 2024, I wrote about the next wave of AI: agents. In May 2025, I wrote about the end of the developer as we know them. Both articles shared a thesis: AI is not coming to assist. It is coming to execute.&lt;/p&gt;
&lt;p&gt;The first week of February proved us right in the most brutal way possible.&lt;/p&gt;
&lt;p&gt;Anthropic launched a set of plugins for its Cowork tool — essentially markdown files that encode legal, financial, and sales expertise. Nothing revolutionary on the surface. But the market understood something that many CIOs still have not processed: if Thomson Reuters&amp;rsquo; business logic fits in a text file that an AI agent can read and execute, their $200/user/month software has an existential problem.&lt;/p&gt;
&lt;p&gt;Thomson Reuters lost 57% from its highs. ServiceNow 48%. Salesforce 43%. The S&amp;amp;P software index had its worst month since October 2008.&lt;/p&gt;
&lt;p&gt;But the interesting part is not the decline. It is the thesis behind it.&lt;/p&gt;
&lt;p&gt;Satya Nadella said it in December 2024: &amp;ldquo;SaaS applications are CRUD databases with business logic on top. Agents will absorb that logic.&amp;rdquo; Foundation Capital put a number on it: $4.6 trillion in opportunity. They called it &amp;ldquo;Service as Software&amp;rdquo; — the inversion of the SaaS model. You no longer sell tools for humans to solve problems. You sell outcomes. The agent does the work.&lt;/p&gt;
&lt;p&gt;A Palantir client summed it up better than any analyst: &amp;ldquo;All third-party software must justify its existence. And so far, it has not been able to.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;This does not mean Salesforce or SAP will disappear tomorrow. Switching costs are real. Deep integrations provide protection. But the per-seat pricing model has its days numbered. IDC predicts it will be obsolete by 2028. And companies that do not transition to outcome-based models will discover their moat was narrower than they thought.&lt;/p&gt;
&lt;p&gt;I have spent over 20 years leading technology transformation in banking and education. I have seen panic cycles before — cloud, mobile, blockchain. But this time is different. It is not a new layer being added to the stack. It is a layer that absorbs the others.&lt;/p&gt;
&lt;p&gt;The question for every CIO today is not &amp;ldquo;how do I integrate AI into my software.&amp;rdquo; It is &amp;ldquo;how much of my software can an agent replace with a markdown file and access to my APIs.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;If the answer makes you uncomfortable, you are probably already late.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/saaspocalypse/featured.png"/></item><item><title>The End of the Developer: The Future of Software Development with AI Agents</title><link>https://carlesabarca.com/posts/fin-del-desarrollador/</link><pubDate>Tue, 20 May 2025 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/fin-del-desarrollador/</guid><description>The developer role as we know it has its days numbered. The architect becomes the conductor of an AI agent orchestra.</description><content:encoded>&lt;p&gt;The End of the Junior Developer: The Future of Software Development with AI Agents&lt;/p&gt;
&lt;p&gt;In a world where artificial intelligence advances at breakneck speed, an uncomfortable truth looms on the horizon of the technology industry: the junior developer role may be on the path to extinction. This is not science fiction but an emerging reality that is already transforming how we build software.&lt;/p&gt;

&lt;h3 class="relative group"&gt;The Silent Revolution of AI Agents
 &lt;div id="the-silent-revolution-of-ai-agents" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-silent-revolution-of-ai-agents" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;Today, AI is no longer just an assistant that completes code. AI agents are evolving into autonomous entities capable of perceiving the development environment, making complex decisions, and executing complete programming tasks with minimal human oversight. We are no longer talking about simple tools, but digital collaborators that are reconfiguring the entire development chain.&lt;/p&gt;

&lt;h3 class="relative group"&gt;From Assistant to Autonomous Agent
 &lt;div id="from-assistant-to-autonomous-agent" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#from-assistant-to-autonomous-agent" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;AI-based code assistants like GitHub Copilot or Codeium have already transformed developer productivity. However, what is coming is far more disruptive: specialized agents working in concert to manage the entire development lifecycle.&lt;/p&gt;
&lt;p&gt;What does this mean? While today a junior developer can still ask an AI to generate boilerplate code or explain complex systems, tomorrow a technical architect will be able to instruct a complete team of agents to:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Develop complex code based on high-level requirements&lt;/li&gt;
&lt;li&gt;Perform exhaustive testing and bug resolution&lt;/li&gt;
&lt;li&gt;Optimize performance without manual intervention&lt;/li&gt;
&lt;li&gt;Manage deployments and update documentation&lt;/li&gt;
&lt;/ul&gt;

&lt;h3 class="relative group"&gt;The Prediction That Is Already Happening
 &lt;div id="the-prediction-that-is-already-happening" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-prediction-that-is-already-happening" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;Mark Zuckerberg stated it without ambiguity: &amp;ldquo;By 2025, AI will be capable of functioning as a mid-level engineer, writing code and potentially replacing software developers.&amp;rdquo; We are not talking about a distant future, but a reality that is already emerging.&lt;/p&gt;
&lt;p&gt;According to Gartner, by 2027 generative AI will require 80% of the engineering workforce to upskill, creating new roles and eliminating others. The question is no longer whether it will happen, but when it will reach the tipping point that transforms the entire ecosystem.&lt;/p&gt;

&lt;h3 class="relative group"&gt;Orchestration: The New Paradigm
 &lt;div id="orchestration-the-new-paradigm" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#orchestration-the-new-paradigm" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;The key concept here is AI agent orchestration: a process by which multiple specialized agents work together within a unified system. Each agent focuses on a specific task — UI design, backend development, testing, security — while a central entity (human or AI) conducts the symphony.&lt;/p&gt;

&lt;h3 class="relative group"&gt;The Architect as Orchestra Conductor
 &lt;div id="the-architect-as-orchestra-conductor" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-architect-as-orchestra-conductor" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;In this new paradigm, the technical architect becomes the true protagonist. Their role evolves from solution designer to strategic director of an AI agent team, defining:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The system vision and requirements&lt;/li&gt;
&lt;li&gt;Technical and business constraints&lt;/li&gt;
&lt;li&gt;Architecture and quality standards&lt;/li&gt;
&lt;li&gt;Resolution of complex problems that require human judgment&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This transformation is already happening. According to ServiceNow/Pearson research, by 2027 18.7% of technical architect tasks will be at least partially augmented by AI. Architects will focus less on guiding code implementation and more on directing and supervising the autonomous work of agents.&lt;/p&gt;

&lt;h3 class="relative group"&gt;The Experience Crisis
 &lt;div id="the-experience-crisis" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-experience-crisis" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;Here the fundamental dilemma arises: if AI agents can handle the tasks traditionally assigned to junior developers, how will new professionals acquire experience?&lt;/p&gt;
&lt;p&gt;One of the greatest concerns is precisely how junior developers can grow into mid-level and senior roles if AI handles most of the routine coding. Traditionally, developers have learned by doing — writing, debugging, and refactoring real-world code. Without that hands-on experience, there is a risk that developers will not fully understand the complexities of software development.&lt;/p&gt;

&lt;h3 class="relative group"&gt;A Future Without Traditional Juniors
 &lt;div id="a-future-without-traditional-juniors" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#a-future-without-traditional-juniors" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;My thesis is that the developer role, as we know it, will disappear. In its place, we will see:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Prompt and orchestration engineers:&lt;/strong&gt; Professionals specialized in directing and extracting maximum value from AI agents.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Verification and review specialists:&lt;/strong&gt; Experts in evaluating AI-generated code, identifying edge cases, and testing its reliability.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;High-level system designers:&lt;/strong&gt; Professionals focused on architecture and system design, where higher-level thinking remains primarily human.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Companies are already hiring fewer junior engineers due to AI-driven productivity improvements. This trend will only accelerate as AI agents mature.&lt;/p&gt;

&lt;h3 class="relative group"&gt;Adapt or Fall Behind
 &lt;div id="adapt-or-fall-behind" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#adapt-or-fall-behind" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;For current professionals, the message is clear: the developer career is evolving, not disappearing. The future belongs not to those who resist AI nor to those who depend on it exclusively, but to those who learn to work symbiotically with these tools.&lt;/p&gt;
&lt;p&gt;The most successful developers will be those who:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Master prompt engineering to guide AI agents&lt;/li&gt;
&lt;li&gt;Develop sharp evaluation and verification skills&lt;/li&gt;
&lt;li&gt;Focus on areas where human creativity and systems thinking are irreplaceable&lt;/li&gt;
&lt;li&gt;Deeply understand orchestration and collaboration between multiple AI agents&lt;/li&gt;
&lt;/ul&gt;

&lt;h3 class="relative group"&gt;Are We Ready for This Change?
 &lt;div id="are-we-ready-for-this-change" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#are-we-ready-for-this-change" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h3&gt;
&lt;p&gt;If my thesis is correct, we face a radical transformation in how we educate future developers and structure technical teams. Universities, bootcamps, and companies will need to completely rethink their training and hiring programs.&lt;/p&gt;
&lt;p&gt;The question is not whether AI agents will revolutionize software development — they already are — but how quickly we will adapt as an industry to a world where humans design and direct, while AI agents build and implement.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/fin-del-desarrollador/featured.png"/></item><item><title>Year Three of AI: The Age of Agentic AI</title><link>https://carlesabarca.com/posts/year-three-ai-agentic/</link><pubDate>Wed, 11 Dec 2024 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/year-three-ai-agentic/</guid><description>2025 marks Year Three of the new AI era: the year of Agentic AI, with autonomous agents that take actions on our behalf.</description><content:encoded>&lt;p&gt;And now, on to the challenges of 2025 &amp;ndash; Year Three of the new AI era: the year of Agentic AI.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Year I - 2023:&lt;/strong&gt; the emergence of ChatGPT, the year of prompting where AI was text-based conversation. We launched our TecGPT.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Year II - 2024:&lt;/strong&gt; AI becomes multimodal and learns to generate image, sound, music, voice, and video content. We launched our SkillStudio and incorporated multimodality into our platform.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Year III - 2025:&lt;/strong&gt; AI enables the creation of autonomous agents that take actions on our behalf &amp;ndash; the year of Agentic AI. We will launch AgentStudio with the essential contribution of Manuel Teran and his incredible team.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/year-three-ai-agentic/featured.png"/></item><item><title>The Next AI Wave: Agents</title><link>https://carlesabarca.com/posts/next-ai-wave-agents/</link><pubDate>Tue, 15 Oct 2024 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/next-ai-wave-agents/</guid><description>AI agents are not chatbots. They are autonomous entities that plan, reason, and act. This is the next wave, and it changes everything.</description><content:encoded>&lt;p&gt;The AI conversation has been dominated by chatbots and copilots &amp;ndash; tools that assist humans in doing their work faster. That era is ending. The next wave is agents, and the distinction matters more than most people realize.&lt;/p&gt;

&lt;h2 class="relative group"&gt;From assistants to autonomous actors
 &lt;div id="from-assistants-to-autonomous-actors" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#from-assistants-to-autonomous-actors" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;A chatbot responds to prompts. An agent pursues objectives.&lt;/p&gt;
&lt;p&gt;The difference is not incremental. It is architectural. An AI agent is an autonomous entity that receives a goal, decomposes it into subtasks, plans an execution strategy, uses tools and APIs to act on the world, observes results, adjusts its approach, and iterates until the objective is met. No human in the loop for each step. No prompt-response-prompt cycle.&lt;/p&gt;
&lt;p&gt;Think of it this way: a copilot helps you write an email. An agent handles your entire inbox &amp;ndash; triaging, responding, escalating, scheduling follow-ups &amp;ndash; while you focus on the decisions that actually require your judgment.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Why now
 &lt;div id="why-now" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#why-now" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Three converging forces make agents viable today in ways they were not two years ago:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reasoning capability.&lt;/strong&gt; Large language models have crossed a threshold in their ability to decompose complex problems, maintain context across long chains of action, and recover from errors. This is not about generating better text. It is about planning and execution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tool use.&lt;/strong&gt; Modern LLMs can reliably call APIs, query databases, browse the web, execute code, and interact with external systems. The agent is not trapped in a text box. It operates in the real digital environment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cost economics.&lt;/strong&gt; Inference costs have dropped by orders of magnitude. Running an agent that makes dozens of API calls to complete a complex task is now economically viable at enterprise scale.&lt;/p&gt;

&lt;h2 class="relative group"&gt;What this means for enterprises
 &lt;div id="what-this-means-for-enterprises" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#what-this-means-for-enterprises" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;The implications for enterprise technology are profound:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Workflow automation moves from rule-based to goal-based.&lt;/strong&gt; Instead of encoding every step in a process, you define the outcome. The agent figures out the path. This makes automation accessible to business users, not just developers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The value of software shifts.&lt;/strong&gt; If an agent can navigate a UI, call APIs, and execute business logic, the value of the software layer between the user and the data is fundamentally questioned. Middleware, workflow tools, and integration platforms face existential pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;New security surfaces emerge.&lt;/strong&gt; Autonomous agents with API access introduce attack vectors that traditional security models were not designed for. Identity, authorization, and audit trails need to be rethought for non-human actors that make decisions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Organizational structure adapts.&lt;/strong&gt; When agents handle execution, the human role shifts to oversight, strategy, and exception handling. This is not about eliminating jobs &amp;ndash; it is about redefining what humans do in knowledge work.&lt;/p&gt;

&lt;h2 class="relative group"&gt;The road ahead
 &lt;div id="the-road-ahead" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#the-road-ahead" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;We are in the early innings. Current agents are brittle in edge cases, expensive to orchestrate at scale, and difficult to debug when they fail. But the trajectory is clear. The companies building agent infrastructure today &amp;ndash; orchestration frameworks, tool ecosystems, evaluation pipelines &amp;ndash; are building the platforms of the next decade.&lt;/p&gt;
&lt;p&gt;The question for every technology leader is not whether agents will reshape their industry. It is whether they will be the ones deploying them or the ones being disrupted by them.&lt;/p&gt;
&lt;p&gt;The window for strategic positioning is open. It will not stay open long.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/next-ai-wave-agents/featured.png"/></item><item><title>Goodbye Prompt Engineering</title><link>https://carlesabarca.com/posts/goodbye-prompt-engineering/</link><pubDate>Tue, 30 Apr 2024 00:00:00 +0000</pubDate><guid>https://carlesabarca.com/posts/goodbye-prompt-engineering/</guid><description>The prompt engineering era was short-lived. The future points toward autonomous, goal-directed agents.</description><content:encoded>&lt;p&gt;As we come to understand the possibilities offered by the wave of AI-related technological advances, new ways of relating to these technologies are emerging. Our interactions with generative artificial intelligence are transforming, marking the end of the &amp;ldquo;prompt engineering&amp;rdquo; era &amp;ndash; a short-lived &amp;ldquo;era&amp;rdquo; in which some claimed that mastering the art of &amp;ldquo;prompting&amp;rdquo; would be key to benefiting from this new technology. I never subscribed to the notion that &amp;ldquo;prompt engineers&amp;rdquo; would lead AI adoption, and here are some reasons to anticipate the gradual disappearance of the ephemeral &amp;ldquo;art of prompt engineering&amp;rdquo;:&lt;/p&gt;

&lt;h2 class="relative group"&gt;From Interaction to Collaboration
 &lt;div id="from-interaction-to-collaboration" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#from-interaction-to-collaboration" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;The traditional model of interacting with large language models through specific commands is evolving. The future points toward interfaces with autonomous, goal-directed agents. This shift promises greater alignment with human intentions and a significant increase in AI decision-making autonomy.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Enhanced Collaboration
 &lt;div id="enhanced-collaboration" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#enhanced-collaboration" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;These advanced AI agents are designed to understand and anticipate needs, making them perfect partners in diverse professional environments. Whether optimizing decision processes or offering predictive insights, AI is moving toward a more proactive role, eliminating the need for detailed and repetitive command engineering.&lt;/p&gt;

&lt;h2 class="relative group"&gt;Real-World Applications
 &lt;div id="real-world-applications" class="anchor"&gt;&lt;/div&gt;
 
 &lt;span
 class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100 select-none"&gt;
 &lt;a class="text-primary-300 dark:text-neutral-700 !no-underline" href="#real-world-applications" aria-label="Anchor"&gt;#&lt;/a&gt;
 &lt;/span&gt;
 
&lt;/h2&gt;
&lt;p&gt;Imagine an AI that not only responds to commands but also initiates actions aligned with established goals in fields like customer service, healthcare, and finance. This proactive capability could redefine efficiency and effectiveness across various industries. We will progressively see specialized autonomous agents for specific tasks rather than interacting with general-purpose models.&lt;/p&gt;</content:encoded><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://carlesabarca.com/posts/goodbye-prompt-engineering/featured.png"/></item></channel></rss>