Skip to content

Back to blog

The five levels of agentic automation

From AI-augmented tasks to autonomous digital knowledge workers, we introduce a guide to understanding AI agent evolution.

Author July 12, 2024 Antti Karjalainen, Co-Founder

Understanding AI agent capabilities

AI agent is a term that’s often hard to nail down and define precisely. This is because agentic behavior is a spectrum, and we don’t have a clear definition of the capabilities “an agent” should have.

To make things more confusing, as agents have become the AI buzzword of the moment, everything becomes an agent. To cut through this noise and help companies set expectations, we introduce a way to categorize AI agents based on their capabilities and outcomes. 

Our focus is on agentic automation systems that help organizations automate knowledge work. This article is the first version of categorization for agentic automation, and this framework will evolve as we learn more about agentic systems and experience more breakthroughs in AI capabilities.

Knowledge work on autopilot

Like self-driving cars, agentic automation has different capabilities, and it’s useful to develop a similar way to categorize agents as we have for autonomous vehicles. We all understand what it means to call a level five self-driving car a robotaxi: The human in the car can focus on reading a book instead of worrying about traffic.

Using the same analogy, the level five equivalent for agentic automation is a system that can complete knowledge work tasks fully autonomously with very few guardrails. We can give the agent a domain like accounting, and it can work within a given set of boundaries and perform assignments it has never done before, using creative problem-solving and collaboration to formulate solutions.

But before we can proceed further in this classification, we need to define what it means to call a system agentic.

All agents are programs, but all programs are not agents. What separates these two is agency – the ability to take action or to choose what action to take to achieve a particular outcome.

All agentic automation systems exhibit some form of planning, reasoning, and decision-making that is more complex and nuanced than purely rule-based if-this-then-that type of logic. To classify the different levels of agentic automation we can consider the distinguishing characteristics of AI agents:

ReasoningThe action of thinking about something in a logical, sensible way. How does the system formulate and reflect on its plan of action? This includes determining the intent and outcome of a task, inspecting ongoing work, and making decisions on future actions and plans.

CollaborationThe action of working with someone to produce something. How does the system interact with humans, tools, and other agentic systems to achieve an outcome? Can the system defer to other actors for decisions and seek guidance when needed? What level of human involvement is needed to complete a task?

ActionThe process of doing something, typically to achieve an outcome. How does the system interact with the outside world? What range of tasks can it perform through actions?

With these questions in mind, let’s define levels of agentic automation from non-agentic automation to digital knowledge worker.

Levels of agentic automation

Level 0: Fixed automation
At the ground level, fixed automation represents no true agentic behavior. It’s equivalent to traditional Robotic Process Automation (RPA) with a fixed set of rules and fully deterministic outcomes. There’s no planning or execution control, as everything is predetermined during programming. Human interaction is limited to handling exceptions, and the task range is confined to rule-based logic.

Example: Data entry to forms, websites, and business applications. Data collection and scraping.

Level 1: AI-augmented automation
To begin with, the first level introduces basic agentic behavior at the individual decision level. It’s essentially fixed automation with some steps augmented by large language models (LLMs). While it offers limited benefits over traditional automation, it represents the first step towards more advanced agency through constrained decision-making.

Example: Classifying customer support emails and forwarding them to the right teams. 

Level 2: Agentic assistant
As we move up a level, we see task-specific agentic automation assistants capable of using tool-calling. These systems can interpret user intent, determine the desired outcome, and take appropriate action – such as summarizing text, generating content, or using specific tools. However, they are limited to static, short-term plans.

Example: Conversational co-pilot for searching, summarizing, and drafting emails.

Level 3: Plan and reflect
Today, this level is commonly called an AI agent, and it’s the first level to exhibit constrained autonomy. These agentic systems can create plans based on given intents, execute them, reflect on their success, and modify plans mid-execution if necessary. Many of today’s advanced AI agents operate at this level, capable of multiple reasoning cycles and planning to achieve desired outcomes.

Example: Reconciling a hundred-page invoice against internal systems based on a human-level set of rules and guidelines. Handles complexity, ambiguity, and variability in the process flow and data.

Level 4: Self-refinement
Level 4 agentic automation will be capable of meaningful self-improvement with or without human collaboration. It can examine and modify its instructions and learning data, create new tools, and connect to new data sources. This level of automation allows agents to keep up with evolving tasks and environments. While currently theoretical, we know that level 4 systems are possible. However, today’s AI models lack the reasoning power to support them in real-world business applications. 

Example: Agent for complex invoice reconciliation that can add new vendors with minimal human coordination and improve accuracy over time.

Level 5: Autonomy
The highest level of agentic automation represents what many consider Artificial General Intelligence (AGI). These hypothetical agents exhibit original thinking and will synthesize solutions to previously unseen tasks. Leveraging advanced logical reasoning and creativity, Level 5 agents will be able to solve complex problems beyond their initial training.

Example: Digital knowledge worker capable of end-to-end processing of complex tasks with no oversight.

Charting the future of agentic automation

As AI technology continues to advance, this framework provides a useful way to categorize and understand the capabilities of agentic automation and AI agents. By clearly defining these levels, we can better communicate about AI systems, set realistic expectations, and chart the path forward in agentic automation development.

It’s important to note that we’re currently operating primarily within the first three levels. Levels beyond this represent future possibilities that will improve the way we work. As we progress, this framework may evolve to accommodate new breakthroughs and insights in AI capabilities.