Introduction to Agents
If you want to see the full version, click here: https://www.kaggle.com/whitepaper-introduction-to-agents Disclaimer: The discussion I wrote in...

This post is a continuation of Introduction to Agents , the second discussion after From Predictive AI to AI Agents: When Machines No Longer Just Answer Questions About the new way of understanding AI Agents: no longer just smart models, but “creatures made of software” with brains, hands, nervous systems, and bodies that work in a loop to achieve goals.
In my previous article, I discussed the shift from AI that simply “asks → answers → finishes” to systems that can begin to be given mission, make plans, and execute several steps yourself. That is what is referred to as a shift from predictive artificial intelligence to autonomous agents.
Now, let's take a step further:
Actually, what exactly is an “AI Agent”?
And why do people who are serious about this need to break it down into:
So that I'm not just repeating terms, I'll try to explain slowly, using language that even non-hardcore developers can understand.
Table of Contents
The discussion I have written here is purely for the purpose of mutual understanding. I have tried to translate and summarize the material as accurately as possible from the whitepaper “Introduction to Agents and Agent Architectures”
Still, there may be parts of the translation or explanation that are inaccurate.
If you find any errors, explanations that seem odd, or have a different perspective, I am very open to discussion. Please share your thoughts in the comments section so that we can learn and correct things together.
If we just say “AI,” that's too broad. It could mean:
In white paper here, they propose the term AI Agent to describe something more specific:
A system that combines models, tools, orchestration, and deployment, which uses language models within loop to achieve a goal.
So the difference is:
If we compare it to a human being, an agent is not just a “brain.” It has a body, hands, nerves, and a way of moving in the world.
Simplified from white paper, An AI agent can be defined as follows:
A combination of language models, tools, orchestration layers (“Nervous System”), and runtime services that run models in loops to achieve a goal.
The four main components:
This combination makes the agent feel more alive because of these four components, not just the model.
In that model, it displays or demonstrates knowledge (data). Let's make sure we understand it correctly, okay?
If you understand the definition of a model: Showcasing or demonstrating products (clothing).
Why am I explaining this? Because there will be many words like "Model" without any AI behind them.
Let's start with the part you hear most often: language model.
This is:
In the context of agents, model this acts as brain which:
White paper emphasizing that choosing a model is not just a matter of “the largest size” or “the most sophisticated,” but rather a matter of balance between:
(See the image below to understand what this means.)

https://ollama.com/library/deepseek-r1
This image shows the AI data model for Deepseek-r1, where there is code behind its name deepseek-r1:5b-617b (the code behind it indicates how much data is used)
Sometimes, agent architecture actually uses more than one model:
A model without tools is like a genius brain locked in an empty room.
Tools is the agent's method:
Examples of tools:
Here, agents have the ability to:
These tools are what make agents worthy of being called “hand”. Without tools, he is just a commentator. With tools, he can become an operator.
Now for the part that is often overlooked but very important: orchestration layer.
If the model is the brain and the tools are the hands, then the orchestration layer is the nervous system which:
The orchestration task is roughly as follows:
Here, old prompt engineering develop into context engineering:
The orchestration layer makes agents feel like systems that life above the model, not just “one call and it's done.”.
A good agent on a developer's laptop is useless if:
Therefore, white paper enter deployment & services as one of the core elements of an agent:
This part may sound “less sexy,” but this is precisely where the agent transforms from an experiment into truly useful services and can be used.
If we were to compare it:
There is an interesting analogy in white paper which I think adequately describes the new role of developers in the world of agents:

In the past, developers were like masons who built logic line by line.
Nowadays, developers are more like directors.
What does that mean?
Current developer:
It's something like this:
Of course, the code remains. The framework remains important. But the focus has shifted from detailed logic coding, to curating the context and environment in which the agent will operate.
Imagine you have a special agent for:
“Helping manage all your brand's blog and social media content needs.”
With the concept of brain–hand–nervous system–body, it looks something like this:
You no longer just ask:
“Please create a caption for product A.”
But more like:
“Please update next week's content calendar for product A's campaign, using last week's performance data as a reference. Create a draft and send me a summary.”
From the section “Introduction to AI Agents,” there is one idea that is repeated throughout:
Agents are not just “smarter AI,” but a new way of building software:
LMs in a loop, with interconnected tools, orchestration, and deployment.
By dividing it into:
we now have a clearer language for:
In the next article, we will begin to delve into How agents solve problems through the “Think – Act – Observe” loop”, and how that process distinguishes it from AI that only responds once.
For now, I would like to conclude with one simple sentence:
The more complex technology becomes, the more important it is to have a simple way of explaining it — not so that we can join the hype, but so that we can maintain control over how we use technology in our lives and work.