Chatbots respond. Agents take action. The terminology is a mess. Here's how to tell them apart and pick the right one for your use case.
TL;DR:
- Chatbots respond with text. Agents take action through tool calls.
- The line blurred in 2024 when OpenAI shipped function calling and Anthropic released MCP.
- Most teams should start with a chatbot. Agents are powerful, but they fail in louder ways.
- Agent infrastructure costs 3 to 10x more per interaction once you factor retries, tool latency, and human review.
- If your top 20 support questions don't require a database write, you don't need an agent yet.
The word "agent" got hijacked in 2025. Every vendor with a chatbot slapped "AI agent" on the box and called it a day. The result: a customer asks for an AI agent and gets a chatbot. Or asks for a chatbot and pays for agent infrastructure they never use.
Here's the actual distinction, what changed in 2026, and how to figure out which one you need.
Chatbot: A conversational interface that responds to user input with text (or sometimes voice). It can retrieve information from a knowledge base. It cannot change anything in the outside world.
Example: A user asks "what's your return policy?" The chatbot pulls the policy from your docs and responds.
Agent: A chatbot plus the ability to call tools. Tools are functions the model can invoke: an API endpoint, a database query, a calculator, a browser. The agent decides when to call which tool, reads the result, and continues the conversation.
Example: A user says "I want to return order #4521." The agent looks up the order in Shopify, checks the return window, generates a return label, emails it to the customer, and updates the ticket in Zendesk.
The chatbot answers the question. The agent does the thing.
Three shifts collapsed the gap between chatbot and agent over the last 18 months:
1. Function calling went mainstream. OpenAI introduced function calling in mid-2023. By 2026, every serious model provider supports it: Anthropic (tool use), Google (function calling on Gemini), Meta (Llama tool use). Adding a tool to a chatbot is now a JSON schema definition, not a custom integration.
2. Anthropic shipped the Model Context Protocol (MCP). Released November 2024, MCP is an open standard for connecting LLMs to data sources and tools. Instead of writing a custom integration for each vendor, you point your model at an MCP server. By May 2026, there are MCP servers for Slack, GitHub, Postgres, Google Drive, Notion, Linear, Sentry, and dozens more.
3. Multi-step reasoning got cheaper. Claude 4.5 Sonnet and GPT-5 can plan, call a tool, read the output, and decide the next step inside a single inference call. The "agentic loop" used to require careful orchestration code. Now it runs natively.
The practical effect: in 2026, any chatbot can become an agent in an afternoon. The hard part isn't the technology. It's deciding whether you should.
| Dimension | Chatbot | Agent |
|---|---|---|
| What it does | Reads, retrieves, responds | Reads, retrieves, acts |
| Typical latency | 1 to 3 seconds | 3 to 30 seconds per tool call |
| Cost per interaction | $0.001 to $0.02 | $0.02 to $0.50+ |
| Risk if wrong | Bad answer | Bad answer plus side effects |
| Setup time | Hours to a day | Days to weeks |
| Monitoring needed | Light | Heavy (every tool call logged) |
| Best for | FAQ, knowledge lookup, routing | Order changes, refunds, scheduling |
Cost ranges assume Claude 4.5 Sonnet or GPT-5 (default tier). A chatbot that hits cache and answers from a 5KB context costs pennies. An agent that calls 4 tools, retries one, and writes back to a database can hit 50 cents per resolution.
For most SMB support workflows, a chatbot does the job. Here's the test: look at your last 200 tickets. How many of them ended with a customer asking a question and your agent answering it from internal docs or a help center article?
If the answer is more than 70 percent, you don't need an agent. You need a chatbot with a clean knowledge base.
The cases where a chatbot wins:
A well-tuned chatbot on Claude 4.5 Sonnet or GPT-5 with RAG over your help center will resolve 40 to 70 percent of these tickets without escalation. Industry benchmarks from Intercom, Zendesk, and others put automated resolution at the higher end of that range for mature deployments.
You don't need tools for any of this. You need good retrieval.
The case for an agent is when the customer's desired outcome requires changing state somewhere. Some examples:
Ecommerce returns and exchanges. Customer wants to swap a size. The agent looks up the order, checks inventory, generates a new label, refunds the difference, and updates the customer record. Five tool calls. A chatbot can only tell them where the return form lives.
Appointment scheduling. Customer wants to reschedule. The agent reads their calendar, checks staff availability, writes the new appointment, sends confirmation, and updates the CRM.
Account changes. Password resets, email changes, plan upgrades, address updates. Anything that writes to a user record.
Order tracking with real-time data. Not "where do I track my order" (chatbot) but "where is my order right now" (agent calling the carrier API).
Cross-system workflows. Customer reports a bug. The agent creates a Linear ticket, posts in the engineering Slack channel, and replies to the customer with a tracking number.
These workflows save real money. A typical ecommerce return handled by a human costs $4 to $15 in agent time (Zendesk and Gartner have both published figures in that range). An agent that completes the workflow autonomously costs 30 to 50 cents.
The catch: when the agent gets it wrong, the cost is higher too. A misrouted ticket is annoying. An incorrectly issued refund is a financial loss. Build agents for workflows where the upside justifies the operational overhead.
Agents look cheap per interaction until you add up the full bill.
A chatbot interaction:
An agent interaction with 4 tool calls:
Then add the things you can't skip:
For most teams, the all-in cost of running an agent is 3 to 10x what they expected. Not because tokens got expensive, but because the supporting infrastructure adds up.
Three questions:
1. Does the user's desired outcome require changing state? If no, build a chatbot. If yes, agent is on the table.
2. What does a wrong action cost? A wrong answer wastes time. A wrong action might cost real money or break trust. If the cost of a wrong action is greater than the savings from automation, keep a human in the loop or stick with a chatbot.
3. Do you have the engineering bandwidth to monitor it? Agents need eval suites, observability, and on-call coverage when they start acting up. A chatbot can run unattended for weeks. An agent needs eyes on it, especially in the first three months.
If you answer yes to all three, build the agent. Otherwise, ship the chatbot first. You can always add tools later.
A customer emails: "I want to cancel my subscription and get a refund for the unused portion."
Chatbot build: The chatbot identifies the intent (cancellation), pulls the relevant policy from the help center, explains how to cancel in the dashboard, and offers to escalate to a human if the customer wants a prorated refund. Resolution requires the customer to take action. Cost: 1 cent. Time: 2 seconds.
Agent build: The agent verifies the customer's identity via their email, looks up their subscription in Stripe, calculates the prorated refund, processes the cancellation, issues the refund, sends a confirmation email, and updates the CRM. Resolution is complete. Cost: 18 cents. Time: 14 seconds.
Both are valid. The chatbot is faster, cheaper, and lower risk. The agent is better customer experience, more expensive, and requires you to trust the workflow.
If you process 50 cancellations a day, the agent saves about 25 hours of agent time per month. If you process 2 a day, the chatbot is fine.
Before MCP, building an agent that touched 5 systems meant writing 5 integrations. Each one custom. Each one had to be maintained when the vendor changed their API.
MCP standardizes this. You point your model at an MCP server (Anthropic publishes a registry, and the community has built hundreds). The server exposes tools. The model uses them.
For support teams, the practical impact is that connecting your agent to Slack, Shopify, Zendesk, or Postgres takes hours instead of weeks. It's still maturing, MCP servers are uneven in quality, and the security model is being figured out, but the direction is clear.
If you're building an agent in 2026, start with MCP servers where they exist. Fall back to custom tools where they don't.
This article is wrong for you if:
Is every AI assistant an agent now? No. The marketing language has gotten loose, but the technical distinction holds: if it can call tools and change state, it's an agent. If it only generates text, it's a chatbot. A lot of "AI agents" sold in 2026 are actually chatbots with a vendor's CRM integration bolted on.
Can I start with a chatbot and upgrade to an agent later? Yes, and most teams should. Tool use is additive. The underlying model is the same. If you build your chatbot on Claude 4.5 Sonnet or GPT-5, adding tool calls later is a matter of defining the tools and updating the prompt, not switching infrastructure.
Are agents safe? They're as safe as your tool design. The model doesn't decide what's destructive, you do. Mark dangerous tools as requiring human approval. Limit refund amounts. Sandbox account deletions. The agent is only as careful as your guardrails.
What about cost? Are agents priced differently? Same per-token pricing as the underlying model. The cost difference comes from doing more inference calls per interaction (one call per tool response) and from the tool infrastructure itself. Budget for 3 to 10x what a chatbot interaction costs.
The chatbot vs agent question isn't about which is better. It's about which fits your workflow. Chatbots are good at answering questions. Agents are good at doing things. Most support workflows are 70 percent question, 30 percent action. Build for that split.
Start with a chatbot that handles the question side cleanly. Add agent capabilities for the highest-volume action workflows once your chatbot is stable. Don't build an agent because the vendor calls it an agent.
Want to start with a chatbot that's actually good at the question side? Try Chatsy free or see pricing.
How to evaluate a customer support chatbot before launch using a 25-question golden set, four scoring dimensions, and the right tooling stack.