Blog

Agentic AdTech Is Older Than You Think

"Agentic" advertising is sold as a 2024 invention. It isn't. Autonomous, multi-agent campaign optimization was being researched and run on live brand campaigns years before the current wave — and that history is the best guide to what's real now and what's just relabeled.

Author: Ad360 engineering
Discipline: Platform engineering

Read the trade press in 2026 and you would think "agentic" advertising was invented the day large language models learned to call tools. Autonomous agents that plan campaigns, allocate budgets, and optimize toward goals are presented as a brand-new species, born of the generative-AI wave. It is an exciting story. It is also historically wrong.

Autonomous, multi-agent optimization of advertising is not new. The idea that software agents — not humans clicking in a console — should continuously decide where budget goes and how campaigns adapt has been researched, formalized, and run against live brand spend for years. Understanding that history is not nostalgia. It is the single best tool for telling which of today's "agentic" claims are substance and which are a familiar idea wearing a new label.

What "agentic" actually means

Before the history, a definition, because the word is being stretched to cover everything. An agent, in the meaningful sense, is a system that pursues a goal by taking actions in an environment, observing the results, and adjusting — with some degree of autonomy from step-to-step human instruction. A multi-agent system has several such components cooperating or competing, each responsible for part of the problem.

By that definition, large parts of programmatic advertising have been quietly agentic for a long time. A pacing controller that observes delivery and adjusts participation to hit a goal is a control agent. A budget allocator that shifts spend toward what is working is an optimization agent. None of this requires a language model. What the current wave adds is a new interface (natural language) and new planning capabilities — not the foundational concept of autonomous optimization.

The exploration–exploitation engine that predates the hype

The clearest example of "agentic before agentic was cool" is the multi-armed bandit. A bandit is, in essence, an autonomous allocator: it balances exploration (trying options to learn) against exploitation (spending on what is already winning), and it updates its own behaviour from observed reward. It is reinforcement-learning-flavored decision-making with no human in the per-decision loop.

This is not a thought experiment in Ad360's lineage; it is implemented. The optimization library carries a multi-armed bandit (BanditStats) tracking rewards, penalties, per-arm observations, and action/selection counts — explore/exploit allocation as a concrete primitive — alongside goal algorithms that compute and adjust delivery targets. An allocator that learns which options to fund and shifts spend accordingly is precisely the behaviour now marketed as "agentic budget management." The mechanism is decades old in the literature and years old in production.

A 2019 autonomous multi-agent system

The deeper point is that the explicit framing of advertising as a multi-agent autonomous system was written down years ago. Ad360's research lineage includes a body of applied work — academic-style publications on an RTB optimization layer, feature-engineering frameworks, ad-event prediction, and ensemble-based anomaly detection — and proprietary scientific papers dating to 2017, 2018, and 2019.

Among them is a 2019 paper titled "An Autonomous Multi-Agent System for Digital Marketing." Sit with the date. That is the concept now presented as a 2024–2026 breakthrough, formalized and named five-plus years earlier — and not in the abstract. The same lineage includes named proprietary algorithms deployed as cloud functions and a series of 2019 experiments on live brand campaigns (hospitality, automotive, travel, non-profit, among others), complete with feature-importance analyses and conversion/CPA heatmaps. The research was validated against real spend, not just published.

A paper titled "An Autonomous Multi-Agent System for Digital Marketing" — dated 2019 — describes, names, and tests the concept the market now sells as brand-new. The history is the credential.

Why the history matters now

This is not about claiming the past to dismiss the present. The current wave is genuinely additive: natural-language interfaces lower the barrier to commanding these systems, and modern planning models can compose actions in ways earlier systems could not. Those are real advances.

But history is the antidote to two failure modes. The first is credulity — believing that "agentic" is automatically new and therefore automatically better, when much of it is a relabeling of optimization techniques that already existed. The second is amnesia — repeating hard-won lessons about autonomy as if no one had encountered them before. The teams that ran autonomous optimization on live campaigns in 2019 already met the questions the market is only now asking: How much authority should an agent have? How do you bound it? How do you prove what it did? Treating 2024 as year zero throws that knowledge away.

What actually changed, and what didn't

What's genuinely new (2023+): natural-language control surfaces; LLM-based planning and tool-use; the accessibility of agentic systems to non-technical operators.
What's older than the hype: autonomous budget allocation (bandits), goal-seeking control loops (pacing), multi-agent decomposition of the optimization problem, and learning from observed campaign reward.
What's perennial: the governance problem. Whether the agent is a 2019 bandit or a 2026 LLM planner, the hard part has always been bounding its authority, keeping a human in command, and making its actions auditable.

That last point is the one the market still under-discusses — and the one a long history makes unavoidable.

Common misconceptions

"Agentic advertising started with generative AI." Autonomous, multi-agent optimization was formalized and run years earlier; LLMs added an interface and planning, not the concept.
"Agentic means an LLM is in charge." Most durable autonomous optimization is bandits and control loops; a language model is one possible controller, not the definition.
"New means better." Some 2026 "agentic" features are relabeled optimization that has existed for years; novelty is not evidence.
"Autonomy removes the human." The serious lineage treats human command and auditability as the central problem, not an afterthought.
"There's nothing to learn from the past." The questions about authority, bounds, and proof were already being answered on live campaigns years ago.

What good operation looks like

Judge an "agentic" claim by its mechanism (what decides, how it learns, how it's bounded), not its branding.
Recognize bandits and control loops as the autonomous workhorses they are.
Treat governance and auditability as the hard part — because the history says it always was.
Use the new interfaces and planning for what they're genuinely good at, without mistaking them for the whole field.

Open questions

Where should an LLM planner sit relative to proven optimization primitives — commanding them, or being constrained by them?
How do lessons from pre-LLM autonomous systems transfer to language-model agents that can take a wider range of actions?
What does auditing an autonomous decision look like when the agent's reasoning is opaque?

The honest version of the agentic story is more interesting than the hype. Autonomous, multi-agent advertising is not a 2024 invention; it is a field with real history, real research, and real campaigns behind it — including a multi-agent system named and tested in 2019. The new wave is a genuine step forward in interface and planning, sitting on top of foundations that are older and better understood than the marketing admits. Knowing the difference is what separates an informed buyer of "AI" from a credulous one.