Agentic AI in Production: What Canadian Enterprises Are Learning in Year One

IT Strategy

May 22

OutSystems' global 2026 State of AI Development report, surveying nearly 1,900 IT leaders, found that 96 percent of enterprises are already using AI agents in some capacity and 97 percent are exploring system-wide agentic AI strategies. The shift from experimentation to production is real and happening faster than most organizations expected. The problems that come with that shift are also real, and arriving faster than most organizations prepared for.

Ninety-four percent of those same organizations report that AI sprawl is increasing complexity, technical debt, and security risk. Only 12 percent have implemented a centralized platform to manage it. Salesforce's 2026 Connectivity Benchmark found that 50 percent of enterprise agents operate in isolated silos with no coordination, no shared context, and no unified governance. Twenty-seven percent of the APIs connecting those agents are ungoverned, with no audit trail and no access controls.

In Canada specifically, KPMG's May 2026 survey of 306 Canadian executives found that 77 percent of Canadian organizations are deploying agentic AI, while only 3 percent have achieved measurable returns on their AI investments. The gap between deployment and value is larger in Canada than the global average, where 8 percent of organizations have achieved measurable returns. Canadian employee resistance to agentic AI sits at 31 percent, nearly double the global average of 16 percent.

Year one of production agentic AI is producing a consistent set of lessons across Canadian enterprises. They are not lessons about whether agentic AI works. It does, in the use cases where the conditions are right. They are lessons about what the conditions actually are, what breaks when those conditions are not met, and what the organizations producing the 3 percent of measurable returns are doing differently from the 97 percent that are not.

Lesson One: Deployment Is Easy. Governance Is Not.

The barrier to deploying an AI agent in 2026 is genuinely low. Most major enterprise platforms, Salesforce, ServiceNow, Microsoft 365, SAP, have native agent-building capabilities that allow a technically capable business user to deploy a working agent in hours without writing code. Gartner projects that by 2027, 75 percent of employees will acquire, modify, or create technology without IT oversight, up sharply from 41 percent in 2022. Natural language has replaced logic as the barrier to entry for building autonomous systems.

The governance barrier, defining who is allowed to build agents, what systems they are permitted to connect to, what data they are authorized to access, how their actions are logged and reviewed, and how they are retired when they are no longer needed, has not fallen at the same pace. The result is the agent sprawl problem: organizations that started 2025 with fewer than 15 AI agents in production and are on track to have 150,000 by 2028, according to Gartner's projections, without the governance infrastructure to know what those agents are doing, what permissions they have inherited, or whether they are still serving their intended purpose.

The agents that are producing the most risk in year one are not the ones that were formally approved and deployed with IT oversight. They are the ones that individual teams built using platform-native tools, connected to production data and systems using service accounts with broad permissions, and deployed without informing IT, security, or compliance. These agents are technically functional and organizationally invisible. They inherit the permissions of the user or service account that created them, which means they can access everything that user can access. When that user leaves the organization, the agent often continues running with retained permissions, accessing systems and data on behalf of a former employee whose access should have been revoked.

The governance design that addresses this is not technically complex. It requires an agent registry that every deployed agent, including business-user-built agents, must be registered in before connecting to production systems. It requires that agents use dedicated service identities with minimum necessary permissions rather than inheriting user credentials. It requires a lifecycle process that reviews active agents on a defined cadence and retires those that are no longer serving their stated purpose. And it requires that the governance framework is enforced at the platform level, not through policy alone, because policy compliance in environments where thousands of agents can be created by non-technical users without IT involvement is not enforceable through documentation.

Lesson Two: The Cost Model Is Different From What Was Projected

The AI cost model that most organizations used to justify agentic AI investment was based on productivity gains and headcount cost avoidance. The cost model that most organizations are experiencing in production has additional dimensions that were not in the original projections.

Consumption costs for agentic systems scale with usage in ways that are fundamentally different from traditional software licensing. An agent that runs hundreds of LLM inference calls per task, connects to multiple enterprise systems, and executes actions that trigger downstream processes generates consumption costs at every step. The per-task cost of an agentic workflow is often higher than the per-task cost of a human performing the same task, at current LLM pricing, before the productivity multiplier justifies the investment. At scale with high adoption, the economics improve substantially. At partial adoption with low utilization, the economics are often worse than the pre-AI baseline.

Gartner's forecast that global software spending will surge 15.2 percent in 2026, driven significantly by AI consumption costs that bypass traditional procurement processes, describes a dynamic that Canadian IT leaders are encountering in their 2026 budget reviews. Agents deployed by business teams on consumption-based platforms are generating API costs, storage costs, and integration costs that were not captured in the original business case because the business teams that deployed the agents did not include infrastructure cost in their ROI calculation. The IT finance function discovers these costs in the cloud bill rather than in the approved project budget.

The FinOps discipline that cloud cost management requires has a direct agentic AI equivalent. Organizations that instrument their agent deployments to track cost per agent, cost per task, and cost per outcome are the ones that can identify which agents are producing value at an acceptable cost and which are generating cost without proportionate value. Organizations that track only output metrics, tasks completed, interactions handled, without tracking cost metrics, are producing the activity reports that boards and CFOs are no longer accepting as evidence of AI value.

Lesson Three: Integration Failures Are the Most Common Production Problem

Eighty-six percent of IT leaders in Salesforce's 2026 survey worry that agents add more complexity than value due to integration failures. That concern is grounded in production experience. The most common failure mode for agentic AI in year one is not the agent's reasoning capability, which is generally adequate for well-scoped use cases. It is the agent's ability to reliably connect to the enterprise systems it needs to act on.

Enterprise systems were not designed for programmatic access by AI agents. They were designed for human users or for specific point-to-point integrations built to defined specifications. An AI agent attempting to access an ERP system to retrieve a customer record, check an inventory level, and update a service request is making three API calls to systems that may have rate limits, session timeouts, inconsistent error handling, and authentication models that were designed for human login flows rather than service account access. When any one of those calls fails, the agent needs to handle the failure gracefully, retry appropriately, and escalate to a human when the failure is unrecoverable. Most agents deployed in year one were not designed with robust failure handling because the integration problems were not anticipated during the pilot, which ran against demo environments with clean data and reliable connectivity.

The integration architecture that supports production agentic AI is more complex than the integration architecture that supported the pilot. It requires API gateway management that provides rate limiting, retry logic, and circuit breaking between the agent and each connected system. It requires data quality validation before the agent acts on retrieved data, because an agent that acts on a stale or incorrect record can cause downstream errors that are more expensive to remediate than the efficiency gain the agent produced. And it requires monitoring that detects integration failures in real time rather than discovering them after users report that the agent stopped working.

Lesson Four: Employee Trust Is a Production Requirement, Not a Change Management Nice-To-Have

Canada's 31 percent employee resistance rate, nearly double the 16 percent global average, is the most specifically Canadian dimension of the year one agentic AI lesson. The KPMG analysis connects this directly to the value realization gap: when employees do not understand how to work alongside AI tools or do not trust them, adoption stalls, and so does the return on investment.

The adoption problem in Canadian enterprises is not primarily that employees refuse to use AI agents. It is that they use them inconsistently, reverting to manual processes for tasks that feel high-stakes or unfamiliar, routing around the agent when they are not confident in its output, and reporting the agent as unreliable when the problem is actually that they were not trained to evaluate its outputs correctly. An agent that is technically producing correct outputs 90 percent of the time but that employees override 60 percent of the time because they cannot tell the correct outputs from the incorrect ones is not delivering 90 percent of its potential value. It is delivering 36 percent, and the remaining 64 percent is being consumed by the overhead of human review that was not factored into the original efficiency calculation.

The trust gap has a specific solution that is different from general AI literacy training. It is role-specific calibration: training that teaches the people who use a specific agent what it is good at, what it is not good at, which types of its outputs require human verification and which do not, and how to recognize when the agent is operating outside the conditions it was designed for. Generic AI literacy training does not produce this. Role-specific calibration for specific agents in specific workflows does, and it is the investment that separates the organizations producing measurable returns from those with high adoption and low value realization.

Lesson Five: The Governance Architecture Needs to Precede Scale, Not Follow It

The organizations that are in the best position in year two of production agentic AI are the ones that invested in governance infrastructure before they scaled deployment, not after. The AI Centre of Excellence post in this series describes the organizational structure that provides this governance. The responsible AI framework post describes the policy and documentation layer. Together they constitute the governance architecture that prevents the sprawl, the cost overruns, and the integration failures that characterize year one agentic AI deployments that were scaled without governance infrastructure.

The specific governance investments that produce the most value before scale are three. An agent registry with mandatory registration before production deployment, enforced at the platform level. A cost instrumentation framework that tracks consumption cost per agent from day one. And a deployment approval process that requires a defined use case, a named business owner, a data access scope, and an exit criterion, the conditions under which the agent will be retired or redesigned, before any agent connects to production systems.

Gartner's projection that a typical Fortune 500 will manage 150,000 AI agents by 2028, up from fewer than 15 in 2025, describes a governance problem of a scale that cannot be solved retroactively. The organizations that build the registry, the cost instrumentation, and the deployment approval process now, while the number of agents is still manageable, will have a governable agentic AI estate in 2028. The ones that defer governance until the sprawl becomes a crisis will be remediating a problem that is orders of magnitude more complex than the one they could have prevented.

Talk to Us

ClarityArc helps Canadian organizations design agentic AI governance frameworks, build agent registries, and establish the cost instrumentation and deployment approval processes that prevent sprawl before it becomes a remediation problem. If your organization is scaling agentic AI and wants to build the governance infrastructure before the deployment gets ahead of the controls, we are ready to help.

Get in Touch

Administrator