Data Strategy for Canadian Healthcare: The Interoperability Imperative

Data Strategy

May 19

Approximately 20 percent of patients may have incorrect medical records due to data fragmentation across healthcare systems, according to research on interoperability challenges in healthcare organizations. Canadian healthcare organizations rely on an average of 20 or more different software systems, most of which do not share data with each other reliably or at all. Twenty-two percent of Canadian healthcare organizations implemented domain-specific AI tools in 2025, according to Techno-soft's review of AI-integrated Canadian healthcare systems. Nearly all of them discovered the same constraint: the data required to make AI clinically useful is fragmented, inaccessible, or insufficiently standardized to train reliable models or support real-time clinical decision support.

The Canadian healthcare data strategy problem in 2026 is not a shortage of ambition or a shortage of investment. It is a structural problem: health data is governed provincially, collected in systems that were designed before modern interoperability standards existed, owned by organizations with different incentives for sharing, and subject to privacy legislation that varies by jurisdiction. The federal government has signalled its direction clearly through the Pan-Canadian Interoperability Roadmap, the Connected Care for Canadians Act, and Health Canada's 2025-26 Departmental Plan. The signal is that interoperability is coming, whether as a regulatory requirement or as a market necessity driven by AI adoption. The strategic question for healthcare data leaders is not whether to build an interoperability-ready data foundation. It is how to build it in a way that serves the clinical, operational, and AI program objectives the organization has committed to, within the governance constraints that the Canadian healthcare system imposes.

The Policy Moment: What Is Actually Advancing

Bill C-72, the Connected Care for Canadians Act, which would have mandated interoperability of health IT and prohibited data-blocking by vendors, died on the Order Paper when Parliament was dissolved in January 2025. Bill S-5, a successor effort, is advancing in the Senate. Chambers and Partners' 2025 analysis of the Canadian healthcare AI regulatory environment describes the legislative direction clearly: the policy signal is consistent even as the specific legislative vehicle has changed. Federal direction on common standards for exchange is established. The implementation timeline is uncertain but the trajectory is not.

The Pan-Canadian Interoperability Roadmap, developed collaboratively by Canada Health Infoway and the Canadian Institute for Health Information, provides the operational framework within which provincial and territorial health authorities are building their interoperability programs. The Roadmap identifies HL7 FHIR as the preferred standard for health data exchange and establishes a phased implementation timeline for patient-facing and provider-facing data access. Health Canada's 2025-26 Departmental Plan continues the multi-year commitment to digital modernization and identifies interoperability as a central policy objective, building on the Roadmap and ongoing efforts to ensure health information is securely accessible to providers and patients across provinces and territories.

Canadian Healthcare Technology's May 2026 analysis of the connected care opportunity frames the current moment precisely: digital health connectivity is certainly the main goal, but the strategies available today were not available or mature just a few years ago. The combination of FHIR maturation, agentic AI capabilities, and the legislative push creates an opportunity to build genuinely connected care infrastructure rather than continuing to layer point solutions on top of fundamentally fragmented data architecture. The organizations that position their data strategy around that opportunity rather than around managing the existing fragmentation are building toward the system the policy environment is directing. The organizations that continue optimizing within the current fragmented model are building toward obsolescence.

What FHIR Actually Means for Healthcare Data Strategy

HL7 FHIR, Fast Healthcare Interoperability Resources, has become the de facto global standard for health data exchange and the foundation of the Canadian interoperability agenda. Understanding what FHIR actually is and what it requires from a data strategy perspective is essential for healthcare data leaders who are being asked to build FHIR-compliant data infrastructure without always having a clear picture of what that entails in operational terms.

FHIR is a standard that defines how health data is structured, described, and exchanged through APIs. It provides a set of resource types, such as Patient, Encounter, Observation, Medication, and Condition, each with defined data elements and relationships, and specifies how those resources should be exposed through RESTful APIs. A FHIR-compliant system can share a patient's medication history with another FHIR-compliant system using a standardized API call, without either system needing to know the other's internal data model or maintaining a custom point-to-point integration.

The clinical implications are direct. AI clinical decision support tools integrated through SMART on FHIR and CDS Hooks can access real-time patient data at the point of care, enabling risk prediction, care gap identification, and clinical recommendation at the moment when the clinician is making a decision rather than in a reporting environment removed from clinical workflow. FHIR R5 introduced enhancements for patient engagement, care coordination, and population health in March 2023. FHIR R6, expected in late 2026, will achieve normative status for most clinical and administrative resources, providing the long-term stability that has made some organizations cautious about accelerating FHIR adoption before the standard matured.

For data leaders building a healthcare data strategy in 2026, FHIR readiness has three distinct layers. API readiness means that clinical systems can expose their data through FHIR APIs and receive data from external FHIR sources. Semantic readiness means that the clinical data elements are mapped to FHIR resource types with sufficient consistency that a patient's Observation in one system means the same thing as an Observation in another. Governance readiness means that the consent, access control, and audit requirements for FHIR-based data exchange are defined and implemented, including the patient-directed access mechanisms that the Connected Care for Canadians Act would have required and that provincial FHIR implementations are building toward regardless of the federal legislative timeline.

Most Canadian healthcare organizations are at the first layer and working toward the second. The semantic readiness challenge is the hardest, because it requires not just technical mapping but clinical consensus on what data elements mean and how they should be coded, which requires governance work that spans organizational and sometimes provincial boundaries. The organizations that invest in semantic standardization alongside API readiness are building a data foundation that is genuinely useful for AI and analytics. Those that build API readiness without semantic standardization are building integration plumbing that moves data without ensuring the data means what it claims to mean on the other side.

The Data Strategy Problems That Block Clinical AI

The 22 percent of Canadian healthcare organizations that implemented domain-specific AI tools in 2025 encountered a consistent set of data problems that determined whether those implementations produced clinical value or produced technically functional systems that clinicians did not trust or use. Understanding these problems specifically is more useful than a general discussion of data quality, because each has a specific root cause and a specific remediation path.

Incomplete and Inconsistent Patient Records

An AI model trained on patient records that are incomplete, inconsistently coded, or that represent a partial view of the patient's history learns from a distorted picture of clinical reality. A predictive risk model trained on emergency department records without access to the patient's primary care history or community pharmacy records will systematically underestimate risk for patients whose relevant history is not in the ED system. The error is not in the model. It is in the data the model was trained on.

The remediation requires both technical and governance investment. Technical remediation addresses the integration layer: building the FHIR-based APIs and patient identity matching infrastructure that allows a patient's records across multiple systems to be aggregated into a longitudinal view. Governance remediation addresses the consent and access control layer: establishing the patient consent framework that permits the longitudinal aggregation and defines the purposes for which it can be used. In the Canadian context, the consent framework varies by province, and a data strategy that works in Ontario may require modification for British Columbia or Quebec. Building the consent architecture with provincial variation in mind from the start is less expensive than building a single-province model and retrofitting it for others.

Terminology and Coding Inconsistency

Canadian healthcare organizations use multiple clinical terminology standards across different systems and different care settings. SNOMED CT for clinical findings, ICD-10-CA for diagnostic coding, LOINC for laboratory observations, and various local coding systems that accumulated before standardized terminologies were widely adopted all coexist within the same health authority's data estate. An AI system attempting to identify patients with a specific diagnosis across systems that code that diagnosis differently in each system will either miss patients or misidentify them, depending on how the mapping is handled.

Terminology management is governance work rather than technical work in its primary dimension. The technical tools for terminology mapping and value set management are mature. What is typically missing is the clinical informatics governance structure that defines the authoritative mapping between local codes and standard terminologies, maintains it as standards evolve, and ensures that new system implementations use the approved terminology mappings rather than introducing new inconsistencies. Health Canada's and Canada Health Infoway's work on the pan-Canadian reference value sets provides a national reference point for this governance work, but most organizations need to build the local governance structure that connects their own coding practices to the national standards.

Data Residency and Access Control Complexity

Healthcare data in Canada is subject to provincial privacy legislation, in addition to federal requirements under PIPEDA and its successors, that imposes specific constraints on where data can be stored, who can access it, and under what circumstances it can be shared. A health authority that wants to train an AI model on data from multiple provincial systems needs to navigate consent requirements, data residency requirements, and de-identification standards that may differ between the contributing systems and the jurisdiction where the model training will occur.

These constraints are real and non-negotiable, and data strategies that treat them as implementation details to be addressed after the architecture is designed consistently produce compliance problems that require expensive remediation. The data governance framework for a healthcare AI program needs to be designed with the provincial privacy landscape explicitly mapped: which data elements are subject to which legislative requirements, what the consent and access control model is for each use case, and how de-identification and aggregation will be handled in ways that satisfy the applicable requirements in each contributing jurisdiction.

The investment in getting this governance design right before the AI program scales is substantially less than the investment in retrofitting governance into a deployed AI system after a privacy breach or a regulatory finding. The data governance principles described in this series apply in healthcare with additional specificity: the governance needs to be clinically credible as well as technically sound, which means clinical informaticists and privacy counsel need to be involved in the governance design rather than ratifying it after the data team has built it.

The AI Investment That Requires the Data Foundation First

The clinical AI investment agenda for Canadian health systems in 2026 and 2027 is ambitious. Predictive risk stratification for population health management. Clinical decision support embedded in EHR workflows. Diagnostic AI for imaging and pathology. Care coordination AI that manages transitions across the continuum of care. Each of these categories requires a different kind of data foundation, and the organizations that understand this distinction are sequencing their investments more intelligently than those that are evaluating AI tools before assessing whether their data can support them.

Population health management AI requires longitudinal patient data aggregated across primary care, specialist care, and acute care settings, consistently coded, and linked through a reliable patient identity matching system. Without that foundation, population risk stratification produces incomplete lists that miss the patients most at risk precisely because their care has been fragmented across systems that do not share data. The data investment required to support population health AI is primarily an interoperability investment: building the FHIR-based aggregation layer and the governance framework that permits longitudinal data use.

Clinical decision support embedded in EHR workflows requires real-time access to the patient's current data, including data from external systems, with latency low enough to support clinical workflow rather than interrupt it. A CDS alert that arrives after the clinician has already made the decision it was supposed to support is a governance and patient safety liability rather than a clinical tool. The data infrastructure requirement is SMART on FHIR integration that provides the CDS engine with real-time access to the EHR's current data state, which requires FHIR readiness at the API and semantic layers described above.

Diagnostic AI for imaging and pathology is technically the most mature category of clinical AI and the one where the data foundation requirements are best understood. The primary requirements are data quality, standardized metadata, and sufficient volume of labeled cases for model training. The Canadian challenge in this category is typically the absence of standardized metadata across imaging archives from different systems and different time periods rather than a fundamental interoperability problem. Remediation requires metadata standardization work on existing archives alongside standards-compliant metadata collection for new acquisitions.

The Data Strategy That Serves Multiple Masters

Canadian healthcare data leaders operate in an environment where the data strategy needs to serve clinical operations, population health management, regulatory reporting, AI program development, and research simultaneously, using data that was collected for clinical care under consent frameworks that may not have anticipated its use for any of the other purposes. Designing a data strategy that serves all of these purposes without creating governance conflicts requires explicit design choices that most organizations have not made.

The most important design choice is the consent architecture: how patient consent is obtained, recorded, and governed for each category of data use, and how the consent record is accessible to each system that needs to enforce it. A consent architecture that supports clinical care but does not extend to secondary uses like AI model training or research means that the data collected for clinical purposes cannot legally be used for the AI program without additional consent processes, which are expensive to operate and may produce participation rates that bias the resulting data.

The growing body of evidence on patient attitudes toward health data sharing in Canada is modestly encouraging: most patients are willing to share their data for purposes that benefit their own care and the health system more broadly, provided they understand how it will be used and have meaningful control over its use. The consent architectures that achieve high participation rates are those that are transparent about the purposes, granular enough that patients can make meaningful choices rather than consenting or not consenting to everything, and that deliver visible benefit back to the patient rather than treating consent as purely a regulatory compliance requirement.

The data strategy for Canadian healthcare in 2026 is at its core a design problem that requires clinical, legal, technical, and governance expertise working together rather than sequentially. Organizations that engage all four perspectives early and build the data foundation around the constraints they collectively identify will produce data infrastructure that supports the AI program the health system needs. Organizations that build the technical infrastructure and then bring legal and governance in to ratify what has been built will produce infrastructure that is expensive to remediate and limited in its clinical utility precisely in the ways that the regulatory and governance constraints were trying to prevent.

Talk to Us

ClarityArc's data strategy practice supports Canadian healthcare organizations in designing data foundations that are interoperability-ready, governance-sound, and built to support the clinical AI investments that require them. If you are building or reviewing your healthcare data strategy and want a perspective that understands the Canadian regulatory context, we are ready to help.

Get in Touch