Canadian Property Data in 2026: Categories and Sources

Q: What are the main types of Canadian property data?

Five categories cover most use cases. Government land-registry records document the legal status of each property. Appraisal and assessment data reflects valuation at points in time. MLS transaction records capture what sold and at what price. Repeat-sale series track how the same property changes in value across transactions. Listing-lifecycle data captures the full market behaviour of a property over time, not just its completed sales.

Q: What is a persistent property identifier?

A persistent property identifier is a stable reference that links every record touching the same physical property, regardless of which listing number, agent, or address format appears in the underlying source. Without one, a property that relists under a new number looks like a new record, and time-series analysis breaks.

Q: Which data type is right for my use case?

It depends on the time dimension. For a single legal or valuation point, registry and appraisal data are correct. For current market activity, MLS transaction data is correct. For multi-year price movement, repeat-sale series are correct. For behavioural signals like relisting, price changes, or conversions between sale and rental, listing-lifecycle data is correct. Most enterprise use cases combine two or three.

Canadian property data is not a single thing. It is five distinct categories of information, sourced differently, structured differently, and licensed under different terms. Enterprise teams shopping for property data for the first time tend to assume there is one market; there are several, and picking the wrong one wastes procurement cycles. This is a guide to the categories, what each is good for, and how they are delivered.

The size of the Canadian residential market

Roughly 475,000 residential real estate properties change hands each year through the cooperative listing systems run by Canadian real estate boards. The Canadian Real Estate Association, which compiles national figures, reported a national average sale price near the $675,000 mark in early 2026, with active inventory running below long-term averages for most of the past year. Those two numbers, transaction volume and average price, anchor every downstream property data product sold into the Canadian market.

Transaction activity is heavily concentrated by province. Ontario and British Columbia account for the largest share of national volume, with Alberta and Quebec behind them. The Canada Mortgage and Housing Corporation estimates a national housing supply shortfall of roughly 3.45 million units by 2030, concentrated in Ontario, Quebec, and British Columbia. These fundamentals shape why different property data categories exist and which ones are in demand.

Five categories of Canadian property data

Every property data product in Canada fits into one of five categories. The categories are not interchangeable. Each was built for a specific use case, and that original use case still shapes how the data is structured today.

The right data category is defined by the time dimension you need. Point-in-time data answers what is true right now. Longitudinal data answers how things have changed. Most enterprise use cases require both.

1. Government land-registry records

Provincial land-registry systems record the legal status of each property: who owns it, what liens or charges are registered against it, what the legal description is. These records exist because ownership has to be documented for the property to be enforceable as collateral, inheritable, or transferable.

Registry data is authoritative for its own purpose. If the question is who legally owns a property right now, registry records are the answer. The limits of registry data become obvious when the question is anything else. Prices recorded in registries are the consideration at closing, which may or may not match the actual sale price; transfer data lags the market by weeks or months; there is no standard schema across provinces. For use cases like valuation, market analysis, or behavioural targeting, registry data is a necessary reference but not a sufficient source.

Registry data is typically accessed through provincial portals, title-insurance companies, or specialised aggregators. Licensing is usually per-query or per-record.

2. Appraisal and assessment data

Appraisal data is produced by licensed appraisers for specific properties at specific moments, usually to support a mortgage origination, a legal proceeding, or a property sale. Assessment data is produced by municipal assessors to set property tax rates, typically on a multi-year revaluation cycle.

Both are valuation data, and both are point-in-time. An appraisal tells you what a single qualified professional estimated the property was worth on one date. An assessment tells you what a municipality decided the property was worth for tax purposes on one date. Neither refreshes at market speed; an assessment roll may be three or four years old, and an appraisal is current only on the day it was signed.

Appraisal data is primarily used by lenders and is rarely licensed outside that channel. Assessment data is publicly available in most provinces and is a common input into property databases, though coverage varies. For enterprise use cases requiring current valuation, appraisal and assessment data are inputs, not primary sources.

3. MLS transaction records

Multiple listing systems are the cooperative marketing platforms run by Canadian real estate boards. When a property is listed for sale, the listing enters an MLS; when it sells, the transaction is recorded in the MLS. This is the primary data source for most active-market questions: what is for sale right now, what recently sold, at what price.

MLS data is current, detailed, and structured. It is also fragmented. There are dozens of boards across Canada, each with its own schema, its own rules, and its own licensing terms. Accessing MLS data at scale involves negotiating with multiple boards, accepting different data formats, and reconciling field definitions that do not always align.

The deeper limit of MLS data is temporal. MLS numbers are reassigned when a property relists, which means a single property that sold, came back on the market, sold again, and was then leased out may appear as three or four unrelated records in an MLS-native system. Reconstructing a property's history across these events requires joining on something more stable than the listing number.

4. Repeat-sale series

A repeat-sale series is a dataset of property pairs: two verified sales of the same property at different points in time. Repeat-sale methodology is the standard input for serious home-price indices and automated valuation models because it separates genuine price change at the property level from general market movement and property-attribute mix.

Building repeat-sale pairs well is harder than it sounds. Each pair requires two confirmed sale prices, two confirmed sale dates, and a stable link between the two records that survives relisting, address variation, and MLS number reassignment. That stable link is what the industry calls a persistent property identifier. Without one, most properties that sold more than once never get paired.

BrightCat's Canadian Home Price Index dataset contains 194K+ verified repeat-sale pairs across all ten provinces, drawn from sale events reconciled through BrightCat's own pipeline since 2014. Each pair satisfies four conditions: both sales are of the same property, linked by a persistent property identifier; both sale prices are verified; both transaction dates are confirmed; and a minimum ninety-day gap separates the two sales. The AVM training data page covers the methodology in more depth.

5. Listing-lifecycle data

Listing-lifecycle data is the newest category and the one most closely aligned with analytics and AI use cases. A lifecycle record is not just the sale or the current listing; it is the full sequence of events a property goes through in the market. The original listing, every price change, every status transition, every relist, every drop, every completed sale. Lifecycle data captures market behaviour rather than market snapshots.

The value of lifecycle data sits in the patterns between transactions. A property that listed at one price, dropped through three reductions, delisted, waited six months, relisted at a lower price, and eventually sold is telling a different story than a property that sold cleanly on first exposure. MLS data, queried at the transaction layer, shows both as a single sale. Lifecycle data shows the difference.

Lifecycle data also enables cross-track signals. A property that sells and then appears as a rental listing at the same address within a short window is almost certainly an investment property, not an owner-occupier transaction. That signal requires joining sale events and rental events through a persistent property identifier, weekly or better. It is not visible in any single-track data source.

BrightCat's pipeline operates in the lifecycle category. It covers 6 million residential real estate properties and 315,000 commercial real estate properties, with weekly capture across all ten provinces since 2014. The methodology page describes how the pipeline is assembled.

How to pick the right category

Use-case mapping to category is straightforward once the categories are clear:

Legal ownership, lien status, encumbrances: government land-registry records
Point-in-time valuation for a single property: appraisal data (current) or assessment data (retrospective)
What is for sale right now or what recently sold: MLS transaction records
Home price index construction, AVM training, longitudinal valuation research: repeat-sale series
Pre-mover detection, portfolio monitoring, investment property flags, retention timing, AI-native data access: listing-lifecycle data

Most enterprise deployments combine at least two categories. An insurance carrier running a retention model on a policy book might combine registry data (to confirm ownership), MLS transaction records (to flag recent sales), and lifecycle data (to identify sale-to-rent conversions that change the underwriting profile). A bank running collateral monitoring might combine appraisal data, assessment data, and lifecycle signals for the same reason.

How the data gets delivered

Delivery architecture has changed faster than the data categories themselves. Traditional property data arrived as nightly FTP files or monthly CSV extracts. Current enterprise delivery usually means one of three patterns:

Cloud data marketplace (such as Snowflake Marketplace), where data is shared directly into the client's data warehouse with zero ETL
API access, either REST or, increasingly, MCP for AI-native workflows
Secure file delivery for teams that cannot yet consume cloud-native shares

The underlying data is the same regardless of channel. BrightCat ships all three patterns from the same weekly pipeline. The Snowflake Marketplace, MCP Connector, and Developer API pages cover the specifics.

Canadian property data is a category, not a product. Five distinct data types cover most use cases. Picking the right one is a function of whether the answer is point-in-time or longitudinal, property-level or market-level, and what downstream system consumes it.

Market figures: Canadian Real Estate Association (April 2026) · Supply estimates: Canada Mortgage and Housing Corporation

Frequently asked questions

What are the main types of Canadian property data?

Five categories cover most use cases: government land-registry records, appraisal and assessment data, MLS transaction records, repeat-sale series, and listing-lifecycle data. Each was built for a specific purpose and has strengths and limits tied to that purpose.

How large is the Canadian residential transaction market?

Around 475,000 residential real estate properties change hands each year through Canadian MLS Systems, with a national average sale price near the $675,000 to $690,000 range. Figures from the Canadian Real Estate Association, updated monthly.

Why does property data come in so many different forms?

Each data type exists because a specific industry needed it. Government registries exist to record legal ownership. Appraisal data supports lender valuation. MLS records run brokerage cooperation. Repeat-sale series measure price movement. Lifecycle data captures market behaviour over time.

What is a persistent property identifier?

A stable reference that links every record touching the same physical property, regardless of which listing number, agent, or address format appears in the underlying source. Without one, a property that relists under a new number looks like a new record, and time-series analysis breaks.

Where does BrightCat fit in the category?

BrightCat operates in the listing-lifecycle and repeat-sale categories. The pipeline has captured Canadian residential and commercial listings weekly since 2014, producing a longitudinal view of every property's market behaviour, and a repeat-sale pair dataset of 194K+ verified Canadian pairs.

Which data type is right for my use case?

It depends on the time dimension. Registry and appraisal data for a single legal or valuation point. MLS data for current market activity. Repeat-sale series for multi-year price movement. Lifecycle data for behavioural signals like relisting, price changes, or sale-to-rent conversions. Most enterprise use cases combine two or three. Contact us to discuss fit.

Canadian property data in 2026: categories and sources