Intellectual Property Rights in Real Estate Data and Databases
Intellectual property law intersects with real estate data and databases in ways that affect every participant in the property information ecosystem — from multiple provider services and government assessors to data aggregators, proptech firms, and brokerage networks. The protections available under U.S. copyright, trade secret, contract, and database law govern who may compile, license, reproduce, or commercialize property records, provider content, and derived datasets. This page maps the legal frameworks, structural mechanics, classification boundaries, and contested tensions that define this sector, serving as a reference for professionals, researchers, and legal practitioners navigating the property data landscape.
- Definition and scope
- Core mechanics or structure
- Causal relationships or drivers
- Classification boundaries
- Tradeoffs and tensions
- Common misconceptions
- Checklist or steps
- Reference table or matrix
Definition and scope
Intellectual property rights in real estate data and databases refer to the bundle of legal protections — and their acknowledged limits — that attach to the collection, organization, presentation, and commercial use of property-related information. This includes residential and commercial provider data, MLS databases, automated valuation model (AVM) outputs, GIS parcel maps, assessment records, deed indexes, and proprietary analytics layers built on top of public records.
The U.S. Copyright Act, codified at 17 U.S.C. § 101 et seq., protects original works of authorship, but the Supreme Court's ruling in Feist Publications, Inc. v. Rural Telephone Service Co., 499 U.S. 340 (1991), established that bare factual compilations arranged in a non-creative order receive no copyright protection. This ruling has fundamental consequences for real estate data: individual property facts — address, square footage, lot dimensions, assessed value — are not copyrightable. However, the selection, coordination, and creative arrangement of those facts in a database can attract protection if it clears the originality threshold.
Trade secret law, governed federally by the Defend Trade Secrets Act of 2016 (18 U.S.C. § 1836), provides a parallel and often more commercially durable protection for proprietary data architectures, valuation algorithms, and curated datasets maintained in reasonable secrecy. Contractual protections — data licensing agreements, terms of service, and MLS rules — operate independently of both and represent the primary enforcement mechanism in day-to-day industry practice.
The scope of this reference covers U.S. federal frameworks and major industry structures; it does not extend to state-specific database statutes or non-U.S. sui generis database rights frameworks such as those operating under the European Union's Database Directive 96/9/EC.
Core mechanics or structure
Copyright in database structure. For a real estate database to qualify for copyright protection, the compiler must exercise creative judgment in selecting which data fields to include or in arranging records in a manner reflecting authorial choice. A database of 50 fields chosen from 200 possible attributes, organized according to a proprietary taxonomy developed by the compiler, may qualify. A straight alphabetical or geographic dump of all available public records will not. The U.S. Copyright Office (copyright.gov) registers database compilations under the literary works category; the copyright attaches to the expressive elements only, not to the underlying facts.
Trade secret protection. Under the Defend Trade Secrets Act, a real estate data product qualifies as a trade secret if it derives independent economic value from not being generally known and the owner takes reasonable measures to maintain its secrecy. AVM models incorporating proprietary weighting schemes, training datasets, and confidence-interval algorithms are the most commonly asserted trade secrets in the proptech sector. Reasonable measures typically include access-controlled environments, employee non-disclosure agreements, and tiered licensing with data-use restrictions.
Contractual licensing architecture. Multiple provider services operate under rules promulgated by the National Association of Realtors (NAR) and its affiliated MLSs. MLS data is licensed — not sold — to participants, and the license terms govern permitted uses, redistribution rights, and data syndication. NAR's MLS Policy establishes baseline rules, including IDX (Internet Data Exchange) and VOW (Virtual Office Website) frameworks that define how participants may display provider data on public-facing platforms. These contractual restrictions apply independently of whether the data would qualify for copyright protection under Feist.
Government records. Public records generated by county assessors, recorder offices, and court systems are generally in the public domain under federal and state open-records frameworks, including the Freedom of Information Act (5 U.S.C. § 552). However, a private firm that purchases bulk public record feeds and layers proprietary cleaning, geocoding, and normalization processes on top of them may assert trade secret protection or contractual restrictions on the resulting product — even though the underlying facts remain in the public domain.
Causal relationships or drivers
The contested legal terrain in real estate data IP is driven by four structural forces:
-
Data commercialization pressure. The real estate data market, estimated at multiple billions of dollars in annual licensing and analytics revenue by the Consumer Financial Protection Bureau's market monitoring work, has created strong economic incentives for firms to assert maximum IP protection over compiled datasets.
-
The Feist ceiling. Because the Supreme Court denied copyright protection to non-creative factual compilations in 1991, firms cannot rely on copyright alone. This drives investment in trade secret architecture and contractual restriction as the primary protective strategies.
-
MLS consolidation and data governance. NAR's approximately 500 MLS organizations in the U.S. (NAR, MLS Statistics) collectively control the most comprehensive residential provider database in the country. Their licensing frameworks effectively function as a private regulatory system governing data access for agents, brokers, and technology providers.
-
Regulatory scrutiny of data access. The U.S. Department of Justice has monitored MLS data access rules under antitrust frameworks (DOJ Antitrust Division), and the Federal Trade Commission (FTC) has examined data aggregator practices in the proptech market. These regulatory pressures shape how MLSs and aggregators structure licensing terms.
Classification boundaries
Real estate data and database IP falls across four distinct legal categories, with different eligibility criteria and enforcement mechanisms:
1. Copyrightable database structure. Applies where the compiler exercises creative selection or arrangement. Protects expression, not facts. Duration: life of author plus 70 years for natural persons; 95 years from publication for works made for hire (17 U.S.C. § 302).
2. Trade secrets. Applies to proprietary algorithms, model architectures, and curated datasets maintained under reasonable secrecy. No registration required; protection is indefinite as long as secrecy is maintained. Governed by the Defend Trade Secrets Act federally and by state law in all 50 states (most states have adopted the Uniform Trade Secrets Act, per Uniform Law Commission).
3. Contractual data rights. Applies through license agreements, terms of service, and MLS participation rules. These rights are enforceable regardless of the underlying IP status of the data. Breach of contract, not copyright infringement, is the typical cause of action for unauthorized data use in the MLS context.
4. Public domain factual data. Government-generated records — tax assessments, deeds, permits, court filings — are not subject to copyright under 17 U.S.C. § 105, which denies federal copyright to U.S. government works. State analogs vary, but nearly all states treat official public records as publicly accessible under open-records statutes.
The classification of intellectual property providers within these categories determines the enforcement strategy available to data owners and the defense strategy available to alleged infringers.
Tradeoffs and tensions
Access versus exclusivity. The real estate data ecosystem depends on broad participation — agents must access MLS data to serve clients effectively, and consumers benefit from aggregator platforms that consolidate providers. Overly restrictive licensing regimes reduce market efficiency. Underprotected data environments reduce the incentive to invest in database curation and maintenance. This tension has no stable equilibrium and is actively renegotiated through DOJ oversight and NAR policy revision.
Trade secret versus patent. AVM developers and proptech firms choose between trade secret protection (indefinite, no disclosure required) and patent protection (20-year term from filing under 35 U.S.C. § 154, but requiring full public disclosure). Trade secret protection is typically preferred for algorithms because patent applications disclose the innovation; however, independent derivation by a competitor is a valid defense against trade secret claims but not against patent infringement.
Contract law versus IP law. MLS participants can be bound by contractual restrictions on data use that are far broader than copyright would permit. A firm might lawfully scrape publicly available provider data under Feist but simultaneously violate its MLS participation agreement — exposing it to contract damages without any copyright claim being viable. This asymmetry creates legal risk for data entrepreneurs that purely IP-focused analysis misses.
Open data mandates versus proprietary curation. Government open-data initiatives at the federal and municipal level, including HUD's Open Data initiative, push toward broader availability of housing and property data. These mandates can undercut the commercial value of proprietary compilations assembled from the same underlying public sources.
For a broader orientation on how these protections fit within the intellectual property service landscape, the intellectual property provider network purpose and scope page provides sector-level context.
Common misconceptions
Misconception 1: All MLS data is copyrighted.
The factual content of MLS providers — address, price, bedroom count, lot size — is not copyrightable under Feist. The provider photographs and agent-authored property descriptions may qualify separately as copyrightable works because they reflect individual creative expression. MLS enforcement typically proceeds on contract grounds, not copyright grounds, for the data fields themselves.
Misconception 2: Publicly available data is freely reusable.
Data appearing on public-facing websites may be subject to contractual terms of service that restrict scraping, redistribution, and commercial use, regardless of its copyright status. Hiiq v. LinkedIn (N.D. Cal., subsequently addressed by the Ninth Circuit) addressed the Computer Fraud and Abuse Act in the context of scraping, but contractual prohibition remains the dominant enforcement mechanism.
Misconception 3: Government assessor data is always free to redistribute.
State open-records laws vary significantly. While assessor records are generally public, some states charge bulk licensing fees and impose redistribution restrictions through licensing agreements rather than copyright. California, for example, charges fees for bulk parcel data under county-specific terms that restrict resale in certain formats.
Misconception 4: Transforming data breaks the chain of IP liability.
Adding a proprietary analytics layer to licensed data does not automatically insulate the downstream product from licensing claims. MLS participation agreements and data licensing contracts typically include provisions covering derivative works and downstream data products.
Misconception 5: There is a U.S. sui generis database right.
The European Union's 1996 Database Directive created a sui generis right protecting substantial investment in database creation regardless of creativity. The U.S. has not enacted an equivalent statute; Congress considered and declined to pass database protection bills in 1996, 1997, and 1999. U.S. database protection remains dependent on copyright originality, trade secret, and contract.
Checklist or steps
The following sequence describes the standard assessment phases that IP practitioners and data compliance professionals apply when evaluating a real estate database's IP status and associated rights:
-
Identify the data source category. Classify each data element as: (a) government-generated public record, (b) MLS-licensed content, (c) third-party proprietary feed, or (d) self-generated or platform-native data.
-
Assess copyright eligibility. For each non-governmental data category, evaluate whether creative selection or arrangement meets the originality threshold established in Feist (499 U.S. 340). Document the basis for any originality claim.
-
Inventory active licensing agreements. Collect and review all operative license agreements, MLS participation agreements, data feed terms of service, and API terms for each data source. Note permitted uses, prohibited uses, sublicensing rights, and derivative works provisions.
-
Identify trade secret candidates. Document which proprietary algorithms, weighting models, data normalization methods, or curated datasets derive independent economic value from secrecy. Confirm that reasonable secrecy measures — access controls, NDAs, segmented access logs — are in place.
-
Map derivative data products. For each product or data output derived from licensed or protected sources, trace the data lineage to identify which upstream agreements or IP protections flow through to the derivative product.
-
Evaluate contract compliance gaps. Compare current data use practices against the terms of each operative agreement. Flag uses that exceed licensed scope, including redistribution, sublicensing, commercial resale, or use in automated systems not covered by original license terms.
-
Confirm IP registration status. Check the U.S. Copyright Office registration database (cocatalog.loc.gov) for any registered database compilations. Note that registration is not required for copyright protection but is a prerequisite for statutory damages and attorney's fees in infringement litigation under 17 U.S.C. § 412.
-
Document the rights matrix. Produce a written rights matrix mapping each data element or dataset to its source, IP status, licensing restrictions, and applicable enforcement mechanisms.
This framework is referenced in the broader context of IP service navigation on the how to use this intellectual property resource page.
Reference table or matrix
| Data Type | Copyright Protection | Trade Secret Eligible | Governed by Contract | Primary Enforcement Mechanism |
|---|---|---|---|---|
| Individual property facts (address, price, sq ft) | No (Feist, 499 U.S. 340) | No (publicly observable) | Yes (MLS/license terms) | Breach of contract |
| MLS provider photographs | Yes (if original authorship) | No | Yes | Copyright infringement + contract |
| Agent-authored property descriptions | Yes (if original expression) | No | Yes | Copyright infringement + contract |
| AVM algorithms and model weights | No (method, not expression) | Yes (if maintained in secrecy) | Yes | Trade secret misappropriation |
| Proprietary database compilation (creative selection) | Yes (if creative arrangement) | Potentially | Yes | Copyright infringement + contract |
| Government assessor records | No (17 U.S.C. § 105 / state analogs) | No | Potentially (bulk licensing) | Contract (where applicable) |
| GIS parcel base maps (government-generated) | No | No | Potentially | Contract (where applicable) |
| Derived analytics layers on public data | Potentially (if creative expression) | Yes (if secret and valuable) | Yes | Trade secret + contract |
| Aggregated transaction history (from licensed feeds) | Depends on source agreement | Potentially | Yes | Contract (primary) |