Where Does Medical Price Transparency Data Come From?

The data that powers payor rate benchmarking comes from the insurers themselves — published under federal mandate, updated monthly, and publicly accessible to anyone with the infrastructure to use it.

The primary source: payor-published machine-readable files

At Payorology, our medical price transparency data originates primarily from the payors themselves — the organizations that negotiate contracts and issue reimbursements. Under the Transparency in Coverage Final Rule, all commercial insurance companies and group health plans are legally required to publish their in-network negotiated rates and out-of-network allowed amounts in a standardized, machine-readable format.

These payor-reported files are updated monthly, ensuring the data remains current and reflective of the most recent reimbursement structures.

In addition, Payorology also integrates hospital price transparency files when applicable. Hospitals are required to publish their pricing data under a separate federal mandate, though these hospital-reported files are typically updated only once per year.

By combining and analyzing both sources, Payorology delivers the most comprehensive view of medical price transparency data available—providing medical groups, investors, and advisors with unparalleled insight into real reimbursement rates. This enables smarter strategies in contract negotiation, market expansion, M&A, and provider recruitment.

What is the Transparency in Coverage Final Rule?

The Transparency in Coverage Final Rule, issued by the Departments of Health and Human Services (HHS), Labor (DOL), and the Treasury, took effect on July 1, 2022. It requires most health insurers and employer group health plans to publicly disclose:

In-network negotiated rates for all covered services
Out-of-network allowed amounts and billed charges
Monthly updates to all machine-readable files

This rule serves as the foundation of modern medical price transparency, giving providers, employers, and patients access to accurate, payor-level pricing data. The result is a more transparent, competitive, and data-driven healthcare ecosystem that supports informed decision-making across all stakeholders.

What is a machine-readable file (MRF)?

A machine-readable file — commonly referred to as an MRF — is the technical format in which insurers publish their rate disclosures. The term "machine-readable" means the file is structured for software to process, not for a human to open and read directly. In practice, these files are published in JSON format and can be extremely large — a single major insurer's in-network rate MRF can exceed tens of terabytes of compressed data.

Each MRF contains rate data organized by:

Provider — identified by National Provider Identifier (NPI)
Billing code — CPT® codes, HCPCS codes, and DRG codes depending on the service type
Plan — the specific health plan the rate applies to
Rate type — negotiated rate, fee schedule, percentage of billed charges, or per diem

The sheer scale of this data — and the significant variation in how different insurers structure and populate their files — means that MRFs are not practically accessible to most organizations without specialized infrastructure. Errors, inconsistencies, and missing data are common across insurers and require domain expertise to identify and handle correctly.
[ Internal link: Link to: /knowledge-center/glossary/machine-readable-file-mrf-health-insurance/ — Glossary: Machine-readable file (MRF) ]

The hospital price transparency data source

A separate federal mandate — the Hospital Price Transparency rule — requires hospitals to publish their standard charges and payer-specific negotiated rates for inpatient and outpatient services. Unlike the Transparency in Coverage rule which covers insurer-reported rates, hospital price transparency files are reported by the hospitals themselves and cover a defined set of shoppable services.

Hospital price transparency data is particularly useful for medical groups in several scenarios:

Understanding what hospital-employed physicians are being paid for the same CPT codes — a critical benchmark in markets where hospital-affiliated providers dominate
Evaluating potential health system partnership or employment arrangements against independent contracting alternatives
Assessing competitive rate dynamics in markets where hospital outpatient departments bill for services thatoverlap with independent specialty practices

Because hospital price transparency files are updated annually rather than monthly, they provide a useful directional benchmark but should be supplemented with insurer-reported MRF data for current rate analysis.

Why data quality matters as much as data access

The Transparency in Coverage rule made payor rate data technically public. It did not make that data uniformly accurate, complete, or easy to use. Insurers vary significantly in the quality of their MRF disclosures — some publish well-structured, comprehensive files that closely reflect actual contracted rates. Others publish files with errors, missing providers, stale rates, or inconsistent coding that requires substantial work to validate.

This is why the distinction between having access to price transparency data and having reliable price transparency data matters enormously. A benchmarking analysis built on unvalidated MRF data can produce conclusions that are directionally misleading — and in a contract negotiation, directionally misleading is worse than no data at all.

HOW PAYOROLOGY VALIDATES ITS DATA
Before showing any client their market benchmarks, we pull what our data shows for that client's own rates — and ask them to confirm those rates against what they're actually being paid. When the numbers match to the penny, it establishes that our data extraction and validation process is working correctly for that payor and that market. That verification step is the foundation of the trust our clients place in the peer benchmarks that follow.

How Payorology uses this data

Payorology ingests, processes, and validates MRF data from commercial insurers across the United States on a continuous basis. Our data pipeline is built specifically for the scale and complexity of this data — handling the inconsistencies in insurer file structures, normalizing provider identifiers, and applying specialty-specific logic to ensure that rate comparisons are genuinely apples-to-apples.

The output is not a raw data feed. It is a curated, vetted, specialty-specific analysis — built around the CPT and HCPCS codes that drive revenue for the specific type of practice we're working with, in the specific markets that matter to them. Our clients don't navigate a generic database. They see a benchmarking view built for their situation, supported by a team that can help them understand and act on what the data shows.

Have questions? Reach out!

If you think we may be able to help, let's meet for 30 minutes. No pressure, we promise.

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.