Trilliant Health, a leading analytics firm in the healthcare sector, has announced the public release of its comprehensive hospital price transparency dataset in a single DuckDB data lake. This new format removes the need for complex data pipelines and enables immediate comparative analysis of more than 5,000 hospitals across the United States.
Health Technology Insights: Datavant and UBC Partner to Advance Patient Access Research
This release builds on the company’s November update, which centralized individual hospital machine-readable files, making it easier for analysts to query single-hospital data. The new combined database contains over six billion negotiated rates, allowing comparisons across hospitals to be conducted quickly and efficiently.
Health Technology Insights: GE Healthcare to Acquire Intelerad for Cloud Imaging Expansion
Matt O’Neill, Chief Data Officer at Trilliant Health, explained that hospital price transparency data has long been difficult to work with because each hospital publishes files in different formats and structures. Traditionally, normalizing and merging these datasets required weeks of extraction, transformation, and loading work, as well as significant computational resources. By consolidating all data into a single columnar database file optimized for analytics, Trilliant Health has simplified the process, enabling faster and more accurate analysis.
The DuckDB format is particularly well-suited for these analytics tasks as it allows SQL queries on massive datasets without the need for a database server. Analysts can perform complex comparisons such as evaluating price variations for specific procedures across competing hospitals or analyzing negotiated rate differences between major payers directly on a personal laptop.
O’Neill added that DuckDB’s architecture supports analytical workloads and can manage large aggregations across billions of rows without requiring infrastructure investments. Analysts can now write a SQL query to compare negotiated rates for a particular CPT code across all hospitals in the country and receive results within seconds. Previously, such an analysis would have required downloading hundreds of files, building normalization scripts, and provisioning cloud infrastructure.
Trilliant Health is providing this DuckDB dataset at no cost, continuing its mission to support greater transparency in the U.S. healthcare system and help reduce inefficiencies and waste. By making this comprehensive dataset publicly available, the company is empowering analysts, researchers, and policymakers to make data-driven decisions that improve price visibility and healthcare market efficiency.
Health Technology Insights: Sorcero Raises $42.5 Million to Scale Life Sciences AI
To participate in our interviews, please write to our HealthTech Media Room at info@intentamplify.com

