Data Logistics for Healthcare /
How Data Logistics for Healthcare Can Supercharge Your Data Management Strategy
Learn everything you need to know about information blocking.
Author
Datavant
Read Time
12 mins
Chapters
Chapter 1
Introduction
Chapter 2
Contextualizing healthcare data within the patient journey
Chapter 3
Data generated from a single patients office visit
Chapter 4
Where healthcare data sit today: A fragmented supply chain
Chapter 5
Making your organization’s data management strategy more agile is crucial
Chapter 6
Introducing data logistics: The missing part of most data management strategies
Chapter 7
Enterprise-grade solutions are critical for your organization’s data strategy
Bibliography
Chapter 1
Introduction
Data will power true transformation in healthcare, revealing insights and making connections that are otherwise unknown. Take the compelling example of multiple sclerosis (MS) where data is going to help redirect efforts to cure and prevent MS. For years, the cause of this debilitating, fatal disease was unknown. Then researchers studied complete patient journeys using the Department of Defense database, a closed system of 20 years of healthcare information from over 10 million service people. This robust data set enabled researchers to find a clear link between Epstein Barr virus and MS making virology a critical path in MS research. Unfortunately, this data set is unique and will not answer every scientific question we face.
And unfortunately, the way we manage healthcare data today is not setting us up for this type of transformation – or for optimal utilization in general. In fact, a significant amount of valuable healthcare data is not usable, sitting across thousands of siloed sites and databases. The challenge only increases as data multiplies: as patients see more providers, receive more tests, adopt wearables, seek new treatments, and use more devices. Patients also often move between payers and providers, creating a disconnect in their journeys. Compiling these data requires a new way to work with disparate, siloed, and expanding data sets.
The problem isn’t just the fact that data is constantly compounding at mind-spinning rates. It is the opportunity cost of the inability to access, use, and glean insights from this data. Data stands in the way of patient care, medical innovations, and so much more.
That’s where data logistics comes in. Data logistics enables the safe, efficient healthcare data connectivity needed for flexible, fit-for-purpose data sets. This paper explores how data logistics has been the missing piece in making data secure, accessible and usable, offering an opportunity to harness the full potential of healthcare data
Data logistics is key to unifying and harnessing the expansive healthcare data universe, helping us connect the dots to improve patient care and drive healthcare innovation.

Chapter 2
Contextualizing healthcare data within the patient journey
The terms often used to describe the size of digital data can be difficult to comprehend: petabytes, exabytes, zettabytes. Additionally, the digital data universe is growing so quickly, especially in healthcare, that it is on a path to double every 2 years or less.2 Some measures and drivers of this growth include:
- Hospital data: The World Health Forum states that hospitals produce 50 petabytes of data per year
- Genomics and biomarker data: Genomics and associated biomarker data will potentially generate up to 40 exabytes of data by the end of 2025
- Wearables: Globally, wearable devices are expected to create over 90 zettabytes of information in 2025
For healthcare data, it is easier to illustrate the point by focusing on the patient journey. While this is only a subset of the industry landscape, it is a bit more intuitive than, say, calculating claims data (which in 2022 medical plans executed 14 billion claims-related transactions).
To do this, we conducted a study using patient-level health data to better explain how each patient’s longitudinal medical history and their experience engaging with the healthcare system are continuously growing larger, longer, and more complex. Starting with how much more information there is per patient, Figure 1 shows how we found that the volume of patient-level records grew 57% in just 4 years. This growth reflects how much a patient interacts with the healthcare system. A Centers for Medicare and Medicaid Services study from 2019 found that in 1 year alone, each beneficiary enrolled in Medicare Advantage experienced an average of 21.1 outpatient visits, 3.4 outpatient hospital visits, and 0.2 inpatient hospital visits. And that does not account for every interaction they have.
Figure 1. Volume of patient-level records (billions)
CAGR, compound annual growth rate.
Source: Prognos Health data. Valuate and Datavant analysis.
Figure 2. Average records per patient generated per year
Source: Prognos Health data. Valuate and Datavant analysis.
And the number of interactions is only growing, as shown by our study. From 2019 to 2023, per-patient records increased from about 59 to 86 records, even factoring in a decrease during the COVID-19 pandemic (Figure 2).
As Figure 3 suggests, this extra data collection may come from the dramatic increase in the number of provider sources, with a 66% increase in National Provider Identifiers from 2019 to 2023. As all this data grows, so does the risk of duplication and error, with different stakeholders providing and managing information about the same patients. This raises the question: how usable are all these data points?
Figure 3. More providers contributing to patient data pool (provider NPI growth)
Even if a patient’s data have been well organized, their next doctor’s visit could upend these efforts. As shown in Figure 4, a single medical event can trigger a vast amount of healthcare data that are independent of one another, captured in separate data sources, and viewed by healthcare professionals in different places. This ultimately leads to potential misunderstandings, misalignment, and missed opportunities for the patient, provider, and other healthcare stakeholders.
Chapter 3
Data generated from a single patients office visit
Figure 4. Explosion of data from a single patient encounter – showing how much new data is generated across data sets every time a patient visits a healthcare provider

CPT, Current Procedural Terminology; LOINC, Logical Observation Identifiers Names and Codes; NDC, National Drug Code; NPI, National Provider Identifier.
Chapter 4
Where healthcare data sit today: A fragmented supply chain
The healthcare data supply chain is complex, fragmented, and unusual (Figure 5). Where other supply chains are designed to move from input to end product, in many ways, the healthcare data supply chain does the opposite. The system is designed to generate, manage, process, protect, and apply data to specific use cases, but it is not designed to move data.
Figure 5. The healthcare data supply chain: sources and estimated value
Regulatory, data security, privacy
estimated to be $18.2B in 2023: expected to reach $35.3B in 2028 Rules, protocols, oversights, and systems to ensure data are collected, stored, managed, and used to protect privacy Estimated value: Healthcare cybersecurity market size alone9
Data processing and transformation
Ways in which health data are analyzed to drive meaningful macro insights
Estimated value: Global healthcare data analytics market size $21.1B in 2021: expected to reach $85.9B by 202710
Data usability
Ways in which health data are analyzed to drive meaningful macro How data are studied for specific purposes, such as research
Estimated value: Undefined, as governmental and private sources fund this space
Data visibility and interoperability
The processes and technology that enable data sharing across systems, including the ability to find data sources
Estimated value: Global healthcare data interoperability market size expected to reach nearly $16B by 203011
Data management
Ways in which stored data are controlled and accessed to ensure compliance with governance rules, often including quality controls and formatting (eg, data models to map collected data to ensure it is “seen” correctly)
Estimated value: Global enterprise data management market size expected to grow from $77.9B in 2020 to $122.9B by 202512
Data acquisition and collection
Ways in which data are compiled into databases (eg, paper records, EMRS)
Estimated value: Global EMR market size surpassed $27.42B in 2023 projected to reach $41.87B by 203313
Data storage
The secure storage of data that have been collected and may continue to be collected
Estimated value: Up to $11T14
Data generation
Patients generate data, both paper based and digital, through interactions with the healthcare system, services, and technology
Estimated value: Global wearables market size $71.5B in 2022: expected to hit $374.6B by 203215
Source: Prognos data. Valuate and Datavant analysis.
Several key characteristics keep healthcare data “stuck” and hard to use:
Interoperability
Siloing
Privacy concerns
Payer and provider switching
Chapter 5
Making your organization’s data management strategy more agile is crucial
There are specific areas where we have seen organizations successfully enhance their data management strategies to help optimize their use of healthcare data:
Recognizing data gaps
Most healthcare stakeholders have their own data sets. The challenge lies in determining how to complement your own data with other sources to help fill key gaps. We recommend a robust data mapping exercise to assess the data that you currently have access to against the use cases you want to support. For example, a payer organization may be underestimating a member’s risk of medical complications because they do not have access to information on new, relevant tests and clinical visits that occurred since the last assessment. A provider might not see an out-of-system urgent care visit that could affect a procedure’s outcome if they only have access to their system’s electronic health records (EHRs).
Working with the right amount of data to fill the gaps
While relying on a single EHR system or data vendor may make data management easier, it leaves organizations with incomplete data. Additionally, trying to access too much data causes inefficient investments. We recommend proactively expanding your data ecosystem in a safe, timely, and cost-effective way through a flexible, federated model. Ensuring that your organization has access to usable data to address relevant gaps does not mean establishing access to every data set available. However, it is important to understand where the desired data sit and to create access to those data as needed via a federated model.
Using technology and expertise to bring data together
Given the increase in health data – often with duplicative information – requires a scalable approach to managing that information while being compliant with varying governance rules. We recommend plugging into a platform that enables you to identify, link, transform, and access the data you need compliantly. This platform can support multiple types of use cases, from linking all a patient’s data to provide a complete patient record to linking billions of records to support the types of analyses that researchers achieved in MS.
Chapter 6
Introducing data logistics: The missing part of most data management strategies
To increase the usability of healthcare data, organizations need a strategic and systematic capability to move and protect this information across use cases. That capability is called data logistics. It involves managing information and handling processes optimally, including aspects related to time (flow time and capacity), storage, distribution, and presentation. Effective data logistics helps healthcare organizations improve results while managing costs when capturing, creating, searching, and maintaining data. Fundamentally, data logistics plays a crucial role in addressing healthcare data supply chain challenges by coordinating the movement and handling of data across the supply chain efficiently, allowing compliant data sharing between parties while allowing each organization to maintain control. There are a few key components to data logistics:
Ecosystem navigation and access
Data logistics involves tapping into the larger data ecosystem beyond an organization’s boundaries. Understanding the complete patient record, either identified or deidentified, requires accessing relevant data from various sources. Successful ecosystem access (including governance) and navigation enable data to move between healthcare providers, insurance companies, research institutions, and other stakeholders.
Technology platform
Data logistics leverages a technology platform to optimize workflow related to data movement, storage, and processing. It spans fields such as networking, file/database systems, and process management. Authorized access, as governed by regulations like HIPAA, ensures that patient safeguards are maintained and can often be verified through digital interfaces.
Services
Data logistics ensures that data are handled securely, compliantly, and efficiently. Beyond merely moving data, it focuses on protecting patient privacy and connecting data in a compliant manner. Given that most healthcare data are unstructured, and can even be paper-based, people are often the most effective way of securely and compliantly ensuring data get to those who have a use case ready for the data query. Data logistics cannot exist without people.
Chapter 7
Enterprise-grade solutions are critical for your organization’s data strategy
As the leading company bringing data logistics to healthcare, Datavant is dedicated to helping organizations securely and compliantly move health data through our three key pillars of excellence:
Protect
We designed Datavant’s solutions to optimally address privacy, compliance, and security. We protect health data, while always ensuring every organization has complete control over how their data is accessed and used.
Connect
We enable data to move from the very beginning with the broadest footprint of providers in the US. to reach every patient record. We empower organizations by creating access to relevant data sources, so they can put together all the pieces needed for a complete view of the patient.
Deliver
We deliver data that is relevant and timely. Through technology and our value-added services, we power countless decisions that support clinical, operational and research questions with the most relevant, usable data.
As the leading company bringing data logistics to healthcare, Datavant is dedicated to helping organizations securely and compliantly move health data through our three key pillars of excellence:
Governance
Compliant and appropriate sharing of data, both deidentified and identified, to ensure patient privacy.
Source
Working across a variety of systems, such as hospital EHRs, imaging, genomic data, doctor’s visits, and wearables.
Format
Access to structured, semi-structured, and unstructured data.
Use cases
Functioning in various scenarios, from clinical decisions to research purposes and health plan optimization.
Fig. 6 shows the value of this data logistics approach, highlighting the ability to tap into the broadest data network with a platform designed to support the protection, connection, and delivery of data, regardless of use case. Data logistics can help your organization explore a breadth of new opportunities, from truly understanding all the care a patient receives, to identifying patients who might be at higher risk before they incur prescribed treatments, to finding patients with rare diseases who may be eligible for clinical research access.
Figure 6. The value of data logistics
Unparalleled network
Advanced data logistics platform
Exchanging 200 terabytes of data annually
~100M Medical Records/yr, ~100B tokens/month
Prognos Health is a trusted provider of actionable real-world data in the life sciences industry that is driven by its mission to unlock the power of data to improve health. Prognos Health’s exclusive, unique data sets unlock valuable insights in complex clinical populations across the entire commercial life cycle, going beyond traditional real-world data offerings. Prognos helps life sciences companies accelerate the development and delivery of innovative therapies and improve health outcomes by offering fully integrated and harmonized lab and health records on more than 325 million deidentified patients. For more information from Prognos, please reach out to Ashley Triscuit, Marketing Director, [email protected].
Valuate Health Consultancy is a consulting firm that combines deep market access and reimbursement expertise, industry-leading data analytics, and robust market research capabilities to address healthcare organizations’ market access needs. Valuate helps healthcare organizations develop bespoke market access strategies by conducting primary and secondary research, deploying advanced healthcare data analytics and engineering, monitoring and assessing health policy changes, developing pricing and contracting strategies, and more. Our goal is to break through market access barriers to help patients get access to the healthcare they need.
Bibliography
- Bjrnevik K, Cortese M, Healy BC, et al. Longitudinal analysis reveals high prevalence of Epstein-Barr virus associated with multiple sclerosis. Science. 2022;375:296-301. doi:10.1126/science.abj8222
- Statistic determined by using the Rule of 7 and the American Hospital Association’s estimate that healthcare data experienced a compound growth rate of 47% over 7 years,5 while the RBC Capital Markets report suggests a growth rate of 36% by 2025.6
- World Economic Forum. 4 ways data is improving healthcare. December 5, 2019. Accessed March 28, 2024. https://www.weforum.org/agenda/2019/12/four-ways-data-is-improving-healthcare/
- Stephens ZD, Lee SY, Faghri F, et al. Big data: astronomical or genomical? PLoS Biol. 2015;13(7):e1002195. doi:10.1371/journal.pbio.1002195
- Reinsel D, Gantz J, Rydning J. The digitization of the world from edge to core. IDC white paper #US44413318. November 2018. Accessed March 28, 2024. https://www.seagate.com/files/www-content/our-story/trends/files/idc-seagate-dataage-whitepaper.pdf
- American Hospital Association Center for Health Innovation. Leveraging data for health care innovation. Market insights report. Accessed March 28, 2024. https://www.aha.org/system/files/media/file/2021/01/MI_Leveraging_Data_Report.pdf
- RBC Capital Markets. Episode 1: the healthcare data explosion. Accessed March 28, 2024. https://www.rbccm.com/en/gib/healthcare/episode/the_healthcare_data_explosion
- CAQH. 2022 CAQH index: a decade of progress. Accessed March 28, 2024. https://www.caqh.org/hubfs/43908627/drupal/2022-caqh-index-report%20FINAL%20SPREAD%20VERSION.pd
- In this study, “record” is defined as a collected piece of a patient’s medical event comprising standardized test measurements, diagnostic/procedural codes, LOINC® codes, and drug/product details alongside interpreted and structured text from pathology reports.
- Mulcahy AW, Sorbero ME, Mahmud A, et al. Measuring health care utilization in Medicare advantage encounter data: methods, estimates, and considerations for research. RAND Corporation research report for the Centers of Medicare & Medicaid Services. July 25, 2019. Accessed March 28, 2024. https://www.rand.org/pubs/research_reports/RR2681.html
- MarketsandMarkets. Healthcare cybersecurity market report – 2028. Report TC 8616. April 2023. Accessed March 28, 2024. https://www.marketsandmarkets.com/Market-Reports/healthcare-cybersecurity-market-215097518.html?gad_source=1&gclid=CjwKCAiAxaCvBhBaEiwAvsLmWHn8ziP-wU7mv8D6qpAtmtk5gn8IfTyFC92_PBSmgXObPQC9RXVFWBo CuTsQAvD_BwE
- MarketsandMarkets. Healthcare analytics market: global forecast to 2027. Report HIT 2180. December 2022. Accessed March 28, 2024. https://www.marketsandmarkets.com/Market-Reports/healthcare-data-analytics-market-905.html
- Eddy N. Health care data interoperability market worth $16 billion by 2030. Digital CxO. January 19, 2024. Accessed March 27, 2024. https://digitalcxo.com/article/health-care-data-interoperability-market-worth-16-billion-by-2030/
- MarketsandMarkets. Enterprise data management market. Report TC 3589. March 2020. Accessed March 28, 2024. https://www.marketsandmarkets.com/Market-Reports/enterprise-data-management-market-69219339.html
- Precedence Research. Electronic health records (EHRs) market size to reach USD 41.87 bn by 2033. GlobeNewswire News Room. March 5, 2024. Accessed March 28, 2024. https://www.globenewswire.com/news-release/2024/03/05/2840662/0/en/Electronic-Health-Records-EHRs-Market-Size-to-Reach-USD-41-87-Bn-by-2033.html
- CBInsights. The big tech in healthcare report: how Amazon, Google, Microsoft, Apple, & Oracle are fighting for the $11T market. November 20, 2022. Accessed March 28, 2024. https://www.cbinsights.com/research/report/famga-big-tech-healthcare/
- Custom Market Insights. Global mHealth market size, trends, share, forecast 2032. February 28, 2024. Accessed March 28, 2024. https://www.custommarketinsights.com/report/mhealth-market/
- Kelly YP, Kuperman GJ, Steele DJR, Mendu ML. Interoperability and patient electronic health record accessibility: opportunities to improve care delivery for dialysis patients. Am J Kidney Dis. 2020;76(3):427-430. doi:10.1053/j.ajkd.2019.11.001
- O’Reilly-Shah VN, Gentry KR, Van Cleve W, Kendale SM, Jabaley CS, Long DR. The COVID-19 pandemic highlights shortcomings in US Health Care Informatics Infrastructure: a call to action. Anesth Analg. 2020;131(2):340-344. doi:10.1213/ane.0000000000004945
- HIT Consultant. Why unstructured data holds the key to intelligent healthcare systems. March 31, 2015. Accessed March 28, 2024. https://hitconsultant.net/2015/03/31/tapping-unstructured-data-healthcares-biggest-hurdle-realized/
- Reisman M. EHRs: the challenge of making electronic data usable and interoperable. P T. 2017;42(9):572-575.
- Mitchell RL. Medical data sharing: are we there yet? Computerworld. July 20, 2023. Accessed March 28, 2024. https://www.computerworld.com/article/3702109/medical-data-sharing-electronic-health-record-exchanges.html
- PicnicHealth. How to get your medical records from UCSF. Accessed March 28, 2024. https://picnichealth.com/patients/resources/hospitals/ucsf#Radiology
- Li XB, Qin J. Anonymizing and sharing medical text records. Inf Syst Res. 2017;28(2):332-352. doi:10.1287/isre.2016.0676
- Ismail L, Materwala H, Karduck AP, Adem A. Requirements of health data management systems for biomedical care and research: scoping review. J Med Internet Res. 2020;22(7):e17508. doi:10.2196/17508
- US Department of Health and Human Services. Healthcare sector cybersecurity: introduction to the strategy of the U.S. Department of Health and Human Services. Accessed March 28, 2024. https://aspr.hhs.gov/cyber/Documents/Health-Care-Sector-Cybersecurity-Dec2023-508.pdf
- Fang H, Frean M, Sylwestrzak G, Ukert B. Trends in disenrollment and reenrollment within US commercial health insurance plans, 2006-2018. JAMA Netw Open. 2022;5(2):e220320. doi:10.1001/jamanetworkopen.2022.0320
- McCaghy L, Sinha S. Healthcare experience: the difference between loyalty and leaving. Accenture. 2022. Accessed March 28, 2024. https://www.accenture.com/content/dam/accenture/final/accenture-com/document/Accenture-Humanizing-Healthcare-Experience-Payer-Provider-Health-POV.pdf