Life Sciences

Top Takeaways from Real-World Data Connect 2024

Author
Publish Date
Read Time
May 28, 2024
Table of Contents

On May 14, 2024, Datavant held our annual Real-World Data Connect event in Boston, convening more than 130 RWD experts and practitioners from theLife Sciences industry. Our goals were to explore the value of real-world data across clinical and commercial use cases, identify best practices and opportunities for collaboration, and exchange insights to drive innovation at organizations across the health data ecosystem.

We’re excited to share our top takeaways from the day’s panel discussions and look forward to continuing the conversations!

Linking new types of health data enables a comprehensive view of the patient journey

In the past, researchers would typically just connect clinical and claims data, providing insights into patient care and health outcomes based on documented medical histories and billing information. Today, the landscape has expanded dramatically with the accessibility of genomics, imaging, diagnostic, and lab data, as well as social determinants of health (SDoH) data, driving new applications and deeper, more personalized insights into patient health and treatment efficacy.

Pairing multimodal data — ‘omics, imaging, and other unstructured RWD — with an effective data strategy can reveal powerful new insights.

Panelists noted that linking multimodal data — or data derived from multiple sources and types — can create rich datasets and power new research. In one example, aLife Sciences speaker described how his organization is leveraging DNA/RNA sequencing and clinical data for organoid simulations, enabling researchers to explore signals and treatment pathways from tissue-derived models.

In another example, a behavioral health company developed a digital screening device that records consenting patients at their homes. AI/ML models trained on real-world data then analyzed these recordings to detect signals and behavioral patterns indicative of early autism, reducing the time to diagnosis.

Leveraging multimodal data can be complex, and speakers emphasized the importance of robust data strategies, architectures, and governance frameworks. Cloud providers can help manage the size and scale of multimodal data and reduce spend on duplicative RWD, while also making data sharing secure and efficient.

Patient-consented clinical data are powering R&D workflows.

Patient-consented, HIPAA-compliant clinical data is increasingly being used to power large-scale R&D projects. The greater depth, breadth, and utility of robust EHR data is well-suited to supplement traditional RWD, particularly in therapeutic areas with rare or hard-to-find cohorts such as pediatrics and rare diseases. 

One case involved a cancer research registry that faced challenges in retrieving complete patient histories for a clinical study. By obtaining full medical records from consenting patients, including pathology notes, progress notes, radiology reports, and lab tests, the registry enriched its data model to include cancer staging, biomarker details, and comorbidities. 

In another example, a speaker described a study design that combined identified and de-identified workflows to enrich study data and deduplicate an external control arm for a respiratory therapy trial. Tokenization linked relevant trial EDC and PRO data with multiple RWD sources, including claims, EMR, and study drug distribution data, in a privacy-preserving manner.

RWD is reshaping how pharmaceutical products are developed, marketed, and delivered

RWD is supporting regulatory approval for medical devices.

Historically, medical device data lacked the granularity needed for market access and pricing, but several panelists noted that this has changed.

One speaker from a top pharmaceutical company described connecting RWD to pursue label expansions for cardiac medical devices. By linking medical device data with multiple EHR datasets to assess patient outcomes, the pharmaceutical company generated evidence on device safety. This innovative approach led to the FDA's first-ever approval for label expansion based solely on a comparative real-world evidence (RWE) study using EHR databases. This milestone has set the stage for 20 additional test cases funded by the FDA through the National Evaluation System for Health Technology.

In another example, aLife Sciences organization linked data to generate evidence on a wearable device to treat essential tremors, a condition that has limited treatment options. Researchers combined proprietary claims data, longitudinal patient-reported outcomes data, and device data to measure the device's impact on patient health, healthcare utilization, and costs. This evidence supported the device's 510(k) premarket clearance, demonstrating the transformative potential of RWD in advancing medical device innovation.

RWD can improve modeling, engagement, and measurement for commercial teams.

The landscape of healthcare marketing is rapidly evolving, and new technologies are enabling more personalized engagement with patients and providers. In particular, linking de-identified RWD givesLife Sciences organizations and their technology partners insights that can improve audience modeling and campaign measurement accuracy. This, in turn, leads to better patient engagement, with one speaker describing a 30x lift in acquisition for patients with an autoimmune disease.

New partnerships and technologies are making data connectivity and exchange more efficient, controlled, and privacy-centric.

Designing and implementing a linkable data strategy is essential for unlocking new data types and powering innovative clinical and commercial use cases. Efficiently tokenizing and connecting first and third-party data enables organizations to maximize the utility of their data, reduce duplicative spending, and enable researchers to access the data they need when they need it. 

However, this is just one part of the equation. Effective data strategies also depend on the ability to quickly and securely connect fit-for-purpose data. Panelists emphasized that “fit for purpose” involves taking the time upfront to determine the specific research questions, the types and attributes of data needed to answer those questions, and identifying the right partnerships to obtain that data.

RWD partnerships go deeper than price and data quality.

Panelists noted that there are multiple factors to consider when evaluating RWD, including ease and speed of access, transparency, reliability, and relevance. However, as research needs evolve, the degree to which a dataset is fit for purpose changes over time, and attendees generally agreed that the ideal state is where researchers can quickly and easily navigate the landscape of RWD partners to answer their specific research questions.

Clean rooms and federated data models have emerged as privacy-enhancing solutions for secure, efficient collaboration.

RWD practitioners are increasingly using cloud-based clean rooms as a trusted method for data exchange. Commonly used in advertising and marketing, clean rooms are secure environments that enable multiple parties to collaborate without exchanging data, allowing joint analyses on multiple datasets while adhering to privacy controls.

Speakers noted that the advent of clean rooms has encouraged organizations that historically have shied away from sharing data to provide access to de-identified data for health research. But they’re not only valuable for collaborating with external organizations — clean rooms also empower internal teams within an organization to share data in a secure, controlled environment. Bringing tokenization and data assessment tools to the cloud will further enhance the value of clean rooms, enabling organizations to efficiently de-identify, link, and match data without ever moving the data out of their environment.

Federated data models are another method of secure data collaboration, enabling data access and analysis across multiple distributed databases without centralizing the data. One panelist described success with using a federated data model to power oncology studies for multiple cancer diagnoses and Alzheimer’s studies. Multiple research sites ran local AI models on federated datasets, accessing rich clinical data without actually exchanging it. This privacy-centric collaboration model allowed researchers to examine disease progression over time while maintaining strict data privacy and security standards.

Guarded optimism on AI and Large Language Models (LLMs)

AI for predictive modeling is mature and widespread acrossLife Sciences organizations that have governance frameworks in place for model development and management. In the future, proprietary data will be the largest differentiator between the strength and quality of predictive AI models. To advance AI and build better models, organizations must be willing to provide access to high-quality data, whether in a shared setting where first and third-party datasets are being connected or in a federated fashion.

The type and quality of data used for training will also differentiate LLMs, which could transform how we approach data management and analytics. One panelist was optimistic that LLMs will reduce the need to structure data in the future: Researchers will simply be able to ask the model questions and generate analysis in real-time, accelerating the speed to insight.

However, there are still questions regarding proper governance with LLMs. Black box models can be problematic, noted one speaker, andLife Sciences organizations will need to clarify what data sources were used for training and what steps were taken to eliminate biases.

Privacy First and Always

As the volume, modalities, and applications of RWD continue to expand, it’s increasingly important to treat privacy as an essential component of and enabler for data strategy. Experts agreed that privacy should be built into the fabric of the data strategy, at both the enterprise level and project level.

The foundational principles of patient privacy remain unchanged, and speakers noted that researchers must master the essentials. To sustain patient trust and serve as good stewards of health data, organizations must continue to invest in clear governance frameworks, proper data hygiene, data minimization practices, transparency, and organizational literacy on privacy and compliance.

Numerous privacy tools and tactics are available to researchers. From clean rooms and federated data models to contractual controls like data use agreements and new technologies such as unstructured data redaction, researchers should be equipped with the resources they need to connect data securely and compliantly.

Final thoughts

This gathering demonstrated that RWD is becoming a linchpin in the machinery ofLife Sciences, contributing to breakthroughs in therapies, more personalized care, and better patient outcomes. The promise of RWD to transformLife Sciences and patient care is profound, and we left RWD Connect inspired by the progress being made through new technologies, applications, and collaborations. 

Thank you to all who attended and contributed their perspectives! For those unable to attend this year, RWD Connect 2025 will take place in Philadelphia in the spring of 2025. Stay tuned for more information!

Want to join the discussion? Reach out to learn how to join the nation’s largest RWD ecosystem.

Spotlight on AnalyticsIQ: Privacy Leadership in State De-Identification

AnalyticsIQ, a marketing data and analytics company, recently adopted Datavant’s state de-identification process to enhance the privacy of its SDOH datasets. By undergoing this privacy analysis prior to linking its data with other datasets, AnalyticsIQ has taken an extra step that could contribute to a more efficient Expert Determination (which is required when its data is linked with others in Datavant’s ecosystem).

AnalyticsIQ’s decision to adopt state de-identification standards underscores the importance of privacy in the data ecosystem. By addressing privacy challenges head-on, AnalyticsIQ and similar partners are poised to lead clinical research forward, providing datasets that are not only compliant with privacy requirements, but also ready for seamless integration into larger datasets.

"Stakeholders across the industry are seeking swift, secure access to high-quality, privacy-compliant SDOH data to drive efficiencies and improve patient outcomes,” says Christine Lee, head of health strategy and partnerships at AnalyticsIQ. 

“By collaborating with Datavant to proactively perform state de-identification and Expert Determination on our consumer dataset, we help minimize potentially time-consuming steps upfront and enable partners to leverage actionable insights when they need them most. This approach underscores our commitment to supporting healthcare innovation while upholding the highest standards of privacy and compliance."

Building Trust in Privacy-Preserving Data Ecosystems

As the regulatory landscape continues to evolve, Datavant’s state de-identification product offers an innovative tool for privacy officers and data custodians alike. By addressing both state-specific and HIPAA requirements, companies can stay ahead of regulatory demands and build trust across data partners and end-users. For life sciences organizations, this can lead to faster, more reliable access to the datasets they need to drive research and innovation while supporting high privacy standards.

As life sciences companies increasingly rely on SDOH data to drive insights, the need for privacy-preserving solutions grows. Data ecosystems like Datavant’s, which link real-world datasets while safeguarding privacy, are critical to driving innovation in healthcare. By integrating state de-identified SDOH data, life sciences can gain a more comprehensive view of patient populations, uncover social factors that impact health outcomes, and ultimately guide clinical research that improves health. 

The Power of SDOH Data with Providers and Payers to Close Gaps in Care

Both payers and providers are increasingly utilizing SDOH data to enhance care delivery and improve health equity. By incorporating SDOH data into their strategies, both groups aim to deliver more personalized care, address disparities, and better understand the social factors affecting patient outcomes.

Payers Deploy Targeted Care Using SDOH Data

Payers increasingly leverage SDOH data to meet health equity requirements and enhance care delivery:

  • Tailored Member Programs: Payers develop specialized initiatives like nutrition delivery services and transportation to and from medical appointments.
  • Identifying Care Gaps: SDOH data helps payers identify gaps in care for underserved communities, enabling strategic in-home assessments and interventions.
  • Future Risk Adjustment Models: The Centers for Medicare & Medicaid Services (CMS) plans to incorporate SDOH-related Z codes into risk adjustment models, recognizing the significance of SDOH data in assessing healthcare needs.

Payers’ consideration of SDOH underscores their commitment to improving health equity, delivering targeted care, and addressing disparities for vulnerable populations.

Example: CDPHP supports physical and mental wellbeing with non-medical assistance

Capital District Physicians’ Health Plan (CDPHP) incorporated SDOH, partnering with Papa, to combat loneliness and isolation in older adults, families, and other vulnerable populations. CDPHP aimed to address:

  • Social isolation
  • Loneliness
  • Transportation barriers
  • Gaps in care

By integrating SDOH data, CDPHP enhanced their services to deliver comprehensive care for its Medicare Advantage members.

Providers Optimize Value-Based Care Using SDOH Data

Value-based care organizations face challenges in fully understanding their patient panels. SDOH data significantly assists providers to address these challenges and improve patient care. Here are some examples of how:

  • Onboard Patients Into Care Programs: Providers use SDOH data to identify patients who require additional support and connect them with appropriate resources.
  • Stratify Patients by Risk: SDOH data combined with clinical information identifies high-risk patients, enabling targeted interventions and resource allocation.
  • Manage Transition of Care: SDOH data informs post-discharge plans, considering social factors to support smoother transitions and reduce readmissions.

By leveraging SDOH data, providers gain a more comprehensive understanding of their patient population, leading to more targeted and personalized care interventions.

While accessing SDOH data offers significant advantages, challenges can arise from:

  • Lack of Interoperability and Uniformity: Data exists in fragmented sources like electronic health records (EHRs), public health databases, social service systems, and proprietary databases. Integrating and securing data while ensuring data integrity and confidentiality can be complex, resource-intensive and risky.
  • Lag in Payer Claims Data: Payers can take weeks or months to release claims data. This delays informed decision-making, care improvement, analysis, and performance evaluation.
  • Incomplete Data Sets in Health Information Exchanges (HIEs): Not all healthcare providers or organizations participate in HIEs. This reduces the available data pool. Moreover, varying data sharing policies result in data gaps or inconsistencies.

To overcome these challenges, providers must have robust data integration strategies, standardization efforts, and access to health data ecosystems to ensure comprehensive and timely access to SDOH data.

SDOH data holds immense potential in transforming healthcare and addressing health disparities. 

With Datavant, healthcare organizations are securely accessing SDOH data, and further enhancing the efficiency of their datasets through state de-identification capabilities - empowering stakeholders across the industry to make data-driven decisions that drive care forward.

Demo

Connect to the Nation's Largest Health Data Ecosystem

Request a demo
See all blogs

Achieve your boldest ambitions

Explore how Datavant can be your health data logistics partner.

Contact us