Podcast
Brought to you by
Season 4 • Episode 22

Breaking Down Data Silos: Datavant's Vision for Connected Health Information

Claire Manneh, Head of Provider at Datavent

Episode Highlights
Datavant is helping researchers apply AI to massive, linked health datasets, unlocking insights that were previously hidden in siloed systems.
Timestamp: 7:49
Datavant enables global clinical trials by linking real-world data across countries while navigating privacy and compliance barriers.
Timestamp: 11:37
Datavant prioritizes data security using advanced methods like tokenization and audit trails
Timestamp: 18:47

About this episode

We caught up with Claire Manneh, Head of Provider Research at Datavant, to discuss how her team is working to connect the world's health data and break down long-standing silos in healthcare information.  Datavant's innovative approach to data connectivity is paving the way for more comprehensive patient care, advanced research, and improved health outcomes.

Subscribe

In this episode, we cover:

0:00 Introduction + Claire's background

2:36 Overview of Datavant and its mission

3:49 Example: Substance misuse data commons project

5:54 Datavant's roadmap and exciting developments

7:49 AI and data analysis in healthcare research

9:15 Population health projects and achievements

11:37 Global clinical trials and international considerations

14:38 A day in the life at Datavant

16:27 Measuring success and project metrics

18:47 Cybersecurity and data protection measures

21:11 Future vision for health data integration

24:16 Challenges and headwinds in the industry

26:21 How to connect with Claire and Datavant

Chris Hoyd  0:08  

Welcome back to Product in Healthtech,  a community for health tech product leaders by product leaders, I'm Chris Hoyd, a principal at Vynyl. Today, I sat down with Claire Manneh, head of provider research at Datavant. We touch on several topics in the realm of health data connectivity and its impact on healthcare research and outcomes. In our conversation, we explore Datavant's mission to connect the world's health data through their innovative protect, connect and deliver approach, including the power of tokenization in linking disparate health datasets while maintaining patient privacy and data security. Claire offers valuable insights into how Datavant is working to break down data silos in healthcare, empowering researchers and healthcare providers with more comprehensive patient data. Let's dive in. Can you start by telling us a little bit about your journey into health tech and what led you to your current role as the head of provider research at Datavent?

 

Claire Manneh  1:02  

Sure. And first of all, thank you so much for having me. It's a pleasure to be here, Chris. I have always wanted to do something in healthcare. I was pre med - was going through all the classes for med school and to be to go into med school. And realized, after a couple of years from graduating that I wanted to do something that was impactful outside of actually delivering medicine, and worked in bench research, clinical research, consulting. I spent several years working in patient safety, and then got into tech, working at a startup that did second opinions and then found my way to Datavant. And what excited me the most about Datavant is that this was a piece of technology that we had always thought about from years ago when I was on the clinical research side, and it was about linking a patient's health history to follow their journey across the healthcare continuum. And I've been at Datavant for about four years now, and every day excites me so much because I get to work with the very researchers, those who are physicians and those who are academic researchers who are making the impact in healthcare, and I'm excited because I get to be a small sliver of that as part of their work. 

Chris Hoyd  2:29  

Okay, so Claire, for those who might not be familiar, could you give us a brief overview of Datavant and its mission in healthcare?

Claire Manneh  2:36  

Sure, Datavant's mission is to connect the world's health data. We do that in three ways. We protect, connect, and deliver, and they're right above me right now. We protect the data because on the de-identified side, where I primarily focus my work in, is to link data sets using a token. We're connecting the data sets by empowering researchers at life science organizations, at academic med centers and patient advocacy groups to take their very private and precious data and either enhance it with another data set that's coming from the ecosystem, like an SDOH, a social determinants of health data set, or a claims data set. And so they're connecting it. And there are often times where we may need to deliver the tokens, not the data, but we may need to deliver the tokens, match them, and then deliver it to whomever the data aggregator is.

Chris Hoyd  3:37  

That's pretty technical. It's like in the world of healthcare, it's fairly abstract. Can you connect that you know, offering your product to population health or health outcomes?

Claire Manneh  3:49  

A really good example, I can share, one that brings me back to the days when I did do clinical research, is I'm working with researchers at the University of Wisconsin at Madison, and they built a substance misuse data commons. This is a platform that brings together EHR data from the University of Wisconsin health system, their local emergency medical services data their Department of Public Health. There's an all payer claims database. They have social determinants of health data, mortality data, all of this data are combined in a platform that they created using the Datavant token and all the different agencies installed the Datavant software behind their firewall. So we don't own any of the data. We don't house the data. We don't serve as a marketplace in that that regard, but they've been able to link all of their data sets together, and the honest data broker at the University of Wisconsin takes all those tokens together, links them, appends the the necessary data that goes with it and has this complete history of patient data across the state, and they're working to bring about other data sets from other EHRs so that they can get a more complete picture of Wisconsin's population. And this is so exciting, because now they're able to look at trends and follow what's happening at the EMS level. Are we seeing that there are an influx or any trends towards certain types of substance misuse and and looking at at the health of of these patients during treatment and after treatment as well.

Claire Manneh  4:28  

What is exciting to you on Datavant's roadmap, it sounds like you guys are increasingly integral to a lot of very complex stakeholder relationships. Is it growth? Is it new capabilities? What you know? What's exciting to you?

Claire Manneh  5:54  

It's everything that you just mentioned. I think it's exciting to see how many more hospitals, health systems, academic med centers that I work with, and then I also have colleagues who work on the registry side of the house. So anything related to a patient advocacy group, our government organizations, to all of them, are coming together, plus the data aggregators that we work with and life sciences, we're beginning to see a flywheel of all of those parties work together to better understand a patient's journey, to improve health outcomes, to deliver better care, to understand the differences between certain drugs, how they interact with patients across different populations and regions, health equity. The other exciting thing that I'm beginning to see a lot of is the interest in unstructured data. Currently, we work very closely on structured data. So de-identifying the EHR claims record, that's all structured data that now, especially with an increase in the use of different AI technologies, is taking the clinical notes, looking at imaging which we haven't tapped into yet, but hope to one day. But mainly on the clinical notes, is a space that I can't wait to see it come to fruition for a lot of these studies.

Chris Hoyd  7:24  

Yeah I was curious about that. So it sounds like you guys have, you know, there's no shortage of data, right? There might be the bottleneck might be turning that into or your customers turning that into actionable insights. And so it sounds like you're hoping or expecting that AI might help with that. Do you do you guys do that internally, or do you hope your clients will develop some wherewithal around that?

 

Claire Manneh  7:49  

I primarily work with researchers at academic med centers and health systems, and they truly are the brains behind a lot of the AI technology that they're working on. We also have AI capabilities within some of our tools. Our match tool is built on a machine learning algorithm, and it's been trained on billions of public health data records out there. But other than that, most of the research is done by the researchers, whether it's a life science company or an academic med center. They're the ones who are really synthesizing the data. After we've been able to help them link it together, understand it across different data sets, they really distill that and do their analyzes and apply any models to them. The example I shared with you, with the professor at University of Wisconsin Madison, he's been able to create machine learning models to better understand the data sets, even on the structured side, so they all use their own, own methods to understand it.

 

Chris Hoyd  8:58  

Before we jumped into this conversation, you mentioned that you were really proud of the work that your team on the population health, you know, side of things, what you guys do. Outside of the Wisconsin example, is there anything else that you want to highlight within that realm that you're you feel good about?

 

Claire Manneh  9:15  

Yes. I mean, I don't know where to start. There are so many different studies that we work with. Another adjacent hospital to UW Madison is the Medical College of Wisconsin. They put together a registry for sickle cell disease, and sickle cell disease is commonly found among black men in the US, and there isn't a proper registry for this, they received a grant from the CDC, among several other states, to do this as well. They similarly have been able to link their their EHR data to claims data, other SDOH data to better understand the population in Wisconsin and and then other types of. Studies that we work on, believe it or not, there's still a lot of research around Covid. We're still trying to understand what's the long term effects of Covid, and we're seeing a lot of researchers still getting funding and getting refunded on their Covid studies. One of those studies that that we had been a big part of is the National Covid Cohort Collaborative. The NIH put together this collaborative across almost 100 different hospitals in the US, and they all linked to their EHR data to this this platform. And now that we've done a lot of work around Covid. They've renamed it to be the National Clinical Cohort Collaborative. It's still N, 3c but they are studying things beyond conditions and diseases beyond Covid, and that's something really important for us, because a lot of researchers can tap into that beyond the work of Covid, if there's another type of disease that they want to look into that's something that they can reach out to that team to to get a better understanding of their research.

 

Chris Hoyd  11:11  

Okay, awesome. So I think in a previous podcast conversation that you were on you talked about global clinical trials as a potential future focus of the Datavant team, or the is that still a possibility? And if so, how are you thinking about the unique challenges, ethical considerations that come with, you know, tokenizing and linking health data across different countries and healthcare systems?

 

Claire Manneh  11:37  

It's very special, because a lot of the hospitals we work with have relationships with with those at the NHS and and then they have relationships with Ministries of Health and other countries, Australia, Japan, so we know that there are relationships there. We've invested a lot of time working on the HIPAA equivalent in Europe, GDPR, to ensure that we have software that is accessible and usable across different ecosystem partners that have the health data or the hospitals themselves. We're beginning to work on that now and then, of course, I should mention a lot of life science companies are based in Europe, and they have offices in Europe as well. So there is a lot of partnerships that we've already enabled and turned on in in Europe today, on the clinical trial side, but bridging the academic side to the clinical trials with the life sciences is still new, and still working through that. But hopefully in the next several months, we'll begin to see those those linkages actually happen. To answer your question about what is required, what's needed to enable and activate those partnerships. We're working in de-identifying the the data and tokens that - and I should also mention what tokens are in more detail - a token is made up of different PHI or PII, so names, dates of birth or addresses, zip codes and those particular PHI PII elements, are masked into that 44 character hash string, and that can be linked to other data sets. So we can work a lot across the alphanumeric numbers, but we haven't been able to convert that to a different language; Arabic, Chinese, Japanese, etc, Hindi. So those we we haven't worked in yet, but mostly the Roman alphabet is primarily where, where we're working on this. To be able to activate a lot of these partnerships, the considerations we need to be aware of are how our address is captured. Social Security is not used outside of the US, but there could be other numbers, medical numbers and NHS number. So those are the ways in which we're helping to develop those tokens in other countries,

 

Chris Hoyd  14:16  

With our sort of emphasis on product in health tech and and the sort of big tent of product leadership and strategy and research we do try to get a little bit tactical here. So I'm curious, if you could just talk a little bit about what your maybe average day or average week looks like within Datavant.

 

Claire Manneh  14:38  

I would say a lot of the times it's talking to the customer, scoping new projects that they might have. And they, they may not know what we do in its entirety. A lot of the times they they know that we we link disparate health data sets together, but they may not know that we work with genomics partners. They may not know that we're doing the unstructured data for the clinical notes. So those are really exciting, because I get to cross collaborate with my team, engineers, product those who work on our privacy hub, which help us certify the data as well. And we obsess over that piece. Our clinical trials team as well - I often bring the subject matter expert to a lot of these calls, because they can speak to a lot of the work that we touch on outside of the provider and public sector space that I'm in. And then I have a lot of time that I spend talking to to new providers and educating them on on what we do. These are mostly provider researchers, and so a lot of times they either know about Datavant and they want to to find a way to excel at their work or they don't. So I have to start from scratch, too, and it's not always easy explaining what we do, but anytime that you tell a researcher that we could link this data to that data, that gets them so thrilled and excited to learn more.

 

Chris Hoyd  16:14  

That's awesome. Okay, so how do you measure the success of of your team's projects or initiatives? Are there, like, you know, specific KPIs or OKRs?

 

Claire Manneh  16:27  

So once we sign a contract, after we've gone through the security reviews with with the hospital or academic med center, and they're ready to install the software behind their firewall, we want to make sure that they are successful in tokenizing, creating tokens on their patient population, and they have access to a project dashboard, so if they're working on multiple projects, they could see where the tokens have been created for a specific project. And the success metric is to know when everybody has been on board to link their data sets together. So that example project I shared with you about Wisconsin, it's one data aggregator, a data coordinating center rather, and they are waiting to get links from all the other data sources, so making sure all those other data sources have tokenized and submitted their their tokens to them is is a successful metric to me. I also want to know what is the status post connection of all these data sets. Have they been able to analyze the data, and then, are they publishing? Are they having speaking engagements, doing poster sessions. I come from an academic world too, so I I'm so invested in knowing what is the impact of our tools on the patient population. So I want to see the soup to nuts of the entire research itself and and bring it back to our team, because I realize a lot of engineers and our executive assistants and others accounting team, they can work anywhere they want, but they come to Datavant because they know that the work that we do is is very impactful in our community. So I want to be able to share back, because we were able to activate these linkages. And in this particular study, this is what we learned out of the population.

 

Chris Hoyd  18:29  

In kind of this era of escalating cyber threats, especially in the healthcare space - how does Datavant fit into that? How do you guys think about that internally, and how might you work with your partners to guard against that?

 

Claire Manneh  18:47  

And this is a space that is quite constant, and there's always going to be some type of cyber threat. I was on a call with a very large health system a month ago, and they had to pause their work with us because they were addressing a cyber threat that they had, and it didn't impact our work, but in the sense that we weren't involved in the cyber threat itself, but they had another issue internally that didn't involve the tokenization process. So those are things that we are all aware of and alert, and we have alerts that we send out to let the team know that this is going on, and then just explore, is this impacting us in any way, but thankfully, in any of the cyber threats that have gone on at hospitals and other academic med centers, they haven't involved the Datavant tokenization, and I think this is one of the most important things that we obsess over. We obsess over data governance. How is data being shared in the hospital? Who's accessing the tokenization? I work very closely with the C-suite at the hospitals, because they also want to know who is using the software and how is this being managed. And the good news is that the Datavant software is something that is is encrypted, it's site specific, so when they are creating tokens, they're not sharing anything that is going to breach any patient confidentiality, and so that that's one of the reasons why we spend a lot of time and resources and in ensuring that the tokens, when shared, are not going to release any any PHI or PII.

 

Chris Hoyd  20:36  

So let's assume, you know, some years out, that Datavant has been, you know, smoothly adopted by some critical threshold of stakeholders. And so the benefits of your approach are starting to be, starting to be felt. You know, in real ways. What do you think that world looks like? Are there any structural changes that would need to take place for that to have happen from a regulatory standpoint to a technology standpoint, maybe it hinges on what AI becomes capable of. Just curious what that, what that dream state looks like.

 

Claire Manneh  21:11  

I'm going to start from my, my little microcosm of this, this ecosystem that I live in, and then take it to the larger state. So within hospitals and academic med centers, more and more, I'm beginning to see multiple silos that are in hospitals exist at the much larger enterprise side of the hospital. So a CIO, a chief information officer or chief data officer will be able to create their own platform, so their researchers can tap into that use data in any way that it's available on a Datavant token and empower the research that these researchers are working on. Today. it's siloed in such a way that everyone is on their own island, and they're creating their own data sets, and oftentimes researchers at that level don't even know that the data is already there. And so if there's a way in which every hospital, every AMC can can connect and have access to that data, that would be most useful. We're seeing on the government side, the importance of bringing the EHR data sets, the patient advocacy data, and any other types of real world data into some sort of platform that they may be creating, that could be at the sub agency level, or it could potentially be one day at the NIH or the CDC at large, and I think that is going to help connect all those spokes to one place in a privacy preserving record linkage way, and all those other research studies could be benefiting from that data set, instead of, again, all these multiple silos that we see today.  Now zooming out even further with my counterparts on the ecosystem side, that's when life science organizations, many of whom we work with, they'll be able to have partnerships at the agency level or at the hospital level, and ensure that there's a way to to get patients to a clinical trial and be able to identify where there are times and opportunities to to get those patients to support them in a trial. And this is huge in oncology, huge in in adult congenital heart diseases and and even in pediatric cases as well and rare diseases. And I think this is going to help that flywheel move.

 

Chris Hoyd  23:49  

Now that that makes me curious if you have any sort of, you know, personal beliefs about where headwinds might come from in terms of technological developments, is it AI and biotech creates more demand for clinical trials like, what are you sort of excited about, in terms of what you're seeing in the economy and ecosystem that might accelerate that vision?

 

Claire Manneh  24:16  

I think the headwinds are, are mostly going to be on a case by case basis, and and having institutional review board IRB approvals consent for specific use cases, there are times where we you can do health surveillance, population health research without the need to consent the population, but when it becomes very specific At a clinical trial level, there's the headwind of having to oftentimes, go back to to the patient and request them directly. And if you're working on a patient population from from a health system that's trying to ask anywhere from 10s of 1000s to millions of patients if they would be open to that. So I think there needs to be a way to to ask the patient from the get go, before establishing this type of platform, to ensure that, going forward, they they will be considered for for any type of trial that can come up and and help save their lives. And the other area of headwinds and challenges is making sure that everyone is aware of what this is. It's still new to the patient population, to the consumer. The technology itself is not new. It's been around for for decades now, but trying to voice this over to researchers, to the patients like ourselves, to understand its value is something that's going to take some more time to. The researchers who we work with are quite pleased and and happy to share this with our patient population. I think that is going to be one of the biggest benefits, is ensuring that there's a feedback loop of the research going back to the patient community to understand this is the value that this type of data brings.

 

Chris Hoyd  26:14  

Listeners who might want to reach out to you or have additional questions about datavant, how can they get ahold of you?

 

Claire Manneh  26:21  

If someone wants to get in touch with me, they're welcome to email me directly, [email protected] or they can message me on LinkedIn.

 

Chris Hoyd  26:29  

Thank you so much for joining us. You can also connect with us on LinkedIn, YouTube or on our website, www.productinhealthtech.com. If you have ideas or suggestions on what you'd like to hear in a future episode, or if you'd like to be a guest, please shoot us an email at [email protected].

About Claire Manneh

Claire Manneh, Head of Provider Research at Datavant.

Follow Name Claire Manneh
LinkedIn

About Datavent

Datavant is a data platform company for healthcare whose products and solutions enable organizations to move and connect data securely. Through proprietary technology, the world's most robust health data network, and value-added services we protect, connect, and deliver the world's health data. Datavant enables more than 40 million healthcare records to move between thousands of organizations, including 75% of the 100 largest U.S. health systems, 100% of U.S. payers and an ecosystem of 300+ real-world data partners.

Your Strategic Product Development Partner | © 2025 VYNYL LLC