OHDSI

Harmonising observational data - and more - for patient and population level predictions

The OHDSI suite is an open-source, modular solution that enables organizations to explore 360° patient journeys and turn data into evidence. The ecosystem provides a broad range of tools that cover all aspects of real world data and evidence − from data characterization to a standardized data model (OMOP CDM). This enables large scale cross-database analytics with OHDSI.

Please accept marketing-cookies to watch this video.

A quick overview of the Atlas UI

Training and workshops

Our workshops have trained 150+ participants from more than 20 European and US-based life science organizations that have decided to adopt the OMOP CDM and the OHDSI tool suite. We help bring various stakeholders within an organization to the same level of expertise and understanding of the multi-faceted OHDSI ecosystem.

Data Quality Assessment

The Hyve can help you verify the quality of data after each instance of data conversion into the OMOP common data model (CDM) because of our extensive knowledge of and experience with the OHDSI data quality assessment tools. By following this approach, we can guarantee the highest quality of analysis-ready CDM-harmonised data.

Consultancy

Our data engineers and semantic experts help identify relevant analysis use cases for your data and enable integration of real-world data with other data sources. Our expertise lies in using and integrating healthcare data for a wide range of purposes, from clinical and observational research to virtual trials and value-based healthcare.

OMOP CDM conversion

From a fully outsourced approach to ETL to hands-on training aimed at self-service transformations, our experts can provide support with mapping electronic health records, registry and commercial claims data.

Deployment

We can help with deployment of the OHDSI stack in your existing infrastructure – in the Cloud or on-premise – under the Apache 2 license.

Development

Our data engineers can develop custom functionalities on top of an existing stack and integration with existing technologies and software.

"The Hyve is one of Europe’s leading technology IT services providers who have established an international reputation within the biomedical informatics domain, from open standards such as OHDSI to working with FAIR principles. With their passionate leadership, they have been involved in numerous projects including the European Health Data & Evidence Network (EHDEN). I have no doubt that all of these projects have benefited greatly from their thinking, insights and hands-on expertise."

Nigel Hughes, Industry Lead at IMI EHDEN Project

"The workshops provided by The Hyve were very helpful to understand and perform all the processes needed to transform our cohort data into the common data model (OMOP) and use the cutting-edge OHDSI tools for analyzing observational studies. We are glad to have been introduced to the OHDSI real-world data platform and networks by the experts of The Hyve."

Alexis Sentís Fuster, Epidemiologist at Centre for Epidemiological Studies of Sexually Transmitted Disease and AIDS in Catalonia (CEEISCAT)

“We are thrilled with the ETL development to OMOP CDM and data quality validation services produced by The Hyve. What truly sets The Hyve apart is their commitment to collaboration and knowledge sharing. Throughout the project, their team worked closely with ours, empowering us with the skills and insights needed to optimize our in-house ETL processes. Together, we implemented key improvements to our pipelines, introducing innovative techniques to streamline operations and bolster data quality testing. Together with The Hyve team, we've not only achieved our immediate goals but also laid a foundation for continued success in managing and leveraging our data assets.”

Vicki Theurer Crider, Senior Technical Project Manager at Critical Path Institute

"The Hyve's acknowledgment of the FAIR principles and close connection with the OHDSI community result in an efficient support of open science with respect for data privacy and sensitivity. Our cooperation with The Hyve on harmonization of two health data sources within the international projects BigData@Heart and EHDEN was very smooth. Perfect communication, together with The Hyve’s experience in iterative and agile development using synthetic data helped to separate the development process and the ETL deployment on a client side."

Spiros Denaxas and Václav Papež

We recently engaged The Hyve to support the setup of our first OMOP ETL pipeline with large volumes of multi-source real-world data (medical claims, electronic health records, and laboratory test results) on our Wayfinder Platform, built on Databricks. The Hyve’s team of data engineers brought deep expertise in the OMOP standard and was able to begin ETL development within several days from onboarding. What also stood out to us was their collaborative and customer-centric approach - quality, speed in execution, and tangible outcomes were important to us. The Hyve shared weekly progress and demos, leveraging our data expertise to inform and improve design decisions. Knowledge transfer during their implementation was continuous, so that our life science and analytics teams could independently manage and develop the ETL post-implementation. With The Hyve’s support, we now have a strong foundation for our OMOP transformation and are well-positioned to scale our real-world data initiatives.

Glynn Dennis, PhD, Chief Science Officer at Kythera Labs

We contribute to European flagship projects such as IMI EHDEN, a public-private partnership that leverages and further improves the OMOP/OHDSI ecosystem. The aim of the project is to build a federated network for analysis of longitudinal observational real-world data from hundreds of millions of European patients.

Frequently Asked Questions

What are the most important benefits of harmonizing data to the OMOP Common Data Model?

The OMOP CDM provides a common structure and standardized vocabulary for observational healthcare data, making it easier to combine, analyze, and reuse data across organizations and studies. Once data is harmonized, one can leverage the broader OHDSI ecosystem for cohort definition, data quality assessment, large-scale analytics, and federated research.
For many organizations, the biggest benefit is that data is transformed once and can then be reused for multiple research, regulatory, and analytics use cases.

Do you need access to our source data to harmonize it to OMOP?

Not necessarily. Every project has different governance and privacy requirements. In many cases, we can start by reviewing metadata, data dictionaries, synthetic data, and source system documentation. The OHDSI open source tool suite provides tooling for initial source data profiling. Direct access to source data may be helpful during mapping validation and troubleshooting, but our goal is always to minimize access to sensitive data while ensuring a high-quality OMOP transformation.

How much effort is required to maintain an OMOP database?

The largest investment is typically the initial harmonization project, including source data profiling, mapping, ETL development, and data quality validation. After that, maintenance is often limited to periodic data refreshes, vocabulary updates, source system changes, and ongoing data quality monitoring.
A well-designed, automated ETL pipeline can significantly reduce the effort required to keep an OMOP database up to date.

Will all of our data fit into OMOP?

Most structured healthcare data from sources such as EHR systems, registries, and claims databases can be well represented in OMOP CDM, including diagnoses, procedures, medications, laboratory results, measurements, encounters, and patient demographics.
Some highly specialized or local data elements may require additional modeling decisions. Rather than forcing data into a standard, we work with data owners to determine which information is most valuable for future research and how it can best be represented within the OMOP CDM or by using custom extensions.

Our data is already available in FHIR or openEHR. Does that help?

Yes. FHIR and openEHR provide well-structured data models and often contain valuable semantic information that can simplify the OMOP mapping.
However, they serve a different purpose. FHIR and openEHR focus on interoperability and data exchange, while OMOP is designed to support observational research and large-scale analytics. Having data available in FHIR or openEHR can accelerate parts of the transformation process, but a dedicated OMOP harmonization step is still required.

How do you ensure consistency across multiple data partners in a research network?

Consistency requires more than a common data model. Successful networks combine standardized vocabularies, shared mapping conventions, common quality controls, and transparent governance.

We typically support research networks by implementing standardized ETL processes, reviewing mapping across partners, and running OHDSI data quality tools to identify differences in implementation. This helps ensure that databases from multiple organizations can be analyzed in a consistent and reproducible manner.

Engage with our OHDSI team

Our mapping experts can work with EHR, EMR, registry data and most popular commercial / claims datasets.
Training all the new mapping service providers in EU (EHDEN)
Integrating OHDSI with semantic standards