1. The “Silo Tax”
Every hospital that joins an OMOP network faces the same mapping wall. Their lab system uses “Cr_Serum” for creatinine. The one across town uses “CREAT_BLD.” The academic medical center uses “Creatinine_S_mg.” All three mean the same thing: LOINC 2160-0, Creatinine [Mass/volume] in Serum or Plasma. But each hospital maps it independently. Three data engineers, three weeks of work, three times the cost - for the same result. Scale this across 500 hospitals and 800 unique lab codes each, and you’re looking at hundreds of thousands of hours of duplicated mapping labor per year. This is the Silo Tax: the cost of every institution solving the same vocabulary problem in isolation. The OHDSI community has partial solutions. USAGI is an open-source mapping tool that helps data engineers find standard concept matches for local codes. Athena distributes the vocabularies themselves. The OMOP CDM has asource_to_concept_map table designed specifically to store local-to-standard mappings. But none of these provide a way to share completed mappings between institutions.
OMOPHub doesn’t solve the sharing problem either - it doesn’t have a collaborative mapping repository. But it accelerates the creation of mapping files by providing fast vocabulary search without requiring a local Athena installation. And because mapping files follow a standard format (source_to_concept_map), they can be shared between institutions manually - via GitHub, shared drives, or project-specific data exchanges.
This article shows the practical workflow: use OMOPHub to build mapping files fast, format them as source_to_concept_map records, and share them between sites participating in the same research network.
Honest scoping: OMOPHub does not currently have a shared mapping repository, user-contributed mappings, or collaborative mapping API. The workflow described here uses OMOPHub for vocabulary search + standard file formats for sharing. A future collaborative mapping feature would be valuable - but this article works with what exists today.
2. The Core Concept: The source_to_concept_map as a Sharing Format
The OMOP CDM includes a table purpose-built for local-to-standard mappings:
source_to_concept_map for their lab codes and Hospital B has similar local codes, Hospital B can import Hospital A’s mappings as a starting point - reviewing and adjusting as needed.
OMOPHub’s role: Populating the target_concept_id and target_vocabulary_id columns. For each local code, search OMOPHub to find the standard concept match, then write the result into the mapping table format.
3. Use Case A: Building a Shareable Mapping File with OMOPHub
A multi-site sepsis research project needs all participating hospitals to map their local lab codes for inflammatory markers to standard LOINC concepts. Instead of each hospital starting from scratch, the coordinating center builds an initial mapping file using OMOPHub, then distributes it.Python
source_to_concept_map-formatted file in minutes - work that would take days of manual USAGI review. The output is a CSV that can be shared with other hospitals in the network. Hospital B imports it, reviews the mappings (adjusting for their local code variants), and has a head start on their own mapping work.
4. Use Case B: Importing and Adapting a Shared Mapping File
Hospital B receives Hospital A’s mapping file. Their local codes are different (“CRP_Serum” instead of “CRP_HS”), but the target OMOP concepts are the same. They adapt the shared file and fill in the gaps.Python
source_to_concept_map format makes it portable. Hospital B’s new mappings (like IL-6) get added to the shared file for Hospital C.
5. The Sharing Workflow
The practical workflow for a multi-site research network: Step 1: Coordinating center builds initial mapping file (Use Case A)- Extract all unique local codes from the first participating site
- Look up each via OMOPHub
- Human reviews and approves each mapping
- Save as
source_to_concept_mapCSV
- Share via GitHub repo, shared drive, or project data package
- Include review metadata (_match_method, _reviewed, _reviewer)
- Match their local codes against the shared mappings by display name similarity
- Use OMOPHub to fill gaps (codes not in the shared file)
- Contribute new mappings back to the shared file
- The mapping file grows with each new site
- Later sites have fewer gaps to fill
- The coordinating center merges contributions and resolves conflicts
- Centralized, API-accessible mapping repository (search other institutions’ mappings)
- Confidence scoring based on number of institutions that agree on a mapping
- Version-controlled mapping provenance
- Automated conflict detection when institutions map the same local code differently
6. Conclusion: Share Mappings, Not Just Vocabularies
The Silo Tax persists because sharing local-to-standard mappings is harder than it should be. Standard vocabularies are shared (via Athena). Phenotype definitions are shared (via OHDSI PhenotypeLibrary). But the translation layer - thesource_to_concept_map that each site builds laboriously - stays locked in institutional silos.
OMOPHub makes building that mapping file fast: search for a local code display name, get back the standard OMOP concept, write it to the mapping table. The source_to_concept_map format makes the file portable: CSV in, CSV out, same schema everywhere.
The missing piece isn’t technology - it’s workflow discipline. If your research network adopts the pattern of building, sharing, and iterating on mapping files, the Silo Tax drops with every site that joins. The first hospital does 100% of the mapping work. The second does 30%. By the fifth, it’s mostly review and edge cases.
Start with one mapping file. Build it with OMOPHub. Share it with your network. Let the next site extend it. That’s how collaborative mapping works in practice - not with a magic platform, but with standard formats and a fast vocabulary API.