1. The “Contribution Friction”
Every institution converting local EHR data to OMOP hits unmappable codes. Local lab abbreviations with no LOINC equivalent. Institutional procedure codes that don’t exist in SNOMED. Novel biomarkers that haven’t been added to any standard vocabulary yet. These unmapped codes are gaps - and they’re valuable. Each one is a potential contribution to the OHDSI vocabulary ecosystem. If your hospital has a local code for a clinical concept that SNOMED doesn’t cover, and you identify it, the OHDSI Vocabulary Team can add it in the next Athena release. Every institution that converts to OMOP after that benefits from your discovery. But finding these gaps is tedious. You have 3,000 local codes. Most will map fine. Maybe 50 are genuinely missing from the standard vocabularies. The other 2,950 are just lookup work. How do you find the 50 that matter? OMOPHub makes the gap detection step fast. Search for each local code programmatically. The ones that return no match (or only weak matches) are your gap candidates. That’s your shortlist for human review and potential OHDSI contribution. What OMOPHub does not do: submit contributions to OHDSI, add concepts to Athena, or determine whether a gap is “truly missing” vs. “just badly named.” Gap detection is automated; gap interpretation requires human clinical expertise. The OHDSI contribution itself goes through the community forums and vocabulary team - not through an API.2. The Core Concept: Search Failures as Signals
The insight is simple: if OMOPHub’s search can’t find a match for a clinical term, that’s information. Tiered search strategy:- Basic search with vocabulary and domain filters - catches exact and close matches
- Semantic search - catches misspellings, abbreviations, word-order variations
- If both fail - flag as a gap candidate
- Basic fails, semantic succeeds → probably a naming/abbreviation issue, not a real gap
- Both fail → likely a genuine gap or very institution-specific term
- Both return results, but wrong domain → mapping ambiguity, needs human review
3. Use Case A: Automated Gap Detection for Local Codes
A hospital joining an OMOP research network has 200 unique local diagnosis codes related to sepsis. The data engineer needs to identify which ones have no standard OMOP equivalent - those are the gaps to review manually and potentially contribute to OHDSI.Python
4. Use Case B: Categorizing Gaps for Targeted Action
Not all gaps are the same. A gap candidate could be:- A genuinely missing concept - the clinical idea doesn’t exist in any standard vocabulary (e.g., a brand-new biomarker)
- A missing mapping - the concept exists in SNOMED but not in the vocabulary you searched (e.g., exists as a Procedure, not a Condition)
- A local abbreviation - the term is too institution-specific for search to match, but the concept exists under a different name
- A composite concept - the local code combines multiple clinical ideas that are separate concepts in OMOP
Python
genuinely_missing - no matches anywhere. That’s your OHDSI contribution candidate. “Positive qSOFA Score” might find semantic_near_miss results (qSOFA-related SNOMED concepts exist but may not match the exact phrasing). “Atypical Sepsis Presentation” might get partial_match - “Sepsis” matches, but “Atypical” is too vague for a specific concept. Each category drives a different action.
5. The OHDSI Contribution Pathway
Once you’ve identified genuinely missing concepts, the contribution process is community-driven, not API-driven:- Prepare a gap report from your analysis (the output of Use Cases A + B)
- Post to OHDSI Forums (forums.ohdsi.org → Vocabulary category) describing:
- The missing concept and its clinical definition
- The source vocabulary where it originates (if applicable)
- How many records in your data use this code (impact/frequency)
- Any near-miss concepts from your OMOPHub search (shows you did due diligence)
- OHDSI Vocabulary Team reviews the proposal and decides whether to add it
- If accepted, it appears in the next Athena vocabulary release
- Update your local vocabularies from Athena to get the new concept
- USAGI - OHDSI’s mapping tool for the manual review step (reviewing near-misses, approving mappings)
- Athena - Where accepted contributions land (vocabulary downloads)
- OMOPHub - Fast vocabulary search for gap detection (no local vocab DB needed)