Silent fragmentation
One person shows up as multiple patients across eligibility, claims, and clinical sources, which breaks downstream patient-level measures.
In-Warehouse Product
Patient identity breaks quietly, then it breaks trust. EMPI Engine + Workbench gives healthcare data teams transparent matching, analyst-grade stewardship, and auditable golden records without moving PHI outside the warehouse.
Deploy the matching engine, configuration editor, and review workbench inside Snowflake, Databricks, or Microsoft Fabric.
Keep every threshold, rule, and analyst decision inspectable so governance teams can understand exactly how a person_id was created.
Product brief
EMPI turns identity resolution into governed reconciliation: every match, split, merge, and survivorship choice can be inspected before it shapes downstream measures.
Problem
Teams often end up choosing between dbt-only pipelines with no usable analyst workflow and SaaS platforms that require PHI to leave the cloud environment.
One person shows up as multiple patients across eligibility, claims, and clinical sources, which breaks downstream patient-level measures.
Stakeholders cannot explain why records matched, why they did not, or how to tune the clerical-review band responsibly.
Uncertain candidates pile up with no deliberate work queue, merge controls, or audit trail that holds up under governance review.
Approach
EMPI combines the warehouse-native linkage engine with the config editor and review workbench so the same operating model covers matching, stewardship, and downstream publishing.
Blocking rules, thresholds, survivorship, and scoring behavior are visible and tunable instead of buried inside an opaque service.
dbt-native SQL and Python models execute where the source data already lives, under your IAM and network controls.
Analysts can split, merge, ignore, and review golden records through a real workbench, with decisions flowing back into the next run.
Capabilities
A guided configuration layer keeps blocking strategy, thresholds, and survivorship logic readable and versionable.
The workbench turns ambiguous candidates into an operational queue instead of a spreadsheet exercise.
Implementation
EMPI implementations follow a clear sequence so matching quality, analyst workflow, and downstream reporting impacts are validated before broad adoption.
Profile source identity quality, define the matching universe, and align survivorship logic to downstream reporting expectations before running at scale.
Stand up the engine, config editor, and review workbench in your cloud environment, then tune thresholds with stewards against real candidate quality.
Move split, merge, and review operations into day-to-day workflows with auditable controls so identity quality improves continuously after go-live.
Next step
We can talk through your current cloud platform, upstream source contract, and the operational workflow that needs to be supported before we recommend an implementation path.