코드 없이 HIPAA용 커스텀 MRN 탐지 구현

의료 기록 번호(MRN)는 병원마다 형식이 다릅니다. HIPAA 세이프 하버는 MRN 제거를 의무화합니다. 코드 없이도 커스텀 MRN 탐지를 구현할 수 있습니다.

George CurtaApril 30, 20268 분 읽기

custom MRN detectionHIPAA pipeline configurationno-code regexAI pattern helperhospital identifier de-identification

MRN 형식 문제

미국에는 약 6,100개의 병원이 있습니다. 각 병원은 자체 EHR 시스템을 운영합니다. 각 병원은 고유한 의료 기록 번호(MRN) 형식을 사용합니다. 국가 표준이 없습니다. 합동위원회(Joint Commission)는 병원이 환자를 식별할 수 있을 것을 요구하지만 형식 규칙은 정하지 않습니다.

형식은 매우 다양합니다. 7자리 정수를 사용하는 곳이 있습니다. 8자리 정수를 사용하는 곳도 있습니다. HOSP-, MRN-, PT- 같은 접두사 코드를 사용하는 곳도 있습니다. SVHS-, CHOP- 같은 기관 코드를 추가하는 곳도 있습니다. 번호에 입원 연도를 포함하는 경우도 있습니다.

HIPAA 세이프 하버는 환자 기록 번호를 18개 식별자 유형 중 8번으로 규정합니다(45 CFR §164.514(b)(2)). 18개를 모두 제거해야 합니다. 이 규칙은 특정 형식에만 적용되지 않습니다. 병원이 커스텀 형식을 사용한다면 그것도 탐지해야 합니다. 이를 놓치는 도구는 나머지 17개를 모두 제거하더라도 세이프 하버를 충족하지 못합니다.

코드 방식의 한계

비식별화 파이프라인에 커스텀 기록 번호 형식을 추가하는 표준 방법은 Microsoft Presidio를 확장하는 것입니다. 이는 Python 코딩을 의미합니다.

개발자는 EntityRecognizer를 확장하는 클래스를 만들고, 정규식을 작성하고, Presidio 레지스트리에 연결하고, 테스트하고, 유지보수해야 합니다. 코딩을 거의 하지 않는 컴플라이언스 팀에게 이는 심각한 장벽입니다. 형식이 바뀔 때마다 엔지니어가 필요합니다.

의료 엔지니어는 바쁩니다. EHR 통합과 임상 시스템에 집중합니다. 컴플라이언스 도구는 최우선 순위가 아닌 경우가 많습니다.

코드 없는 패턴 워크플로

가이드 기반 패턴 방식은 코딩 단계를 없앱니다.

컴플라이언스 담당자가 웹 앱의 커스텀 엔터티 생성기를 열고, 자신의 시스템에서 샘플 번호 5개를 붙여넣습니다. 예시:

SVHS-0012345
SVHS-0987654
SVHS-1122334
SVHS-4455667
SVHS-8899001

패턴 생성 버튼을 클릭합니다. AI가 구조를 읽고 다음을 반환합니다:

패턴: SVHS-\d{7}
신뢰도: 높음
제안 이름: HOSPITAL-MRN
제안 대체값: [MRN]

담당자가 샘플 5개를 더 붙여넣습니다. 패턴이 통과합니다. HIPAA 프리셋에 저장합니다.

이후부터는 모든 세션 — 웹 앱, Office 추가 기능, 데스크톱 앱, API — 이 표준 PHI 처리 과정에서 이 형식을 자동으로 탐지합니다. 코드가 필요 없습니다.

GDPR 제89조는 연구 데이터셋에 대한 가명 처리를 요구합니다. 커스텀 엔터티는 기관별 고유 식별자를 적용 범위에 포함시켜, 범용 도구가 남겨두는 격차를 해소합니다.

결과

이 워크플로는 반나절이면 충분합니다. 커스텀 코드로 구현하면 몇 주가 걸립니다.

컴플라이언스 담당자가 패턴을 정의하고, 테스트하고, 배포합니다. 티켓도, 대기도 필요 없습니다. 프리셋에는 커스텀 엔터티와 함께 표준 17개 세이프 하버 식별자가 모두 포함됩니다.

다음 번 임상 기록 배치 처리 시, 18개 식별자 유형이 모두 처리됩니다. 세이프 하버가 완성됩니다.

실제 세이프 하버 운영 방식은 의료 연구를 위한 HIPAA 세이프 하버 비식별화를 참조하십시오. 병원별 탐지 패턴은 엔지니어링 없이 병원별 MRN 형식 탐지를 참조하십시오.

참고 자료

데이터 보호를 시작할 준비가 되셨나요?

48개 언어로 285개 이상의 엔티티 유형으로 PII 익명화를 시작하세요.

무료 체험 시작 기능 보기

About this page

We update this page when our platform or the law changes.

Read our founder note for how we work.

Each change shows up in the timestamp at the top.

We follow these rules

GDPR (EU 2016/679).
ISO/IEC 27001:2022.
NIS2 (EU 2022/2555).
HIPAA safe harbor under 45 CFR § 164.514(b)(2).

Our promise

We do not sell your data.

We do not train models on your text.

We store your files in Germany.

You can delete your account at any time.

You own your work.

Where we run

Our company HQ is in Saarbrücken, Germany. Our servers run in Hetzner's Falkenstein datacenter.

Hetzner holds ISO 27001 certification.

All data stays in the EU.

Backups run every day.

Need help?

Email support@anonym.legal.

We reply within one business day.

How we test

We run a full check suite on every release.

Each surface gets its own sweep script and report.

Human reviewers spot-check the output each week.

We track recall and precision on a labelled set.

Bad runs block the deploy.

What we never do

We never sell your information to third parties.
We never train models on what you upload.
We never keep your work after you delete it.
We never share keys with any outside firm.
We never run ads inside the product.

Plans in plain words

We sell credits, not seats.

One credit covers one short job.

Long jobs use a few credits each.

You can top up at any time.

Unused credits roll over each month.

Read the plans page for current rates.

Who built this

A small team of engineers and lawyers built this.

We ship from Europe and work in the open.

Our founder note spells out why we started.

Where to start

How the parts fit

A browser add-on cleans text inside Chrome.

A Word plug-in handles drafts in Office.

A small desktop tool works on whole folders.

An agent protocol link feeds large models safely.

All four share one core engine and one rule set.

Words from our team

We started this work after a lunch about cookies.

One friend kept getting odd ads on her phone.

We asked why a court file leaked through a draft.

We sketched the first build on a napkin that week.

By month three we had a tiny demo for a friend.

She used it on her first case the next day.

Common questions we hear

Can the tool read scanned PDFs? Yes, with OCR.

Does it work on long files? Yes, in small chunks.

Can I roll my own rule set? Yes, save it as a preset.

Does it run offline? The desktop build runs offline.

Do you keep my files? No, the cloud build wipes after each run.

Will it learn from my work? No, we never train on inputs.

A short tour of the workflow

Upload a file or paste a snippet of prose.

Pick the entities you want gone from the draft.

Choose a method: replace, mask, hash, encrypt, or redact.

Press run and watch the side panel show each hit.

Skim the result and tweak any rule that misfired.

Save the cleaned file or send it to a teammate.

코드 없이 HIPAA용 커스텀 MRN 탐지 구현

MRN 형식 문제

코드 방식의 한계

코드 없는 패턴 워크플로

결과

참고 자료

관련 기사

HIPAA MRN Detection Without a Regex PhD

HIPAA: Hospital-Specific MRN Detection

HIPAA Safe Harbor De-ID at Scale

데이터 보호를 시작할 준비가 되셨나요?

코드 없이 HIPAA용 커스텀 MRN 탐지 구현

MRN 형식 문제

코드 방식의 한계

코드 없는 패턴 워크플로

GDPR 연구 참고사항

결과

참고 자료

관련 기사

HIPAA MRN Detection Without a Regex PhD

HIPAA: Hospital-Specific MRN Detection

HIPAA Safe Harbor De-ID at Scale

데이터 보호를 시작할 준비가 되셨나요?

About this page

Related reading

We follow these rules

Our promise

Where we run

Need help?

How we test

What we never do

Plans in plain words

Who built this

Where to start

How the parts fit

Words from our team

Common questions we hear

A short tour of the workflow