De-identify surgical pathology reports under HIPAA Safe Harbor – CCPA/HIPAA-compliant de-identification per 45 CFR §164.514(b)(2)
Surgical pathology reports link patient identifiers — name, MRN, date of biopsy, specimen accession number — to histologic findings and oncologic staging information. anonym.legal removes all 18 §164.514(b)(2) Safe Harbor identifiers from pathology reports while preserving histologic descriptions, cancer staging, surgical margin status, and biomarker results for peer review, oncology audit, and tumor board case studies.
When this applies
Apply this workflow when pathology reports are shared with tumor board members at external institutions, included in oncology registry submissions requiring de-identified data, or prepared as educational case materials, and patient identity must be removed under the Safe Harbor standard.
How anonym.legal handles it
- Upload pathology reports in PDF or DOCX format to anonym.legal.
- The engine identifies Safe Harbor identifiers in the report header and body: patient name, MRN, date of birth, date of service, specimen accession number, and ordering and signing pathologist names.
- All 18 Safe Harbor categories are removed; specimen accession numbers (certificate or license numbers under category (11)) are replaced with synthetic codes.
- Histologic diagnosis, tumor grade, TNM staging, surgical margin findings, and immunohistochemistry results are preserved in full.
- Ordering and signing pathologist names are removed; their roles (e.g., 'attending pathologist', 'fellow') are retained.
- A de-identification certificate is generated and the de-identified report is delivered for approved secondary use.
What you provide
- Surgical pathology reports (PDF or DOCX)
- Specimen accession number list (to verify batch completeness)
Limitations & cautions
- Pathology reports for rare tumor histologies in small regional populations may remain re-identifying even after Safe Harbor de-identification; supplement with Expert Determination review for rare-entity cohorts.
- The tool de-identifies the narrative report but does not process associated tissue blocks or glass slides — physical specimen handling remains the responsibility of the pathology department.
- Molecular pathology reports containing CLIA laboratory identifiers (certificate numbers) must include those identifiers in the de-identification scope; confirm field coverage with the laboratory informatics team.
FAQ
Are specimen accession numbers treated as identifiers under Safe Harbor?
Yes. Specimen accession numbers are certificate or license numbers within the meaning of §164.514(b)(2)(i)(K) when they can be linked back to an individual patient. They are replaced with synthetic codes in the de-identified output.
Can de-identified pathology reports be submitted to the National Cancer Database (NCDB)?
NCDB submissions are governed by the Commission on Cancer data-use agreement and typically require identified data for the initial registry submission. De-identified pathology reports under this workflow are more appropriate for secondary analytical or educational uses rather than primary registry submissions.
Are molecular biomarker results (e.g., HER2, PD-L1, KRAS) preserved in the de-identified report?
Yes. Molecular biomarker results, IHC scores, and mutation status fields are clinical data, not identifiers, and are preserved in full in the de-identified output.