By · Last updated 2026-05-18

Vissza a BlograGDPR & Megfelelés

Datatilsynet Dánia: CPR-szám Modulo-11 Érvényesítés...

A dán CPR-szám 10 jegyű, modulo-11 ellenőrző jeggyel. A 1954 előtt születetteknek speciális CPR-formátumuk van.

May 18, 20267 perc olvasás
Denmark DatatilsynetCPR modulus-11Danish healthcare GDPRhealth data anonymizationNordic compliance

Denmark's Datatilsynet issued 31 GDPR enforcement decisions in 2024, with 14 specifically involving healthcare data systems — a concentration reflecting the high stakes of Denmark's comprehensive national health data infrastructure and the technical failures that repeatedly expose patient data.

CPR-Number: The Modulus-11 Requirement

The CPR number (Det Centrale Personregister-nummer) — 10 digits, format DDMMYY-XXXX — encodes birth date (digits 1-6) and a sequential number with check digit (digits 7-10). The final digit is validated using modulus-11 arithmetic:

Modulus-11 check: multiply digits 1-9 by weights (4,3,2,7,6,5,4,3,2), sum, take modulo 11. If result is 0, check digit = 0. If result is 1, the CPR is invalid (no valid check digit exists for this prefix). Otherwise, check digit = 11 minus result.

This creates the important property that some DDMMYY-XXXX patterns can never be valid CPR numbers (those where the modulo-11 calculation produces 1). Tools that pattern-match 10-digit numbers formatted as DDMMYY-XXXX without modulus-11 validation generate false positives from date strings, reference numbers, and invoice codes.

67% of generic NLP tools lack CPR modulus-11 implementation (Datatilsynet 2024). This detection failure is the single most cited technical inadequacy in Datatilsynet's healthcare enforcement actions.

Denmark's Health Data Research Ecosystem

Denmark's health registers — among the most complete longitudinal health datasets in the world — are linked through the CPR number. The CPR enables researchers to link:

  • Hospital discharge records (from 1977)
  • Prescription database (from 1995)
  • Cancer registry (from 1943)
  • Cause of death registry (from 1970)
  • Primary care diagnosis data (from 1990)

This linkability makes Danish health research world-class but creates a re-identification risk that Datatilsynet takes seriously: even "de-identified" datasets that retain CPR-linked attributes (age, sex, diagnosis, year) can be re-identified in combination with other datasets.

Datatilsynet's 2024 guidance on secondary health data use requires that organizations using these registers demonstrate:

Technical anonymization documentation: Not a policy statement, but technical documentation showing exactly which identifiers were removed, which quasi-identifiers were generalized, and what k-anonymity level was achieved in the output dataset.

Third-party validation for research datasets: For research datasets with more than 5,000 individuals, Datatilsynet recommends independent technical review of anonymization procedures.

Data minimization: Research dataset scope must match the documented research question. Datatilsynet has found multiple cases where researchers used complete national registers when a random sample or geographically limited dataset would have served the research purpose.

Specific Healthcare Enforcement Findings

Datatilsynet's 14 healthcare enforcement decisions in 2024 document recurring technical failures:

Case pattern 1: Hospital shares de-identified patient dataset with academic research partner for AI training. Dataset contains CPR birth date components, diagnosis codes, and treatment dates. Datatilsynet finds the combination enables re-identification of rare disease patients (small denominator problem — unusual diagnoses narrow identification significantly).

Case pattern 2: Health tech startup processes Danish patient data through US-based AI API for clinical documentation support. CPR numbers in medical notes are transmitted to US servers without adequate transfer mechanism and without prior CPR detection and removal.

Case pattern 3: Insurance company processes medical certificate data for disability claims. CPR numbers in scanned PDF certificates are not detected by the company's OCR-plus-extraction pipeline (OCR converts image to text; text is processed but without CPR validation, many CPR numbers are missed in the OCR output due to formatting artifacts).

The OCR-plus-extraction failure mode is particularly common in healthcare contexts where documents are received as scanned images. CPR detection must work on OCR-processed text, which often introduces formatting inconsistencies (spaces inserted mid-number, dash position errors) that break simple pattern matching.

For Danish healthcare GDPR compliance: CPR detection with modulus-11 validation in both clean text and OCR-processed output, Danish-language NER (spaCy da_core_news), and technical anonymization documentation meeting Datatilsynet's 2024 secondary use standards are the minimum requirements.

Sources:

Készen áll az adatai védelmére?

Kezdje el a PII anonimizálását 285+ entitástípuson 48 nyelven.

About this page

We update this page when our platform or the law changes.

Read our founder note for how we work.

Each change shows up in the timestamp at the top.

Related reading

We follow these rules

  • GDPR (EU 2016/679).
  • ISO/IEC 27001:2022.
  • NIS2 (EU 2022/2555).
  • HIPAA safe harbor under 45 CFR § 164.514(b)(2).

Our promise

We do not sell your data.

We do not train models on your text.

We store your files in Germany.

You can delete your account at any time.

You own your work.

Where we run

Our servers live in Falkenstein, Germany.

We use Hetzner. They hold ISO 27001 certification.

All data stays in the EU.

Backups run every day.

Need help?

Email support@anonym.legal.

We reply within one business day.

How we test

We run a full check suite on every release.

Each surface gets its own sweep script and report.

Human reviewers spot-check the output each week.

We track recall and precision on a labelled set.

Bad runs block the deploy.

What we never do

  • We never sell your information to third parties.
  • We never train models on what you upload.
  • We never keep your work after you delete it.
  • We never share keys with any outside firm.
  • We never run ads inside the product.

Plans in plain words

We sell credits, not seats.

One credit covers one short job.

Long jobs use a few credits each.

You can top up at any time.

Unused credits roll over each month.

Read the plans page for current rates.

Who built this

A small team of engineers and lawyers built this.

We ship from Europe and work in the open.

Our founder note spells out why we started.

Where to start

How the parts fit

A browser add-on cleans text inside Chrome.

A Word plug-in handles drafts in Office.

A small desktop tool works on whole folders.

An agent protocol link feeds large models safely.

All four share one core engine and one rule set.