Tilbage til BlogGDPR & Overholdelse

CNIL France: GDPR Compliance Under France's Data Protection Authority — What Technical Teams Must Know

CNIL processed 16,433 complaints in 2023 and fined €150M+ since 2019. Its AI guidance mandates documented anonymization for training data. Here's what technical teams must implement.

March 7, 20267 min læsning
CNIL FranceFrench GDPRAI anonymizationFrench data protectionprivacy by design

CNIL's Position as EU's Most Technically Demanding DPA

France's Commission Nationale de l'Informatique et des Libertés (CNIL) publishes the EU's most detailed and technically specific guidance on data protection. Where most EU DPAs issue general guidance, CNIL publishes "recommandations" — detailed technical specifications that constitute the CNIL's interpretation of what GDPR compliance requires.

This technical rigor has established CNIL as the EU benchmark for privacy engineering. Other EU DPAs frequently reference CNIL's technical publications, particularly its 2023 "Guide pratique de l'anonymisation" (practical guide to anonymization) and 2024 generative AI guidance.

CNIL processed 16,433 complaints in 2023 — a 43% increase from 2022 — and has issued approximately €150M in GDPR fines since 2018. The acceleration in complaint volume reflects both increasing public awareness and CNIL's outreach campaigns encouraging data subjects to exercise their rights.

CNIL's AI Training Data Anonymization Requirements

CNIL's 2024 generative AI guidance ("Systèmes d'IA générative") establishes binding requirements for organizations training AI models on French personal data or deploying AI systems that process French users' data.

The guidance identifies six mandatory anonymization categories for AI training data:

  1. Identifiants directs (direct identifiers): Names, addresses, identification numbers — must be removed or replaced before AI training
  2. Identifiants quasi-directs (quasi-identifiers): Combinations of attributes that enable re-identification — must be assessed for k-anonymity
  3. Données sensibles (special categories): Health, biometric, political, religious data — must be segregated with additional anonymization measures
  4. Données comportementales (behavioral data): Browsing history, interaction patterns — must be aggregated or pseudonymized
  5. Données inférées (inferred data): AI-inferred characteristics from behavioral data — subject to purpose limitation controls
  6. Données relatives aux mineurs (children's data): Any data potentially relating to persons under 15 — mandatory age verification and enhanced anonymization

For organizations using LLMs trained on web-scraped data (a common approach), CNIL's guidance requires documentation that the training data was assessed against these six categories and appropriate anonymization applied.

The "Guide Pratique de l'Anonymisation" Requirements

CNIL's 2023 anonymization guide is the EU's most detailed official guidance on what technically constitutes anonymization. Key requirements:

Anonymization techniques endorsed by CNIL:

  • k-anonymity: ensuring each record is indistinguishable from at least k-1 other records
  • l-diversity: requiring diversity in sensitive attributes within equivalence classes
  • Differential privacy: adding calibrated noise to statistical outputs
  • Pseudonymization (explicitly noted as not anonymization but a risk reduction measure)

Documentation requirements: CNIL's guide requires that organizations maintain a "fiche d'anonymisation" (anonymization record) for each processing activity using anonymization, documenting: the anonymization technique applied, the parameters used (k value for k-anonymity, epsilon value for differential privacy), the assessment of residual re-identification risk, and the validation methodology.

Re-identification risk assessment: CNIL requires organizations to conduct a re-identification risk assessment before claiming data is anonymized. The assessment must consider: the "motivated intruder" test (could a motivated individual re-identify the data?), available auxiliary datasets, and the specific context of the data.

CNIL's French-Language PII Detection Considerations

For organizations processing data in French, CNIL's guidance implicitly requires that PII detection tools cover French-language PII. French-specific entity types that must be detected:

  • Numéro de Sécurité Sociale (NIR): 13-digit French Social Security Number with specific format validation
  • Carte vitale number: Health insurance card identifier used in French healthcare administration
  • Numéro d'identification au répertoire (NIR): Population registry identifier
  • SIRET/SIREN: Business identifiers that may appear in personal business contexts
  • Numéro d'ordre professionnel: Professional registration numbers (doctors, lawyers, accountants)
  • Carte nationale d'identité (CNI): French national ID card number

French NER models for person name detection must also handle French naming conventions: compound names (Jean-Pierre), hyphenated names, particles (de, du, des), and French-specific name patterns.

CNIL Enforcement: The AI Fine Pattern

CNIL's enforcement actions against AI systems establish the precedent for what "adequate technical measures" means in the AI context:

Clearview AI (€20M fine, 2022): Processing biometric data of French individuals without legal basis, collected from public web sources. Established that bulk web-scraping of personal data for AI training requires explicit legal basis.

TikTok investigation (2024-2025 ongoing): Focused on algorithmic recommendation systems that may infer sensitive categories from behavioral data. CNIL's investigation methodology has become the EU standard for AI system audits.

Generative AI review (2024-2025): CNIL conducted systematic reviews of LLM vendors operating in France, focusing on training data provenance and anonymization. Vendors without documented anonymization procedures for French users' data were required to implement controls.

The pattern: CNIL enforcement focuses on technical inadequacy — the absence of documented technical controls — rather than purely on procedural violations.

Implementing CNIL-Compliant Anonymization Documentation

For French organizations or organizations serving French users, a CNIL-compliant anonymization posture requires:

1. Fiche d'anonymisation (anonymization record) for each processing activity:

  • Processing purpose and data categories
  • Anonymization technique applied (with parameters)
  • Re-identification risk assessment outcome
  • Validation method (testing, external review)
  • Responsible person and review date

2. Pre-processing for AI systems:

  • Document the PII detection tool and configuration used
  • Record the entity types detected and removed/pseudonymized
  • Maintain processing logs for CNIL audit requests

3. French-language PII coverage:

  • Verify detection coverage for French-specific identifiers (NIR, carte vitale, CNI)
  • Validate French NER model performance on French personal names
  • Document coverage gaps and compensating controls

4. Training data provenance:

  • For AI systems trained on web-scraped data: document the source dataset anonymization assessment
  • For AI systems trained on user data: document the user data anonymization process

CNIL inspection requests for AI systems routinely include requests for these documents. Organizations with pre-existing documentation satisfy inspection requirements significantly faster than those conducting assessments reactively.

Sources:

Klar til at beskytte dine data?

Begynd at anonymisere PII med 285+ enhedstyper på tværs af 48 sprog.