By · Last updated 2026-05-02

返回博客GDPR 与合规

全球个人信息合规:GDPR、LGPD与DPDP

巴西CPF、印度Aadhaar和美国SSN在格式和验证逻辑上存在根本差异。LGPD和印度DPDP法案分别将CPF和Aadhaar列为受保护数据,单一国家工具无法满足全球合规要求。

May 2, 20268 分钟阅读
global PII complianceBrazilian CPF detectionIndian Aadhaar DPDPLGPD compliancemulti-regulatory PII

全球个人信息合规:三项法律,三种标识格式

一家英国市场平台处理来自80个国家的卖家文件,同时面临三项法律:针对欧盟卖家的GDPR、针对巴西卖家的LGPD,以及针对印度卖家的DPDP法案。每项法律指定了不同的国家身份标识为受保护数据,每种格式有其独特的校验逻辑。

巴西CPF:格式与LGPD状态

CPF(Cadastro de Pessoas Físicas,个人纳税人登记号)是巴西的纳税人识别码,共11位,格式为XXX.XXX.XXX-XX,末两位为校验位,通过对前九位数字进行数学运算得出。

巴西LGPD将CPF视为受保护的个人标识符,敏感程度与美国SSN相当。不了解CPF格式的工具无法识别它;跳过校验和验证的工具则会产生误报。

印度Aadhaar:格式与DPDP规则

Aadhaar是由印度唯一身份认证局(UIDAI)签发的12位号码,随机分配,末位为Verhoeff校验位。

印度DPDP法案对处理Aadhaar关联数据的主体规定了相应义务。检测需要两个步骤:首先匹配12位格式并验证Verhoeff校验位,其次结合上下文进行过滤——并非所有12位数字串都是Aadhaar号码。

美国SSN:已知结构

SSN由9位数字组成:前三位为地区号,中间两位为组号,末四位为序列号。每段均有固定规则,验证方法有完善文档记录。

单国工具与全球规则之间的差距

这三种标识的格式和校验规则完全不同。为美国市场开发的工具能够识别SSN,但可能完全遗漏CPF和Aadhaar。

大多数团队是在监管机构询问时才发现这一缺口,而非事先防范。该缺口在三项法律下均产生实质性风险:

  • GDPR第28条要求与每个数据处理方签署书面数据处理协议。如果数据集同时包含CPF号码,而DPIA只将「SSN检测」列为主要控制措施,审计人员能够发现这一已记录在案的缺口。
  • LGPD罚款最高可达巴西收入的2%,每次违规上限为5000万雷亚尔。未检测到的CPF构成直接的LGPD违规。
  • DPDP执法尚处于早期阶段。现在记录覆盖范围的团队,将在早期裁决确立标准时处于更有利的位置。

三套罚款体系同时运转,风险叠加。单一国家工具将全球团队置于暴露之下。

完整覆盖的技术要求

工具需要了解每种标识的格式、校验算法和法律背景。CPF需要模块化校验和,Aadhaar需要Verhoeff校验加上下文过滤,SSN需要地区和组别规则。这是三个独立的技术问题,没有任何单一匹配规则能够全部覆盖。

另请参阅:全球PII标识缺口:SSN、CPF、Aadhaar巴西ANPD LGPD执法指南印度DPDPA隐私法技术合规

参考资料

准备好保护您的数据了吗?

开始使用 285 种实体类型在 48 种语言中匿名化 PII。

About this page

We update this page when our platform or the law changes.

Read our founder note for how we work.

Each change shows up in the timestamp at the top.

Related reading

We follow these rules

  • GDPR (EU 2016/679).
  • ISO/IEC 27001:2022.
  • NIS2 (EU 2022/2555).
  • HIPAA safe harbor under 45 CFR § 164.514(b)(2).

Our promise

We do not sell your data.

We do not train models on your text.

We store your files in Germany.

You can delete your account at any time.

You own your work.

Where we run

Our servers live in Falkenstein, Germany.

We use Hetzner. They hold ISO 27001 certification.

All data stays in the EU.

Backups run every day.

Need help?

Email support@anonym.legal.

We reply within one business day.

How we test

We run a full check suite on every release.

Each surface gets its own sweep script and report.

Human reviewers spot-check the output each week.

We track recall and precision on a labelled set.

Bad runs block the deploy.

What we never do

  • We never sell your information to third parties.
  • We never train models on what you upload.
  • We never keep your work after you delete it.
  • We never share keys with any outside firm.
  • We never run ads inside the product.

Plans in plain words

We sell credits, not seats.

One credit covers one short job.

Long jobs use a few credits each.

You can top up at any time.

Unused credits roll over each month.

Read the plans page for current rates.

Who built this

A small team of engineers and lawyers built this.

We ship from Europe and work in the open.

Our founder note spells out why we started.

Where to start

How the parts fit

A browser add-on cleans text inside Chrome.

A Word plug-in handles drafts in Office.

A small desktop tool works on whole folders.

An agent protocol link feeds large models safely.

All four share one core engine and one rule set.

Words from our team

We started this work after a lunch about cookies.

One friend kept getting odd ads on her phone.

We asked why a court file leaked through a draft.

We sketched the first build on a napkin that week.

By month three we had a tiny demo for a friend.

She used it on her first case the next day.

Common questions we hear

Can the tool read scanned PDFs? Yes, with OCR.

Does it work on long files? Yes, in small chunks.

Can I roll my own rule set? Yes, save it as a preset.

Does it run offline? The desktop build runs offline.

Do you keep my files? No, the cloud build wipes after each run.

Will it learn from my work? No, we never train on inputs.

A short tour of the workflow

Upload a file or paste a snippet of prose.

Pick the entities you want gone from the draft.

Choose a method: replace, mask, hash, encrypt, or redact.

Press run and watch the side panel show each hit.

Skim the result and tweak any rule that misfired.

Save the cleaned file or send it to a teammate.