Rudi kwa BlogKitaalamu

Air-Gapped PII: Jinsi ya Kuweza Anonymize Hata Bilingual

Air-gapped networks (disconnected) ni sehemu ya asili kwa serikali na healthcare. Jinsi ya kuweza anonymize PII offline. Rust, spaCy, Presidio standalone.

March 3, 20268 dakika kusoma
offlineair-gapdesktopITARGDPRgovernmentdefenselocal processing

Air-Gapped Nini?

Air-gapped network ni network iliyotengana kutoka kwenye internet. Serikali, healthcare, na military zina:

  • Hakuna WiFi
  • Hakuna VPN
  • Hakuna wanaofanya kazi na internet

Mfano:

  • Serikali: NSA, CIA, Pentagon
  • Healthcare: VA (Veterans Affairs), TRICARE
  • Nchi nyingine: Rusi, China, Iran

Changamoto ya Anonymization

Mfano #1: VA Hospital

  • 50K medical records na PII
  • Haja kugamia namna ya HIPAA (kupungua jina, SSN, MRN)
  • Lakini hakana internet — haiwezi kutumia anonym.legal API

Mfano #2: Pentagon

  • 1M emails na siri za serikali
  • Haja kugamia (FOIA requests)
  • Lakini hakana cloud access — haiwezi kutumia ChatGPT au Claude

Kuwaeza Anonymization

Hekima #1: Standalone Presidio

Presidio anaweza kuendesha standalone bila API:

# Install Presidio
pip install presidio-analyzer presidio-anonymizer

# Kutumia command-line
echo "John Smith at john@example.com" | presidio_analyzer \
  --entities PERSON EMAIL PHONE_NUMBER

# Output: [(PERSON, 0, 10), (EMAIL, 18, 40)]

Hekima #2: Desktop App na Tauri

anonym.legal Desktop app ni Rust-based, inakimbiya offline:

// src/main.rs
use presidio_analyzer::Analyzer;
use presidio_anonymizer::Anonymizer;

fn main() {
    let analyzer = Analyzer::new();
    let anonymizer = Anonymizer::new();
    
    let text = "John Smith lives at john@example.com";
    let results = analyzer.analyze(text, ENTITIES);
    let anonymized = anonymizer.anonymize(text, results);
    
    println!("{}", anonymized);
    // Output: [PERSON_1] lives at [EMAIL_1]
}

Hekima #3: spaCy Offline

spaCy (NLP library) anaweza kuendesha offline pia:

# Download spaCy model once
python -m spacy download en_core_web_md

# Kisha kutumia offline
python -c "import spacy; nlp = spacy.load('en_core_web_md'); print(nlp('John Smith'))" 

Air-Gapped Mipango

Kwa Serikali / Healthcare:

  1. Install Presidio standalone kwenye air-gapped machine
  2. Download spaCy models kwenye internet, kisha copy USB
  3. Endesha Desktop app (Tauri) offline
  4. Kumfuta sehemu: Presidio + Rust binary + spaCy models = 500MB

Kwa Waandishi (GDPR/HIPAA):

  1. Export PII data kwenye CSV
  2. Tunza Presidio offline
  3. Kumfuta sehemu, kisha import cleaned CSV
  4. Hakana internet + hakana leaks

Mipango ya Nchi

anonym.legal kwa Air-Gapped:

  • Desktop app (v7.4.13) — kuendesha offline, zero internet
  • Rust backend (spaCy + Presidio) — standalone binaries
  • Batch anonymization — process 1000s CSV rows offline
  • Zero-Knowledge — hakana server, hakana leaks possible

Tayari kulinda data yako?

Anza kuanonymisha PII na aina 285+ za vitu katika lugha 48.