Data Methodology
How the IQWorks Data Observatory collects, verifies, and maintains the data that powers our enforcement fines explorer, breach landscape, regulatory heatmap, and ransomware tracker.
Our Principles
Primary Sources First
We prioritize data from official regulatory publications, DPA press releases, and government gazettes over secondary reporting.
Verifiable Claims
Every data point links back to a source URL. Statistics we publish are derived from data you can inspect in our interactive explorers.
Regular Updates
Datasets are refreshed on a regular cadence using automated ingestion pipelines with manual review.
Transparent Limitations
We document known gaps, estimation methods, and coverage boundaries for each dataset.
Enforcement Fines
Coverage
419 actions from 2018-11-26 to 2026-02-22
Sources (33)
AEPD, AEPD Spain, AP, AP Netherlands, APD, Baden-Württemberg DPA (LfDI BW), Bavarian DPA (BayLDA), Bayerisches Landesamt für Datenschutzaufsicht, and 25 more
Enforcement action data is collected from official Data Protection Authority (DPA) publications across Europe, India, the United States, and other jurisdictions. Each action is recorded with the organization name, issuing authority, regulation violated, fine amount in the original currency and EUR equivalent, violation types, industry classification, and a severity score.
Fine amounts in non-EUR currencies are converted using the exchange rate at the date of issuance. Where fine amounts are expressed as a range, we record the upper bound. Severity scores (1-10) are computed based on fine magnitude relative to the issuing authority's typical range, the nature of violations, and whether the action involved repeat offenses.
Data Breaches
Coverage
953 breaches from 2007-07-12 to 2026-03-13
Primary Source
Have I Been Pwned (HIBP)
Breach data is sourced primarily from Have I Been Pwned (HIBP), supplemented with breach disclosures from SEC filings, DPA notifications, and news reporting. Each breach is enriched with industry classification, attack vector categorization, affected data types, and estimated financial impact.
Records affected counts reflect the number of unique accounts or records compromised as reported by the breached organization or verified through HIBP. Financial impact estimates use the IBM/Ponemon Cost of a Data Breach methodology applied to the number of records and industry sector.
Global Regulations
Coverage
174 countries and territories
Sources
LLM-generated (Claude Haiku), IQWorks Enforcement Fines Explorer, Manual verification, ITU Global Cybersecurity Index 2024, OECD AI Policy Observatory
Regulatory data covers privacy law status, cybersecurity law status, and AI governance status for 174 countries. Each country profile includes the law name, enactment year, DPA name, robustness assessment, breach notification requirements, DPO obligation status, cross-border transfer restrictions, and EU adequacy decisions.
Robustness assessments (heavy, moderate, limited, inadequate) are based on the scope of the law, enforcement mechanisms, individual rights granted, and international benchmarking. AI governance data is sourced from the OECD AI Policy Observatory and manual verification of national AI strategies and regulations.
Ransomware & Cyber Incidents
Coverage
61 major incidents + 1551 CISA KEV CVEs
Sources
CISA KEV, LLM-curated historical incidents
Ransomware incident data covers publicly documented attacks with known ransomware groups, ransom demands, payments, and organizational impact. CISA Known Exploited Vulnerabilities (KEV) data is sourced directly from the CISA catalog and cross-referenced with ransomware campaign attribution.
Ransom demand and payment figures are sourced from public disclosures, SEC filings, and investigative reporting. Where exact amounts are not publicly available, we note the source's confidence level. Incidents with unverified ransom amounts are excluded from aggregate totals.
Questions about our data?
If you notice an error or have questions about our methodology, we want to hear from you.
Contact us