What is the legal definition of personal data under GDPR?

Article 4(1) of the GDPR defines personal data as any information relating to an identified or identifiable natural person (the 'data subject'). A person is identifiable if they can be identified, directly or indirectly, by reference to an identifier such as a name, identification number, location data, online identifier, or factors specific to their physical, physiological, genetic, mental, economic, cultural, or social identity.

Is an IP address personal data under GDPR?

Yes. The CJEU confirmed in the Breyer case (C-582/14) that even dynamic IP addresses can constitute personal data when the controller has lawful means to identify the person behind the address. Static IP addresses are generally considered personal data without qualification.

What are special categories of personal data under GDPR?

Article 9 of the GDPR identifies special categories as: racial or ethnic origin, political opinions, religious or philosophical beliefs, trade union membership, genetic data, biometric data (when used for identification), health data, and data concerning a person's sex life or sexual orientation. Processing these categories is prohibited unless a specific exemption under Article 9(2) applies.

Does GDPR apply to business contact information?

Yes. A business email address like jane.smith@company.com is personal data because it identifies a natural person. The GDPR applies to all personal data regardless of the context in which it was collected , professional or personal. The only exception is genuinely generic addresses like info@company.com that do not identify an individual.

How do you map personal data across multiple subsidiaries?

Start with a consistent data taxonomy that defines personal data categories identically across all entities. Then map each processing activity to those categories at the subsidiary level, identifying data flows between entities and to third parties. Tools like Priverion automate this with cross-entity data mapping that gives group DPOs a single, unified view , ensuring the same employee health record is classified consistently in every jurisdiction.

Can aggregated or statistical data be personal data?

It depends on whether individuals can be singled out from the aggregated dataset. If a statistical report covers a group of 3 employees in a department, the data may still be identifiable. True anonymization requires that no individual can be reasonably re-identified from the dataset, even when combined with other available information.

GDPR Essentials Guide

What Is Personal Data Under GDPR , And Why Getting It Wrong Puts Your Entire Compliance Program at Risk

Q: What is the difference between pseudonymized and anonymized data under GDPR?

Pseudonymized data is still personal data under GDPR because the individual can be re-identified using separately held additional information. Anonymized data , where re-identification is irreversible and not reasonably possible , falls outside the scope of GDPR entirely. The critical test is whether any party, using all means reasonably likely to be used, could re-identify the individual.

Updated 2026-06-22

Key Takeaways: Personal data under GDPR includes any information that can directly or indirectly identify a living individual — misclassification risks fines up to €20 million.

Every GDPR obligation , from Records of Processing Activities to Data Protection Impact Assessments , starts with one question: are you processing personal data? Misclassifying data doesn't just create legal exposure; it means your ROPA is incomplete, your DPIAs miss critical risks, and your data subject access requests return the wrong records.

This guide gives you the definitive breakdown, with practical examples and a downloadable checklist built for organizations managing compliance across multiple entities and jurisdictions.

Download the Personal Data Classification Checklist

Free PDF. No credit card. Just your work email.

Trusted by 50+ privacy teams across 14 countries

Healthcare

Aviation

Energy

Legal

Technology

The Legal Definition: Article 4(1) GDPR

The GDPR casts a deliberately wide net. Article 4(1) defines personal data as:

"Any information relating to an identified or identifiable natural person ('data subject'); an identifiable natural person is one who can be identified, directly or indirectly, in particular by reference to an identifier such as a name, an identification number, location data, an online identifier or to one or more factors specific to the physical, physiological, genetic, mental, economic, cultural or social identity of that natural person."
Article 4(1), Regulation (EU) 2016/679

Four elements make this definition work , and each one matters for how you classify data across your organization:

"Any information" , no limit on format. Text, numbers, photos, audio recordings, biometric templates, metadata, and behavioral patterns all qualify.
"Relating to" , the information must concern the individual, either by content (it describes them), purpose (it's used to evaluate them), or result (its processing impacts them).
"Identified or identifiable" , the person doesn't need to be named. If you can single them out by combining data points, that's enough.
"Natural person" . GDPR protects living individuals, not companies. But data about a sole trader or a named employee is personal data.

The practical consequence: if there's a reasonable possibility that anyone , not just you, but any party with access , could link data back to an individual, it's personal data under GDPR. This is why classification errors cascade through your entire compliance program.

Real-World Examples: What Counts and What Doesn't

The boundary between personal data and non-personal data is less obvious than most organizations assume. Here's how common data types classify under GDPR, based on regulatory guidance and CJEU case law:

Data Type	Personal Data?	Why
Full name	Yes	Directly identifies an individual
Email: [email protected]	Yes	Identifies a natural person by name
Email: [email protected]	No	Generic address, no identifiable person
Dynamic IP address	Yes	CJEU Breyer ruling (C-582/14): identifiable with ISP records
Cookie ID / device fingerprint	Yes	Online identifier under Recital 30; singles out a user
Employee ID number	Yes	Identification number linked to a specific person
GPS location data	Yes	Tracks an individual's movements; location identifier
Salary data linked to role	Yes	Combined with department/role, identifies the individual
Anonymized survey results	It depends	Only non-personal if re-identification is not reasonably possible
Pseudonymized customer records	Yes	Re-identification possible with the separately held key
CCTV footage	Yes	Images of identifiable individuals
Aggregated stats (large dataset)	No	Only if individuals cannot be singled out from the aggregate
Genetic test results	Yes (special category)	Article 9 special category , genetic data
Trade union membership	Yes (special category)	Article 9 special category , explicitly listed

The "Mosaic Effect" Trap

Many organizations classify individual data points in isolation , an employee number here, a department code there , and conclude they're not personal data. But GDPR looks at identifiability through combination. When your HR system, payroll platform, and access control logs can be cross-referenced, data that seems anonymous in one system becomes personal data in the aggregate. This is exactly the gap that cross-entity data mapping is designed to close.

Special Categories: Article 9 Data Requires Extra Protection

GDPR treats certain types of personal data as inherently high-risk. Article 9 prohibits processing these categories unless a specific legal exemption applies , and the penalties for getting it wrong are proportionally higher.

The special categories are:

Racial or ethnic origin , includes nationality fields in HR systems if they reveal ethnicity
Political opinions , political party donations, voter registration data
Religious or philosophical beliefs , dietary preference fields that reveal religion (e.g., "halal" or "kosher" in catering systems)
Trade union membership , payroll deductions for union dues
Genetic data . DNA test results, hereditary information
Biometric data (when used for identification) , fingerprint scans, facial recognition templates. Note: biometric data used for authentication (unlocking a phone) may not trigger Article 9 in all interpretations, but fingerprint access logs certainly do
Health data , sick leave records, disability accommodations, occupational health assessments, insurance claims. This is the most commonly misclassified category in enterprise environments
Sex life or sexual orientation . HR diversity monitoring fields, beneficiary designations that reveal partner gender

Where Multi-Entity Organizations Get Caught

The most common failure we see in organizations managing privacy across multiple subsidiaries: health data classification inconsistency. A sick leave record is classified as "standard HR data" in one subsidiary and "Article 9 health data" in another. When a supervisory authority audits the group, the inconsistency itself becomes evidence of inadequate governance. This is why a unified data taxonomy across all entities isn't optional . it's the foundation of defensible compliance.

Legal Bases for Processing Special Categories

Processing special category data requires both a legal basis under Article 6 and a separate exemption under Article 9(2). The most commonly relied-upon exemptions include:

Explicit consent (Article 9(2)(a)) , must be freely given, specific, informed, and unambiguous. Implied consent is never sufficient.
Employment law obligations (Article 9(2)(b)) , processing necessary for carrying out obligations under employment and social security law
Vital interests (Article 9(2)(c)) , emergency medical situations where the data subject cannot consent
Substantial public interest (Article 9(2)(g)) , must be proportionate and have safeguards

Pseudonymized vs. Anonymized Data: The Critical Distinction

This is where more compliance programs go wrong than almost anywhere else. The distinction determines whether GDPR applies at all , and the line is far less clear than most organizations assume.

Pseudonymized Data = Still Personal Data

Pseudonymization replaces direct identifiers with artificial ones (tokens, codes, hashes) while keeping the re-identification key separate. GDPR explicitly defines pseudonymization in Article 4(5) and treats pseudonymized data as personal data throughout.

Why? Because re-identification is possible. The key exists somewhere. As long as any party , you, a processor, a data recipient, or an attacker with reasonable effort , could reconnect the pseudonym to the individual, it remains personal data subject to all GDPR obligations.

Pseudonymization is a security measure, not an exemption. It can reduce risk (and GDPR recognizes it as a safeguard in Articles 25 and 32), but it doesn't remove the data from GDPR's scope.

Anonymized Data = Outside GDPR Scope

Truly anonymized data , where re-identification is irreversible and not reasonably possible by any party using any means reasonably likely to be used , falls outside GDPR entirely (Recital 26).

The test is rigorous: you must consider all means "reasonably likely to be used" for re-identification, including future technological developments, cost of re-identification, and the availability of complementary datasets. The European Data Protection Board (EDPB) has set a high bar, and supervisory authorities have consistently found that datasets organizations believed were anonymous were, in fact, pseudonymous.

Practical Implication for Your ROPA

If your Records of Processing Activities exclude datasets on the assumption they're "anonymized," verify that assumption with a documented re-identification risk assessment. If you're wrong, those datasets should have been in your ROPA all along , and every processing activity involving them has been undocumented. Priverion's ROPA management includes data classification workflows that flag exactly this kind of gap across all group entities.

Criminal Conviction Data: Article 10

Data relating to criminal convictions and offences gets its own rule under Article 10. It's not a special category under Article 9, but processing is restricted to official authority or when authorized by EU or Member State law with appropriate safeguards.

For employers: background checks, criminal record disclosures, and even noting that an employee has a clean record all fall under Article 10. If your subsidiaries in different jurisdictions handle pre-employment screening differently, inconsistent treatment of Article 10 data is a common audit finding.

Children's Data: Enhanced Protections Under Article 8

When processing children's personal data based on consent for information society services, GDPR requires parental consent for children under 16 (though Member States can lower this to 13). The controller must make reasonable efforts to verify that consent is given by the holder of parental responsibility.

If your organization processes data from minors , educational platforms, family benefit programs, youth services , this adds a classification layer that your data mapping must reflect.

Mapping Personal Data Across a Multi-Entity Organization

Understanding the definition is the starting point. The real challenge for organizations with multiple subsidiaries is applying that definition consistently across every entity, every system, and every jurisdiction.

Here's the process that works , and what we've seen fail:

What Works: Centralized Taxonomy, Distributed Execution

Establish a group-wide data classification taxonomy , define personal data categories identically across all entities. "Health data" means the same thing in your Swiss subsidiary and your German one.
Map processing activities at the entity level , each subsidiary documents its own processing activities using the shared taxonomy. This ensures local accuracy with group-wide consistency.
Identify cross-entity data flows , where personal data moves between subsidiaries or to third parties, map those flows explicitly. Intra-group transfers still require a legal basis.
Automate recertification , when a classification changes (e.g., a new data type is added to HR systems), that change must propagate to every entity's ROPA. Manual email chains don't scale.
Review and audit regularly , data classification isn't a one-time exercise. New systems, new vendors, and regulatory guidance all change what counts as personal data.

What Fails: Decentralized Classification Without Oversight

When each subsidiary defines personal data independently , or when the group DPO has no visibility into subsidiary-level classifications , inconsistencies accumulate silently. The German subsidiary classifies dietary preferences as health data (correct under many DPA interpretations). The UK subsidiary classifies the same field as "standard employee data." The ROPA looks complete in both entities, but the group's compliance posture has a gap that any cross-border audit will find.

This is the exact problem that led to Priverion's founding: a 12-subsidiary enterprise managing GDPR compliance across 47 spreadsheets, with no consistent way to ensure the same data was classified the same way everywhere.

Key Product Capabilities

From Personal Data Mapping to Audit-Ready Evidence , Without the Spreadsheet Chaos

Correctly classifying personal data is only the starting point. What matters is how that classification flows through your entire compliance program , across every subsidiary, every processing activity, every jurisdiction.

ROPA Management

Automated Recertification Across Every Group Entity

When personal data classifications change , or new data types emerge , your Records of Processing Activities need to reflect it everywhere, not just in the subsidiary that noticed. Priverion automates ROPA recertification across all entities, so a classification update in one subsidiary propagates group-wide without manual chasing.

100% recertification rate

AXA achieved full automated ROPA recertification across all entities

DPIA / TIA Automation

AI-Assisted Impact Assessments That Flag What You Missed

Special category data hiding in HR systems. Pseudonymized datasets that still qualify as personal data. AI-assisted drafting and risk scoring surfaces the classifications that matter most , then your team reviews and decides. Every output stays within Swiss infrastructure, and no customer data is used for model training.

200+ hours saved

Medtec saved 200+ hours in ISO 27001 preparation using Priverion

Cross-Entity Data Mapping

One Consistent View of Personal Data Across All Subsidiaries

The same employee health record classified as "sensitive" in Germany and "standard" in another subsidiary creates the exact gap a supervisory authority will find. Cross-entity data mapping gives your group DPO a single, consistent taxonomy , so personal data is classified the same way everywhere, and your compliance posture holds under scrutiny.

60% less admin time

Aircraft manufacturer reduced compliance admin time by 60% in the first 6 months

Vendor Risk Assessments

Know Exactly What Personal Data Your Third Parties Touch

Your vendors process personal data on your behalf , and under GDPR, you remain accountable for it. Priverion centralizes vendor risk assessments across all group entities, tracks data processing agreements, and flags gaps before your next audit surfaces them. No more scattered vendor spreadsheets across subsidiaries.

100% vendor coverage

Zurzach Care achieved 100% vendor risk assessment coverage with Priverion

DSR Handling

Respond to Data Subject Requests With Complete Records

When a data subject exercises their right to access, the response is only as good as your personal data inventory. If you misclassified it, it is missing from the response , and that is a compliance failure. Priverion connects DSR workflows to your cross-entity data map, so every request returns the complete picture within your 30-day window.

24/7 DPO support

Swiss Data Sovereignty

All Your Compliance Data Stays in Switzerland

In a post-Schrems II landscape, where your compliance platform stores data is itself a compliance question. Priverion is Swiss-built and Swiss-hosted . European data residency with all processing within Swiss infrastructure. Your personal data classifications, ROPA records, and DPIA assessments never leave a jurisdiction recognized by the EU as providing adequate protection.

Operational in weeks

Priverion customers go live in weeks, not months , verified across customer deployments

Download the Personal Data Classification Checklist

Free PDF. No credit card. Just your work email.

Measurable Results From Organizations Like Yours

200+

Hours saved on ROPA management

Medtec reclaimed 200+ hours during ISO 27001 preparation by replacing manual record-keeping with automated recertification workflows.

60%

Less compliance admin time

Aircraft manufacturer cut compliance admin time by 60% in 6 months , with predictable pricing based on entities, not per-user expansion traps.

3 mo

Ahead of schedule on ISO 27001

Medtec accelerated their ISO 27001 certification timeline by 3 months using Priverion's audit-ready evidence packages and automated documentation.

Priverion vs. OneTrust

Built for mid-market reality, not enterprise theater

You don't need a platform built for 50,000 employees and a dedicated compliance department of 30. You need one that works for your team of three managing privacy across a dozen subsidiaries.

Priverion

Swiss data sovereignty , guaranteed

All data processed and stored within Swiss infrastructure. In a post-Schrems II world, this isn't a marketing checkbox . it's legal certainty for cross-border transfers.

One platform, one price, everything included

ROPA, DPIA, vendor risk, DSR, incident management, AI Register , all in one. Pricing based on number of entities and org size, not per-user or per-module traps.

Operational in weeks, not quarters

No six-month implementation projects. No army of consultants. Aircraft manufacturer achieved 60% reduction in compliance admin time within their first six months.

, Aircraft manufacturer, first 6 months post-deployment

Built for group-wide complexity

Cross-entity data mapping, automated ROPA recertification across subsidiaries, and a single dashboard view for your entire group. AXA achieved 100% ROPA recertification , fully automated.

, AXA, automated recertification across all entities

AI-assisted, human-decided

AI drafts DPIAs, suggests risk scores, and maps regulations , but every output is reviewed by your team before it becomes a compliance record. No customer data is ever used for model training.

Typical enterprise GRC platform

US-hosted infrastructure

Data stored in US or multi-region cloud infrastructure. European data residency often available as an add-on tier , but not guaranteed by default. Cross-border transfer risk remains your problem.

What Is Personal Data Under GDPR , And Why Getting It Wrong Puts Your Entire Compliance Program at Risk

The Legal Definition: Article 4(1) GDPR

Real-World Examples: What Counts and What Doesn't

Special Categories: Article 9 Data Requires Extra Protection

Legal Bases for Processing Special Categories

Pseudonymized vs. Anonymized Data: The Critical Distinction

Pseudonymized Data = Still Personal Data

Anonymized Data = Outside GDPR Scope

Criminal Conviction Data: Article 10

Children's Data: Enhanced Protections Under Article 8

Mapping Personal Data Across a Multi-Entity Organization

What Works: Centralized Taxonomy, Distributed Execution

What Fails: Decentralized Classification Without Oversight

From Personal Data Mapping to Audit-Ready Evidence , Without the Spreadsheet Chaos

Automated Recertification Across Every Group Entity

AI-Assisted Impact Assessments That Flag What You Missed

One Consistent View of Personal Data Across All Subsidiaries

Know Exactly What Personal Data Your Third Parties Touch

Respond to Data Subject Requests With Complete Records

All Your Compliance Data Stays in Switzerland

Measurable Results From Organizations Like Yours

Built for mid-market reality, not enterprise theater

Swiss data sovereignty , guaranteed

One platform, one price, everything included

Operational in weeks, not quarters

Built for group-wide complexity

AI-assisted, human-decided

US-hosted infrastructure

Module

Key Takeaways — What Is Personal Data Under GDPR?

Definitions

What is personal data?

What is pseudonymization?

What is anonymization?

What are special categories of data?

Frequently Asked Questions

Is an IP address personal data under GDPR?

Does GDPR apply to pseudonymized data?

What is the difference between personal data and sensitive data under GDPR?

Can aggregated or statistical data be personal data?

What fines can result from misclassifying personal data?

How does the CJEU Breyer ruling affect data classification?

What is the mosaic effect in GDPR data classification?

Statistics and Regulatory Context