Back to Continuous Improvement for Existing Solutions

Data sensitivity classification

5 minutes 5 Questions

Data sensitivity classification is a critical component of AWS security architecture that involves categorizing data based on its confidentiality requirements and potential impact if compromised. This systematic approach enables organizations to apply appropriate security controls proportional to t…

Data Sensitivity Classification for AWS Solutions Architect Professional

Why Data Sensitivity Classification is Important

Data sensitivity classification is a foundational element of cloud security and compliance. In AWS environments, understanding how to classify data helps organizations:

• Meet regulatory requirements (GDPR, HIPAA, PCI-DSS, SOC 2)
• Apply appropriate security controls based on data value and risk
• Optimize costs by avoiding over-protection of non-sensitive data
• Enable proper access control and encryption strategies
• Facilitate incident response by knowing what data was potentially exposed

What is Data Sensitivity Classification?

Data sensitivity classification is the process of categorizing data based on its level of sensitivity and the impact that unauthorized disclosure, alteration, or destruction would have on the organization. Common classification levels include:

Public: Information that can be freely shared (marketing materials, public documentation)

Internal: Data meant for internal use only but would cause minimal harm if disclosed

Confidential: Sensitive business data requiring protection (financial records, employee data)

Restricted/Highly Confidential: Most sensitive data requiring maximum protection (PII, PHI, payment card data, trade secrets)

How Data Sensitivity Classification Works in AWS

1. AWS Macie
Amazon Macie uses machine learning to automatically discover, classify, and protect sensitive data in S3. It can identify:
• Personally Identifiable Information (PII)
• Financial data
• Credentials and secrets
• Custom data identifiers you define

2. Resource Tagging Strategy
Implement a consistent tagging strategy using tags like:
• DataClassification: Public/Internal/Confidential/Restricted
• DataOwner: team-name
• ComplianceScope: HIPAA/PCI/GDPR

3. AWS Config Rules
Create custom Config rules to ensure resources containing sensitive data have appropriate controls:
• Encryption enabled
• Proper access policies
• Logging configured

4. Data Protection Controls by Classification

For Restricted Data:
• Server-side encryption with AWS KMS CMK
• VPC endpoints for private access
• CloudTrail data event logging
• Cross-account access restrictions
• MFA delete enabled

For Confidential Data:
• Server-side encryption (SSE-S3 or SSE-KMS)
• IAM policies restricting access
• S3 bucket policies with conditions

For Internal/Public Data:
• Basic encryption at rest
• Standard IAM controls

5. AWS Lake Formation
For data lakes, Lake Formation provides fine-grained access control at the column and row level based on data classification.

Implementation Best Practices

• Automate classification using Macie and custom Lambda functions
• Integrate classification into CI/CD pipelines
• Use AWS Organizations SCPs to enforce controls based on tags
• Implement data lifecycle policies aligned with classification
• Regular audits using AWS Security Hub and Config

Exam Tips: Answering Questions on Data Sensitivity Classification

1. Think Macie First for S3: When questions mention discovering or classifying sensitive data in S3 buckets, Amazon Macie is typically the correct answer.

2. Match Controls to Classification: Questions often test whether you can select appropriate security controls. Higher classification requires stricter controls (KMS CMK vs SSE-S3, VPC endpoints vs public endpoints).

3. Consider Compliance Requirements: If a question mentions HIPAA, PCI-DSS, or GDPR, think about what classification level that data requires and corresponding controls.

4. Tag-Based Policies: Look for answers involving resource tags combined with IAM policies or SCPs when the scenario requires enforcing controls based on data classification.

5. Cost Optimization Angle: Some questions frame classification as a cost issue - avoid applying expensive controls (like KMS with CloudHSM) to non-sensitive data.

6. Data Discovery vs Protection: Distinguish between tools that discover/classify data (Macie) versus tools that protect data (KMS, IAM, bucket policies).

7. Multi-Account Scenarios: In questions involving multiple AWS accounts, consider how data classification affects cross-account access patterns and centralized governance.

8. Automation Keywords: When questions ask for automated or continuous classification, look for answers combining Macie, EventBridge, Lambda, and Config.

9. Lake Formation for Analytics: For data lake scenarios requiring column-level or row-level security based on classification, AWS Lake Formation is the preferred solution.

10. Read Carefully for Scale: Large-scale classification across thousands of buckets suggests Macie with organization-wide deployment rather than manual approaches.

Test mode:

Exam (Timed)

Practice (With explanations)

Start practice test

Unlock Premium Access

AWS Certified Solutions Architect - Professional

Access to ALL Certifications: Study for any certification on our platform with one subscription
8734 Superior-grade AWS Certified Solutions Architect - Professional practice questions
Unlimited practice tests across all certifications
Detailed explanations for every question
SAP-C02: 5 full exams plus all other certification exams
100% Satisfaction Guaranteed: Full refund if unsatisfied
Risk-Free: 7-day free trial with all premium features!

More Data sensitivity classification questions

30 questions (total)

Start 30 question test