Data Store Management

Choosing optimal data stores, understanding data cataloging systems, managing data lifecycles, and designing data models with schema evolution on AWS.

This domain focuses on selecting and managing the right data storage solutions on AWS. It covers choosing storage services based on cost, performance, and access pattern requirements across Amazon Redshift, Amazon EMR, AWS Lake Formation, Amazon RDS, Amazon DynamoDB, and Amazon Kinesis. Candidates must understand data cataloging systems including the AWS Glue Data Catalog, Glue crawlers, schema discovery, partition synchronization, and business data catalogs with Amazon SageMaker Catalog. The domain also covers managing the lifecycle of data through S3 Lifecycle policies, storage tiering, data versioning, DynamoDB TTL, and data deletion for compliance. Additionally, it tests designing data models and schema evolution including schema design for Redshift and DynamoDB, schema conversion with AWS SCT and DMS, data lineage tracking, open table formats like Apache Iceberg, vector database concepts (HNSW, IVF), and optimization techniques such as indexing, partitioning, and compression. (26% of exam)
5 minutes 5 Questions

Data Store Management is a critical domain in the AWS Certified Data Engineer - Associate certification that focuses on designing, implementing, and maintaining various data storage solutions on AWS. It encompasses selecting appropriate data stores based on requirements such as performance, cost, s…

Concepts covered: Data Versioning and TTL Management, Amazon Redshift Architecture and Access Patterns, Amazon RDS and Relational Database Selection, Open Table Formats with Apache Iceberg, Data Migration with AWS Transfer Family, AWS Glue Data Catalog and Crawlers, S3 Lifecycle Policies and Storage Tiering, Schema Conversion and Evolution with AWS SCT and DMS, Choosing Storage Services for Cost and Performance, Amazon DynamoDB for NoSQL Data Storage, Data Lakes with Lake Formation and Amazon S3, Vector Databases and Indexing (HNSW, IVF), Federated Queries and Materialized Views in Redshift, Schema Discovery and Partition Synchronization, Data Models and Schema Design for Redshift and DynamoDB, Data Lineage and Optimization Techniques

Test mode:
More Data Store Management questions
720 questions (total)