Data Management

5 minutes 5 Questions

Data management in Amazon EMR refers to the processes of ingesting, storing, processing, and exporting data from your cluster. EMR provides multiple storage options, such as HDFS (Hadoop Distributed File System) for storing data locally on the instances, Amazon S3 for long-term, cost-effective stor…

Test mode:
AWS Certified Solutions Architect - Data Management Example Questions

Test your knowledge of Data Management

Question 1

A data analyst needs to access a 20 TB dataset on Amazon S3 infrequently and has a limited budget. The analyst does not need to access the entire dataset at once but requires fast access to subsets of data. Which solution should the company implement?

Question 2

A company will store sensitive customer records as objects in Amazon S3. The company requires encryption at rest with centrally managed keys, fine-grained access controls, and auditability of key usage. Which solution should the company implement?

Question 3

A company runs its ETL pipeline on a single Amazon EC2 instance. The jobs generate large intermediate datasets that are heavily read and written during processing. The intermediate data does not need to persist if the instance is stopped or terminated. The final results are written to Amazon S3. Which storage option should the company use for the temporary staging area?

More Data Management questions
15 questions (total)