DAS-C01 Amazon-Web-Services Exam Questions and Free Practice Test

Question 8

An IoT company wants to release a new device that will collect data to track sleep overnight on an intelligent mattress. Sensors will send data that will be uploaded to an Amazon S3 bucket. About 2 MB of data is generated each night for each bed. Data must be processed and summarized for each user, and the results need to be available as soon as possible. Part of the process consists of time windowing and other functions. Based on tests with a Python script, every run will require about 1 GB of memory and will complete within a couple of minutes.
Which solution will run the script in the MOST cost-effective way?

A. AWS Lambda with a Python script

B. AWS Glue with a Scala job

C. Amazon EMR with an Apache Spark script

D. AWS Glue with a PySpark job

Correct Answer:A

Question 9

A media analytics company consumes a stream of social media posts. The posts are sent to an Amazon Kinesis data stream partitioned on user_id. An AWS Lambda function retrieves the records and validates the content before loading the posts into an Amazon Elasticsearch cluster. The validation process needs to receive the posts for a given user in the order they were received. A data analyst has noticed that, during peak hours, the social media platform posts take more than an hour to appear in the Elasticsearch cluster.
What should the data analyst do reduce this latency?

A. Migrate the validation process to Amazon Kinesis Data Firehose.

B. Migrate the Lambda consumers from standard data stream iterators to an HTTP/2 stream consumer.

C. Increase the number of shards in the stream.

D. Configure multiple Lambda functions to process the stream.

Correct Answer:D

Question 10

A company is hosting an enterprise reporting solution with Amazon Redshift. The application provides reporting capabilities to three main groups: an executive group to access financial reports, a data analyst group to run long-running ad-hoc queries, and a data engineering group to run stored procedures and ETL processes. The executive team requires queries to run with optimal performance. The data engineering team expects queries to take minutes.
Which Amazon Redshift feature meets the requirements for this task?

A. Concurrency scaling

B. Short query acceleration (SQA)

C. Workload management (WLM)

D. Materialized views

Correct Answer:D

Materialized views:

Question 11

A company is sending historical datasets to Amazon S3 for storage. A data engineer at the company wants to make these datasets available for analysis using Amazon Athena. The engineer also wants to encrypt the Athena query results in an S3 results location by using AWS solutions for encryption. The requirements for encrypting the query results are as follows:
Use custom keys for encryption of the primary dataset query results.
Use generic encryption for all other query results.
Provide an audit trail for the primary dataset queries that shows when the keys were used and by whom. Which solution meets these requirements?

A. Use server-side encryption with S3 managed encryption keys (SSE-S3) for the primary datase

B. Use SSE-S3 for the other datasets.

C. Use server-side encryption with customer-provided encryption keys (SSE-C) for the primary dataset.Use server-side encryption with S3 managed encryption keys (SSE-S3) for the other datasets.

D. Use server-side encryption with AWS KMS managed customer master keys (SSE-KMS CMKs) for the primary datase

E. Use server-side encryption with S3 managed encryption keys (SSE-S3) for the other datasets.

F. Use client-side encryption with AWS Key Management Service (AWS KMS) customer managed keys for the primary datase

G. Use S3 client-side encryption with client-side keys for the other datasets.

Correct Answer:A

Question 12

A company wants to provide its data analysts with uninterrupted access to the data in its Amazon Redshift cluster. All data is streamed to an Amazon S3 bucket with Amazon Kinesis Data Firehose. An AWS Glue job that is scheduled to run every 5 minutes issues a COPY command to move the data into Amazon Redshift.
The amount of data delivered is uneven throughout the day, and cluster utilization is high during certain periods. The COPY command usually completes within a couple of seconds. However, when load spike occurs, locks can exist and data can be missed. Currently, the AWS Glue job is configured to run without retries, with timeout at 5 minutes and concurrency at 1.
How should a data analytics specialist configure the AWS Glue job to optimize fault tolerance and improve data availability in the Amazon Redshift cluster?

A. Increase the number of retrie

B. Decrease the timeout valu

C. Increase the job concurrency.

D. Keep the number of retries at 0. Decrease the timeout valu

E. Increase the job concurrency.

F. Keep the number of retries at 0. Decrease the timeout valu

G. Keep the job concurrency at 1.

H. Keep the number of retries at 0. Increase the timeout valu

I. Keep the job concurrency at 1.

Correct Answer:B

START DAS-C01 EXAM

Question 7

Question 8

Question 9

Question 10

Question 11

Question 12