Online Databricks-Certified-Data-Engineer-Associate Practice TestMore Databricks Products >

Free Databricks Databricks-Certified-Data-Engineer-Associate Exam Dumps Questions

Databricks Databricks-Certified-Data-Engineer-Associate: Databricks Certified Data Engineer Associate Exam

- Get instant access to Databricks-Certified-Data-Engineer-Associate practice exam questions

- Get ready to pass the Databricks Certified Data Engineer Associate Exam exam right now using our Databricks Databricks-Certified-Data-Engineer-Associate exam package, which includes Databricks Databricks-Certified-Data-Engineer-Associate practice test plus an Databricks Databricks-Certified-Data-Engineer-Associate Exam Simulator.

- The best online Databricks-Certified-Data-Engineer-Associate exam study material and preparation tool is here.

4.5

(6120 ratings)

Question 1

A data engineer has a Job that has a complex run schedule, and they want to transfer that schedule to other Jobs.
Rather than manually selecting each value in the scheduling form in Databricks, which of the following tools can the data engineer use to represent and submit the schedule programmatically?

A. pyspark.sql.types.DateType

B. datetime

C. pyspark.sql.types.TimestampType

D. Cron syntax

E. There is no way to represent and submit this information programmatically

Correct Answer:D

Question 2

Which of the following is a benefit of the Databricks Lakehouse Platform embracing open source technologies?

A. Cloud-specific integrations

B. Simplified governance

C. Ability to scale storage

D. Ability to scale workloads

E. Avoiding vendor lock-in

Correct Answer:E
https://double.cloud/blog/posts/2023/01/break-free-from-vendor-lock-in-with-open-source-tech/

Question 3

Which of the following must be specified when creating a new Delta Live Tables pipeline?

A. A key-value pair configuration

B. The preferred DBU/hour cost

C. A path to cloud storage location for the written data

D. A location of a target database for the written data

E. At least one notebook library to be executed

Correct Answer:E
https://docs.databricks.com/en/delta-live-tables/tutorial-pipelines.html

Question 5

A data engineer is attempting to drop a Spark SQL table my_table and runs the following command:
DROP TABLE IF EXISTS my_table;
After running this command, the engineer notices that the data files and metadata files have been deleted from the file system.
Which of the following describes why all of these files were deleted?

A. The table was managed

B. The table's data was smaller than 10 GB

C. The table's data was larger than 10 GB

D. The table was external

E. The table did not have a location

Correct Answer:A
managed tables files and metadata are managed by metastore and will be deleted when the table is dropped . while external tables the metadata is stored in a external location. hence when a external table is dropped you clear off only the metadata and the files (data) remain.

Question 6

A data engineer is designing a data pipeline. The source system generates files in a shared directory that is also used by other processes. As a result, the files should be kept as is and will accumulate in the directory. The data engineer needs to identify which files are new since the previous run in the pipeline, and set up the pipeline to only ingest those new files with each run.
Which of the following tools can the data engineer use to solve this problem?

A. Unity Catalog

B. Delta Lake

C. Databricks SQL

D. Data Explorer

E. Auto Loader

Correct Answer:E
Auto Loader incrementally and efficiently processes new data files as they arrive in cloud storage without any additional setup.https://docs.databricks.com/en/ingestion/auto-loader/index.html

START Databricks-Certified-Data-Engineer-Associate EXAM