Databricks-Certified-Data-Engineer-Associate Databricks Certified Data Engineer Associate Exam Questions and Answers

Questions 4

Which of the following approaches should be used to send the Databricks Job owner an email in the case that the Job fails?

Options:

Manually programming in an alert system in each cell of the Notebook

Setting up an Alert in the Job page

Setting up an Alert in the Notebook

There is no way to notify the Job owner in the case of Job failure

MLflow Model Registry Webhooks

Buy Now

Questions 5

Which query is performing a streaming hop from raw data to a Bronze table?

Databricks-Certified-Data-Engineer-Associate Question 5

Options:

Option A

Option B

Option C

Option D

Buy Now

Questions 6

In which of the following scenarios should a data engineer select a Task in the Depends On field of a new Databricks Job Task?

Options:

When another task needs to be replaced by the new task

When another task needs to fail before the new task begins

When another task has the same dependency libraries as the new task

When another task needs to use as little compute resources as possible

When another task needs to successfully complete before the new task begins

Buy Now

Questions 7

Which of the following is hosted completely in the control plane of the classic Databricks architecture?

Options:

Worker node

JDBC data source

Databricks web application

Databricks Filesystem

Driver node

Buy Now

Questions 8

Which of the following commands will return the number of null values in the member_id column?

Options:

SELECT count(member_id) FROM my_table;

SELECT count(member_id) - count_null(member_id) FROM my_table;

SELECT count_if(member_id IS NULL) FROM my_table;

SELECT null(member_id) FROM my_table;

SELECT count_null(member_id) FROM my_table;

Buy Now

Questions 9

A data engineer is maintaining a data pipeline. Upon data ingestion, the data engineer notices that the source data is starting to have a lower level of quality. The data engineer would like to automate the process of monitoring the quality level.

Which of the following tools can the data engineer use to solve this problem?

Options:

Unity Catalog

Data Explorer

Delta Lake

Delta Live Tables

Auto Loader

Buy Now

Questions 10

What is stored in a Databricks customer's cloud account?

Options:

Data

Cluster management metadata

Databricks web application

Notebooks

Buy Now

Questions 11

A dataset has been defined using Delta Live Tables and includes an expectations clause:

CONSTRAINT valid_timestamp EXPECT (timestamp > '2020-01-01') ON VIOLATION FAIL UPDATE

What is the expected behavior when a batch of data containing data that violates these constraints is processed?

Options:

Records that violate the expectation cause the job to fail.

Records that violate the expectation are added to the target dataset and flagged as invalid in a field added to the target dataset.

Records that violate the expectation are dropped from the target dataset and recorded as invalid in the event log.

Records that violate the expectation are added to the target dataset and recorded as invalid in the event log.

Buy Now

Questions 12

A data analyst has a series of queries in a SQL program. The data analyst wants this program to run every day. They only want the final query in the program to run on Sundays. They ask for help from the data engineering team to complete this task.

Which of the following approaches could be used by the data engineering team to complete this task?

Options:

They could submit a feature request with Databricks to add this functionality.

They could wrap the queries using PySpark and use Python’s control flow system to determine when to run the final query.

They could only run the entire program on Sundays.

They could automatically restrict access to the source table in the final query so that it is only accessible on Sundays.

They could redesign the data model to separate the data used in the final query into a new table.

Buy Now

Questions 13

A data architect has determined that a table of the following format is necessary:

Databricks-Certified-Data-Engineer-Associate Question 13

Which of the following code blocks uses SQL DDL commands to create an empty Delta table in the above format regardless of whether a table already exists with this name?

Databricks-Certified-Data-Engineer-Associate Question 13

Options:

Option A

Option B

Option C

Option D

Option E

Buy Now

Questions 14

A data engineer has been given a new record of data:

id STRING = 'a1'

rank INTEGER = 6

rating FLOAT = 9.4

Which of the following SQL commands can be used to append the new record to an existing Delta table my_table?

Options:

INSERT INTO my_table VALUES ('a1', 6, 9.4)

my_table UNION VALUES ('a1', 6, 9.4)

INSERT VALUES ( 'a1' , 6, 9.4) INTO my_table

UPDATE my_table VALUES ('a1', 6, 9.4)

UPDATE VALUES ('a1', 6, 9.4) my_table

Buy Now

Questions 15

A new data engineering team team. has been assigned to an ELT project. The new data engineering team will need full privileges on the database customers to fully manage the project.

Which of the following commands can be used to grant full permissions on the database to the new data engineering team?

Options:

GRANT USAGE ON DATABASE customers TO team;

GRANT ALL PRIVILEGES ON DATABASE team TO customers;

GRANT SELECT PRIVILEGES ON DATABASE customers TO teams;

GRANT SELECT CREATE MODIFY USAGE PRIVILEGES ON DATABASE customers TO team;

GRANT ALL PRIVILEGES ON DATABASE customers TO team;

Buy Now

Questions 16

A data analysis team has noticed that their Databricks SQL queries are running too slowly when connected to their always-on SQL endpoint. They claim that this issue is present when many members of the team are running small queries simultaneously.They ask the data engineering team for help. The data engineering team notices that each of the team’s queries uses the same SQL endpoint.

Which of the following approaches can the data engineering team use to improve the latency of the team’s queries?

Options:

They can increase the cluster size of the SQL endpoint.

They can increase the maximum bound of the SQL endpoint’s scaling range.

They can turn on the Auto Stop feature for the SQL endpoint.

They can turn on the Serverless feature for the SQL endpoint.

They can turn on the Serverless feature for the SQL endpoint and change the Spot Instance Policy to “Reliability Optimized.”

Buy Now

Questions 17

A data engineer has configured a Structured Streaming job to read from a table, manipulate the data, and then perform a streaming write into a new table.

Databricks-Certified-Data-Engineer-Associate Question 17

The code block used by the data engineer is below:

Which line of code should the data engineer use to fill in the blank if the data engineer only wants the query to execute a micro-batch to process data every 5 seconds?

Options:

trigger("5 seconds")

trigger(continuous="5 seconds")

trigger(once="5 seconds")

trigger(processingTime="5 seconds")

Buy Now

Questions 18

Which of the following is stored in the Databricks customer's cloud account?

Options:

Databricks web application

Cluster management metadata

Repos

Data

Notebooks

Buy Now

Questions 19

Which of the following Structured Streaming queries is performing a hop from a Silver table to a Gold table?

Options:

Databricks-Certified-Data-Engineer-Associate Question 19 Option 1

Buy Now

Questions 20

A data engineer has a Python variable table_name that they would like to use in a SQL query. They want to construct a Python code block that will run the query using table_name.

They have the following incomplete code block:

____(f"SELECT customer_id, spend FROM {table_name}")

Which of the following can be used to fill in the blank to successfully complete the task?

Options:

spark.delta.sql

spark.delta.table

spark.table

dbutils.sql

spark.sql

Buy Now

Questions 21

Which of the following describes the type of workloads that are always compatible with Auto Loader?

Options:

Dashboard workloads

Streaming workloads

Machine learning workloads

Serverless workloads

Batch workloads

Buy Now

Questions 22

A data engineer needs access to a table new_table, but they do not have the correct permissions. They can ask the table owner for permission, but they do not know who the table owner is.

Which of the following approaches can be used to identify the owner of new_table?

Options:

Review the Permissions tab in the table's page in Data Explorer

All of these options can be used to identify the owner of the table

Review the Owner field in the table's page in Data Explorer

Review the Owner field in the table's page in the cloud storage solution

There is no way to identify the owner of the table

Buy Now

Questions 23

Which tool is used by Auto Loader to process data incrementally?

Options:

Spark Structured Streaming

Unity Catalog

Checkpointing

Databricks SQL

Buy Now

Questions 24

A data engineer has been using a Databricks SQL dashboard to monitor the cleanliness of the input data to a data analytics dashboard for a retail use case. The job has a Databricks SQL query that returns the number of store-level records where sales is equal to zero. The data engineer wants their entire team to be notified via a messaging webhook whenever this value is greater than 0.

Which of the following approaches can the data engineer use to notify their entire team via a messaging webhook whenever the number of stores with $0 in sales is greater than zero?

Options:

They can set up an Alert with a custom template.

They can set up an Alert with a new email alert destination.

They can set up an Alert with one-time notifications.

They can set up an Alert with a new webhook alert destination.

They can set up an Alert without notifications.

Buy Now

Questions 25

A data engineer has configured a Structured Streaming job to read from a table, manipulate the data, and then perform a streaming write into a new table.

The code block used by the data engineer is below:

Databricks-Certified-Data-Engineer-Associate Question 25

If the data engineer only wants the query to process all of the available data in as many batches as required, which of the following lines of code should the data engineer use to fill in the blank?

Options:

processingTime(1)

trigger(availableNow=True)

trigger(parallelBatch=True)

trigger(processingTime="once")

trigger(continuous="once")

Buy Now

Questions 26

Which of the following tools is used by Auto Loader process data incrementally?

Options:

Checkpointing

Spark Structured Streaming

Data Explorer

Unity Catalog

Databricks SQL

Buy Now

Questions 27

An engineering manager wants to monitor the performance of a recent project using a Databricks SQL query. For the first week following the project’s release, the manager wants the query results to be updated every minute. However, the manager is concerned that the compute resources used for the query will be left running and cost the organization a lot of money beyond the first week of the project’s release.

Which of the following approaches can the engineering team use to ensure the query does not cost the organization any money beyond the first week of the project’s release?

Options:

They can set a limit to the number of DBUs that are consumed by the SQL Endpoint.

They can set the query’s refresh schedule to end after a certain number of refreshes.

They cannot ensure the query does not cost the organization money beyond the first week of the project’s release.

They can set a limit to the number of individuals that are able to manage the query’s refresh schedule.

They can set the query’s refresh schedule to end on a certain date in the query scheduler.

Buy Now

Questions 28

Which of the following Git operations must be performed outside of Databricks Repos?

Options:

Commit

Pull

Push

Clone

Merge

Buy Now

Questions 29

A data engineer needs to apply custom logic to string column city in table stores for a specific use case. In order to apply this custom logic at scale, the data engineer wants to create a SQL user-defined function (UDF).

Which of the following code blocks creates this SQL UDF?

Options:

Databricks-Certified-Data-Engineer-Associate Question 29 Option 1

Buy Now

Questions 30

Which of the following describes the storage organization of a Delta table?

Options:

Delta tables are stored in a single file that contains data, history, metadata, and other attributes.

Delta tables store their data in a single file and all metadata in a collection of files in a separate location.

Delta tables are stored in a collection of files that contain data, history, metadata, and other attributes.

Delta tables are stored in a collection of files that contain only the data stored within the table.

Delta tables are stored in a single file that contains only the data stored within the table.

Buy Now

Questions 31

A data engineer needs access to a table new_uable, but they do not have the correct permissions. They can ask the table owner for permission, but they do not know who the table owner is.

Which approach can be used to identify the owner of new_table?

Options:

There is no way to identify the owner of the table

Review the Owner field in the table's page in the cloud storage solution

Review the Permissions tab in the table's page in Data Explorer

Review the Owner field in the table’s page in Data Explorer

Buy Now

Questions 32

A data engineer wants to create a new table containing the names of customers that live in France.

They have written the following command:

Databricks-Certified-Data-Engineer-Associate Question 32

A senior data engineer mentions that it is organization policy to include a table property indicating that the new table includes personally identifiable information (PII).

Which of the following lines of code fills in the above blank to successfully complete the task?

Options:

There is no way to indicate whether a table contains PII.

"COMMENT PII"

TBLPROPERTIES PII

COMMENT "Contains PII"

PII

Buy Now

Exam Code: Databricks-Certified-Data-Engineer-Associate

Exam Name: Databricks Certified Data Engineer Associate Exam

Last Update: Jul 25, 2025

Questions: 108

PDF + Testing Engine

$72.6 ~~$181.49~~

Testing Engine

$57.8 ~~$144.49~~

PDF (Q&A)

$49.8 ~~$124.49~~

buy now Databricks-Certified-Data-Engineer-Associate pdf

Databricks-Certified-Data-Engineer-Associate Databricks Certified Data Engineer Associate Exam Questions and Answers

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer: