Summer Certification Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code = getmirror

Pass the Microsoft Certified: Azure Databricks Data Engineer DP-750 Questions and answers with ExamsMirror

Practice at least 50% of the questions to maximize your chances of passing.
Exam DP-750 Premium Access

View all detail and faqs for the DP-750 exam


0 Students Passed

0% Average Score

0% Same Questions
Viewing page 1 out of 1 pages
Viewing questions 1-10 out of questions
Questions # 1:

You have an Azure Databricks workspace that is enabled for Unity Catalog and contains a managed Delta table named Table1. Table1 stores customer data.

You need to implement a data retention solution that meets the following requirements:

Deleted data must be retained for 30 days to support audits.

Deleted data that is older than 30 days must be removed permanently.

The solution must minimize administrative effort.

Which two properties should you configure? Each correct answer presents part of the solution.

NOTE: Each correct selection is worth one point.

Options:

A.

delta.timeUntilArchived

B.

delta.deletedFileRetentionDuration

C.

delta.autoOptimize.autoCompact

D.

delta.logRetentionDuration

E.

delta.enableDeletionVectors

Questions # 2:

You use Databricks Asset Bundles to manage two jobs and an app.

You need to deploy the bundle to development and production environments. The solution must meet the following requirements

• Deploy the app to both environments.

• Deploy only one job to development.

• Minimize administrative effort.

What should you use?

Options:

A.

a resources node in a databricks.yml file

B.

separate databricks.yml files for each environment

C.

a variables node in a databricks.yml file

D.

a targets node in a databricks.yml file

Questions # 3:

You need to deploy Databricks Asset Bundles to a development environment. The solution must support automated and repeatable deployments across environments.

What should you use?

Options:

A.

the Azure Developer CLI (azd)

B.

Git folders

C.

the Databricks CLI

D.

the Azure Command-Line Interface (CLI)

Questions # 4:

You have an Azure Databricks workspace named Workspace1. You create a compute cluster named Cluser1 that will be used to ingest data.

You need to install the required libraries on Cluster 1. The solution must use Unity Catalog for access control. What should you do?

Options:

A.

Create a custom dependency management script and run the script from a Databricks notebook.

B.

Install the libraries by using pip3.

C.

Install the libraries on Cluster1 and manually restart the cluster.

D.

Upload the libraries to Workspace1 and install the libraries on Cluster1.

Questions # 5:

You need to complete the PySpark code for the Spark Structured Streaming pipelines. The solution must meet the data ingestion and processing requirements.

How should you complete the code segment? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

Question # 5

Options:

Questions # 6:

Which ingestion option should you recommend for each data source? To answer, drag the appropriate options to the correct data sources. Each option may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

Question # 6

Options:

Questions # 7:

You need to develop the task logic for a new job in Lakeflow Jobs that processes telemetry data.

Each task must contain only the appropriate logic for its step in the pipeline. The solution must support the planned changes and meet the data ingestion and processing requirements.

What should you do?

Options:

A.

Use a single Databricks notebook task that performs ingestion, cleansing, and curation in one script.

B.

Create three tasks that each contains the identical logic and use task retries.

C.

Use a single SQL task that performs ingestion, cleansing, and curation by running merge commands.

D.

Create separate tasks for ingestion, cleansing, and curation.

Questions # 8:

Which SCD type should you use to support the planned data modeling changes? To answer, drag the appropriate types to the correct issues. Each type may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

Question # 8

Options:

Questions # 9:

You need to configure compute for the ingestion of telemetry data. The solution must meet the data ingestion and processing requirements.

What should you do?

Options:

A.

Enable Photon acceleration for a job compute cluster.

B.

Move the ingestion pipelines to shared compute.

C.

Increase an all-purpose cluster to a larger fixed node type.

D.

Disable autoscaling for a job compute cluster.

Viewing page 1 out of 1 pages
Viewing questions 1-10 out of questions
TOP CODES

TOP CODES

Top selling exam codes in the certification world, popular, in demand and updated to help you pass on the first try.