DP-203 Exam Dumps - Data Engineering on Microsoft Azure

Go to page:

Question # 17

You have an Azure Data Factory pipeline that is triggered hourly.

The pipeline has had 100% success for the past seven days.

The pipeline execution fails, and two retries that occur 15 minutes apart also fail. The third failure returns the following error.

What is a possible cause of the error?

The parameter used to generate year=2021/month=01/day=10/hour=06 was incorrect.

From 06:00 to 07:00 on January 10, 2021, there was no data in wwi/BIKES/CARBON.

From 06:00 to 07:00 on January 10, 2021, the file format of data in wwi/BIKES/CARBON was incorrect.

The pipeline was triggered too early.

Full Access

Question # 18

You have an Azure subscription that contains an Azure Synapse Analytics serverless SQL pool. You run the following query in the pool.

For each of the following statements, select Yes if the statement is true. Otherwise, select No. NOTE: Each correct selection is worth one point.

Full Access

Question # 19

You are developing an application that uses Azure Data Lake Storage Gen 2.

You need to recommend a solution to grant permissions to a specific application for a limited time period.

What should you include in the recommendation?

Azure Active Directory (Azure AD) identities

shared access signatures (SAS)

account keys

role assignments

Full Access

Question # 20

You have an Azure Synapse Analytics dedicated SQL pool named pool1.

You plan to implement a star schema in pool1 and create a new table named DimCustomer by using the following code.

You need to ensure that DimCustomer has the necessary columns to support a Type 2 slowly changing dimension (SCD). Which two columns should you add? Each correct answer presents part of the solution. NOTE: Each correct selection is worth one point.

[HistoricalSalesPerson] [nvarchar] (256) NOT NULL

[EffectiveEndDate] [datetime] NOT NULL

[PreviousModifiedDate] [datetime] NOT NULL

[RowID] [bigint] NOT NULL

[EffectiveStartDate] [datetime] NOT NULL

Full Access

Question # 21

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.

After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.

You are designing an Azure Stream Analytics solution that will analyze Twitter data.

You need to count the tweets in each 10-second window. The solution must ensure that each tweet is counted only once.

Solution: You use a hopping window that uses a hop size of 10 seconds and a window size of 10 seconds.

Does this meet the goal?

Yes

Full Access

Question # 22

You have an Azure Databricks workspace that contains a Delta Lake dimension table named Tablet. Table1 is a Type 2 slowly changing dimension (SCD) table. You need to apply updates from a source table to Table1. Which Apache Spark SQL operation should you use?

CREATE

UPDATE

MERGE

ALTER

Full Access

Question # 23

You are designing a folder structure for the files m an Azure Data Lake Storage Gen2 account. The account has one container that contains three years of data.

You need to recommend a folder structure that meets the following requirements:

â€¢ Supports partition elimination for queries by Azure Synapse Analytics serverless SQL pooh

â€¢ Supports fast data retrieval for data from the current month

â€¢ Simplifies data security management by department

Which folder structure should you recommend?

\YYY\MM\DD\Department\DataSource\DataFile_YYYMMMDD.parquet

\Depdftment\DataSource\YYY\MM\DataFile_YYYYMMDD.parquet

\DD\MM\YYYY\Department\DataSource\DataFile_DDMMYY.parquet

\DataSource\Department\YYYYMM\DataFile_YYYYMMDD.parquet

Full Access