Summer Special Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: scxmas70

Databricks-Certified-Data-Analyst-Associate Exam Dumps - Databricks Certified Data Analyst Associate Exam

Question # 4

A data analyst has been asked to produce a visualization that shows the flow of users through a website.

Which of the following is used for visualizing this type of flow?

A.

Heatmap

B.

IChoropleth

C.

Word Cloud

D.

Pivot Table

E.

Sankey

Full Access
Question # 5

A data analyst has created a Query in Databricks SQL, and now they want to create two data visualizations from that Query and add both of those data visualizations to the same Databricks SQL Dashboard.

Which of the following steps will they need to take when creating and adding both data visualizations to the Databricks SQL Dashboard?

A.

They will need to alter the Query to return two separate sets of results.

B.

They will need to add two separate visualizations to the dashboard based on the same Query.

C.

They will need to create two separate dashboards.

D.

They will need to decide on a single data visualization to add to the dashboard.

E.

They will need to copy the Query and create one data visualization per query.

Full Access
Question # 6

A data analyst has a managed table table_name in database database_name. They would now like to remove the table from the database and all of the data files associated with the table. The rest of the tables in the database must continue to exist.

Which of the following commands can the analyst use to complete the task without producing an error?

A.

DROP DATABASE database_name;

B.

DROP TABLE database_name.table_name;

C.

DELETE TABLE database_name.table_name;

D.

DELETE TABLE table_name FROM database_name;

E.

DROP TABLE table_name FROM database_name;

Full Access
Question # 7

A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every minute.

A data analyst has created a dashboard based on this gold-level data. The project stakeholders want to see the results in the dashboard updated within one minute or less of new data becoming available within the gold-level tables.

Which of the following cautions should the data analyst share prior to setting up the dashboard to complete this task?

A.

The required compute resources could be costly

B.

The gold-level tables are not appropriately clean for business reporting

C.

The streaming data is not an appropriate data source for a dashboard

D.

The streaming cluster is not fault tolerant

E.

The dashboard cannot be refreshed that quickly

Full Access
Question # 8

Data professionals with varying titles use the Databricks SQL service as the primary touchpoint with the Databricks Lakehouse Platform. However, some users will use other services like Databricks Machine Learning or Databricks Data Science and Engineering.

Which of the following roles uses Databricks SQL as a secondary service while primarily using one of the other services?

A.

Business analyst

B.

SQL analyst

C.

Data engineer

D.

Business intelligence analyst

E.

Data analyst

Full Access
Question # 9

How can a data analyst determine if query results were pulled from the cache?

A.

Go to the Query History tab and click on the text of the query. The slideout shows if the results came from the cache.

B.

Go to the Alerts tab and check the Cache Status alert.

C.

Go to the Queries tab and click on Cache Status. The status will be green if the results from the last run came from the cache.

D.

Go to the SQL Warehouse (formerly SQL Endpoints) tab and click on Cache. The Cache file will show the contents of the cache.

E.

Go to the Data tab and click Last Query. The details of the query will show if the results came from the cache.

Full Access
Question # 10

A data analyst has been asked to count the number of customers in each region and has written the following query:

If there is a mistake in the query, which of the following describes the mistake?

A.

The query is using count('). which will count all the customers in the customers table, no matter the region.

B.

The query is missing a GROUP BY region clause.

C.

The query is using ORDER BY. which is not allowed in an aggregation.

D.

There are no mistakes in the query.

E.

The query is selecting region but region should only occur in the ORDER BY clause.

Full Access
Question # 11

A data analyst has been asked to use the below tablesales_tableto get the percentage rank of products within region by the sales:

The result of the query should look like this:

Which of the following queries will accomplish this task?

A)

B)

C)

D)

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Full Access
Question # 12

A data analysis team is working with the table_bronze SQL table as a source for one of its most complex projects. A stakeholder of the project notices that some of the downstream data is duplicative. The analysis team identifies table_bronze as the source of the duplication.

Which of the following queries can be used to deduplicate the data from table_bronze and write it to a new table table_silver?

A)

CREATE TABLE table­_silver AS

SELECT DISTINCT *

FROM table_bronze;

B)

CREATE TABLE table_silver AS

INSERT *

FROM table_bronze;

C)

CREATE TABLE table_silver AS

MERGE DEDUPLICATE *

FROM table_bronze;

D)

INSERT INTO TABLE table_silver

SELECT * FROM table_bronze;

E)

INSERT OVERWRITE TABLE table_silver

SELECT * FROM table_bronze;

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

Option E

Full Access
Question # 13

A data analyst creates a Databricks SQL Query where the result set has the following schema:

region STRING

number_of_customer INT

When the analyst clicks on the "Add visualization" button on the SQL Editor page, which of the following types of visualizations will be selected by default?

A.

Violin Chart

B.

Line Chart

C.

IBar Chart

D.

Histogram

E.

There is no default. The user must choose a visualization type.

Full Access