March Special Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: scxmas70

DA0-001 Exam Dumps - CompTIA Data+ Certification Exam

Question # 4

Which of the following is an example of a data-mining ETL tool?

A.

SSIS

B.

Stata

C.

SPSS

D.

Cognos

Full Access
Question # 5

Which of the following is an example of a flat file?

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Full Access
Question # 6

The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company’s year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?

A.

A Q2 2020 and Q4 2019

B.

YTD 2020 and YTD 2019

C.

Q2 2020 and Q2 2019

D.

Q2 2020 and Q2 2021

Full Access
Question # 7

Given the diagram below:

Which of the following data schemas shown?

A.

Key-value pairs

B.

Online transactional processing

C.

Data Lake

D.

Relational database

Full Access
Question # 8

Which of the following is a best practice when updating a legacy data source?

A.

Placing old data in new fields

B.

Keeping only the most recent data

C.

Creating a codebook to document field changes

D.

Removing the data source from production

Full Access
Question # 9

An analyst needs to conduct a quick analysis. Which of the following is the FIRST step the analyst should perform with the data?

A.

Conduct an exploratory analysis and use descriptive statistics.

B.

Conduct a trend analysis and use a scatter chart.

C.

Conduct a link analysis and illustrate the connection points.

D.

Conduct an initial analysis and use a Pareto chart.

Full Access
Question # 10

Joseph is interpreting a left skewed distribution of test scores. Joe scored at the mean, Alfonso scored at the median, and gaby scored and the end of the tail.

Who had the highest score?

A.

Joseph

B.

Joe

C.

Alfonso

D.

Gaby

Full Access
Question # 11

Which of the following is a characteristic of a relational database?

A.

It utilizes key-value pairs.

B.

It has undefined fields.

C.

It is structured in nature.

D.

It uses minimal memory.

Full Access
Question # 12

The process of performing initial investigations on data to spot outliers, discover patterns, and test assumptions with statistical insight and graphical visualization is called:

A.

a t-test.

B.

a performance analysis.

C.

an exploratory data analysis.

D.

a link analysis.

Full Access
Question # 13

Given the table below:

Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?

A.

Name, one

B.

Gender, two

C.

Level, three

D.

Code, four

E.

Region, five

Full Access
Question # 14

A survey asks participants to rate a company on a scale of one to ten. Which of the following best describes the rating variable?

A.

Continuous

B.

Ordinal

C.

Categorical

D.

Nominal

Full Access
Question # 15

An analyst must obtain the average daily sales for the following week:

Which of the following must the analyst perform to obtain this value?

A.

Data normalization

B.

Data append

C.

Data aggregation

D.

Data blending

Full Access
Question # 16

An analyst has generated a report that includes the number of months in the first two quarters of 2019 when sales exceeded $50,000:

Which of the following functions did the analyst use to generate the data in the Sales_indicator column?

A.

Aggregate

B.

Logical

C.

Date

D.

Sort

Full Access
Question # 17

Which of the following concepts should be applied if a data set with 40 fields needs to be pared down to 20 fields and contains similar data across multiple fields?

A.

Duplication

B.

Consolidation

C.

Compliance

D.

Standardization

Full Access
Question # 18

Given the following report:

Which of the following components need to be added to ensure the report is point-in-time and static? (Select two).

A.

A control group for the phrases

B.

A summary of the KPIs

C.

Filter buttons for the status

D.

The date when the report was last accessed

E.

The time period lhe report covers

F.

The date on which the report was run

Full Access
Question # 19

A data analyst was asked to create a chart that shows the relationship between study hours and exam scores for each student using the data sets in the table below:

Which of the following charts would BEST represent the relationship between the variables?

A.

A histogram

B.

A scatter plot

C.

A heat map

D.

A bar chart

Full Access
Question # 20

Analytics reports should follow corporate style guidelines.

A.

True.

B.

False.

Full Access
Question # 21

Taylor wants to investigate how manufacturing, marketing, and sales expenditures impact overall profitability for her company.

Which of the following systems is the most appropriate?

A.

OLTP.

B.

OLAP.

C.

Data warehouse.

D.

Data mart.

Full Access
Question # 22

Which of the following is the first step an analyst should perform upon receiving a business request for analysis?

A.

Determine the data needs and sources for analysis.

B.

Initiate the analysis for exploratory data analysis.

C.

Review the business questions to understand the scope.

D.

Finalize the methodology to solve the problem.

Full Access
Question # 23

Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?

A.

Rephrase the business requirement.

B.

Determine the data necessary for the analysis.

C.

Build a mock dashboard/presentation layout.

D.

Perform exploratory data analysis.

Full Access
Question # 24

During data profiling, an analyst decides to recode the status column in the following data set:

Which of the following data concerns explains why the analyst wants to take this action?

A.

Redundancy

B.

Duplication

C.

Invalidity

D.

Inconsistency

Full Access
Question # 25

A data analyst has been asked to create an ad-hoc sales report for the Chief Executive Officer (CEO).

Which of the following should be included in the report?

A.

The sales representatives' home addresses.

B.

Line-item SKU numbers.

C.

YTD total sales.

D.

The customers' first and last names.

Full Access
Question # 26

Which one of the following would not normally be considered a summary statistic?

A.

z-score.

B.

Mean.

C.

Variance.

D.

Standard deviation.

Full Access
Question # 27

A Chief Executive Officer (CEO) is requesting more up-to-date sales data for improved visibility prior to month-end. An analyst must determine the frequency of a sales report that was previously distributed on an as-needed basis. Which of the following would be the most appropriate frequency for this report?

A.

Monthly

B.

Quarterly

C.

Weekly

D.

Every other month

Full Access
Question # 28

Which of the following is an example of a discrete data type?

A.

8in (20cm)

B.

5 kids

C.

2.5mi (4km)

D.

10.7lbs (4.9kg)

Full Access
Question # 29

A table in a hospital database has a column for patient height in inches and a column for patient height in centimeters. This is an example of:

A.

dependent data.

B.

duplicate data.

C.

invalid data

D.

redundant data

Full Access
Question # 30

A data analyst has been asked to create a sales report that calculates the rolling 12-month average for sales. If the report will be published on November 1, 2020, which of the following months shouts the report cover?

A.

October 1, 2019 to October 31, 2020

B.

October 31, 2020 to November 1, 2021

C.

November 1, 2019 to October 31, 2020

D.

October 31, 2019 to October 31, 2020

Full Access
Question # 31

Which of the following summary statements upholds integrity in data reporting?

A.

Sales are approximately equal for Product A and Product B across all strategies.

B.

Strategy 4 provides the best sales in comparison to other strategies.

C.

While Strategy 2 does not result in the highest sales of Product D. over all products it appears to be the most effective.

D.

Product D should be promoted more than the other products in all strategies.

Full Access
Question # 32

While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?

A.

Replace missing data.

B.

Remove duplicate data.

C.

Replace redundant data.

D.

Remove invalid data.

Full Access
Question # 33

Consider the following dataset which contains information about houses that are for sale:

Which of the following string manipulation commands will combine the address and region name columns to create a full address?

full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan

A.

SELECT CONCAT(address, ' , ' , regionname) AS full_address FROM melb LIMIT 5;

B.

SELECT CONCAT(address, '-' , regionname) AS full_address FROM melb LIMIT 5;

C.

SELECT CONCAT(regionname, ' , ' , address) AS full_address FROM melb LIMIT 5

D.

SELECT CONCAT(regionname, '-' , address) AS full_address FROM melb LIMIT 5;

Full Access
Question # 34

A financial analyst is creating a daily billing report for a company. One night, the company's data warehouse did not update the data, which caused the data to be reported incorrectly the next day. Which of the following documentation elements should the analyst add to catch this error?

A.

Version number

B.

Data refresh

C.

Frequently asked questions tab

D.

Summary

Full Access
Question # 35

Emma is working in a data warehouse and finds a finance fact table links to an organization dimension, which in turn links to a currency dimension that not linked to the fact table.

What type of design pattern is the data warehouse using?

A.

Star.

B.

Sun.

C.

Snowflake.

D.

Comet.

Full Access
Question # 36

An analyst is creating a resource to improve users' experience when they select specific records based on particular dates. Which of the following should the analyst use to create a resource that best meets user needs?

A.

Drop-down menu

B.

Date range

C.

Text field

D.

Frequency

Full Access
Question # 37

Which of the ing is the correct ion for a tab-delimited spre file?

A.

tap

B.

tar

C.

sv

D.

az

Full Access
Question # 38

Which of the following best describes a business analytics tool with interactive visualization and business capabilities and an interface that is simple enough for end users to create their own reports and dashboards?

  • Python

A.

R

B.

Microsoft Power Bl

C.

SAS

Full Access
Question # 39

The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?

A.

Q2 2020 and Q4 2019

B.

YTD 2020 and YTD 2019

C.

Q2 2020 and Q2 2019

D.

Q2 2020 and Q2 2021

Full Access
Question # 40

Alex wants to use data from his corporate sale, CRM, and shipping systems to try and predict future sales.

Which of the following systems is the most appropriate?

Choose the best answer.

A.

Data mart.

B.

OLAP.

C.

Data Warehouse.

D.

OLTP.

Full Access
Question # 41

Given the following data table:

Which of the following are appropriate reasons to undertake data cleansing? (Select two).

A.

Non-parametric data

B.

Missing data

C.

Duplicate data

D.

Invalid data

E.

Redundant data

F.

Normalized data

Full Access
Question # 42

Consider this dataset showing the retirement age of 11 people, in whole years:

54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60

This tables show a simple frequency distribution of the retirement age data.

A.

56

B.

55

C.

57

D.

54

Full Access
Question # 43

Which of the following best describes the law of large numbers?

A.

As a sample size decreases, its standard deviation gets closer to the average of the whole population.

B.

As a sample size grows, its mean gets closer to the average of the whole population

C.

As a sample size decreases, its mean gets closer to the average of the whole population.

D.

When a sample size doubles. the sample is indicative of the whole population.

Full Access
Question # 44

A company's human resources department has asked a data analyst to categorize the income of all employees into five salary bands:

Which of the following types of functions would be the most appropriate to use?

A.

Statistical

B.

Aggregate

C.

Logical

D.

Mathematical

Full Access
Question # 45

What category of data stewardship work is focused on ensuring that the organization respects the wishes of data subjects?

A.

Data quality.

B.

Data privacy.

C.

Data security.

D.

Regulatory compliance.

Full Access
Question # 46

What SQL command is used to delete an entire table from a database?

A.

DROP.

B.

MODIFY.

C.

DELETE.

D.

ALTER.

Full Access
Question # 47

Standardized tests are given to students in the middle of each month, and the results are ready by the end of the month. The superintendent needs a quick view of test performance. Which of the following would be the best recommendation to meet the superintendent's requirements?

A.

A dashboard with a continuous data stream and saved searches

B.

A report of test scores by classroom, emailed to the superintendent at the end of the month

C.

A report of test scores with pie charts showing student performance

D.

A dashboard with a scheduled delivery, the ability to filter scores by school, and bar charts for comparison

Full Access
Question # 48

Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?

A.

Simple random

B.

Cluster

C.

Systematic

D.

Stratified

Full Access
Question # 49

A sales team wants visibility of current sales numbers, pipeline, and team performance. The team would also like to see calculations of individuals’ earned commissions and projected commissions based on sales, but they want that information to be kept confidential. Which of the following would be the BEST way to provide this visibility?

A.

Create a dashboard displaying a data refresh date so users know the current sales numbers and configure permissions to control access.

B.

Create a dashboard for sales numbers, pipeline, and team and individual performance for the management team.

C.

Create a dashboard with filters for the overall team, individuals, and management. Users can filter to see the data they want.

D.

Create a dashboard with views for team, individuals, and management. Configure permissions to control access.

Full Access
Question # 50

Which of the following is an example of a at flat file?

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Full Access
Question # 51

A user receives a large custom report to track company sales across various date ranges. The user then completes a series of manual calculations for each date range. Which of the following should an analyst suggest so the user has a dynamic, seamless experience?

A.

Create multiple reports, one for each needed date range.

B.

Build calculations into the report so they are done automatically.

C.

Add macros to the report to speed up the filtering and calculations process.

D.

Create a dashboard with a date range picker and calculations built in.

Full Access
Question # 52

An analyst reviews the following data:

7

3

5

2

3

7

7

10

Which of the following is the value of the mode?

A.

3

B.

5

C.

7

D.

10

Full Access
Question # 53

An analyst notices changes in sales ratios when analyzing a quarterly report. Which of the following is the analyst conducting?

A.

A gap analysis

B.

A link analysis

C.

A trend analysis

D.

A statistical analysis

Full Access
Question # 54

Which of the following is used for calculations and pivot tables?

A.

IBM SPSS

B.

SAS

C.

Microsoft Excel

D.

Domo

Full Access
Question # 55

A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be the MOST efficient way to deliver this report?

A.

A workbook with multiple tabs for each region

B.

A daily email with snapshots of regional summaries

C.

A static report with a different page for every filtered view

D.

A dashboard with filters at the top that the user can toggle

Full Access
Question # 56

Given the diagram below:

Which of the following steps is missing?

A.

Remove redundant data.

B.

Validate the data types.

C.

Connect to the data API.

D.

Normalize the data.

Full Access
Question # 57

A data analyst must separate the column shown below into multiple columns for each component of the name:

Which of the following data manipulation techniques should the analyst perform?

A.

Imputing

B.

Transposing

C.

Parsing

D.

Concatenating

Full Access
Question # 58

Given the data below:

In which of the following file formats is the data presented?

A.

Xs

B.

CSV

C.

RIF

D.

XML

Full Access
Question # 59

Which of the following is a control measure for preventing a data breach?

A.

Data transmission

B.

Data attribution

C.

Data retention

D.

Data encryption

Full Access
Question # 60

A data analyst needs to perform a full outer join of a customer's orders using the tables below:

Which of the following is the mean of the order quantity?

A.

73.5

B.

76.5

C.

78.8

D.

81.5

Full Access
Question # 61

Which one of the following values will appear first if they are sorted in descending order?

A.

Aaron.

B.

Molly.

C.

Xavier.

D.

Adam.

Full Access
Question # 62

Andy is a pricing analyst for a retailer. Using a hypothesis test, he wants to assess whether people who receive electronic coupons spend more on average.

What should Andy's null hypothesis be?

A.

People who receive electronic coupons spend more on average.

B.

People who receive electronic coupons spend less on average.

C.

People who receive electronic coupons do not spend more on average.

D.

People who do not receive electronic coupons spend more on average.

Full Access
Question # 63

An analyst has conducted a review of business questions. Which of the following should the analyst do next to conduct an analysis?

A.

Determine the data needs and review the observations.

B.

Determine the data needs and sources for analysis.

C.

Determine the data needs and schedule interviews.

D.

Determine the data needs and begin the analysis.

Full Access
Question # 64

Different people manually type a series of handwritten surveys into an online database. Which of the following issues will MOST likely arise with this data? (Choose two.)

A.

Data accuracy

B.

Data constraints

C.

Data attribute limitations

D.

Data bias

E.

Data consistency

F.

Data manipulation

Full Access
Question # 65

A data set was recorded using multimedia technology. Which of the following is a necessary step on the way to interpretation?

A.

Structural equation modeling

B.

Transcription

C.

Sequential analysis

D.

Sampling

Full Access
Question # 66

Given the following graph:

Which of the following summary statements upholds integrity in data reporting?

A.

Sales are approximately equal for Product A and Product B across all strategies.

B.

Strategy 4 provides the best sales in comparison to other strategies.

C.

While Strategy 2 does not result in the highest sales of Product D, over all products it appears to be the most effective.

D.

Product D should be promoted more than the other products in all strategies.

Full Access
Question # 67

An analyst has written the following code:

SELECT *

FROM Cust_table

WHERE age > 60 AND City = "New York"

Which of the following criteria is the analyst retrieving?

A.

All customers older than age 60 in New York state

B.

All customers aged 60 and older in New York state

C.

All customers older than age 60 in New York City

D.

All customers younger than age 60 in New York City

Full Access
Question # 68

A research analyst wants to determine whether the data being analyzed is connected to other datapoints. Which of the following is the BEST type of analysis to conduct?

A.

Trend analysis

B.

Performance analysis

C.

Link analysis

D.

Exploratory analysis

Full Access
Question # 69

Which of the following techniques is used to quantify data?

A.

Decoding

B.

Enumeration

C.

Coding

D.

Structure

Full Access
Question # 70

A data analyst is developing a data dictionary that aligns with a company's data management processes and policies. Which of the following best describes what should be included in the data dictionary?

A.

Information containing the links to business data

B.

Information explaining the business methodologies

C.

Information containing definitions of the business data

D.

Information describing the data analysis phases

Full Access
Question # 71

An employer needs to maintain adequate office staffing during the winter and wants to track storm data. Which of the following data collection methods should the employer use?

A.

Web scraping

B.

Public databases

C.

Observations

D.

Weather surveys

Full Access
Question # 72

A data analyst needs to create a dashboard to help identify trends in the data sets. Which of the following is an appropriate consideration for dashboard development?

A.

Data sources and attributes

B.

Frequently asked questions

C.

A report from the data source

D.

A comparison of data sets

Full Access
Question # 73

A data analyst needs to create a dashboard using the company's yearly revenue data sets. Which of the following would be the best way to plot the information to show the top-performing region?

A.

A line chart

B.

A waterfall chart

C.

A heat map

D.

A stacked bar chart

Full Access
Question # 74

An analyst is building a new dashboard for a user. After an initial conversation with the user. the analyst created a mock-up of the dashboard. Which of the following best explains why the analyst created the mock-up?

A.

To identify the dimensions and measures

B.

To send to the client after deploying the dashboard to production

C.

To confirm important details before dashboard development begins

D.

To receive client approval for the final dashboard design

Full Access
Question # 75

Which of the following will MOST likely be streamed live?

A.

Machine data

B.

Key-value pairs

C.

Delimited rows

D.

Flat files

Full Access
Question # 76

A data analyst needs to create a master file that includes customer information from the tables below:

Given the three tables above, the analyst wants to filter down the information prior to joining it together. In which of the following orders should this data manipulation bo approached for the most efficient result?

A.

Merge, append, deduplicate

B.

Merge, deduplicate, append

C.

Deduplicate, append, merge

D.

Append, deduplicate, merge

Full Access
Question # 77

Which of the following would be used to store unstructured data from different sources?

A.

A data lake

B.

A database management system

C.

A database

D.

A data warehouse

Full Access
Question # 78

Angela is aggregating data from CRM system with data from an employee system.

While performing an initial quality check, she realizes that her employee ID is not associated with her identifier in the CRM system.

What kind of issues is Angela facing?

Choose the best answer.

A.

ETL process.

B.

Record linkage.

C.

ELT process.

D.

System integration.

Full Access