Summer Sale Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: v4s65

DY0-001 Exam Dumps - CompTIA DataX Exam

Go to page:
Question # 9

A data scientist is presenting the recommendations from a monthslong modeling and experiment process to the company’s Chief Executive Officer. Which of the following is the best set of artifacts to include in the presentation?

A.

Methods, data overview, results, recommendations, and charts

B.

Results, recommendations, justifications, and clear charts

C.

Recommendation, charts, justifications, code reviews, and results

D.

Methodology, code snippets, findings, data tables, and p-values

Full Access
Question # 10

Which of the following measures would a data scientist most likely use to calculate the similarity of two text strings?

A.

Word cloud

B.

Edit distance

C.

String indexing

D.

k-nearest neighbors

Full Access
Question # 11

A computer vision model is trained to identify cats on a training set that is composed of both cat and dog images. The model predicts a picture of a cat is a dog. Which of the following describes this error?

A.

Error due to reality

B.

False positive error

C.

Sampling error

D.

Type II error

Full Access
Question # 12

Which of the following distance metrics for KNN is best described as a straight line?

A.

Radial

B.

Euclidean

C.

Cosine

D.

Manhattan

Full Access
Question # 13

A data scientist needs to determine whether product sales are impacted by other contributing factors. The client has provided the data scientist with sales and other variables in the data set.

The data scientist decides to test potential models that include other information.

INSTRUCTIONS

Part 1

Use the information provided in the table to select the appropriate regression model.

Part 2

Review the summary output and variable table to determine which variable is statistically significant.

If at any time you would like to bring back the initial state of the simulation, please click the Reset All button.

Full Access
Question # 14

Which of the following modeling tools is appropriate for solving a scheduling problem?

A.

One-armed bandit

B.

Constrained optimization

C.

Decision tree

D.

Gradient descent

Full Access
Question # 15

Which of the following JOINS would generate the largest amount of data?

A.

RIGHT JOIN

B.

LEFT JOIN

C.

CROSS JOIN

D.

INNER JOIN

Full Access
Question # 16

A team is building a spam detection system. The team wants a probability-based identification method without complex, in-depth training from the historical data set. Which of the following methods would best serve this purpose?

A.

Logistic regression

B.

Random forest

C.

Naive Bayes

D.

Linear regression

Full Access
Go to page: