Summer Sale Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: v4s65

DY0-001 Exam Dumps - CompTIA DataX Exam

Go to page:
Question # 17

Which of the following best describes the minimization of the residual term in a ridge linear regression?

A.

|e|

B.

e

C.

e²

D.

0

Full Access
Question # 18

A data analyst is examining the correlation matrix of a new data set to identify issues that could adversely impact model performance. Which of the following is the analyst most likely checking for?

A.

Undersampling

B.

Multicollinearity

C.

Oversampling

D.

Overfitting

Full Access
Question # 19

A data scientist would like to model a complex phenomenon using a large data set composed of categorical, discrete, and continuous variables. After completing exploratory data analysis, the data scientist is reasonably certain that no linear relationship exists between the predictors and the target. Although the phenomenon is complex, the data scientist still wants to maintain the highest possible degree of interpretability in the final model. Which of the following algorithms best meets this objective?

A.

Artificial neural network

B.

Decision tree

C.

Multiple linear regression

D.

Random forest

Full Access
Question # 20

The most likely concern with a one-feature, machine-learning model is high error due to:

A.

bias

B.

dimensionality

C.

variance

D.

probability

Full Access
Question # 21

A data analyst wants to use compression on an analyzed data set and send it to a new destination for further processing. Which of the following issues will most likely occur?

A.

Library dependency will be missing.

B.

Server CPU usage will be too high.

C.

Operating system support will be missing.

D.

Server memory usage will be too high.

Full Access
Question # 22

A data scientist is standardizing a large data set that contains website addresses. A specific string inside some of the web addresses needs to be extracted. Which of the following is the best method for extracting the desired string from the text data?

A.

Regular expressions

B.

Named-entity recognition

C.

Large language model

D.

Find and replace

Full Access
Question # 23

A data scientist is building a model to predict customer credit scores based on information collected from reporting agencies. The model needs to automatically adjust its parameters to adapt to recent changes in the information collected. Which of the following is the best model to use?

A.

Decision tree

B.

Random forest

C.

Linear discriminant analysis

D.

XGBoost

Full Access
Question # 24

Which of the following belong in a presentation to the senior management team and/or C-suite executives? (Choose two.)

A.

Full literature reviews

B.

Code snippets

C.

Final recommendations

D.

High-level results

E.

Detailed explanations of statistical tests

F.

Security keys and login information

Full Access
Go to page: