# E20-065 Exam Dumps - Advanced Analytics Specialist Exam for Data Scientists

Question # 4

What runs more efficiently because of Apache Tez?

A.

Pig and Hive

B.

Hive and HBase

C.

Yarn and Spark

D.

All MapReduce jobs

Question # 5

You conduct a TFIDF analysis on 3 documents containing raw text and derive TFIDF ("data", document y) = 1.908. You know that the term "dataâ€ only appears in document 2.

What is the TF of â€œdata" in document 2?

A.

2 based on the following reasoning:

TFIDF = TF1DF = 1 908

You then know that IDF will equal LOG (32)=0.954

Therefore, TFIDF=TF*0.954 = 1.908

TF will then round to 2

B.

4 based on the following reasoning:

TFIDF = TF1DF = 1.908

You then know that IDF will equal LOG (3/1 )=0.477

Therefore, TFIDF=TF'0 477 = 1.908

TF will then round to 4

C.

6 based on the following reasoning:

TFIDF = TF1DF = 1.908

You then know that IDF will equal 3/1=3

Therefore, TFIDF=TF/3 = 1.908

TF will then round to 6

D.

11 based on the following reasoning:

TFIDF = TF1DF = 1908

You then know that IDF will equal LOG(3/2)=0.176

Therefore, TFIDF=TF"0.176 = 1.908

TF will then round to 11

Question # 6

What are three of the eight visual variables?

A.

Selection, orientation, and mark

B.

Size, separation, and orientation

C.

Position, size, and orientation

D.

Position, texture, and selection

Question # 7

In a connected, undirected graph of 5 nodes with 10 edges, how many more edges need to be added to make the clustering coefficient of every node equal 1 ?

A.

0

B.

5

C.

10

D.

15

Question # 8

Which graph structure would best model the relationship between job seekers and employers?

A.

Bipartite

B.

Weighted

C.

Directed acyclic

D.

Ranked

Question # 9

What process must address acoustic ambiguity in NLP?

A.

Part-of-speech tagging

B.

Word sense disambiguation

C.

Speech recognition

D.

Discourse