Weekend Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code = simple70

Pass the Databricks Certification Databricks-Certified-Professional-Data-Scientist Questions and answers with ExamsMirror

Practice at least 50% of the questions to maximize your chances of passing.
Exam Databricks-Certified-Professional-Data-Scientist Premium Access

View all detail and faqs for the Databricks-Certified-Professional-Data-Scientist exam


429 Students Passed

90% Average Score

96% Same Questions
Viewing page 1 out of 5 pages
Viewing questions 1-10 out of questions
Questions # 1:

Refer to the exhibit.

Question # 1

You are building a decision tree. In this exhibit, four variables are listed with their respective values of info-gain.

Based on this information, on which attribute would you expect the next split to be in the decision tree?

Options:

A.

Credit Score

B.

Age

C.

Income

D.

Gender

Questions # 2:

Which of the following are advantages of the Support Vector machines?

Options:

A.

Effective in high dimensional spaces.

B.

it is memory efficient

C.

possible to specify custom kernels

D.

Effective in cases where number of dimensions is greater than the number of samples

E.

Number of features is much greater than the number of samples, the method still give good performances

F.

SVMs directly provide probability estimates

Questions # 3:

What describes a true limitation of Logistic Regression method?

Options:

A.

It does not handle redundant variables well.

B.

It does not handle missing values well.

C.

It does not handle correlated variables well.

D.

It does not have explanatory values.

Questions # 4:

Select the correct algorithm of unsupervised algorithm

Options:

A.

K-Nearest Neighbors

B.

K-Means

C.

Support Vector Machines

D.

Naive Bayes

Questions # 5:

Let's say you have two cases as below for the movie ratings

1. You recommend to a user a movie with four stars and he really doesn't like it and he'd rate it two stars

2. You recommend a movie with three stars but the user loves it (he'd rate it five stars). So which statement correctly applies?

Options:

A.

In both cases, the contribution to the RMSE is the same

B.

In both cases, the contribution to the RMSE is the different

C.

In both cases, the contribution to the RMSE, could varies

D.

None of the above

Questions # 6:

You are working with the Clustering solution of the customer datasets. There are almost 40 variables are available for each customer and almost 1.00,0000 customer's data is available. You want to reduce the number of variables for clustering, what would you do?

Options:

A.

You will randomly reduce the number of variables

B.

You will find the correlation among the variables and from their variables are not co-related will be discarded.

C.

You will find the correlation among the variables and from the highly co-related variables, you will be considering only one or two variables from it.

D.

You cannot discard any variable for creating clusters.

E.

You can combine several variables in one variable

Questions # 7:

Select the correct objectives of principal component analysis

Options:

A.

To reduce the dimensionality of the data set

B.

To identify new meaningful underlying variables

C.

To discover the dimensionality of the data set

D.

Only 1 and 2

E.

All 1, 2 and 3

Questions # 8:

Which of the following are point estimation methods?

Options:

A.

MAP

B.

MLE

C.

MMSE

Questions # 9:

You are working on a Data Science project and during the project you have been gibe a responsibility to interview all the stakeholders in the project. In which phase of the project you are?

Options:

A.

Discovery

B.

Data Preparations

C.

Creating Models

D.

Executing Models

E.

Creating visuals from the outcome

F.

Operationnalise the models

Questions # 10:

You are creating a Classification process where input is the income, education and current debt of a customer, what could be the possible output of this process.

Options:

A.

Probability of the customer default on loan repayment

B.

Percentage of the customer loan repayment capability

C.

Percentage of the customer should be given loan or not

D.

The output might be a risk class, such as "good", "acceptable", "average", or "unacceptable".

Viewing page 1 out of 5 pages
Viewing questions 1-10 out of questions
TOP CODES

TOP CODES

Top selling exam codes in the certification world, popular, in demand and updated to help you pass on the first try.