Black Friday Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

Databricks-Machine-Learning-Professional Databricks Certified Machine Learning Professional Questions and Answers

Questions 4

Which of the following MLflow Model Registry use cases requires the use of an HTTP Webhook?

Options:

A.

Starting a testing job when a new model is registered

B.

Updatingdata in a source table for a Databricks SQL dashboard when a model version transitions to the Production stage

C.

Sending an email alert when an automated testing Job fails

D.

None of these use cases require the use of an HTTP Webhook

E.

Sending a message to a Slack channel when a model version transitions stages

Buy Now
Questions 5

Which of the following is a simple statistic to monitor for categorical feature drift?

Options:

A.

Mode

B.

None of these

C.

Mode, number of unique values, and percentage of missing values

D.

Percentage of missing values

E.

Number of unique values

Buy Now
Questions 6

Which of the following operations in Feature Store Client fs can be used to return a Spark DataFrame of a data set associated with a Feature Store table?

Options:

A.

fs.create_table

B.

fs.write_table

C.

fs.get_table

D.

There is no way to accomplish this task with fs

E.

fs.read_table

Buy Now
Questions 7

A data scientist has developed a scikit-learn modelsklearn_modeland they want to log the model using MLflow.

They write the following incomplete code block:

Databricks-Machine-Learning-Professional Question 7

Which of the following lines of code can be used to fill in the blank so the code block can successfully complete the task?

Options:

A.

mlflow.spark.track_model(sklearn_model, "model")

B.

mlflow.sklearn.log_model(sklearn_model, "model")

C.

mlflow.spark.log_model(sklearn_model, "model")

D.

mlflow.sklearn.load_model("model")

E.

mlflow.sklearn.track_model(sklearn_model, "model")

Buy Now
Questions 8

A machine learning engineer wants to programmatically create a new Databricks Job whose schedule depends on the result of some automated tests in a machine learning pipeline.

Which of the following Databricks tools can be used to programmatically create the Job?

Options:

A.

MLflow APIs

B.

AutoML APIs

C.

MLflow Client

D.

Jobs cannot be created programmatically

E.

Databricks REST APIs

Buy Now
Questions 9

A data scientist would like to enable MLflow Autologging for all machine learning libraries used in a notebook. They want to ensure that MLflow Autologging is used no matter what version of the Databricks Runtime for Machine Learning is used to run the notebook and no matter what workspace-wide configurations are selected in the Admin Console.

Which of the following lines of code can they use to accomplish this task?

Options:

A.

mlflow.sklearn.autolog()

B.

mlflow.spark.autolog()

C.

spark.conf.set(“autologging”, True)

D.

It is not possible to automatically log MLflow runs.

E.

mlflow.autolog()

Buy Now
Questions 10

Which of the following Databricks-managed MLflow capabilities is a centralized model store?

Options:

A.

Models

B.

Model Registry

C.

Model Serving

D.

Feature Store

E.

Experiments

Buy Now
Questions 11

A data scientist has developed a scikit-learn random forest model model, but they have not yet logged model with MLflow. They want to obtain the input schema and the output schema of the model so they can document what type of data is expected as input.

Which of the following MLflow operations can be used to perform this task?

Options:

A.

mlflow.models.schema.infer_schema

B.

mlflow.models.signature.infer_signature

C.

mlflow.models.Model.get_input_schema

D.

mlflow.models.Model.signature

E.

There is no way to obtain the input schema and the output schema of an unlogged model.

Buy Now
Questions 12

A data scientist has computed updated feature values for all primary key values stored in the Feature Store table features. In addition, feature values for some new primary key values have also been computed. The updated feature values are stored in the DataFrame features_df. They want to replace all data in features with the newly computed data.

Which of the following code blocks can they use to perform this task using the Feature Store Client fs?

A)

Databricks-Machine-Learning-Professional Question 12

B)

Databricks-Machine-Learning-Professional Question 12

C)

Databricks-Machine-Learning-Professional Question 12

D)

Databricks-Machine-Learning-Professional Question 12

E)

Databricks-Machine-Learning-Professional Question 12

Options:

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

Option E

Buy Now
Questions 13

Which of the following describes concept drift?

Options:

A.

Concept drift is when there is a change in the distribution of an input variable

B.

Concept drift is when there is a change in the distribution of a target variable

C.

Concept drift is when there is a change in the relationship between input variables and target variables

D.

Concept drift is when there is a change in the distribution of the predicted target given by the model

E.

None of these describe Concept drift

Buy Now
Questions 14

A machine learning engineering team wants to build a continuous pipeline for data preparation of a machine learning application. The team would like the data to be fully processed and made ready for inference in a series of equal-sized batches.

Which of the following tools can be used to provide this type of continuous processing?

Options:

A.

Spark UDFs

B.

[Structured Streaming

C.

MLflow

D Delta Lake

D.

AutoML

Buy Now
Questions 15

A machine learning engineering team has written predictions computed in a batch job to a Delta table for querying. However, the team has noticed that the querying is running slowly. The team has alreadytuned the size of the data files. Upon investigating, the team has concluded that the rows meeting the query condition are sparsely located throughout each of the data files.

Based on the scenario, which of the following optimization techniques could speed up the query by colocating similar records while considering values in multiple columns?

Options:

A.

Z-Ordering

B.

Bin-packing

C.

Write as a Parquet file

D.

Data skipping

E.

Tuning the file size

Buy Now
Questions 16

A data scientist is using MLflow to track their machine learning experiment. As a part of each MLflow run, they are performing hyperparameter tuning. The data scientist would like to have one parent run for the tuning process with a child run for each unique combination of hyperparameter values.

They are using the following code block:

Databricks-Machine-Learning-Professional Question 16

The code block is not nesting the runs in MLflow as they expected.

Which of the following changes does the data scientist need to make to the above code block so that it successfully nests the child runs under the parent run in MLflow?

Options:

A.

Indent the child run blocks within the parent run block

B.

Add the nested=True argument to the parent run

C.

Remove the nested=True argument from the child runs

D.

Provide the same name to the run name parameter for all three run blocks

E.

Add the nested=True argument to the parent run and remove the nested=True arguments from the child runs

Buy Now
Questions 17

A machine learning engineer wants to move their model versionmodel_versionfor the MLflow Model Registry modelmodelfrom the Staging stage to the Production stage using MLflow Clientclient.

Which of the following code blocks can they use to accomplish the task?

A)

Databricks-Machine-Learning-Professional Question 17

B)

Databricks-Machine-Learning-Professional Question 17

C)

Databricks-Machine-Learning-Professional Question 17

D)

Databricks-Machine-Learning-Professional Question 17

E)

Databricks-Machine-Learning-Professional Question 17

Options:

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

option E

Buy Now
Questions 18

A data scientist set up a machine learning pipeline to automatically log a data visualization with each run. They now want to view the visualizations in Databricks.

Which of the following locations in Databricks will show these data visualizations?

Options:

A.

The MLflow Model RegistryModel paqe

B.

The Artifacts section of the MLflow Experiment page

C.

Logged data visualizations cannot be viewed in Databricks

D.

The Artifacts section of the MLflow Run page

E.

The Figures section of the MLflow Run page

Buy Now
Exam Name: Databricks Certified Machine Learning Professional
Last Update: Nov 15, 2024
Questions: 60

PDF + Testing Engine

$57.75  $164.99

Testing Engine

$43.75  $124.99
buy now Databricks-Machine-Learning-Professional testing engine

PDF (Q&A)

$36.75  $104.99
buy now Databricks-Machine-Learning-Professional pdf