DP-100 Designing and Implementing a Data Science Solution on Azure Questions and Answers

Questions 4

You develop and train a machine learning model to predict fraudulent transactions for a hotel booking website.

Traffic to the site varies considerably. The site experiences heavy traffic on Monday and Friday and much lower traffic on other days. Holidays are also high web traffic days. You need to deploy the model as an Azure Machine Learning real-time web service endpoint on compute that can dynamically scale up and down to support demand. Which deployment compute option should you use?

Options:

attached Azure Databricks cluster

Azure Container Instance (ACI)

Azure Kubernetes Service (AKS) inference cluster

Azure Machine Learning Compute Instance

attached virtual machine in a different region

Buy Now

Questions 5

You create a binary classification model using Azure Machine Learning Studio.

You must use a Receiver Operating Characteristic (RO C) curve and an F1 score to evaluate the model.

You need to create the required business metrics.

How should you complete the experiment? To answer, select the appropriate options in the dialog box in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 5

Options:

Buy Now

Questions 6

You are preparing to use the Azure ML SDK to run an experiment and need to create compute. You run the following code:

DP-100 Question 6

For each of the following statements, select Yes if the statement is true. Otherwise, select No.

NOTE: Each correct selection is worth one point.

DP-100 Question 6

Options:

Buy Now

Questions 7

O: 199 HOTSPOT

You are using the Hyperdrive feature in Azure Machine Learning to train a model.

You configure the Hyperdrive experiment by running the following code:

DP-100 Question 7

For each of the following statements, select Yes if the statement is true. Otherwise, select No.

NOTE: Each correct selection is worth one point.

DP-100 Question 7

Options:

Buy Now

Questions 8

You are creating a machine learning model in Python. The provided dataset contains several numerical columns and one text column. The text column represents a product's category. The product category will always be one of the following:

Bikes
Cars
Vans
Boats

You are building a regression model using the scikit-learn Python package.

You need to transform the text data to be compatible with the scikit-learn Python package.

How should you complete the code segment? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 8

Options:

Buy Now

Questions 9

You are developing a machine learning, experiment by using Azure. The following images show the input and output of a machine learning experiment:

DP-100 Question 9

Use the drop-down menus to select the answer choice that answers each question based on the information presented in the graphic.

NOTE: Each correct selection is worth one point.

DP-100 Question 9

Options:

Buy Now

Questions 10

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.

After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.

You are creating a new experiment in Azure Learning learning Studio.

One class has a much smaller number of observations than the other classes in the training

You need to select an appropriate data sampling strategy to compensate for the class imbalance.

Solution: You use the Synthetic Minority Oversampling Technique (SMOTE) sampling mode.

Does the solution meet the goal?

Options:

Yes

Buy Now

Questions 11

You create an Azure Machine Learning workspace named workspaces. You create a Python SDK v2 notebook to perform custom model training in workspace1. You need to run the notebook from Azure Machine Learning Studio in workspace1. What should you provision first?

Options:

default storage account

real-time endpoint

Azure Machine Learning compute cluster

Azure Machine Learning compute instance

Buy Now

Questions 12

You train and publish a machine teaming model.

You need to run a pipeline that retrains the model based on a trigger from an external system.

What should you configure?

Options:

Azure Data Catalog

Azure Batch

Azure logic App

Buy Now

Questions 13

You are creating an experiment by using Azure Machine Learning Studio.

You must divide the data into four subsets for evaluation. There is a high degree of missing values in the data. You must prepare the data for analysis.

You need to select appropriate methods for producing the experiment.

Which three modules should you run in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

NOTE: More than one order of answer choices is correct. You will receive credit for any of the correct orders you select.

DP-100 Question 13

Options:

Buy Now

Questions 14

After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.

You are creating a model to predict the price of a student’s artwork depending on the following variables: the student’s length of education, degree type, and art form.

You start by creating a linear regression model.

You need to evaluate the linear regression model.

Solution: Use the following metrics: Mean Absolute Error, Root Mean Absolute Error, Relative Absolute Error, Relative Squared Error, and the Coefficient of Determination.

Does the solution meet the goal?

Options:

Yes

Buy Now

Questions 15

You train a machine learning model by using Aunt Machine Learning.

You use the following training script m Python to log an accuracy value.

DP-100 Question 15

You must use a Python script to define a sweep job.

You need to provide the primary metric and goal you want hyper parameter tuning to optimize.

How should you complete the Python script? To answer select the appropriate options in the answer area

NOTE: Each correct selection is worth one point.

DP-100 Question 15

Options:

Buy Now

Questions 16

You write five Python scripts that must be processed in the order specified in Exhibit A – which allows the same modules to run in parallel, but will wait for modules with dependencies.

You must create an Azure Machine Learning pipeline using the Python SDK, because you want to script to create the pipeline to be tracked in your version control system. You have created five PythonScriptSteps and have named the variables to match the module names.

DP-100 Question 16

You need to create the pipeline shown. Assume all relevant imports have been done.

Which Python code segment should you use?

DP-100 Question 16

Options:

Option A

Option B

Option C

Option D

Buy Now

Questions 17

You plan to build a team data science environment. Data for training models in machine learning pipelines will

be over 20 GB in size.

You have the following requirements:

Models must be built using Caffe2 or Chainer frameworks.
Data scientists must be able to use a data science environment to build the machine learning pipelines and train models on their personal devices in both connected and disconnected network environments.
Personal devices must support updating machine learning pipelines when connected to a network.

You need to select a data science environment.

Which environment should you use?

Options:

Azure Machine Learning Service

Azure Machine Learning Studio

Azure Databricks

Azure Kubernetes Service (AKS)

Buy Now

Questions 18

You use the Azure Machine Learning service to create a tabular dataset named training.data. You plan to use this dataset in a training script.

You create a variable that references the dataset using the following code:

training_ds = workspace.datasets.get("training_data")

You define an estimator to run the script.

You need to set the correct property of the estimator to ensure that your script can access the training.data dataset

Which property should you set?

DP-100 Question 18

Options:

Option A

Option B

Option C

Option D

Buy Now

Questions 19

You create an Azure Machine Learning workspace. You train a classification model by using automated machine learning (automated ML) in Azure Machine Learning studio. The training data contains multiple classes that have significantly different numbers of samples.

You must use a metric type to avoid labeling negative samples as positive and an averaging method that will minimize the class imbalance.

You need to configure the metric type and the averaging method.

Which configurations should you use? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 19

Options:

Buy Now

Questions 20

You plan to create a speech recognition deep learning model.

The model must support the latest version of Python.

You need to recommend a deep learning framework for speech recognition to include in the Data Science Virtual Machine (DSVM).

What should you recommend?

Options:

Apache Drill

Tensorflow

Rattle

Weka

Buy Now

Questions 21

You are implementing hyperparameter tuning for a model training from a notebook. The notebook is in an Azure Machine Learning workspace. You add code that imports all relevant Python libraries.

You must configure Bayesian sampling over the search space for the num_hidden_layers and batch_size hyperparameters.

You need to complete the following Python code to configure Bayesian sampling.

Which code segments should you use? To answer, select the appropriate options in the answer area

NOTE: Each correct selection is worth one point.

Options:

Buy Now

Questions 22

DP-100 Question 22

You need to record the row count as a metric named row_count that can be returned using the get_metrics method of the Run object after the experiment run completes. Which code should you use?

Options:

run.upload_file(‘row_count’, ‘./data.csv’)

run.log(‘row_count’, rows)

run.tag(‘row_count’, rows)

run.log_table(‘row_count’, rows)

run.log_row(‘row_count’, rows)

Buy Now

Questions 23

: 212

You register a model that you plan to use in a batch inference pipeline.

The batch inference pipeline must use a ParallelRunStep step to process files in a file dataset. The script has the ParallelRunStep step runs must process six input files each time the inferencing function is called.

You need to configure the pipeline.

Which configuration setting should you specify in the ParallelRunConfig object for the PrallelRunStep step?

Options:

process_count_per_node= "6"

node_count= "6"

mini_batch_size= "6"

error_threshold= "6"

Buy Now

Questions 24

You use Azure Machine Learning to deploy a model as a real-time web service.

You need to create an entry script for the service that ensures that the model is loaded when the service starts and is used to score new data as it is received.

Which functions should you include in the script? To answer, drag the appropriate functions to the correct actions. Each function may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content

NOTE: Each correct selection is worth one point.

DP-100 Question 24

Options:

Buy Now

Questions 25

You manage an Azure Machine Learning workspace.

You must define the execution environments for your jobs and encapsulate the dependencies for your code.

You need to configure the environment from a Docker build context.

How should you complete the rode segment? To answer, select the appropriate option in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 25

Options:

Buy Now

Questions 26

You create an Azure Machine Learning workspace named ML-workspace. You also create an Azure Databricks workspace named DB-workspace. DB-workspace contains a cluster named DB-cluster.

You must use DB-cluster to run experiments from notebooks that you import into DB-workspace.

You need to use ML-workspace to track MLflow metrics and artifacts generated by experiments running on DB-cluster. The solution must minimize the need for custom code.

What should you do?

Options:

From DB-cluster, configure the Advanced Logging option.

From DB-workspace. configure the Link Azure ML workspace option.

From ML-workspace. create an attached compute.

From ML-workspace. create a compute cluster.

Buy Now

Questions 27

You need to define a process for penalty event detection.

Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

DP-100 Question 27

Options:

Buy Now

Questions 28

You use Azure Machine Learning designer to create a real-time service endpoint. You have a single Azure Machine Learning service compute resource. You train the model and prepare the real-time pipeline for deployment You need to publish the inference pipeline as a web service. Which compute type should you use?

Options:

HDInsight

Azure Databricks

Azure Kubernetes Services

the existing Machine Learning Compute resource

a new Machine Learning Compute resource

Buy Now

Questions 29

You need to build a feature extraction strategy for the local models.

How should you complete the code segment? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 29

Options:

Buy Now

Questions 30

You need to modify the inputs for the global penalty event model to address the bias and variance issue.

Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

DP-100 Question 30

Options:

Buy Now

Questions 31

You need to define a process for penalty event detection.

Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

DP-100 Question 31

Options:

Buy Now

Questions 32

You need to implement a feature engineering strategy for the crowd sentiment local models.

What should you do?

Options:

Apply an analysis of variance (ANOVA).

Apply a Pearson correlation coefficient.

Apply a Spearman correlation coefficient.

Apply a linear discriminant analysis.

Buy Now

Questions 33

You need to define an evaluation strategy for the crowd sentiment models.

Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

DP-100 Question 33

Options:

Buy Now

Questions 34

You need to define an evaluation strategy for the crowd sentiment models.

Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

DP-100 Question 34

Options:

Buy Now

Questions 35

You need to implement a new cost factor scenario for the ad response models as illustrated in the

performance curve exhibit.

Which technique should you use?

Options:

Set the threshold to 0.5 and retrain if weighted Kappa deviates +/- 5% from 0.45.

Set the threshold to 0.05 and retrain if weighted Kappa deviates +/- 5% from 0.5.

Set the threshold to 0.2 and retrain if weighted Kappa deviates +/- 5% from 0.6.

Set the threshold to 0.75 and retrain if weighted Kappa deviates +/- 5% from 0.15.

Buy Now

Questions 36

You need to implement a scaling strategy for the local penalty detection data.

Which normalization type should you use?

Options:

Streaming

Weight

Batch

Cosine

Buy Now

Questions 37

You need to define a modeling strategy for ad response.

Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

DP-100 Question 37

Options:

Buy Now

Questions 38

You need to resolve the local machine learning pipeline performance issue. What should you do?

Options:

Increase Graphic Processing Units (GPUs).

Increase the learning rate.

Increase the training iterations,

Increase Central Processing Units (CPUs).

Buy Now

Questions 39

You need to select an environment that will meet the business and data requirements.

Which environment should you use?

Options:

Azure HDInsight with Spark MLlib

Azure Cognitive Services

Azure Machine Learning Studio

Microsoft Machine Learning Server

Buy Now

Questions 40

You need to implement a model development strategy to determine a user’s tendency to respond to an ad.

Which technique should you use?

Options:

Use a Relative Expression Split module to partition the data based on centroid distance.

Use a Relative Expression Split module to partition the data based on distance travelled to the event.

Use a Split Rows module to partition the data based on distance travelled to the event.

Use a Split Rows module to partition the data based on centroid distance.

Buy Now

Questions 41

You need to use the Python language to build a sampling strategy for the global penalty detection models.

How should you complete the code segment? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 41

Options:

Buy Now

Questions 42

You need to identify the methods for dividing the data according, to the testing requirements.

Which properties should you select? To answer, select the appropriate option-, m the answer area. NOTE: Each correct selection is worth one point.

DP-100 Question 42

Options:

Buy Now

Questions 43

You need to select a feature extraction method.

Which method should you use?

Options:

Spearman correlation

Mutual information

Mann-Whitney test

Pearson’s correlation

Buy Now

Questions 44

You need to configure the Permutation Feature Importance module for the model training requirements.

What should you do? To answer, select the appropriate options in the dialog box in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 44

Options:

Buy Now

Questions 45

You need to replace the missing data in the AccessibilityToHighway columns.

How should you configure the Clean Missing Data module? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 45

Options:

Buy Now

Questions 46

You need to identify the methods for dividing the data according to the testing requirements.

Which properties should you select? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 46

Options:

Buy Now

Questions 47

You need to correct the model fit issue.

Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

DP-100 Question 47

Options:

Buy Now

Questions 48

You need to configure the Edit Metadata module so that the structure of the datasets match.

Which configuration options should you select? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 48

Options:

Buy Now

Questions 49

You need to set up the Permutation Feature Importance module according to the model training requirements.

Which properties should you select? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 49

Options:

Buy Now

Questions 50

You need to implement early stopping criteria as suited in the model training requirements.

Which three code segments should you use to develop the solution? To answer, move the appropriate code segments from the list of code segments to the answer area and arrange them in the correct order.

NOTE: More than one order of answer choices is correct. You will receive credit for any of the correct orders you select.

DP-100 Question 50

Options:

Buy Now

Questions 51

You need to visually identify whether outliers exist in the Age column and quantify the outliers before the outliers are removed.

Which three Azure Machine Learning Studio modules should you use in sequence? To answer, move the appropriate modules from the list of modules to the answer area and arrange them in the correct order.

DP-100 Question 51

Options:

Buy Now

Questions 52

You need to produce a visualization for the diagnostic test evaluation according to the data visualization requirements.

Which three modules should you recommend be used in sequence? To answer, move the appropriate modules from the list of modules to the answer area and arrange them in the correct order.

DP-100 Question 52

Options:

Buy Now

Questions 53

You need to configure the Feature Based Feature Selection module based on the experiment requirements and datasets.

How should you configure the module properties? To answer, select the appropriate options in the dialog box in the answer area.

NOTE: Each correct selection is worth one point.

DP-100 Question 53

Options:

Buy Now

Exam Code: DP-100

Exam Name: Designing and Implementing a Data Science Solution on Azure

Last Update: May 16, 2024

Questions: 407

PDF + Testing Engine

$70 ~~$174.99~~

Testing Engine

$54 ~~$134.99~~

PDF (Q&A)

$48 ~~$119.99~~

Weekend Sale Limited Time 60% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 713PS592

marks4sure logo

Navigation:

DP-100 Designing and Implementing a Data Science Solution on Azure Questions and Answers

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Explanation:

Options:

Answer:

Options:

Answer:

Options:

Answer:

Options:

Answer:

Explanation: