A word processing company is developing an automatic text correction tool. A machine learning algorithm was used to develop the auto text correction feature. The testers have discovered that when they start typing “Isle of Wight” it fills in “Isle of Eight”. Several UAT testers have accepted this change without noticing. What type of bias is this?
Consider a natural language processing (NLP) algorithm that attempts to predict the next word that you would like to type in a text message. An update to the algorithm has been created that should increase the accuracy of the predictions based on user typing patterns. The old algorithm was rated for accuracy by the users. Then, after the new update was released, the users rated the updated algorithm. A statistical test was used to compare the two versions of the algorithm to see whether or not the update should remain in place.
This is an example of what type of testing?
Which of the following characteristics of AI-based systems make it more difficult to ensure they are safe?
Pairwise testing can be used in the context of self-driving cars for controlling an explosion in the number of combinations of parameters.
Which ONE of the following options is LEAST likely to be a reason for this incredible growth of parameters?
SELECT ONE OPTION
Which statement about using AI to analyze reported defects is MOST correct?
Choose ONE option (1 out of 4)
Which statement regarding the use of training, validation, and test data sets is correct?
Choose ONE option (1 out of 4)
Which ONE of the following models BEST describes a way to model defect prediction by looking at the history of bugs in modules by using code quality metrics of modules of historical versions as input?
SELECT ONE OPTION
Which of the following is a problem with AI-generated test cases that are generated from the requirements?
A system was developed for screening the X-rays of patients for potential malignancy detection (skin cancer). A workflow system has been developed to screen multiple cancers by using several individually trained ML models chained together in the workflow.
Testing the pipeline could involve multiple kind of tests (I - III):
I.Pairwise testing of combinations
II.Testing each individual model for accuracy
III.A/B testing of different sequences of models
Which ONE of the following options contains the kinds of tests that would be MOST APPROPRIATE to include in the strategy for optimal detection?
SELECT ONE OPTION
Which ONE of the following tests is MOST likely to describe a useful test to help detect different kinds of biases in ML pipeline?
SELECT ONE OPTION
Which of the following approaches would help overcome testing challenges associated with probabilistic and non-deterministic AI-based systems?
An engine manufacturing facility wants to apply machine learning to detect faulty bolts. Which of the following would result in bias in the model?
Written requirements are given in text documents, which ONE of the following options is the BEST way to generate test cases from these requirements?
SELECT ONE OPTION
Which ONE of the following combinations of Training, Validation, Testing data is used during the process of learning/creating the model?
SELECT ONE OPTION
Which ONE of the following characteristics is the least likely to cause safety related issues for an Al system?
SELECT ONE OPTION
A software component uses machine learning to recognize the digits from a scan of handwritten numbers. In the scenario above, which type of Machine Learning (ML) is this an example of?
SELECT ONE OPTION
A ML engineer is trying to determine the correctness of the new open-source implementation *X", of a supervised regression algorithm implementation. R-Square is one of the functional performance metrics used to determine the quality of the model.
Which ONE of the following would be an APPROPRIATE strategy to achieve this goal?
SELECT ONE OPTION
Which of the following statements about explainable AI is correct?
Choose ONE option (1 out of 4)
Which ONE of the following statements is a CORRECT adversarial example in the context of machine learning systems that are working on image classifiers.
SELECT ONE OPTION
Which ONE of the following options is the MOST APPROPRIATE stage of the ML workflow to set model and algorithm hyperparameters?
SELECT ONE OPTION
Which data-labeling approach uses a two-step process where labeling is first done by a tool and then verified or completed by a human?
Choose ONE option (1 out of 4)
Upon testing a model used to detect rotten tomatoes, the following data was observed by the test engineer, based on certain number of tomato images.

For this confusion matrix which combinations of values of accuracy, recall, and specificity respectively is CORRECT?
SELECT ONE OPTION
A bank wants to use an algorithm to determine which applicants should be given a loan. The bank hires a data scientist to construct a logistic regression model to predict whether the applicant will repay the loan or not. The bank has enough data on past customers to randomly split the data into a training dataset and a test/validation dataset. A logistic regression model is constructed on the training dataset using the following independent variables:
Gender
Marital status
Number of dependents
Education
Income
Loan amount
Loan term
Credit score
The model reveals that those with higher credit scores and larger total incomes are more likely to repay their loans. The data scientist has suggested that there might be bias present in the model based on previous models created for other banks.
Given this information, what is the best test approach to check for potential bias in the model?
Before deployment of an AI-based system, a developer is expected to demonstrate in a test environment how decisions are made. Which of the following characteristics does decision making fall under?
You have access to the training data that was used to train an AI-based system. You can review this information and use it as a guideline when creating your tests. What type of characteristic is this?
A mobile app start-up company is implementing an AI-based chat assistant for e-commerce customers. In the process of planning the testing, the team realizes that the specifications are insufficient.
Which testing approach should be used to test this system?
"Splendid Healthcare" has started developing a cancer detection system based on ML. The type of cancer they plan on detecting has 2% prevalence rate in the population of a particular geography. It is required that the model performs well for both normal and cancer patients.
Which ONE of the following combinations requires MAXIMIZATION?
SELECT ONE OPTION
Which statement regarding testing transparency, explainability, or interpretability is MOST correct?
Choose ONE option (1 out of 4)
A wildlife conservation group would like to use a neural network to classify images of different animals. The algorithm is going to be used on a social media platform to automatically pick out pictures of the chosen animal of the month. This month’s animal is set to be a wolf. The test team has already observed that the algorithm could classify a picture of a dog as being a wolf because of the similar characteristics between dogs and wolves. To handle such instances, the team is planning to train the model with additional images of wolves and dogs so that the model is able to better differentiate between the two.
What test method should you use to verify that the model has improved after the additional training?
Which of the following statements about reinforcement learning is correct?
Choose ONE option (1 out of 4)
Which of the following statements regarding experience-based testing for AI-based systems is correct?
Choose ONE option (1 out of 4)
Which option describes a reasonable application of AIB testing for a self-learning system after it has changed its behavior due to user input?
Choose ONE option (1 out of 4)