Cyber Monday Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

CT-AI ISTQB Certified Tester AI Testing Exam Questions and Answers

Questions 4

A word processing company is developing an automatic text correction tool. A machine learning algorithm was used to develop the auto text correction feature. The testers have discovered that when they start typing “Isle of Wight” it fills in “Isle of Eight”. Several UAT testers have accepted this change without noticing. What type of bias is this?

Options:

A.

Geographical/Locality

B.

Automation/Complacency

C.

Complacency/Disregard

D.

Ignorance/Cognitive

Buy Now
Questions 5

Consider a natural language processing (NLP) algorithm that attempts to predict the next word that you would like to type in a text message. An update to the algorithm has been created that should increase the accuracy of the predictions based on user typing patterns. The old algorithm was rated for accuracy by the users. Then, after the new update was released, the users rated the updated algorithm. A statistical test was used to compare the two versions of the algorithm to see whether or not the update should remain in place.

This is an example of what type of testing?

Options:

A.

Metamorphic testing

B.

A/B testing

C.

Exploratory testing

D.

Pairwise testing

Buy Now
Questions 6

Which of the following characteristics of AI-based systems make it more difficult to ensure they are safe?

Options:

A.

Simplicity

B.

Sustainability

C.

Non-determinism

D.

Robustness

Buy Now
Questions 7

Pairwise testing can be used in the context of self-driving cars for controlling an explosion in the number of combinations of parameters.

Which ONE of the following options is LEAST likely to be a reason for this incredible growth of parameters?

SELECT ONE OPTION

Options:

A.

Different Road Types

B.

Different weather conditions

C.

ML model metrics to evaluate the functional performance

D.

Different features like ADAS, Lane Change Assistance etc.

Buy Now
Questions 8

Which statement about using AI to analyze reported defects is MOST correct?

Choose ONE option (1 out of 4)

Options:

A.

ML models trained with critical defect tickets can identify defects that cause serious consequences.

B.

ML models can support duplicate defect identification when checking defect criticality.

C.

ML models can identify categories for a reported defect during assignment.

D.

ML models identify developers who should handle a defect based on ticket content.

Buy Now
Questions 9

Which statement regarding the use of training, validation, and test data sets is correct?

Choose ONE option (1 out of 4)

Options:

A.

If only limited data is available, validation and test data sets can be combined in multiple ways during training.

B.

If limited data is available, it may be better to work without a separate test data set.

C.

Optimally, the data should be distributed equally between the training, validation, and test data sets.

D.

The data in the test data set must be equivalent to the data in the training data sets and to the data in the validation data sets.

Buy Now
Questions 10

Which ONE of the following models BEST describes a way to model defect prediction by looking at the history of bugs in modules by using code quality metrics of modules of historical versions as input?

SELECT ONE OPTION

Options:

A.

Identifying the relationship between developers and the modules developed by them.

B.

Search of similar code based on natural language processing.

C.

Clustering of similar code modules to predict based on similarity.

D.

Using a classification model to predict the presence of a defect by using code quality metrics as the input data.

Buy Now
Questions 11

Which of the following is a problem with AI-generated test cases that are generated from the requirements?

Options:

A.

They are slow and will usually not be able to execute in the time allowed

B.

They are defect-prone because they are unable to detect nuances in the requirements

C.

They make debugging more complicated because the number of steps is usually high in order to induce the target failure

D.

They are usually missing the expected results, so verification is difficult or must resort to only detecting significant failures

Buy Now
Questions 12

A system was developed for screening the X-rays of patients for potential malignancy detection (skin cancer). A workflow system has been developed to screen multiple cancers by using several individually trained ML models chained together in the workflow.

Testing the pipeline could involve multiple kind of tests (I - III):

I.Pairwise testing of combinations

II.Testing each individual model for accuracy

III.A/B testing of different sequences of models

Which ONE of the following options contains the kinds of tests that would be MOST APPROPRIATE to include in the strategy for optimal detection?

SELECT ONE OPTION

Options:

A.

Only III

B.

I and II

C.

I and III

D.

Only II

Buy Now
Questions 13

Which ONE of the following tests is MOST likely to describe a useful test to help detect different kinds of biases in ML pipeline?

SELECT ONE OPTION

Options:

A.

Testing the distribution shift in the training data for inappropriate bias.

B.

Test the model during model evaluation for data bias.

C.

Testing the data pipeline for any sources for algorithmic bias.

D.

Check the input test data for potential sample bias.

Buy Now
Questions 14

Which of the following approaches would help overcome testing challenges associated with probabilistic and non-deterministic AI-based systems?

Options:

A.

Run the test several times to ensure that the AI always returns the same correct test result

B.

Decompose the system test into multiple data ingestion tests to determine if the AI system is getting a sufficient volume of input data

C.

Decompose the system test into multiple data ingestion tests to determine if the AI system is getting precise and accurate input data

D.

Run the test several times to generate a statistically valid test result to ensure that an appropriate number of answers are accurate

Buy Now
Questions 15

An engine manufacturing facility wants to apply machine learning to detect faulty bolts. Which of the following would result in bias in the model?

Options:

A.

Selecting training data purposely excluding specific faulty conditions

B.

Selecting training data by purposely including all known faulty conditions

C.

Selecting testing data from a different dataset than the training dataset

D.

Selecting testing data from a boat manufacturer's bolt longevity data

Buy Now
Questions 16

Written requirements are given in text documents, which ONE of the following options is the BEST way to generate test cases from these requirements?

SELECT ONE OPTION

Options:

A.

Natural language processing on textual requirements

B.

Analyzing source code for generating test cases

C.

Machine learning on logs of execution

D.

GUI analysis by computer vision

Buy Now
Questions 17

Which ONE of the following combinations of Training, Validation, Testing data is used during the process of learning/creating the model?

SELECT ONE OPTION

Options:

A.

Training data - validation data - test data

B.

Training data - validation data

C.

Training data • test data

D.

Validation data - test data

Buy Now
Questions 18

Which ONE of the following characteristics is the least likely to cause safety related issues for an Al system?

SELECT ONE OPTION

Options:

A.

Non-determinism

B.

Robustness

C.

High complexity

D.

Self-learning

Buy Now
Questions 19

A software component uses machine learning to recognize the digits from a scan of handwritten numbers. In the scenario above, which type of Machine Learning (ML) is this an example of?

SELECT ONE OPTION

Options:

A.

Reinforcement learning

B.

Regression

C.

Classification

D.

Clustering

Buy Now
Questions 20

A ML engineer is trying to determine the correctness of the new open-source implementation *X", of a supervised regression algorithm implementation. R-Square is one of the functional performance metrics used to determine the quality of the model.

Which ONE of the following would be an APPROPRIATE strategy to achieve this goal?

SELECT ONE OPTION

Options:

A.

Add 10% of the rows randomly and create another model and compare the R-Square scores of both the model.

B.

Train various models by changing the order of input features and verify that the R-Square score of these models vary significantly.

C.

Compare the R-Square score of the model obtained using two different implementations that utilize two different programming languages while using the same algorithm and the same training and testing data.

D.

Drop 10% of the rows randomly and create another model and compare the R-Square scores of both the models.

Buy Now
Questions 21

Which of the following statements about explainable AI is correct?

Choose ONE option (1 out of 4)

Options:

A.

Interpretability refers to how easily users can determine whether the result provided by the AI-based system is correct

B.

Explainability refers to how easily the algorithms and training data needed to create the model can be determined

C.

According to The Royal Society, one reason for explainable AI is to increase user confidence in the system

D.

According to The Royal Society, one reason for explainable AI is to eliminate the need for risk and vulnerability assessments

Buy Now
Questions 22

Which ONE of the following statements is a CORRECT adversarial example in the context of machine learning systems that are working on image classifiers.

SELECT ONE OPTION

Options:

A.

Black box attacks based on adversarial examples create an exact duplicate model of the original.

B.

These attack examples cause a model to predict the correct class with slightly less accuracy even though they look like the original image.

C.

These attacks can't be prevented by retraining the model with these examples augmented to the training data.

D.

These examples are model specific and are not likely to cause another model trained on same task to fail.

Buy Now
Questions 23

Which ONE of the following options is the MOST APPROPRIATE stage of the ML workflow to set model and algorithm hyperparameters?

SELECT ONE OPTION

Options:

A.

Evaluating the model

B.

Deploying the model

C.

Tuning the model

D.

Data testing

Buy Now
Questions 24

Which data-labeling approach uses a two-step process where labeling is first done by a tool and then verified or completed by a human?

Choose ONE option (1 out of 4)

Options:

A.

Internal data labeling

B.

Crowdsourced data labeling

C.

Outsourced data labeling

D.

AI-assisted data labeling

Buy Now
Questions 25

Upon testing a model used to detect rotten tomatoes, the following data was observed by the test engineer, based on certain number of tomato images.

CT-AI Question 25

For this confusion matrix which combinations of values of accuracy, recall, and specificity respectively is CORRECT?

SELECT ONE OPTION

Options:

A.

0.87.0.9. 0.84

B.

1,0.87,0.84

C.

1,0.9, 0.8

D.

0.84.1,0.9

Buy Now
Questions 26

A bank wants to use an algorithm to determine which applicants should be given a loan. The bank hires a data scientist to construct a logistic regression model to predict whether the applicant will repay the loan or not. The bank has enough data on past customers to randomly split the data into a training dataset and a test/validation dataset. A logistic regression model is constructed on the training dataset using the following independent variables:

    Gender

    Marital status

    Number of dependents

    Education

    Income

    Loan amount

    Loan term

    Credit score

The model reveals that those with higher credit scores and larger total incomes are more likely to repay their loans. The data scientist has suggested that there might be bias present in the model based on previous models created for other banks.

Given this information, what is the best test approach to check for potential bias in the model?

Options:

A.

Experience-based testing should be used to confirm that the training data set is operationally relevant. This can include applying exploratory data analysis (EDA) to check for bias within the training data set.

B.

Back-to-back testing should be used to compare the model created using the training data set to another model created using the test data set. If the two models significantly differ, it will indicate there is bias in the original model.

C.

Acceptance testing should be used to make sure the algorithm is suitable for the customer. The team can re-work the acceptance criteria such that the algorithm is sure to correctly predict the remaining applicants that have been set aside for the validation dataset ensuring no bias is present.

D.

A/B testing should be used to verify that the test data set does not detect any bias that might have been introduced by the original training data. If the two models significantly differ, it will indicate there is bias in the original model.

Buy Now
Questions 27

Before deployment of an AI-based system, a developer is expected to demonstrate in a test environment how decisions are made. Which of the following characteristics does decision making fall under?

Options:

A.

Explainability

B.

Autonomy

C.

Self-learning

D.

Non-determinism

Buy Now
Questions 28

You have access to the training data that was used to train an AI-based system. You can review this information and use it as a guideline when creating your tests. What type of characteristic is this?

Options:

A.

Autonomy

B.

Explorability

C.

Transparency

D.

Accessibility

Buy Now
Questions 29

A mobile app start-up company is implementing an AI-based chat assistant for e-commerce customers. In the process of planning the testing, the team realizes that the specifications are insufficient.

Which testing approach should be used to test this system?

Options:

A.

Exploratory testing

B.

Static analysis

C.

Equivalence partitioning

D.

State transition testing

Buy Now
Questions 30

"Splendid Healthcare" has started developing a cancer detection system based on ML. The type of cancer they plan on detecting has 2% prevalence rate in the population of a particular geography. It is required that the model performs well for both normal and cancer patients.

Which ONE of the following combinations requires MAXIMIZATION?

SELECT ONE OPTION

Options:

A.

Maximize precision and accuracy

B.

Maximize accuracy and recall

C.

Maximize recall and precision

D.

Maximize specificity number of classes

Buy Now
Questions 31

Which statement regarding testing transparency, explainability, or interpretability is MOST correct?

Choose ONE option (1 out of 4)

Options:

A.

Tests for explainability and transparency are comparable to exploratory testing and can be performed with little information about development

B.

Since different users have different backgrounds, interpretability testing depends on the comprehensibility of the ML algorithm

C.

Dynamic testing is one way to quantify explainability; however, each method is specific to a particular model type

D.

LIME can precisely state the decisive reason for a change in the output

Buy Now
Questions 32

A wildlife conservation group would like to use a neural network to classify images of different animals. The algorithm is going to be used on a social media platform to automatically pick out pictures of the chosen animal of the month. This month’s animal is set to be a wolf. The test team has already observed that the algorithm could classify a picture of a dog as being a wolf because of the similar characteristics between dogs and wolves. To handle such instances, the team is planning to train the model with additional images of wolves and dogs so that the model is able to better differentiate between the two.

What test method should you use to verify that the model has improved after the additional training?

Options:

A.

Metamorphic testing because the application domain is not clearly understood at this point

B.

Adversarial testing to verify that no incorrect images have been used in the training

C.

Pairwise testing using combinatorics to look at a long list of photo parameters

D.

Back-to-back testing using the version of the model before training and the new version of the model after being trained with additional images

Buy Now
Questions 33

Which of the following statements about reinforcement learning is correct?

Choose ONE option (1 out of 4)

Options:

A.

The agent creates a model of the environment from labeled data during training

B.

The approach is suitable when the application doesnotrequire interaction with the environment

C.

The agent’s training is based on a reward function that rewards successful attempts

D.

From experience, the agent learns theoptimal reward function

Buy Now
Questions 34

Which of the following statements regarding experience-based testing for AI-based systems is correct?

Choose ONE option (1 out of 4)

Options:

A.

Intuitive test case design for AI-based systems involves interactive, hypothesis-driven examination of data for correlations or developmental trends.

B.

In checklist-based testing of AI-based systems, the existing test cases are dynamically adapted, for example based on metamorphic testing.

C.

Exploratory testing is often used for AI-based systems because there are often insufficient specifications or problems with the test oracle for AI-based systems.

D.

Tour refers to intuitive test case design for AI-based systems based on multiple, sequential test cases using systematically biased training data.

Buy Now
Questions 35

Which option describes a reasonable application of AIB testing for a self-learning system after it has changed its behavior due to user input?

Choose ONE option (1 out of 4)

Options:

A.

Generating test cases for the system before and after the change, since neither has a test oracle

B.

Comparing outputs before and after the change using different inputs

C.

Comparing outputs before and after the change using identical inputs

D.

Comparing outputs of a non-self-learning system with those of the changed self-learning system

Buy Now
Questions 36

Which of the following is an example of overfitting?

Options:

A.

The model is not able to generalize to accommodate new types of data

B.

The model is too simplistic for the data

C.

The model is missing relationships between the inputs and outputs

D.

The model discards data it considers to be noise or outliers

Buy Now
Exam Code: CT-AI
Exam Name: ISTQB Certified Tester AI Testing Exam
Last Update: Dec 2, 2025
Questions: 120

PDF + Testing Engine

$63.52  $181.49

Testing Engine

$50.57  $144.49
buy now CT-AI testing engine

PDF (Q&A)

$43.57  $124.49
buy now CT-AI pdf