AIP-210 CertNexus Certified Artificial Intelligence Practitioner (CAIP) Questions and Answers
You create a prediction model with 96% accuracy. While the model ' s true positive rate (TPR) is performing well at 99%, the true negative rate (TNR) is only 50%. Your supervisor tells you that the TNR needs to be higher, even if it decreases the TPR. Upon further inspection, you notice that the vast majority of your data is truly positive.
What method could help address your issue?
You and your team need to process large datasets of images as fast as possible for a machine learning task. The project will also use a modular framework with extensible code and an active developer community. Which of the following would BEST meet your needs?
What is the open framework designed to help detect, respond to, and remediate threats in ML systems?
Given a feature set with rows that contain missing continuous values, and assuming the data is normally distributed, what is the best way to fill in these missing features?
Which of the following sentences is TRUE about the definition of cloud models for machine learning pipelines?
An AI system recommends New Year ' s resolutions. It has an ML pipeline without monitoring components. What retraining strategy would be BEST for this pipeline?
Which of the following describes a benefit of machine learning for solving business problems?
Which two of the following decrease technical debt in ML systems? (Select two.)
A data scientist is tasked to extract business intelligence from primary data captured from the public. Which of the following is the most important aspect that the scientist cannot forget to include?
Which of the following statements are true regarding highly interpretable models? (Select two.)
Your dependent variable Y is a count, ranging from 0 to infinity. Because Y is approximately log-normally distributed, you decide to log-transform the data prior to performing a linear regression.
What should you do before log-transforming Y?
You are developing a prediction model. Your team indicates they need an algorithm that is fast and requires low memory and low processing power. Assuming the following algorithms have similar accuracy on your data, which is most likely to be an ideal choice for the job?
We are using the k-nearest neighbors algorithm to classify the new data points. The features are on different scales.
Which method can help us to solve this problem?
Which of the following metrics is being captured when performing principal component analysis?
Which of the following scenarios is an example of entanglement in ML pipelines?
In which of the following scenarios is lasso regression preferable over ridge regression?
Which two techniques are used to build personas in the ML development lifecycle? (Select two.)
Which two encoders can be used to transform categorical data into numerical features? (Select two.)
Which of the following occurs when a data segment is collected in such a way that some members of the intended statistical population are less likely to be included than others?
