# AWS-Certified-Machine-Learning---Specialty — Question 426

**Type:** multiple_choice
**Topics:** topic_1

## Question

A Data Scientist is developing a binary classifier to predict whether a patient has a particular disease on a series of test results. The Data Scientist has data on
400 patients randomly selected from the population. The disease is seen in 3% of the population.
Which cross-validation strategy should the Data Scientist adopt?

## Correct Answer

_See scenario._

## Explanation

B - stratified k-fold cross-validation will enforce the class distribution in each split of the data to match the distribution in the complete training dataset.

**Reference:** examtopics_top_comment

---
Source: https://hiexam.net/q/amazon/AWS-Certified-Machine-Learning---Specialty/426  
Practice (tracked): https://hiexam.net/study/AWS-Certified-Machine-Learning---Specialty/practice