Missing answers
Amaya’s company surveyed a large group of people. After collecting and digitizing all of the data, she noticed that many participants didn’t answer one particular question.
It was clear to Amaya that not disclosing the answer to the question was as important as the answer itself, so she wanted to consider this when preparing the data.
Amaya is ready to build a machine learning model but must first deal with the missing answers.
Which of the following should be the best strategy that Amaya should pursue?
Amaya should remove the column that contains the missing values.
Amaya should add a new column to the dataset to flag rows with missing values and then replace those values with a reasonable answer.
Amaya should replace the missing values with a reasonable answer.
Amaya should predict the missing values using a separate machine learning model.