Missing Data Imputation
Data Preprocessing Missing Data Imputation Why are their Missing values?? Survey--Depression Survey They hesitate to put down the information Survey informations are not that valid Men--salary Women---age People may have died----NAN What are the different types of Missing Data? Missing Completely at Random, MCAR: A variable is missing completely at random (MCAR) if the probability of being missing is the same for all the observations. When data is MCAR, there is absolutely no relationship between the data missing and any other values, observed or missing, within the dataset. In other words, those missing data points are a random subset of the data. There is nothing systematic going on that makes some data more likely to be missing than other. e.g. df [ df [ 'Embarked' ] . isnull ()] PassengerId Survived Pclass Name Sex Age SibSp Parch Ticket Fare Cabin Embarked 61 62 1 1 Icard, Miss. Amelie female 38.0 0 0 113572 80.0 B28 NaN 829 830 1 1 Stone, Mrs. George Nelson (Mar...