Infer — Static Imputation
-
Static Imputation resembles the Replace By (Mean/Modal Imputation) method but differs in three important aspects:
-
The buttons under
Infer
are available whenever a variable with missing values is selected in the Data Panel.
- While Replace By (Mean/Modal Imputation) is deterministic, Static Imputation performs random draws from the marginal distributions of the observed data and saves these randomly drawn values as “placeholder values.”
- The imputation is only performed internally, and BayesiaLab still “remembers” exactly which observations are missing.
- Whereas Replace By (Mean/Modal Imputation) can be applied to individual variables, any of the options under Infer apply to all variables with missing values, with the exception of those that have already been processed by Filter or Replace By (Mean/Modal Imputation).
Although this probabilistic imputation method is not optimal at the observation/individual level (it is not the rational decision for minimizing the prediction error), it is optimal at the dataset/population level.
As illustrated below, drawing the imputed values from the current distribution keeps the distributions of variables pre and post-processing the same. As a result, Static Imputation returns distributions that match the ones produced by Filter but without deleting any observations. As no records are discarded, Static Imputation does not introduce any additional biases. However, the distributions of X2 (MAR) and X4 (MNAR) remain strongly biased.