透過您的圖書館登入
IP:18.191.5.239
  • 期刊
  • OpenAccess

Direct and Unbiased Multiple Imputation Methods for MissingValues of Categorical Variables

若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

並列摘要


Missing data is a common problem in statistical analyses. To make use of information in data with incomplete observation, missing values can be imputed so that standard statistical methods can be used to analyze the data. Variables with missing values are often categorical and the missing pattern may not be monotone. Currently, commonly used imputation methods for data with a non-monotone missing pattern do not allow direct inclusion of categorical variables. Categorical variables are converted to numerical variables before imputation. For many applications, the imputed numerical values for those categorical variables must then be converted back to categorical values. However, this conversion introduces bias which can seriously affect subsequent analyses. In this paper, we propose two direct imputation methods for categorical variables with a non-monotone missing pattern: the direct imputation approach incorporated with the expectation- maximization algorithm and the direct imputation approach incorporated with a new algorithm: the imputation-maximization algorithm. Simulation studies show that both methods perform better than the method using variable conversion. An application to real data is provided to compare the direct imputation method and the method using variable conversion.

延伸閱讀