透過您的圖書館登入
IP:18.218.234.83
  • 期刊

缺失資料在因素分析上的處理方法之研究

Missing Data Techniques for Factor Analysis

摘要


因素分析常用來研究問卷及量表。當資料缺失過多或缺失機制為非完全隨機時,分析所得的共同因素個數或因素負荷常有偏差。本研究使用「台灣教育長期追蹤資料庫」,將其中的完整資料視為基準資料,並根據原有缺失結構,建構一至五倍缺失比率的資料集,以探討因素分析對缺失插補的敏感度。研究者比較了四種缺失處理法,包括:可用個體法、完整個體法、邏輯斯迴歸插補法與蒙第卡羅-馬可夫鏈(Monte Carlo Markov Chain, MCMC)插補法。結果顯示,缺失比率愈高時,所估計出來的變異數矩陣與基準資料的矩陣差異愈大。可用個體法在缺失比率較高時,萃取的共同因子的個數比基準資料多。在因素負荷上,可用個體法的誤差最嚴重,而完整個體法雖然和其他兩種插補法的誤差接近,不過會因缺失比率的增加與基準的誤差而隨之變大。研究者建議在缺失比率20%~30%或以上時,使用邏輯斯迴歸插補法或是蒙第卡羅-馬可夫鏈插補法後再進行因素分析會有較小的誤差。

並列摘要


Factor analysis is frequently employed to analyze scales and questionnaires. However, when the proportion of missing data is high or the missing data are not random, the number of factors extracted can be biased. We used the Taiwan Education Panel Survey (TEPS) and constructed 5 data sets with different missing proportions to assess the effects of missingness on factor analysis imputation. Complete observed data were used as a baseline for comparison. We compared the 4 treatments: available case method (AC), the complete case method (CC), MCMC single imputation (MCMC), and step-wise logistic regression single imputation (LR). The results show that the higher the missing proportion, the greater the discrepancy between the covariance matrix of the constructed data set and that of the baseline. For the AC method, the higher the proportion of missing data, the more the number of extracted factors exceeds that of the baseline. The AC method possessed the largest bias in factor loadings. The bias in factor loading of the CC method increased as the missing portion also increased. Thus, we recommend not applying the list-wise deletion method for factor analysis when the missing proportion is 20% or more.

參考文獻


張苙雲()。,未出版。
Allison, P. D.(2000).Multiple imputation for missing data: A cautionary tale.Sociological Methods and Research.28(3),301-309.
Allison, P. D.(2003).Missing data techniques for structural equation modeling.Journal of Abnormal Psychology.112(4),545-557.
Anderson, T. W.(2003).An introduction to multivariate statistical analysis.New York, NY:John Wiley & Sons.
Bernaards, C. A.,Sijtsma, K.(2000).Influence of imputation and EM methods on factor analysis when item non-response in questionnaire data is non-ignorable.Multivariate Behavioral Research.35(3)

被引用紀錄


江文楷(2013)。大量資料遺失情形下 缺失資料處理方法效用之研究- 以 「 青少年家庭中的親職壓力與親子衝突 」 研究資料為例〔碩士論文,國立臺北大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0023-1108201321263900
洪靜茹(2013)。缺失資料處理對多變量變異數分析(MANOVA)的影響〔碩士論文,國立臺北大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0023-0602201318131300
廖志峰(2013)。兩波成對資料之缺失資料處理方法對交叉延宕分析影響之研究-以「自我決定歷程與愛情依戀對約會暴力為之影響」研究資料為例〔碩士論文,國立臺北大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0023-1508201321351700

延伸閱讀