透過您的圖書館登入
IP:18.188.108.54
  • 學位論文

遺失資料之多重插補法模擬比較研究

Simulation Study for Missing Data with Multiple Imputation

指導教授 : 劉清

摘要


本研究重點為探討拔靴法(bootstrap method)、資料擴增法(data augmentation)與多重插補法(multiple imputation)對遺失資料(missing data)進行區間估計的成效。一般處理遺失值的方式有很多種,但插補法則較多統計學者使用,插補法可分為單一插補法(single imputation)及多重插補法。在早期因為單一插補法較為方便,所以較為多人所使用,而今日許多軟體漸漸提供多重插補法,讓使用者可以爲遺失值填入多個比單一插補法更有效的值。本研究利用模擬資料,同時用三種方法進行平均數的區間估計,以證明多重插補法的參數估計能力,可以達到與拔靴法的同樣效果。 本研究模擬三種型態的多元常態分布資料,並且設定不同的遺失比例,分別進行拔靴法、資料擴增法與多重插補三種方法的信賴區間估計。結果發現拔靴法在信賴區間估計上,無論樣本大小或是遺失資料比例為何,其估計能力都有一定的水準;資料擴增法則是在小樣本的信賴區間估計較大樣本好,但無論遺失比例為何,資料擴增法都是三種方法中出現最多錯誤估算的;多重插補法的估計效果和拔靴法相當,無論樣本大小或是遺失比例為何,信賴區間幾乎都涵蓋母體參數,尤其在遺失比例高達50%時,多重插補法依舊能準確地估計母體參數,顯示多重插補法所產生的參數估計是可以信賴的。

並列摘要


The purpose of this study is to compare the difference between the method of bootstrap, a data augmentation and multiple imputation for estimating the confidence interval of missing data. In general, there are several methods dealing with missing data, but imputation method is usually used by statisticians. The imputation method can be divided into single imputation and multiple imputation. In early years, single imputation is more convenient. Now lots of software provides the procedure of multiple imputation. The confidence intervals for mean were established by the three methods. Then we show that the multiple imputation is as efficient as bootstrap method. We simulate data from multivariate normal distribution with three different sample sizes, and set different missing rates it was found that the coverage probability for bootstrap method is approximate to the confidence coefficient. The data augmentation is inferior to both bootstrap method and multiple imputation. The result of the multiple imputation is similar to the bootstrap method. The multiple imputation still estimates the parameters accurately even for high missing rate.

參考文獻


陳信木、林佳瑩(1996)。調查資料之遺漏值的處理-以熱卡插補法為例。調查研究,第3卷:75-106。
Barnard J, and Rubin D.B. (1999), Small sample degrees of freedom with multiple imputation, Biometrika 86: 948-955.
Efron, B. (1979), Bootstrap methods: another Look at the Jackknife, The Annals of Statistics 7: 1-26.
Efron, B. (1994), Missing data, imputation, and the bootstrap, Journal of the American Statistical Association 89: 463-478.
Little, R.J.A., and Rubin, D.B. (1987), Statistical Analysis with Missing data, New York: J. Wiley.

被引用紀錄


程于庭(2013)。遺失資料插補法在最適資產配置投資組合上之應用與比較-以臺灣證券市場為例〔碩士論文,淡江大學〕。華藝線上圖書館。https://doi.org/10.6846/TKU.2013.00602
劉畢琳(2010)。多重插補法在非完整資料統計分析上之應用〔碩士論文,國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU.2010.00875
洪靜茹(2013)。缺失資料處理對多變量變異數分析(MANOVA)的影響〔碩士論文,國立臺北大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0023-0602201318131300

延伸閱讀