PM2.5時空資料的降維分解模型與預測

本論文所感興趣的研究議題為PM2.5預測分析，近年來空氣汙染問題日益嚴重，PM2.5為一個重要的空氣汙染物指標，其預測尤其重要。本文以Airbox 計畫所提供的PM2.5資料集做預測分析，該資料集有著觀測站數多、觀測時間不規律和觀測誤差過大等問題，使分析極具挑戰。本文提供了一種降維分解模型來進行該資料的分析與預測，模型分為兩部分包含固定效應的平均結構與時空隨機效應，分別以降維後再投影至空間與時間的基底等方式建模，並在時空隨機效應裡加入時間的動態結構，進而藉由kalman filter輔助獲得空間與時間上的PM2.5一步或多步預測值與預測誤差。最後應用PM2.5資料集演示模型預測的結果。

關鍵字

降維分解；時空資料；動態結構

並列摘要

In recent years, air pollution becomes a serious problem in Taiwan, in particular PM2.5 plays an important role to affect the public health. This thesis studies the topic of PM2.5 forecast. The data used in this study is from AirBox Project which collects high-frequency data from more than one thousand small measurement devices using IoT technologies. The data are available instantaneously but very irregular in time, having excessive observation errors and many missing data. This study suggests a reduced-rank decomposition model to analyze AirBox data. The model consists two parts. The mean structure of daily pattern is specified via a linear combination of products of spatial eigen-functions and temporal (hourly) eigen-functions obtained via singular value decomposition. The dependence structure is specified via the fixed rank spatial-temporal random effect model. For parameter estimation, the method of moments is used. Given the model with estimated parameters, the kalman filter is used to generate the map of the best linear spatial prediction and their prediction errors for the one-step-ahead and multi-step-ahead PM2.5 values. The methodology is demonstrated using the data at south Taiwan.

並列關鍵字

Reduced-rank ； spatial and temporal data ； stata space model

參考文獻

[1] Chen, T-L, Huang, S-Y, Hung, H., Tu, I-P. (2014). An introduction to multilinear principal component analysis. Journal of the Chinese Statistical Association, 52, 24-43.

Google Scholar

[2] Crainiceanu, C.M., Caffo, B.S., Luo, S., Zipunnikov, V.M., and Punjabi, N.M. (2011). Population value decomposition, a framework for the analysis of image populations. Journal of the American Statistical Association, 106, 775–790.

Google Scholar

[3] Cressie, N. and Johannesson, G. (2008). Fixed rank kriging for very large spatial data sets. Journal of the Royal Statistical Society Series B, 70, 209-226.

Google Scholar

[4] Cressie, N., Shi, T. and Kang, E. L. (2010). Fixed rank filtering for spatio-temporal data. Journal of Computational and Graphical Statistics, 19, 724-745.

Google Scholar

[5] Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D. and Altman, R. B. (2001). Missing value estimation methods for DNA microarrays. Bioinformatics, 17, 520–525.

Google Scholar

國際替代計量

PM2.5時空資料的降維分解模型與預測

全文下載

主題瀏覽