透過您的圖書館登入
IP:216.73.216.210
  • 學位論文

群性資料之潛藏排序法

Latent Seriation Models for Cluster Data

指導教授 : 江金倉

摘要


本論文針對所觀測群體未知的健康程度或生理機能狀況提出排序方法,其主要考量之資料結構為一群性資料。在此,利用潛藏因子作為個體實際的表徵。對此排序問題,一些適當且廣為使用的潛藏因子及群性量測資料聯合機率模型將作為預測個體潛藏因子數值依據。至於排序方法的評估,我們利用參數化自助重取法產生的潛藏因子數值估計排序指標。此外,更進一步提供簡便及快速計算之估計流程。最後,我們藉助模擬檢視所提出之排序及評估方法,並將此方法應用於一流行病學研究之長期追蹤資料上。

並列摘要


Based on cluster measurements, our research interest mainly focuses on seriating the uncertainty in the degree of health or functioning of the body for collected subjects. For this problem, a latent variable is used to represent an unobserved seriation. In this thesis, some adequately and widely used joint models of a latent variable and cluster measurements are proposed to predict the most possible occurring value of a latent variable, which is taken in our seriation procedure. Since a latent variable is considered in modeling, a popular expectation and maximization (EM) algorithm is implemented for the estimation of parameters in the observed likelihood function. Moreover, a parametric bootstrapping method is considered to generate latent values and bootstrap samples, which are used to estimate seriation indices such as the correlation and concordance proportion in the evaluation of seriation. To examine the performance of the developed procedures, a class of simulations is conducted. From the numerical studies, we further detect that the evaluation indices computed based on the maximum likelihood estimators or the true parameters are very close although the accuracy of estimators relies on the sample size. Thus, a computationally efficient approach is proposed to estimate seriation indices. Finally, the seriation and evaluation procedures are applied to a CD4 depletion study.

參考文獻


Breslow, N. E. and Clayton, D. G. (1993). Approximate
inference in generalized linear mixed models. Journal of the
American Statistical Association. 88, 125-134.
generalized linear mixed models with a single component of
dispersion. Biometrika. 82, 81-92.

延伸閱讀


國際替代計量