透過您的圖書館登入
IP:3.141.8.247
  • 學位論文

結合生物知識的橢圓排序導引階層分群樹於基因微陣列資料的群集分析

Incorporating Biological Knowledge into Hierarchical Clustering Tree into Rank-Two Ellipse Seriation in Gene Expression Profiles

指導教授 : 吳漢銘

摘要


橢圓排序導引階層分群樹(HCT-R2E)應用在基因表現資料的矩陣視覺 化及群集分析上,是一種很有效的方法。它可以同時對基因表現資料提供較一致的局部群集和較佳的全域群組狀態。然而和傳統的數理式的群集分析一樣,橢圓排序導引階層分群樹方法僅利用到基因微陣列表現資料卻未考慮到把這些已知基因功能的屬性結合到分群演算裡。在本研究中,我們結合微陣列資料之基因所代表的生物知識,計算一個新的距離尺度,當作橢圓排序導引階層分群樹法使用的距離尺度。新的距離尺度的採用可以同時獲得群集後基因表現的相似性與基因功能屬性的同一性。以結合生物知識為基礎的橢圓排序導引階層分群樹法應用在酵母菌細胞週期和老鼠腦細胞這兩種微陣列資料,我們發現結果不僅保存原本橢圓排序導引階層分群樹法所具有的分群排序性質,也同時提供更相關及有意義的生物註解資訊去幫助識別基因的功 能屬性。

並列摘要


The hierarchical clustering tree (HCT) guided by a rank-two ellipse seriation (R2E) is an effective method to identify coherent local clusters and better global grouping patterns simultaneously in gene expression profiles. Like most other mathematical clustering methods, the HCT-R2E conducted only on the statistical characteristics of gene expression data while the known gene functions was not considered in the clustering process. In this study, we incorporate these information to create a new distance metric for HCT-R2E. The new distance metric captures both expression pattern similarities and biological function agreements. With cases studies on the microarray data of the yeast cell-cycle and mouse mesencephalon data. we shown the biological knowledge-based HCT-R2E not only preserves the desirable properties of its own its own but also identifies genes that are more relevant and meaningful to biological annotations.

參考文獻


Bhattacherjee, V., Mukhopadhyay, P., Singh, S., Johnson, C., Philipose,JT., Warner, CP., Greene, RM., Pisano, MM., 2007. Neural crest and mesoderm lineage-dependent gene expression in orofacial development. Differentiation, 75
Chu, S., DeRisi, J., Eisen, M., Mulholland, J., Botstein, D., Brown, P.O., Herskowitz, I., 1998. The transcriptional program of sporulation in budding yeast. Science, 282, 699-705.
Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D., 1998. Cluster analysis and display of genome-wide expression patterns. PNAS, 95:14863-14868.
Fang, Z., Yang, J., Li, Y., Qingming., Luo., Liu, L., 2006. Knowledge guided analysis of microarray data. Journal of Biomedical Informatics, 39(4), 401 - 411.
Grzegorz, M.B., Member, IEEE., Susmita, D., Somnath, D., 2006. Biologically supervised hierarchical clustering algorithms for gene expression data. Conf Proc IEEE Eng Med Bio Soc, 5515-5518.

被引用紀錄


羅凱威(2018)。以資料視覺化進行探索性資料分析:以探討多重用藥相關因素為例〔碩士論文,國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU201800369

延伸閱讀