透過您的圖書館登入
IP:18.190.152.38
  • 學位論文

病毒分類之研究-藉由基因體編碼序列

A Study of Virus Classication via Genomic Coding Sequence

指導教授 : 王經篤
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


傳統的病毒分類方法是根據型態學、核酸型態(DNA 或RNA) 、核酸是單鏈/雙鏈、或病毒入侵的物種。本研究從基因體的觀點, 根據編碼序列(CDS) 來做病毒分類。其中CDS 是用來定義蛋白質的編碼區域。根據『相似的CDS之轉譯序列擁有相近的生物功能』的簡單假設, 我們以E-value(E)值為評量標準, 利用blast 程式將CDS 序列分成不同的群組, 然後根據這些群組, 將病毒轉換成具有代表性的向量, 以便做進一步分類處理。實驗資料來源是選擇過的22科(family) 病毒, 包含1,596種病毒。利用著名的SVM 分類器,經由5-fold 交叉驗證, 當E-value 值為39時, 得到最佳的正確率為93.8% 。本研究對於比較基因體研究提供一個新的病毒分類方法。

並列摘要


Traditional approaches for virus classification based on morphology, the type of nucleic acid genome (DNA or RNA), the nucleic acid being single/double stranded, or the species the virus invaded. In this study, from the genomic point of view, we have virus classification via translated coding sequences (CDS). Note that the translated CDS feature is used to define a protein coding region. Under an assumption, for simplicity, that similar translated CDS sequences conserve similar biological functions, the translated CDS sequences were clustered into distinct groups via a criteria of the E-value(E) of blast program such that the instances of viruses could be transferred into representative vectors for further classification processes. The experimental resources included 22 virus family consisting of 1,596 viruses. The best accuracy was as high as 93.8% when E=39 achieved by the well known SVM classifier through 5-fold cross-validation. This study provided a new approach to virus classifiation for comparative genomic analysis.

參考文獻


[9] 鄭清俊. 應用類神經網路與支援向量機於目標客戶選取. 2005.
[15] David H. Bos and David Posada. Using models of nucleotide evolution to build phylogenetic trees. Developmental and Comparative Immunology, 29:211–227, 2005.
[18] Jonathan Pevsner. Bioinformatics and Functional Genomics. Wiley List, 2003.
[20] David W. Mount. Bioinformatics: Sequence and Genome Analysis. Cold Spring Harbor Laboratory Press, 2 edition, 2004.
[21] Jing-Doo Wang and Jig-Fu Huang. A Study of Virus Classification via Genomic DNA Sequences. In 2011 Conference of Information Technology and Application in Outlying Islands, 2011.

延伸閱讀