透過您的圖書館登入
IP:18.118.254.94
  • 學位論文

透過分析雜合去氧核醣核酸圖譜偵測基因體變異序列的混合序列閱讀程式

Mixed Sequence Reader (MSR) program for analyzing DNA sequences with heterozygous base calling chromatography to detect genomic variations

指導教授 : 唐傳義

摘要


當逆轉錄聚合酶鏈式反應的產物是單核苷酸多態性,插入刪除序列,短串連重復序列和旁系同源基因等,直接被定序時會得到雜合的螢光圖譜。插入刪除序列和短串列重復序列可以很容易的被偵測出來而且不需要參考序列資料庫以目前的軟體如Indelligent或ShiftDetector.然而基因變異的檢測仍然式一個挑戰,由於缺乏適合的工具來分析雜合的螢光圖譜數據.在這項研究中,我們開發了一套免費的網頁工具「混合序列閱讀器」可以直接分析ABI檔案格式的雜合螢光圖譜數據。兩個雜合的序列可以透過比對參考的序列資料庫而被確認並且分離開來,在我們的研究結果中顯示出,混合序列閱讀器可以用於下列情況:(一)判別插入刪除序列和短串列重復序列在參考序列中的的實際物理位置並計算出短串列重復序列的重複次數(二)以美國聯邦調查局的合併核醣核甘酸索引系統預測核醣核甘酸的微型衛星組合型態(三)利用目前已知的人類乳凸病毒資料庫判別複合型病毒感染的病毒型態(四)預估旁系同源基因的拷貝數例如β-defensin 4, DEFB4和他的同源基因。

並列摘要


When PCR products are directly sequenced, heterozygous base-calling fluorescence chromatogram data are derived for identifying single nucleotide polymorphisms (SNP), insertion-deletion (Indel), short tandem repeat (STR), and paralogous genes. Indel and STR can be easily detected using the currently available Indelligent or ShiftDetector programs without searching reference sequences. However, the detection of other genomic variants remains a challenge because of the lack of appropriate tools to analyze heterozygous base-calling fluorescence chromatogram data. In this study, we developed the free, web-based “Mixed Sequence Reader (MSR)” that can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format with reference sequences. The heterozygous sequences can be identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used for: (i) physically locating Indel and STR sequences by searching the NCBI reference sequences, and determining the copy number of STR, (ii) predicting the combinations of microsatellite pattern using Federal Bureau of Investigation Combined DNA Index System (CODIS), (iii) determining human papilloma virus (HPV) genotypes by searching current viral databases in cases of multiple infections, and (iv) estimating the copy number of paralogous genes, such as β-defensin 4, DEFB4, and its paralog HSPDP3

參考文獻


1. Janssens, A.C. and C.M. van Duijn, Genome-based prediction of common diseases: advances and prospects. Hum Mol Genet, 2008. 17(R2): p. R166-73.
2. Manolio, T.A., Genomewide association studies and assessment of the risk of disease. N Engl J Med, 2010. 363(2): p. 166-76.
3. Menashe, I., et al., Pathway analysis of breast cancer genome-wide association study highlights three pathways and one canonical signaling cascade. Cancer Res, 2010. 70(11): p. 4453-9.
4. Wacholder, S., et al., Performance of common genetic variants in breast-cancer risk models. N Engl J Med, 2010. 362(11): p. 986-93.
5. Conrad, D.F., et al., Origins and functional impact of copy number variation in the human genome. Nature, 2010. 464(7289): p. 704-12.

延伸閱讀