透過您的圖書館登入
IP:3.134.78.106
  • 學位論文

癌症基因體變異視覺化整合分析工具

A web tool for visual summary of mutations in cancer cohorts

指導教授 : 呂平江

摘要


CoMut plot是廣泛應用於癌症基因體研究,利用條狀圖將群體間突變頻率最高的基因由高到低排列出來,同時也可以清楚找出基因體突變最多的個體。此外,也用熱點圖來呈現個體的每個特定基因上突變的程度與變異的種類。最後利用程式語言將這些圖縫合在一起,僅用一張圖就能呈現研究群體中個體間的基因體變異圖譜。整個分析過程需經過檔案格式轉換、變體位點註釋、顯著變異基因預測、變異種類統計分析、突變特徵分析。目前已有少數軟體工具雖然可以分析資料並繪製出綜合圖,但存在著幾個缺點:1.不支援主流的檔案格式(如:VCF格式) 2.缺乏預測顯著變異基因與突變特徵的功能 3.缺乏跨癌症群體比較功能。而且有些需要使用者使用程式語言才能繪製分析,這對於沒有程式語言背景的生物研究者是一個門檻。 因此,我們開發了一個網頁工具: CoMutPlotter,使用者不需生物資訊背景就能夠自行操作,上傳研究群體的癌症基因體突變資料,進行全自動分析與產生CoMut plot圖表。CoMutPlotter支援多種基因體變異資料格式(TSV, MAF,VCF),變體位點經過基因功能註解、癌症驅動基因找尋和突變特徵辨認等分析流程,最後將所有結果整合繪製成CoMut plot。而我們也提供使用者將自己的資料與現有癌症基因體資料庫(TCGA/ICGC)的資料庫做比對,讓使用者可以比較不同國家的癌症資料差異,而所有分析結果的圖表都可以提供使用者下載。

並列摘要


CoMut plot is a visual summary of mutational patterns in cancer cohorts, which is usually used in cancer research. This plot summarizes gene mutation rate and sample mutation burden along with their relevant clinical details. To date, there are two web-based tools cBioPortal and iCoMut, which allow users select only TCGA and ICGC data to create involute visualizations. For custom data analysis, only certain command-line packages with limit of specific file format are available now. It is difficult for non-bioinformatics researchers to generate the CoMut plot from their custom data by themself. In order to solve the needs for custom data to achieve CoMut plot, and moreover let user compare with TCGA/ICGC data.We create CoMutPlotter, an easy-of-use and automatic web-based tool for the production of publication quality graphs. CoMutPlotter is supported for various file format without annotation to CoMut plot and annotation report. It also provides the comparison of mutation patterns between custom data and TCGA/ICGC project, detection of top driver gene in cohort and contributions of COSMIC mutational signatures in individual samples.

並列關鍵字

CoMut plot WES Annotation Mutational Signature

參考文獻


1. The Cancer Genome Atlas Research, N., et al., Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature, 2008. 455: p. 1061.
2. Zhang, J., et al., International Cancer Genome Consortium Data Portal--a one-stop shop for cancer genomics data. Database : the journal of biological databases and curation, 2011. 2011: p. bar026-bar026.
3. Tomczak, K., P. Czerwińska, and M. Wiznerowicz, The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Contemporary oncology (Poznan, Poland), 2015. 19(1A): p. A68-A77.
4. Wang, K., M. Li, and H. Hakonarson, ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res, 2010. 38(16): p. e164.
5. McLaren, W., et al., The Ensembl Variant Effect Predictor. Genome Biol, 2016. 17(1): p. 122.

延伸閱讀