透過您的圖書館登入
IP:18.221.222.47
  • 期刊

Classify Cancer Subtype by Using Gene Expression Data with Deeep Learning

摘要


Cancer is a disease that seriously threatens human life, and the study of cancer subtype classification has become the focus of current research. Gene expression profiles are an effective and widely used data in cancer research, but the sparse high-dimensional features lead to suboptimal results of classification methods. In this paper, we propose a deep learning method combining fully connected layer (FC layer) and convolutional neural network (CNN): FCDN to learn its key features from sparse high-dimensional data for cancer classification. Specifically, in the process of nonlinear dimensionality reduction, the key features are learned from the sparse global features, thereby overcoming the high-dimensional sparsity challenge. In the experiments, we compare the performance of FCDN with other four classification methods on high-dimensional datasets. The results show that the overall performance of FCDN is better than other methods, and it can obtain more ideal classification results.

參考文獻


Chawla N V , Bowyer K W , Hall L O , et al. SMOTE: Synthetic Minority Over-sampling Technique[J]. 2011.
Chen R , Yang L , Goodison S , et al. Deep learning approach to identifying cancer subtypes using high-dimensional genomic data[J]. Bioinformatics, 2019, 36(5).
Curtis, C., Shah, S. P ., Chin, S.-F., Turashvili, G., Rueda, O. M., Dunning, M. J.,Speed, D., Lynch, A. G., Samarajiwa, S., Y uan, Y ., et al. (2012). The genomicand transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.Nature, 486(7403), 346–352.
D.S. Huang , C.H. Zheng , Independent component analysis-based penalized dis- criminant method for tumor classification using gene expression data, Bioin- formatics 22 (15) (2006) 1855–1862.
Hay A M . The derivation of global estimates from a confusion matrix[J]. International Journal of Remote Sensing, 1988, 9(8):1395-1398.

延伸閱讀