透過您的圖書館登入
IP:3.145.143.239
  • 期刊

Identification of Protein-protein Interactions Based on Weighted Sparse Representation

摘要


Protein plays an important role in the cellular process of an organism, and its function is demonstrated by protein interaction. Rich information on protein interactions can facilitate the treatment of diseases and the development of drugs, so accurate prediction of protein interactions is of great significance. High‐flux biological experiments can be used to predict new protein pairs, but they are expensive and time‐consuming to operate and do not meet the demand for such information. With the rise of machine learning algorithms and the increasingly powerful computing power, the use of scientific computing models to predict each other has become the first choice. This paper mainly studies the application of weighted sparse representation classifiers under protein sequence feature coding. First of all, the composition, transfer and distribution of the physical and chemical properties of amino acids are selected to encode the amino acid sequence. Secondly, according to the characteristic importance of random forest, the feature operator de‐dimensionally de‐noises. Finally, for the features extracted in this paper, a weighted sparse representation classifier with strong noise resistance is used to classify the feature set. The results of the 50% cross‐validation were: accuracy 96.97%, sensitivity 97.51%, accuracy 96.43%, Matthews correlation coefficient 93.91%, Predictive results are better than existing machine learning models.

參考文獻


Archakov AI, Govorun VM, Dubanov AV, Ivanov YD, Veselovsky AV, Lewi P, et al. Protein-protein interactions as a target for drugs in proteomics[J]. Proteomics, 2003,3:380-391.
Foltman M, Sanchez-Diaz A. Studying Protein–Protein Interactions in Budding Yeast Using Co-immunoprecipitation[J]. Methods in Molecular Biology, 2016, 1369: 239-256.
Kawahashi Y, Doi N, Takashima H, Tsuda C, Oishi Y, Oyama R, et al. In vitro protein microarrays for detecting protein-protein interactions: application of a new method for fluorescence labeling of proteins[J]. Proteomics, 2003; 3:1236-1243.
Shen J, Zhang J, Luo X, et al. Predicting protein–protein interactions based only on sequences information[J]. Proceedings of the National Academy of Sciences, 2007, 104(11): 4337-4341.
Guo Y, Yu L, Wen Z, et al. Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequences[J]. Nucleic Acids Research, 2008, 36(9): 3025-3030.

延伸閱讀