skip to main content

iPPI-Esml: An ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC

Jia, Jianhua ; Liu, Zi ; Xiao, Xuan ; Liu, Bingxiang ; Chou, Kuo-Chen

Journal of theoretical biology, 2015-07, Vol.377, p.47-56 [Periódico revisado por pares]

England: Elsevier Ltd

Texto completo disponível

Citações Citado por
  • Título:
    iPPI-Esml: An ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC
  • Autor: Jia, Jianhua ; Liu, Zi ; Xiao, Xuan ; Liu, Bingxiang ; Chou, Kuo-Chen
  • Assuntos: Algorithms ; Amino Acids - chemistry ; Animals ; Chemistry, Physical ; Computational Biology - methods ; Ensemble classifier ; Fusion ; Physicochemical properties ; Protein Binding ; Protein Interaction Mapping - methods ; Proteins - chemistry ; Pseudo amino acid composition ; Random forests ; Voting system ; Wavelet Analysis ; Wavelets transforms
  • É parte de: Journal of theoretical biology, 2015-07, Vol.377, p.47-56
  • Notas: ObjectType-Article-1
    SourceType-Scholarly Journals-1
    ObjectType-Feature-2
    content type line 23
  • Descrição: A cell contains thousands of proteins. Many important functions of cell are carried out through the proteins therein. Proteins rarely function alone. Most of their functions essential to life are associated with various types of protein–protein interactions (PPIs). Therefore, knowledge of PPIs is fundamental for both basic research and drug development. With the avalanche of proteins sequences generated in the postgenomic age, it is highly desired to develop computational methods for timely acquiring this kind of knowledge. Here, a new predictor, called “iPPI-Emsl”, is developed. In the predictor, a protein sample is formulated by incorporating the following two types of information into the general form of PseAAC (pseudo amino acid composition): (1) the physicochemical properties derived from the constituent amino acids of a protein; and (2) the wavelet transforms derived from the numerical series along a protein chain. The operation engine to run the predictor is an ensemble classifier formed by fusing seven individual random forest engines via a voting system. It is demonstrated with the benchmark dataset from Saccharomyces cerevisiae as well as the dataset from Helicobacter pylori that the new predictor achieves remarkably higher success rates than any of the existing predictors in this area. The new predictor׳ web-server has been established at http://www.jci-bioinfo.cn/iPPI-Esml. For the convenience of most experimental scientists, we have further provided a step-by-step guide, by which users can easily get their desired results without the need to follow the complicated mathematics involved during its development. [Display omitted] •Protein–protein interactions.•Discrete wavelet transform approach.•Ensemble classifier formed by fusing seven individual random forest.•Web-server predictor.
  • Editor: England: Elsevier Ltd
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.