Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.

Detalhes bibliográficos
Autor(a) principal: MORAES, F. R. de
Data de Publicação: 2014
Outros Autores: NESHICH, I. A. P., MAZONI, I., YANO, I. H., PEREIRA, J. G. C., SALIM, J. A., JARDINE, J. G., NESHICH, G.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo: http://www.alice.cnptia.embrapa.br/alice/handle/doc/981215
Resumo: Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html).
id EMBR_2f8bdd4614aac864ce9433fbf6b5657b
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/981215
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.Interações proteína-proteínaBiologiaProteinsBiological sciencesProtein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html).FÁBIO R. DE MORAES, IB/Unicamp, CNPTIA; IZABELLA A. P. NESHICH, IB/Unicamp, CNPTIA; IVAN MAZONI, IB/Unicamp, CNPTIA; INÁCIO HENRIQUE YANO, CNPTIA; JOSÉ G. C. PEREIRA, IB/Unicamp, CNPTIA; JOSÉ A. SALIM, Unicamp; JOSE GILBERTO JARDINE, CNPTIA; GORAN NESHICH, CNPTIA.MORAES, F. R. deNESHICH, I. A. P.MAZONI, I.YANO, I. H.PEREIRA, J. G. C.SALIM, J. A.JARDINE, J. G.NESHICH, G.2014-02-25T11:11:11Z2014-02-25T11:11:11Z2014-02-2520142014-05-20T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlePlos One, San Francisco, v. 9, n. 1, p. 1-15, Jan. 2014.10.1371/journal.pone.0087107http://www.alice.cnptia.embrapa.br/alice/handle/doc/981215enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2017-08-16T00:49:59Zoai:www.alice.cnptia.embrapa.br:doc/981215Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542017-08-16T00:49:59falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542017-08-16T00:49:59Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.
title Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.
spellingShingle Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.
MORAES, F. R. de
Interações proteína-proteína
Biologia
Proteins
Biological sciences
title_short Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.
title_full Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.
title_fullStr Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.
title_full_unstemmed Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.
title_sort Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.
author MORAES, F. R. de
author_facet MORAES, F. R. de
NESHICH, I. A. P.
MAZONI, I.
YANO, I. H.
PEREIRA, J. G. C.
SALIM, J. A.
JARDINE, J. G.
NESHICH, G.
author_role author
author2 NESHICH, I. A. P.
MAZONI, I.
YANO, I. H.
PEREIRA, J. G. C.
SALIM, J. A.
JARDINE, J. G.
NESHICH, G.
author2_role author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv FÁBIO R. DE MORAES, IB/Unicamp, CNPTIA; IZABELLA A. P. NESHICH, IB/Unicamp, CNPTIA; IVAN MAZONI, IB/Unicamp, CNPTIA; INÁCIO HENRIQUE YANO, CNPTIA; JOSÉ G. C. PEREIRA, IB/Unicamp, CNPTIA; JOSÉ A. SALIM, Unicamp; JOSE GILBERTO JARDINE, CNPTIA; GORAN NESHICH, CNPTIA.
dc.contributor.author.fl_str_mv MORAES, F. R. de
NESHICH, I. A. P.
MAZONI, I.
YANO, I. H.
PEREIRA, J. G. C.
SALIM, J. A.
JARDINE, J. G.
NESHICH, G.
dc.subject.por.fl_str_mv Interações proteína-proteína
Biologia
Proteins
Biological sciences
topic Interações proteína-proteína
Biologia
Proteins
Biological sciences
description Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html).
publishDate 2014
dc.date.none.fl_str_mv 2014-02-25T11:11:11Z
2014-02-25T11:11:11Z
2014-02-25
2014
2014-05-20T11:11:11Z
dc.type.driver.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Plos One, San Francisco, v. 9, n. 1, p. 1-15, Jan. 2014.
10.1371/journal.pone.0087107
http://www.alice.cnptia.embrapa.br/alice/handle/doc/981215
identifier_str_mv Plos One, San Francisco, v. 9, n. 1, p. 1-15, Jan. 2014.
10.1371/journal.pone.0087107
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/981215
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1794503387477180416