Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains

Detalhes bibliográficos
Autor(a) principal: Rodrigues,Thiago de Souza
Data de Publicação: 2004
Outros Autores: Pacífico,Lucila Grossi Gonçalves, Teixeira,Santuza Maria Ribeiro, Oliveira,Sérgio Costa, Braga,Antônio de Pádua
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Genetics and Molecular Biology
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400032
Resumo: A new scheme for representing proteins of different lengths in number of amino acids that can be presented to a fixed number of inputs Artificial Neural Networks (ANNs) speel-out classification is described. K-Means's clustering of the new vectors with subsequent classification was then possible with the dimension reduction technique Principal Component Analysis applied previously. The new representation scheme was applied to a set of 112 antigens sequences from several parasitic helminths, selected in the National Center for Biotechnology Information and classified into fourth different groups. This bioinformatic tool permitted the establishment of a good correlation with domains that are already well characterized, regardless of the differences between the sequences that were confirmed by the PFAM database. Additionally, sequences were grouped according to their similarity, confirmed by hierarchical clustering using ClustalW.
id SBG-1_9f6f17cb547c35d9099b5341a616af7f
oai_identifier_str oai:scielo:S1415-47572004000400032
network_acronym_str SBG-1
network_name_str Genetics and Molecular Biology
repository_id_str
spelling Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domainsbioinformaticsartificial neural networksclusteringhelminth antigendomainA new scheme for representing proteins of different lengths in number of amino acids that can be presented to a fixed number of inputs Artificial Neural Networks (ANNs) speel-out classification is described. K-Means's clustering of the new vectors with subsequent classification was then possible with the dimension reduction technique Principal Component Analysis applied previously. The new representation scheme was applied to a set of 112 antigens sequences from several parasitic helminths, selected in the National Center for Biotechnology Information and classified into fourth different groups. This bioinformatic tool permitted the establishment of a good correlation with domains that are already well characterized, regardless of the differences between the sequences that were confirmed by the PFAM database. Additionally, sequences were grouped according to their similarity, confirmed by hierarchical clustering using ClustalW.Sociedade Brasileira de Genética2004-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400032Genetics and Molecular Biology v.27 n.4 2004reponame:Genetics and Molecular Biologyinstname:Sociedade Brasileira de Genética (SBG)instacron:SBG10.1590/S1415-47572004000400032info:eu-repo/semantics/openAccessRodrigues,Thiago de SouzaPacífico,Lucila Grossi GonçalvesTeixeira,Santuza Maria RibeiroOliveira,Sérgio CostaBraga,Antônio de Páduaeng2005-01-14T00:00:00Zoai:scielo:S1415-47572004000400032Revistahttp://www.gmb.org.br/ONGhttps://old.scielo.br/oai/scielo-oai.php||editor@gmb.org.br1678-46851415-4757opendoar:2005-01-14T00:00Genetics and Molecular Biology - Sociedade Brasileira de Genética (SBG)false
dc.title.none.fl_str_mv Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
title Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
spellingShingle Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
Rodrigues,Thiago de Souza
bioinformatics
artificial neural networks
clustering
helminth antigen
domain
title_short Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
title_full Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
title_fullStr Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
title_full_unstemmed Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
title_sort Clustering and artificial neural networks: classification of variable lengths of Helminth antigens in set of domains
author Rodrigues,Thiago de Souza
author_facet Rodrigues,Thiago de Souza
Pacífico,Lucila Grossi Gonçalves
Teixeira,Santuza Maria Ribeiro
Oliveira,Sérgio Costa
Braga,Antônio de Pádua
author_role author
author2 Pacífico,Lucila Grossi Gonçalves
Teixeira,Santuza Maria Ribeiro
Oliveira,Sérgio Costa
Braga,Antônio de Pádua
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Rodrigues,Thiago de Souza
Pacífico,Lucila Grossi Gonçalves
Teixeira,Santuza Maria Ribeiro
Oliveira,Sérgio Costa
Braga,Antônio de Pádua
dc.subject.por.fl_str_mv bioinformatics
artificial neural networks
clustering
helminth antigen
domain
topic bioinformatics
artificial neural networks
clustering
helminth antigen
domain
description A new scheme for representing proteins of different lengths in number of amino acids that can be presented to a fixed number of inputs Artificial Neural Networks (ANNs) speel-out classification is described. K-Means's clustering of the new vectors with subsequent classification was then possible with the dimension reduction technique Principal Component Analysis applied previously. The new representation scheme was applied to a set of 112 antigens sequences from several parasitic helminths, selected in the National Center for Biotechnology Information and classified into fourth different groups. This bioinformatic tool permitted the establishment of a good correlation with domains that are already well characterized, regardless of the differences between the sequences that were confirmed by the PFAM database. Additionally, sequences were grouped according to their similarity, confirmed by hierarchical clustering using ClustalW.
publishDate 2004
dc.date.none.fl_str_mv 2004-01-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400032
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400032
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/S1415-47572004000400032
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Sociedade Brasileira de Genética
publisher.none.fl_str_mv Sociedade Brasileira de Genética
dc.source.none.fl_str_mv Genetics and Molecular Biology v.27 n.4 2004
reponame:Genetics and Molecular Biology
instname:Sociedade Brasileira de Genética (SBG)
instacron:SBG
instname_str Sociedade Brasileira de Genética (SBG)
instacron_str SBG
institution SBG
reponame_str Genetics and Molecular Biology
collection Genetics and Molecular Biology
repository.name.fl_str_mv Genetics and Molecular Biology - Sociedade Brasileira de Genética (SBG)
repository.mail.fl_str_mv ||editor@gmb.org.br
_version_ 1752122379415322624