Database limitations for studying the human gut microbiome

Detalhes bibliográficos
Autor(a) principal: Dias, Camila K.
Data de Publicação: 2020
Outros Autores: Starke, Robert, Pylro, Victor S., Morais, Daniel K.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UFLA
Texto Completo: http://repositorio.ufla.br/jspui/handle/1/46398
Resumo: Background. In the last twenty years, new methodologies have made possible the gathering of large amounts of data concerning the genetic information and metabolic functions associated to the human gut microbiome. In spite of that, processing all this data available might not be the simplest of tasks, which could result in an excess of information awaiting proper annotation. This assessment intended on evaluating how well respected databases could describe a mock human gut microbiome. Methods. In this work, we critically evaluate the output of the cross–reference between the Uniprot Knowledge Base (Uniprot KB) and the Kyoto Encyclopedia of Genes and Genomes Orthologs (KEGG Orthologs) or the evolutionary genealogy of genes: Nonsupervised Orthologous groups (EggNOG) databases regarding a list of species that were previously found in the human gut microbiome. Results. From a list which contemplates 131 species and 52 genera, 53 species and 40 genera had corresponding entries for KEGG Database and 82 species and 47 genera had corresponding entries for EggNOG Database. Moreover, we present the KEGG Orthologs (KOs) and EggNOG Orthologs (NOGs) entries associated to the search as their distribution over species and genera and lists of functions that appeared in many species or genera, the ‘‘core’’ functions of the human gut microbiome. We also present the relative abundance of KOs and NOGs throughout phyla and genera. Lastly, we expose a variance found between searches with different arguments on the database entries. Inferring functionality based on cross-referencing UniProt and KEGG or EggNOG can be lackluster due to the low number of annotated species in Uniprot and due to the lower number of functions affiliated to the majority of these species. Additionally, the EggNOG database showed greater performance for a cross-search with Uniprot about a mock human gut microbiome. Notwithstanding, efforts targeting cultivation, single-cell sequencing or the reconstruction of high-quality metagenomeassembled genomes (MAG) and their annotation are needed to allow the use of these databases for inferring functionality in human gut microbiome studies.
id UFLA_b8ba08814f78f4658b97c13fea47fcf8
oai_identifier_str oai:localhost:1/46398
network_acronym_str UFLA
network_name_str Repositório Institucional da UFLA
repository_id_str
spelling Database limitations for studying the human gut microbiomeHuman microbiomeGut microbiomeFunctional diversityBioinformaticsComputational biologyDatabasesMicrobioma humanoDiversidade funcionalBioinformáticaBiologia computacionalBackground. In the last twenty years, new methodologies have made possible the gathering of large amounts of data concerning the genetic information and metabolic functions associated to the human gut microbiome. In spite of that, processing all this data available might not be the simplest of tasks, which could result in an excess of information awaiting proper annotation. This assessment intended on evaluating how well respected databases could describe a mock human gut microbiome. Methods. In this work, we critically evaluate the output of the cross–reference between the Uniprot Knowledge Base (Uniprot KB) and the Kyoto Encyclopedia of Genes and Genomes Orthologs (KEGG Orthologs) or the evolutionary genealogy of genes: Nonsupervised Orthologous groups (EggNOG) databases regarding a list of species that were previously found in the human gut microbiome. Results. From a list which contemplates 131 species and 52 genera, 53 species and 40 genera had corresponding entries for KEGG Database and 82 species and 47 genera had corresponding entries for EggNOG Database. Moreover, we present the KEGG Orthologs (KOs) and EggNOG Orthologs (NOGs) entries associated to the search as their distribution over species and genera and lists of functions that appeared in many species or genera, the ‘‘core’’ functions of the human gut microbiome. We also present the relative abundance of KOs and NOGs throughout phyla and genera. Lastly, we expose a variance found between searches with different arguments on the database entries. Inferring functionality based on cross-referencing UniProt and KEGG or EggNOG can be lackluster due to the low number of annotated species in Uniprot and due to the lower number of functions affiliated to the majority of these species. Additionally, the EggNOG database showed greater performance for a cross-search with Uniprot about a mock human gut microbiome. Notwithstanding, efforts targeting cultivation, single-cell sequencing or the reconstruction of high-quality metagenomeassembled genomes (MAG) and their annotation are needed to allow the use of these databases for inferring functionality in human gut microbiome studies.PeerJ, Inc2021-05-27T17:26:45Z2021-05-27T17:26:45Z2020info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfDIAS, C. K. et al. Database limitations for studying the human gut microbiome. PeerJ Computer Science, [S. l.], v. 6, e289, 2020. DOI: 10.7717/peerj-cs.289.http://repositorio.ufla.br/jspui/handle/1/46398PeerJ Computer Sciencereponame:Repositório Institucional da UFLAinstname:Universidade Federal de Lavras (UFLA)instacron:UFLAAttribution 4.0 Internationalhttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessDias, Camila K.Starke, RobertPylro, Victor S.Morais, Daniel K.eng2021-05-27T17:26:46Zoai:localhost:1/46398Repositório InstitucionalPUBhttp://repositorio.ufla.br/oai/requestnivaldo@ufla.br || repositorio.biblioteca@ufla.bropendoar:2021-05-27T17:26:46Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA)false
dc.title.none.fl_str_mv Database limitations for studying the human gut microbiome
title Database limitations for studying the human gut microbiome
spellingShingle Database limitations for studying the human gut microbiome
Dias, Camila K.
Human microbiome
Gut microbiome
Functional diversity
Bioinformatics
Computational biology
Databases
Microbioma humano
Diversidade funcional
Bioinformática
Biologia computacional
title_short Database limitations for studying the human gut microbiome
title_full Database limitations for studying the human gut microbiome
title_fullStr Database limitations for studying the human gut microbiome
title_full_unstemmed Database limitations for studying the human gut microbiome
title_sort Database limitations for studying the human gut microbiome
author Dias, Camila K.
author_facet Dias, Camila K.
Starke, Robert
Pylro, Victor S.
Morais, Daniel K.
author_role author
author2 Starke, Robert
Pylro, Victor S.
Morais, Daniel K.
author2_role author
author
author
dc.contributor.author.fl_str_mv Dias, Camila K.
Starke, Robert
Pylro, Victor S.
Morais, Daniel K.
dc.subject.por.fl_str_mv Human microbiome
Gut microbiome
Functional diversity
Bioinformatics
Computational biology
Databases
Microbioma humano
Diversidade funcional
Bioinformática
Biologia computacional
topic Human microbiome
Gut microbiome
Functional diversity
Bioinformatics
Computational biology
Databases
Microbioma humano
Diversidade funcional
Bioinformática
Biologia computacional
description Background. In the last twenty years, new methodologies have made possible the gathering of large amounts of data concerning the genetic information and metabolic functions associated to the human gut microbiome. In spite of that, processing all this data available might not be the simplest of tasks, which could result in an excess of information awaiting proper annotation. This assessment intended on evaluating how well respected databases could describe a mock human gut microbiome. Methods. In this work, we critically evaluate the output of the cross–reference between the Uniprot Knowledge Base (Uniprot KB) and the Kyoto Encyclopedia of Genes and Genomes Orthologs (KEGG Orthologs) or the evolutionary genealogy of genes: Nonsupervised Orthologous groups (EggNOG) databases regarding a list of species that were previously found in the human gut microbiome. Results. From a list which contemplates 131 species and 52 genera, 53 species and 40 genera had corresponding entries for KEGG Database and 82 species and 47 genera had corresponding entries for EggNOG Database. Moreover, we present the KEGG Orthologs (KOs) and EggNOG Orthologs (NOGs) entries associated to the search as their distribution over species and genera and lists of functions that appeared in many species or genera, the ‘‘core’’ functions of the human gut microbiome. We also present the relative abundance of KOs and NOGs throughout phyla and genera. Lastly, we expose a variance found between searches with different arguments on the database entries. Inferring functionality based on cross-referencing UniProt and KEGG or EggNOG can be lackluster due to the low number of annotated species in Uniprot and due to the lower number of functions affiliated to the majority of these species. Additionally, the EggNOG database showed greater performance for a cross-search with Uniprot about a mock human gut microbiome. Notwithstanding, efforts targeting cultivation, single-cell sequencing or the reconstruction of high-quality metagenomeassembled genomes (MAG) and their annotation are needed to allow the use of these databases for inferring functionality in human gut microbiome studies.
publishDate 2020
dc.date.none.fl_str_mv 2020
2021-05-27T17:26:45Z
2021-05-27T17:26:45Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv DIAS, C. K. et al. Database limitations for studying the human gut microbiome. PeerJ Computer Science, [S. l.], v. 6, e289, 2020. DOI: 10.7717/peerj-cs.289.
http://repositorio.ufla.br/jspui/handle/1/46398
identifier_str_mv DIAS, C. K. et al. Database limitations for studying the human gut microbiome. PeerJ Computer Science, [S. l.], v. 6, e289, 2020. DOI: 10.7717/peerj-cs.289.
url http://repositorio.ufla.br/jspui/handle/1/46398
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv Attribution 4.0 International
http://creativecommons.org/licenses/by/4.0/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution 4.0 International
http://creativecommons.org/licenses/by/4.0/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv PeerJ, Inc
publisher.none.fl_str_mv PeerJ, Inc
dc.source.none.fl_str_mv PeerJ Computer Science
reponame:Repositório Institucional da UFLA
instname:Universidade Federal de Lavras (UFLA)
instacron:UFLA
instname_str Universidade Federal de Lavras (UFLA)
instacron_str UFLA
institution UFLA
reponame_str Repositório Institucional da UFLA
collection Repositório Institucional da UFLA
repository.name.fl_str_mv Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA)
repository.mail.fl_str_mv nivaldo@ufla.br || repositorio.biblioteca@ufla.br
_version_ 1815439092067860480