Database limitations for studying the human gut microbiome
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UFLA |
Texto Completo: | http://repositorio.ufla.br/jspui/handle/1/46398 |
Resumo: | Background. In the last twenty years, new methodologies have made possible the gathering of large amounts of data concerning the genetic information and metabolic functions associated to the human gut microbiome. In spite of that, processing all this data available might not be the simplest of tasks, which could result in an excess of information awaiting proper annotation. This assessment intended on evaluating how well respected databases could describe a mock human gut microbiome. Methods. In this work, we critically evaluate the output of the cross–reference between the Uniprot Knowledge Base (Uniprot KB) and the Kyoto Encyclopedia of Genes and Genomes Orthologs (KEGG Orthologs) or the evolutionary genealogy of genes: Nonsupervised Orthologous groups (EggNOG) databases regarding a list of species that were previously found in the human gut microbiome. Results. From a list which contemplates 131 species and 52 genera, 53 species and 40 genera had corresponding entries for KEGG Database and 82 species and 47 genera had corresponding entries for EggNOG Database. Moreover, we present the KEGG Orthologs (KOs) and EggNOG Orthologs (NOGs) entries associated to the search as their distribution over species and genera and lists of functions that appeared in many species or genera, the ‘‘core’’ functions of the human gut microbiome. We also present the relative abundance of KOs and NOGs throughout phyla and genera. Lastly, we expose a variance found between searches with different arguments on the database entries. Inferring functionality based on cross-referencing UniProt and KEGG or EggNOG can be lackluster due to the low number of annotated species in Uniprot and due to the lower number of functions affiliated to the majority of these species. Additionally, the EggNOG database showed greater performance for a cross-search with Uniprot about a mock human gut microbiome. Notwithstanding, efforts targeting cultivation, single-cell sequencing or the reconstruction of high-quality metagenomeassembled genomes (MAG) and their annotation are needed to allow the use of these databases for inferring functionality in human gut microbiome studies. |
id |
UFLA_b8ba08814f78f4658b97c13fea47fcf8 |
---|---|
oai_identifier_str |
oai:localhost:1/46398 |
network_acronym_str |
UFLA |
network_name_str |
Repositório Institucional da UFLA |
repository_id_str |
|
spelling |
Database limitations for studying the human gut microbiomeHuman microbiomeGut microbiomeFunctional diversityBioinformaticsComputational biologyDatabasesMicrobioma humanoDiversidade funcionalBioinformáticaBiologia computacionalBackground. In the last twenty years, new methodologies have made possible the gathering of large amounts of data concerning the genetic information and metabolic functions associated to the human gut microbiome. In spite of that, processing all this data available might not be the simplest of tasks, which could result in an excess of information awaiting proper annotation. This assessment intended on evaluating how well respected databases could describe a mock human gut microbiome. Methods. In this work, we critically evaluate the output of the cross–reference between the Uniprot Knowledge Base (Uniprot KB) and the Kyoto Encyclopedia of Genes and Genomes Orthologs (KEGG Orthologs) or the evolutionary genealogy of genes: Nonsupervised Orthologous groups (EggNOG) databases regarding a list of species that were previously found in the human gut microbiome. Results. From a list which contemplates 131 species and 52 genera, 53 species and 40 genera had corresponding entries for KEGG Database and 82 species and 47 genera had corresponding entries for EggNOG Database. Moreover, we present the KEGG Orthologs (KOs) and EggNOG Orthologs (NOGs) entries associated to the search as their distribution over species and genera and lists of functions that appeared in many species or genera, the ‘‘core’’ functions of the human gut microbiome. We also present the relative abundance of KOs and NOGs throughout phyla and genera. Lastly, we expose a variance found between searches with different arguments on the database entries. Inferring functionality based on cross-referencing UniProt and KEGG or EggNOG can be lackluster due to the low number of annotated species in Uniprot and due to the lower number of functions affiliated to the majority of these species. Additionally, the EggNOG database showed greater performance for a cross-search with Uniprot about a mock human gut microbiome. Notwithstanding, efforts targeting cultivation, single-cell sequencing or the reconstruction of high-quality metagenomeassembled genomes (MAG) and their annotation are needed to allow the use of these databases for inferring functionality in human gut microbiome studies.PeerJ, Inc2021-05-27T17:26:45Z2021-05-27T17:26:45Z2020info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfDIAS, C. K. et al. Database limitations for studying the human gut microbiome. PeerJ Computer Science, [S. l.], v. 6, e289, 2020. DOI: 10.7717/peerj-cs.289.http://repositorio.ufla.br/jspui/handle/1/46398PeerJ Computer Sciencereponame:Repositório Institucional da UFLAinstname:Universidade Federal de Lavras (UFLA)instacron:UFLAAttribution 4.0 Internationalhttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessDias, Camila K.Starke, RobertPylro, Victor S.Morais, Daniel K.eng2021-05-27T17:26:46Zoai:localhost:1/46398Repositório InstitucionalPUBhttp://repositorio.ufla.br/oai/requestnivaldo@ufla.br || repositorio.biblioteca@ufla.bropendoar:2021-05-27T17:26:46Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA)false |
dc.title.none.fl_str_mv |
Database limitations for studying the human gut microbiome |
title |
Database limitations for studying the human gut microbiome |
spellingShingle |
Database limitations for studying the human gut microbiome Dias, Camila K. Human microbiome Gut microbiome Functional diversity Bioinformatics Computational biology Databases Microbioma humano Diversidade funcional Bioinformática Biologia computacional |
title_short |
Database limitations for studying the human gut microbiome |
title_full |
Database limitations for studying the human gut microbiome |
title_fullStr |
Database limitations for studying the human gut microbiome |
title_full_unstemmed |
Database limitations for studying the human gut microbiome |
title_sort |
Database limitations for studying the human gut microbiome |
author |
Dias, Camila K. |
author_facet |
Dias, Camila K. Starke, Robert Pylro, Victor S. Morais, Daniel K. |
author_role |
author |
author2 |
Starke, Robert Pylro, Victor S. Morais, Daniel K. |
author2_role |
author author author |
dc.contributor.author.fl_str_mv |
Dias, Camila K. Starke, Robert Pylro, Victor S. Morais, Daniel K. |
dc.subject.por.fl_str_mv |
Human microbiome Gut microbiome Functional diversity Bioinformatics Computational biology Databases Microbioma humano Diversidade funcional Bioinformática Biologia computacional |
topic |
Human microbiome Gut microbiome Functional diversity Bioinformatics Computational biology Databases Microbioma humano Diversidade funcional Bioinformática Biologia computacional |
description |
Background. In the last twenty years, new methodologies have made possible the gathering of large amounts of data concerning the genetic information and metabolic functions associated to the human gut microbiome. In spite of that, processing all this data available might not be the simplest of tasks, which could result in an excess of information awaiting proper annotation. This assessment intended on evaluating how well respected databases could describe a mock human gut microbiome. Methods. In this work, we critically evaluate the output of the cross–reference between the Uniprot Knowledge Base (Uniprot KB) and the Kyoto Encyclopedia of Genes and Genomes Orthologs (KEGG Orthologs) or the evolutionary genealogy of genes: Nonsupervised Orthologous groups (EggNOG) databases regarding a list of species that were previously found in the human gut microbiome. Results. From a list which contemplates 131 species and 52 genera, 53 species and 40 genera had corresponding entries for KEGG Database and 82 species and 47 genera had corresponding entries for EggNOG Database. Moreover, we present the KEGG Orthologs (KOs) and EggNOG Orthologs (NOGs) entries associated to the search as their distribution over species and genera and lists of functions that appeared in many species or genera, the ‘‘core’’ functions of the human gut microbiome. We also present the relative abundance of KOs and NOGs throughout phyla and genera. Lastly, we expose a variance found between searches with different arguments on the database entries. Inferring functionality based on cross-referencing UniProt and KEGG or EggNOG can be lackluster due to the low number of annotated species in Uniprot and due to the lower number of functions affiliated to the majority of these species. Additionally, the EggNOG database showed greater performance for a cross-search with Uniprot about a mock human gut microbiome. Notwithstanding, efforts targeting cultivation, single-cell sequencing or the reconstruction of high-quality metagenomeassembled genomes (MAG) and their annotation are needed to allow the use of these databases for inferring functionality in human gut microbiome studies. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020 2021-05-27T17:26:45Z 2021-05-27T17:26:45Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
DIAS, C. K. et al. Database limitations for studying the human gut microbiome. PeerJ Computer Science, [S. l.], v. 6, e289, 2020. DOI: 10.7717/peerj-cs.289. http://repositorio.ufla.br/jspui/handle/1/46398 |
identifier_str_mv |
DIAS, C. K. et al. Database limitations for studying the human gut microbiome. PeerJ Computer Science, [S. l.], v. 6, e289, 2020. DOI: 10.7717/peerj-cs.289. |
url |
http://repositorio.ufla.br/jspui/handle/1/46398 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
Attribution 4.0 International http://creativecommons.org/licenses/by/4.0/ info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Attribution 4.0 International http://creativecommons.org/licenses/by/4.0/ |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
PeerJ, Inc |
publisher.none.fl_str_mv |
PeerJ, Inc |
dc.source.none.fl_str_mv |
PeerJ Computer Science reponame:Repositório Institucional da UFLA instname:Universidade Federal de Lavras (UFLA) instacron:UFLA |
instname_str |
Universidade Federal de Lavras (UFLA) |
instacron_str |
UFLA |
institution |
UFLA |
reponame_str |
Repositório Institucional da UFLA |
collection |
Repositório Institucional da UFLA |
repository.name.fl_str_mv |
Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA) |
repository.mail.fl_str_mv |
nivaldo@ufla.br || repositorio.biblioteca@ufla.br |
_version_ |
1815439092067860480 |