Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
Autor(a) principal: | |
---|---|
Data de Publicação: | 2007 |
Outros Autores: | , , , , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
Texto Completo: | http://www.alice.cnptia.embrapa.br/alice/handle/doc/814 |
Resumo: | Abstract. An effective strategy for managing protein databases is to provide mechanisms to transform raw data into consistent, accurate and reliable information. Such mechanisms will greatly reduce operational inefficiencies and improve one's ability to better handle scientific objectives and interpret the research results. To achieve this challenging goal for the STING project, we introduce Sting_RDB, a relational database of structural parameters for protein analysis with support for data warehousing and data mining. In this article, we highlight the main features of Sting_RDB and show how a user can explore it for efficient and biologically relevant queries. Considering its importance for molecular biologists, effort has been made to advance Sting_RDB toward data quality assessment. To the best of our knowledge, Sting_RDB is one of the most comprehensive data repositories for protein analysis, now also capable of providing its users with a data quality indicator. This paper differs from our previous study in many aspects. First, we introduce Sting_RDB, a relational database with mechanisms for efficient and relevant queries using SQL. Sting_rdb evolved from the earlier, text (flat file)-based database, in which data consistency and integrity was not guaranteed. Second, we provide support for data warehousing and mining. Third, the data quality indicator was introduced. Finally and probably most importantly, complex queries that could not be posed on a text-based database, are now easily implemented. |
id |
EMBR_6d0f50a0d3318262372755ce7b017a1f |
---|---|
oai_identifier_str |
oai:www.alice.cnptia.embrapa.br:doc/814 |
network_acronym_str |
EMBR |
network_name_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository_id_str |
2154 |
spelling |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.BioinformáticaAnálise de estrutura de proteínasMineração de dadosBase de dados StingData miningData warehousingProteínaBioinformaticsProteinsDatabasesAbstract. An effective strategy for managing protein databases is to provide mechanisms to transform raw data into consistent, accurate and reliable information. Such mechanisms will greatly reduce operational inefficiencies and improve one's ability to better handle scientific objectives and interpret the research results. To achieve this challenging goal for the STING project, we introduce Sting_RDB, a relational database of structural parameters for protein analysis with support for data warehousing and data mining. In this article, we highlight the main features of Sting_RDB and show how a user can explore it for efficient and biologically relevant queries. Considering its importance for molecular biologists, effort has been made to advance Sting_RDB toward data quality assessment. To the best of our knowledge, Sting_RDB is one of the most comprehensive data repositories for protein analysis, now also capable of providing its users with a data quality indicator. This paper differs from our previous study in many aspects. First, we introduce Sting_RDB, a relational database with mechanisms for efficient and relevant queries using SQL. Sting_rdb evolved from the earlier, text (flat file)-based database, in which data consistency and integrity was not guaranteed. Second, we provide support for data warehousing and mining. Third, the data quality indicator was introduced. Finally and probably most importantly, complex queries that could not be posed on a text-based database, are now easily implemented.STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; PAULA REGINA KUSER FALCAO, CNPTIA; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; EDGARD HENRIQUE DOS SANTOS, CNPTIA; FABIO DANILO VIEIRA, CNPTIA; JOSE GILBERTO JARDINE, CNPTIA; GORAN NESHICH, CNPTIA.OLIVEIRA, S. R. de M.ALMEIDA, G. V.SOUZA, K. R. R.RODRIGUES, D. N.KUSER-FALCÃO, P. R.YAMAGISHI, M. E. B.SANTOS, E. H. dosVIEIRA, F. D.JARDINE, J. G.NESHICH, G.2011-04-10T11:11:11Z2011-04-10T11:11:11Z2007-12-0720072017-05-11T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleGenetics and Molecular Research, v. 6, n. 4, p. 911-922, 2007.http://www.alice.cnptia.embrapa.br/alice/handle/doc/814enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2017-05-12T01:37:50Zoai:www.alice.cnptia.embrapa.br:doc/814Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542017-05-12T01:37:50falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542017-05-12T01:37:50Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false |
dc.title.none.fl_str_mv |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining. |
title |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining. |
spellingShingle |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining. OLIVEIRA, S. R. de M. Bioinformática Análise de estrutura de proteínas Mineração de dados Base de dados Sting Data mining Data warehousing Proteína Bioinformatics Proteins Databases |
title_short |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining. |
title_full |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining. |
title_fullStr |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining. |
title_full_unstemmed |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining. |
title_sort |
Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining. |
author |
OLIVEIRA, S. R. de M. |
author_facet |
OLIVEIRA, S. R. de M. ALMEIDA, G. V. SOUZA, K. R. R. RODRIGUES, D. N. KUSER-FALCÃO, P. R. YAMAGISHI, M. E. B. SANTOS, E. H. dos VIEIRA, F. D. JARDINE, J. G. NESHICH, G. |
author_role |
author |
author2 |
ALMEIDA, G. V. SOUZA, K. R. R. RODRIGUES, D. N. KUSER-FALCÃO, P. R. YAMAGISHI, M. E. B. SANTOS, E. H. dos VIEIRA, F. D. JARDINE, J. G. NESHICH, G. |
author2_role |
author author author author author author author author author |
dc.contributor.none.fl_str_mv |
STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; PAULA REGINA KUSER FALCAO, CNPTIA; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; EDGARD HENRIQUE DOS SANTOS, CNPTIA; FABIO DANILO VIEIRA, CNPTIA; JOSE GILBERTO JARDINE, CNPTIA; GORAN NESHICH, CNPTIA. |
dc.contributor.author.fl_str_mv |
OLIVEIRA, S. R. de M. ALMEIDA, G. V. SOUZA, K. R. R. RODRIGUES, D. N. KUSER-FALCÃO, P. R. YAMAGISHI, M. E. B. SANTOS, E. H. dos VIEIRA, F. D. JARDINE, J. G. NESHICH, G. |
dc.subject.por.fl_str_mv |
Bioinformática Análise de estrutura de proteínas Mineração de dados Base de dados Sting Data mining Data warehousing Proteína Bioinformatics Proteins Databases |
topic |
Bioinformática Análise de estrutura de proteínas Mineração de dados Base de dados Sting Data mining Data warehousing Proteína Bioinformatics Proteins Databases |
description |
Abstract. An effective strategy for managing protein databases is to provide mechanisms to transform raw data into consistent, accurate and reliable information. Such mechanisms will greatly reduce operational inefficiencies and improve one's ability to better handle scientific objectives and interpret the research results. To achieve this challenging goal for the STING project, we introduce Sting_RDB, a relational database of structural parameters for protein analysis with support for data warehousing and data mining. In this article, we highlight the main features of Sting_RDB and show how a user can explore it for efficient and biologically relevant queries. Considering its importance for molecular biologists, effort has been made to advance Sting_RDB toward data quality assessment. To the best of our knowledge, Sting_RDB is one of the most comprehensive data repositories for protein analysis, now also capable of providing its users with a data quality indicator. This paper differs from our previous study in many aspects. First, we introduce Sting_RDB, a relational database with mechanisms for efficient and relevant queries using SQL. Sting_rdb evolved from the earlier, text (flat file)-based database, in which data consistency and integrity was not guaranteed. Second, we provide support for data warehousing and mining. Third, the data quality indicator was introduced. Finally and probably most importantly, complex queries that could not be posed on a text-based database, are now easily implemented. |
publishDate |
2007 |
dc.date.none.fl_str_mv |
2007-12-07 2007 2011-04-10T11:11:11Z 2011-04-10T11:11:11Z 2017-05-11T11:11:11Z |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
Genetics and Molecular Research, v. 6, n. 4, p. 911-922, 2007. http://www.alice.cnptia.embrapa.br/alice/handle/doc/814 |
identifier_str_mv |
Genetics and Molecular Research, v. 6, n. 4, p. 911-922, 2007. |
url |
http://www.alice.cnptia.embrapa.br/alice/handle/doc/814 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA |
instname_str |
Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
instacron_str |
EMBRAPA |
institution |
EMBRAPA |
reponame_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
collection |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository.name.fl_str_mv |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
repository.mail.fl_str_mv |
cg-riaa@embrapa.br |
_version_ |
1794503435994791936 |