Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.

Detalhes bibliográficos
Autor(a) principal: OLIVEIRA, S. R. de M.
Data de Publicação: 2007
Outros Autores: ALMEIDA, G. V., SOUZA, K. R. R., RODRIGUES, D. N., KUSER-FALCÃO, P. R., YAMAGISHI, M. E. B., SANTOS, E. H. dos, VIEIRA, F. D., JARDINE, J. G., NESHICH, G.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo: http://www.alice.cnptia.embrapa.br/alice/handle/doc/814
Resumo: Abstract. An effective strategy for managing protein databases is to provide mechanisms to transform raw data into consistent, accurate and reliable information. Such mechanisms will greatly reduce operational inefficiencies and improve one's ability to better handle scientific objectives and interpret the research results. To achieve this challenging goal for the STING project, we introduce Sting_RDB, a relational database of structural parameters for protein analysis with support for data warehousing and data mining. In this article, we highlight the main features of Sting_RDB and show how a user can explore it for efficient and biologically relevant queries. Considering its importance for molecular biologists, effort has been made to advance Sting_RDB toward data quality assessment. To the best of our knowledge, Sting_RDB is one of the most comprehensive data repositories for protein analysis, now also capable of providing its users with a data quality indicator. This paper differs from our previous study in many aspects. First, we introduce Sting_RDB, a relational database with mechanisms for efficient and relevant queries using SQL. Sting_rdb evolved from the earlier, text (flat file)-based database, in which data consistency and integrity was not guaranteed. Second, we provide support for data warehousing and mining. Third, the data quality indicator was introduced. Finally and probably most importantly, complex queries that could not be posed on a text-based database, are now easily implemented.
id EMBR_6d0f50a0d3318262372755ce7b017a1f
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/814
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.BioinformáticaAnálise de estrutura de proteínasMineração de dadosBase de dados StingData miningData warehousingProteínaBioinformaticsProteinsDatabasesAbstract. An effective strategy for managing protein databases is to provide mechanisms to transform raw data into consistent, accurate and reliable information. Such mechanisms will greatly reduce operational inefficiencies and improve one's ability to better handle scientific objectives and interpret the research results. To achieve this challenging goal for the STING project, we introduce Sting_RDB, a relational database of structural parameters for protein analysis with support for data warehousing and data mining. In this article, we highlight the main features of Sting_RDB and show how a user can explore it for efficient and biologically relevant queries. Considering its importance for molecular biologists, effort has been made to advance Sting_RDB toward data quality assessment. To the best of our knowledge, Sting_RDB is one of the most comprehensive data repositories for protein analysis, now also capable of providing its users with a data quality indicator. This paper differs from our previous study in many aspects. First, we introduce Sting_RDB, a relational database with mechanisms for efficient and relevant queries using SQL. Sting_rdb evolved from the earlier, text (flat file)-based database, in which data consistency and integrity was not guaranteed. Second, we provide support for data warehousing and mining. Third, the data quality indicator was introduced. Finally and probably most importantly, complex queries that could not be posed on a text-based database, are now easily implemented.STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; PAULA REGINA KUSER FALCAO, CNPTIA; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; EDGARD HENRIQUE DOS SANTOS, CNPTIA; FABIO DANILO VIEIRA, CNPTIA; JOSE GILBERTO JARDINE, CNPTIA; GORAN NESHICH, CNPTIA.OLIVEIRA, S. R. de M.ALMEIDA, G. V.SOUZA, K. R. R.RODRIGUES, D. N.KUSER-FALCÃO, P. R.YAMAGISHI, M. E. B.SANTOS, E. H. dosVIEIRA, F. D.JARDINE, J. G.NESHICH, G.2011-04-10T11:11:11Z2011-04-10T11:11:11Z2007-12-0720072017-05-11T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleGenetics and Molecular Research, v. 6, n. 4, p. 911-922, 2007.http://www.alice.cnptia.embrapa.br/alice/handle/doc/814enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2017-05-12T01:37:50Zoai:www.alice.cnptia.embrapa.br:doc/814Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542017-05-12T01:37:50falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542017-05-12T01:37:50Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
title Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
spellingShingle Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
OLIVEIRA, S. R. de M.
Bioinformática
Análise de estrutura de proteínas
Mineração de dados
Base de dados Sting
Data mining
Data warehousing
Proteína
Bioinformatics
Proteins
Databases
title_short Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
title_full Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
title_fullStr Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
title_full_unstemmed Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
title_sort Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.
author OLIVEIRA, S. R. de M.
author_facet OLIVEIRA, S. R. de M.
ALMEIDA, G. V.
SOUZA, K. R. R.
RODRIGUES, D. N.
KUSER-FALCÃO, P. R.
YAMAGISHI, M. E. B.
SANTOS, E. H. dos
VIEIRA, F. D.
JARDINE, J. G.
NESHICH, G.
author_role author
author2 ALMEIDA, G. V.
SOUZA, K. R. R.
RODRIGUES, D. N.
KUSER-FALCÃO, P. R.
YAMAGISHI, M. E. B.
SANTOS, E. H. dos
VIEIRA, F. D.
JARDINE, J. G.
NESHICH, G.
author2_role author
author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; PAULA REGINA KUSER FALCAO, CNPTIA; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; EDGARD HENRIQUE DOS SANTOS, CNPTIA; FABIO DANILO VIEIRA, CNPTIA; JOSE GILBERTO JARDINE, CNPTIA; GORAN NESHICH, CNPTIA.
dc.contributor.author.fl_str_mv OLIVEIRA, S. R. de M.
ALMEIDA, G. V.
SOUZA, K. R. R.
RODRIGUES, D. N.
KUSER-FALCÃO, P. R.
YAMAGISHI, M. E. B.
SANTOS, E. H. dos
VIEIRA, F. D.
JARDINE, J. G.
NESHICH, G.
dc.subject.por.fl_str_mv Bioinformática
Análise de estrutura de proteínas
Mineração de dados
Base de dados Sting
Data mining
Data warehousing
Proteína
Bioinformatics
Proteins
Databases
topic Bioinformática
Análise de estrutura de proteínas
Mineração de dados
Base de dados Sting
Data mining
Data warehousing
Proteína
Bioinformatics
Proteins
Databases
description Abstract. An effective strategy for managing protein databases is to provide mechanisms to transform raw data into consistent, accurate and reliable information. Such mechanisms will greatly reduce operational inefficiencies and improve one's ability to better handle scientific objectives and interpret the research results. To achieve this challenging goal for the STING project, we introduce Sting_RDB, a relational database of structural parameters for protein analysis with support for data warehousing and data mining. In this article, we highlight the main features of Sting_RDB and show how a user can explore it for efficient and biologically relevant queries. Considering its importance for molecular biologists, effort has been made to advance Sting_RDB toward data quality assessment. To the best of our knowledge, Sting_RDB is one of the most comprehensive data repositories for protein analysis, now also capable of providing its users with a data quality indicator. This paper differs from our previous study in many aspects. First, we introduce Sting_RDB, a relational database with mechanisms for efficient and relevant queries using SQL. Sting_rdb evolved from the earlier, text (flat file)-based database, in which data consistency and integrity was not guaranteed. Second, we provide support for data warehousing and mining. Third, the data quality indicator was introduced. Finally and probably most importantly, complex queries that could not be posed on a text-based database, are now easily implemented.
publishDate 2007
dc.date.none.fl_str_mv 2007-12-07
2007
2011-04-10T11:11:11Z
2011-04-10T11:11:11Z
2017-05-11T11:11:11Z
dc.type.driver.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Genetics and Molecular Research, v. 6, n. 4, p. 911-922, 2007.
http://www.alice.cnptia.embrapa.br/alice/handle/doc/814
identifier_str_mv Genetics and Molecular Research, v. 6, n. 4, p. 911-922, 2007.
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/814
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1794503435994791936