Is a genome a codeword of an error-correcting code?

Detalhes bibliográficos
Autor(a) principal: FARIA, L. C. B.
Data de Publicação: 2012
Outros Autores: ROCHA, A. S. L., KLEINSCHMIDT, J. H., SILVA-FILHO, M. C., BIM, E., HERAI, R. H., YAMAGISHI, M. E. B., PALAZZO JÚNIOR, R.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo: http://www.alice.cnptia.embrapa.br/alice/handle/doc/925637
Resumo: Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.
id EMBR_d2ede18ccd338ff2d59e70bab7ef6a09
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/925637
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling Is a genome a codeword of an error-correcting code?Sequência de DNABiologyBiologiaGenomeNucleotide sequencesSince a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.LUZINETE C. B. FARIA, Unicamp; ANDRÉA S. L. ROCHA, Unicamp; JOÃO H. KLEINSCHMIDT, UFABC; MÁRCIO C. SILVA-FILHO, Esalq/USP; EDSON BIM, Unicamp; ROBERTO H. HERAI, University of California San Diego; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; REGINALDO PALAZZO JÚNIOR, Unicamp.FARIA, L. C. B.ROCHA, A. S. L.KLEINSCHMIDT, J. H.SILVA-FILHO, M. C.BIM, E.HERAI, R. H.YAMAGISHI, M. E. B.PALAZZO JÚNIOR, R.2012-05-29T11:11:11Z2012-05-29T11:11:11Z2012-05-29T11:11:11Z2012-05-29T11:11:11Z2012-05-2920122012-08-30T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlePLoS ONE, San Francisco, v. 7, n. 5, p. 1-9, May 2012.http://www.alice.cnptia.embrapa.br/alice/handle/doc/925637enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2017-08-16T00:09:14Zoai:www.alice.cnptia.embrapa.br:doc/925637Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542017-08-16T00:09:14falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542017-08-16T00:09:14Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv Is a genome a codeword of an error-correcting code?
title Is a genome a codeword of an error-correcting code?
spellingShingle Is a genome a codeword of an error-correcting code?
FARIA, L. C. B.
Sequência de DNA
Biology
Biologia
Genome
Nucleotide sequences
title_short Is a genome a codeword of an error-correcting code?
title_full Is a genome a codeword of an error-correcting code?
title_fullStr Is a genome a codeword of an error-correcting code?
title_full_unstemmed Is a genome a codeword of an error-correcting code?
title_sort Is a genome a codeword of an error-correcting code?
author FARIA, L. C. B.
author_facet FARIA, L. C. B.
ROCHA, A. S. L.
KLEINSCHMIDT, J. H.
SILVA-FILHO, M. C.
BIM, E.
HERAI, R. H.
YAMAGISHI, M. E. B.
PALAZZO JÚNIOR, R.
author_role author
author2 ROCHA, A. S. L.
KLEINSCHMIDT, J. H.
SILVA-FILHO, M. C.
BIM, E.
HERAI, R. H.
YAMAGISHI, M. E. B.
PALAZZO JÚNIOR, R.
author2_role author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv LUZINETE C. B. FARIA, Unicamp; ANDRÉA S. L. ROCHA, Unicamp; JOÃO H. KLEINSCHMIDT, UFABC; MÁRCIO C. SILVA-FILHO, Esalq/USP; EDSON BIM, Unicamp; ROBERTO H. HERAI, University of California San Diego; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; REGINALDO PALAZZO JÚNIOR, Unicamp.
dc.contributor.author.fl_str_mv FARIA, L. C. B.
ROCHA, A. S. L.
KLEINSCHMIDT, J. H.
SILVA-FILHO, M. C.
BIM, E.
HERAI, R. H.
YAMAGISHI, M. E. B.
PALAZZO JÚNIOR, R.
dc.subject.por.fl_str_mv Sequência de DNA
Biology
Biologia
Genome
Nucleotide sequences
topic Sequência de DNA
Biology
Biologia
Genome
Nucleotide sequences
description Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.
publishDate 2012
dc.date.none.fl_str_mv 2012-05-29T11:11:11Z
2012-05-29T11:11:11Z
2012-05-29T11:11:11Z
2012-05-29T11:11:11Z
2012-05-29
2012
2012-08-30T11:11:11Z
dc.type.driver.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv PLoS ONE, San Francisco, v. 7, n. 5, p. 1-9, May 2012.
http://www.alice.cnptia.embrapa.br/alice/handle/doc/925637
identifier_str_mv PLoS ONE, San Francisco, v. 7, n. 5, p. 1-9, May 2012.
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/925637
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1794503363520364544