Is a genome a codeword of an error-correcting code?
Autor(a) principal: | |
---|---|
Data de Publicação: | 2012 |
Outros Autores: | , , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
Texto Completo: | http://www.alice.cnptia.embrapa.br/alice/handle/doc/925637 |
Resumo: | Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction. |
id |
EMBR_d2ede18ccd338ff2d59e70bab7ef6a09 |
---|---|
oai_identifier_str |
oai:www.alice.cnptia.embrapa.br:doc/925637 |
network_acronym_str |
EMBR |
network_name_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository_id_str |
2154 |
spelling |
Is a genome a codeword of an error-correcting code?Sequência de DNABiologyBiologiaGenomeNucleotide sequencesSince a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.LUZINETE C. B. FARIA, Unicamp; ANDRÉA S. L. ROCHA, Unicamp; JOÃO H. KLEINSCHMIDT, UFABC; MÁRCIO C. SILVA-FILHO, Esalq/USP; EDSON BIM, Unicamp; ROBERTO H. HERAI, University of California San Diego; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; REGINALDO PALAZZO JÚNIOR, Unicamp.FARIA, L. C. B.ROCHA, A. S. L.KLEINSCHMIDT, J. H.SILVA-FILHO, M. C.BIM, E.HERAI, R. H.YAMAGISHI, M. E. B.PALAZZO JÚNIOR, R.2012-05-29T11:11:11Z2012-05-29T11:11:11Z2012-05-29T11:11:11Z2012-05-29T11:11:11Z2012-05-2920122012-08-30T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlePLoS ONE, San Francisco, v. 7, n. 5, p. 1-9, May 2012.http://www.alice.cnptia.embrapa.br/alice/handle/doc/925637enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2017-08-16T00:09:14Zoai:www.alice.cnptia.embrapa.br:doc/925637Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542017-08-16T00:09:14falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542017-08-16T00:09:14Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false |
dc.title.none.fl_str_mv |
Is a genome a codeword of an error-correcting code? |
title |
Is a genome a codeword of an error-correcting code? |
spellingShingle |
Is a genome a codeword of an error-correcting code? FARIA, L. C. B. Sequência de DNA Biology Biologia Genome Nucleotide sequences |
title_short |
Is a genome a codeword of an error-correcting code? |
title_full |
Is a genome a codeword of an error-correcting code? |
title_fullStr |
Is a genome a codeword of an error-correcting code? |
title_full_unstemmed |
Is a genome a codeword of an error-correcting code? |
title_sort |
Is a genome a codeword of an error-correcting code? |
author |
FARIA, L. C. B. |
author_facet |
FARIA, L. C. B. ROCHA, A. S. L. KLEINSCHMIDT, J. H. SILVA-FILHO, M. C. BIM, E. HERAI, R. H. YAMAGISHI, M. E. B. PALAZZO JÚNIOR, R. |
author_role |
author |
author2 |
ROCHA, A. S. L. KLEINSCHMIDT, J. H. SILVA-FILHO, M. C. BIM, E. HERAI, R. H. YAMAGISHI, M. E. B. PALAZZO JÚNIOR, R. |
author2_role |
author author author author author author author |
dc.contributor.none.fl_str_mv |
LUZINETE C. B. FARIA, Unicamp; ANDRÉA S. L. ROCHA, Unicamp; JOÃO H. KLEINSCHMIDT, UFABC; MÁRCIO C. SILVA-FILHO, Esalq/USP; EDSON BIM, Unicamp; ROBERTO H. HERAI, University of California San Diego; MICHEL EDUARDO BELEZA YAMAGISHI, CNPTIA; REGINALDO PALAZZO JÚNIOR, Unicamp. |
dc.contributor.author.fl_str_mv |
FARIA, L. C. B. ROCHA, A. S. L. KLEINSCHMIDT, J. H. SILVA-FILHO, M. C. BIM, E. HERAI, R. H. YAMAGISHI, M. E. B. PALAZZO JÚNIOR, R. |
dc.subject.por.fl_str_mv |
Sequência de DNA Biology Biologia Genome Nucleotide sequences |
topic |
Sequência de DNA Biology Biologia Genome Nucleotide sequences |
description |
Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction. |
publishDate |
2012 |
dc.date.none.fl_str_mv |
2012-05-29T11:11:11Z 2012-05-29T11:11:11Z 2012-05-29T11:11:11Z 2012-05-29T11:11:11Z 2012-05-29 2012 2012-08-30T11:11:11Z |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
PLoS ONE, San Francisco, v. 7, n. 5, p. 1-9, May 2012. http://www.alice.cnptia.embrapa.br/alice/handle/doc/925637 |
identifier_str_mv |
PLoS ONE, San Francisco, v. 7, n. 5, p. 1-9, May 2012. |
url |
http://www.alice.cnptia.embrapa.br/alice/handle/doc/925637 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA |
instname_str |
Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
instacron_str |
EMBRAPA |
institution |
EMBRAPA |
reponame_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
collection |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository.name.fl_str_mv |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
repository.mail.fl_str_mv |
cg-riaa@embrapa.br |
_version_ |
1794503363520364544 |