Evaluation of genome similarities using a wavelet-domain approach

Detalhes bibliográficos
Autor(a) principal: Ferreira, Leila Maria
Data de Publicação: 2020
Outros Autores: Sáfadi, Thelma, Ferreira, Juliano Lino
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UFLA
Texto Completo: http://repositorio.ufla.br/jspui/handle/1/43458
Resumo: INTRODUCTION: Tuberculosis is listed among the top 10 causes of deaths worldwide. The resistant strains causing this disease have been considered to be responsible for public health emergencies and health security threats. As stated by the World Health Organization (WHO), around 558,000 different cases coupled with resistance to rifampicin (the most operative first-line drug) have been estimated to date. Therefore, in order to detect the resistant strains using the genomes of Mycobacterium tuberculosis (MTB), we propose a new methodology for the analysis of genomic similarities that associate the different levels of decomposition of the genome (discrete non-decimated wavelet transform) and the Hurst exponent. METHODS: The signals corresponding to the ten analyzed sequences were obtained by assessing GC content, and then these signals were decomposed using the discrete non-decimated wavelet transform along with the Daubechies wavelet with four null moments at five levels of decomposition. The Hurst exponent was calculated at each decomposition level using five different methods. The cluster analysis was performed using the results obtained for the Hurst exponent. RESULTS: The aggregated variance, differenced aggregated variance, and aggregated absolute value methods presented the formation of three groups, whereas the Peng and R/S methods presented the formation of two groups. The aggregated variance method exhibited the best results with respect to the group formation between similar strains. CONCLUSION: The evaluation of Hurst exponent associated with discrete non-decimated wavelet transform can be used as a measure of similarity between genome sequences, thus leading to a refinement in the analysis.
id UFLA_5cfe2226cfd5933c1f4b953ae14d6178
oai_identifier_str oai:localhost:1/43458
network_acronym_str UFLA
network_name_str Repositório Institucional da UFLA
repository_id_str
spelling Evaluation of genome similarities using a wavelet-domain approachGC contentHurst exponentMycobacterium tuberculosisDiscrete non-decimated wavelet transformTransformada não-decimada de ondaletasMétodo de variância agregadaExpoente de HurstINTRODUCTION: Tuberculosis is listed among the top 10 causes of deaths worldwide. The resistant strains causing this disease have been considered to be responsible for public health emergencies and health security threats. As stated by the World Health Organization (WHO), around 558,000 different cases coupled with resistance to rifampicin (the most operative first-line drug) have been estimated to date. Therefore, in order to detect the resistant strains using the genomes of Mycobacterium tuberculosis (MTB), we propose a new methodology for the analysis of genomic similarities that associate the different levels of decomposition of the genome (discrete non-decimated wavelet transform) and the Hurst exponent. METHODS: The signals corresponding to the ten analyzed sequences were obtained by assessing GC content, and then these signals were decomposed using the discrete non-decimated wavelet transform along with the Daubechies wavelet with four null moments at five levels of decomposition. The Hurst exponent was calculated at each decomposition level using five different methods. The cluster analysis was performed using the results obtained for the Hurst exponent. RESULTS: The aggregated variance, differenced aggregated variance, and aggregated absolute value methods presented the formation of three groups, whereas the Peng and R/S methods presented the formation of two groups. The aggregated variance method exhibited the best results with respect to the group formation between similar strains. CONCLUSION: The evaluation of Hurst exponent associated with discrete non-decimated wavelet transform can be used as a measure of similarity between genome sequences, thus leading to a refinement in the analysis.Sociedade Brasileira de Medicina Tropical - SBMT2020-10-19T21:54:35Z2020-10-19T21:54:35Z2020-05info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfFERREIRA, L. M.; SÁFADI, T.; FERREIRA, J. L. Evaluation of genome similarities using a wavelet-domain approach. Revista da Sociedade Brasileira de Medicina Tropical, Uberaba, v. 53, e20190470, 2020. DOI: https://doi.org/10.1590/0037-8682-0470-2019.http://repositorio.ufla.br/jspui/handle/1/43458Revista da Sociedade Brasileira de Medicina Tropicalreponame:Repositório Institucional da UFLAinstname:Universidade Federal de Lavras (UFLA)instacron:UFLAAttribution 4.0 Internationalhttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessFerreira, Leila MariaSáfadi, ThelmaFerreira, Juliano Linoeng2020-10-19T21:54:59Zoai:localhost:1/43458Repositório InstitucionalPUBhttp://repositorio.ufla.br/oai/requestnivaldo@ufla.br || repositorio.biblioteca@ufla.bropendoar:2020-10-19T21:54:59Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA)false
dc.title.none.fl_str_mv Evaluation of genome similarities using a wavelet-domain approach
title Evaluation of genome similarities using a wavelet-domain approach
spellingShingle Evaluation of genome similarities using a wavelet-domain approach
Ferreira, Leila Maria
GC content
Hurst exponent
Mycobacterium tuberculosis
Discrete non-decimated wavelet transform
Transformada não-decimada de ondaletas
Método de variância agregada
Expoente de Hurst
title_short Evaluation of genome similarities using a wavelet-domain approach
title_full Evaluation of genome similarities using a wavelet-domain approach
title_fullStr Evaluation of genome similarities using a wavelet-domain approach
title_full_unstemmed Evaluation of genome similarities using a wavelet-domain approach
title_sort Evaluation of genome similarities using a wavelet-domain approach
author Ferreira, Leila Maria
author_facet Ferreira, Leila Maria
Sáfadi, Thelma
Ferreira, Juliano Lino
author_role author
author2 Sáfadi, Thelma
Ferreira, Juliano Lino
author2_role author
author
dc.contributor.author.fl_str_mv Ferreira, Leila Maria
Sáfadi, Thelma
Ferreira, Juliano Lino
dc.subject.por.fl_str_mv GC content
Hurst exponent
Mycobacterium tuberculosis
Discrete non-decimated wavelet transform
Transformada não-decimada de ondaletas
Método de variância agregada
Expoente de Hurst
topic GC content
Hurst exponent
Mycobacterium tuberculosis
Discrete non-decimated wavelet transform
Transformada não-decimada de ondaletas
Método de variância agregada
Expoente de Hurst
description INTRODUCTION: Tuberculosis is listed among the top 10 causes of deaths worldwide. The resistant strains causing this disease have been considered to be responsible for public health emergencies and health security threats. As stated by the World Health Organization (WHO), around 558,000 different cases coupled with resistance to rifampicin (the most operative first-line drug) have been estimated to date. Therefore, in order to detect the resistant strains using the genomes of Mycobacterium tuberculosis (MTB), we propose a new methodology for the analysis of genomic similarities that associate the different levels of decomposition of the genome (discrete non-decimated wavelet transform) and the Hurst exponent. METHODS: The signals corresponding to the ten analyzed sequences were obtained by assessing GC content, and then these signals were decomposed using the discrete non-decimated wavelet transform along with the Daubechies wavelet with four null moments at five levels of decomposition. The Hurst exponent was calculated at each decomposition level using five different methods. The cluster analysis was performed using the results obtained for the Hurst exponent. RESULTS: The aggregated variance, differenced aggregated variance, and aggregated absolute value methods presented the formation of three groups, whereas the Peng and R/S methods presented the formation of two groups. The aggregated variance method exhibited the best results with respect to the group formation between similar strains. CONCLUSION: The evaluation of Hurst exponent associated with discrete non-decimated wavelet transform can be used as a measure of similarity between genome sequences, thus leading to a refinement in the analysis.
publishDate 2020
dc.date.none.fl_str_mv 2020-10-19T21:54:35Z
2020-10-19T21:54:35Z
2020-05
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv FERREIRA, L. M.; SÁFADI, T.; FERREIRA, J. L. Evaluation of genome similarities using a wavelet-domain approach. Revista da Sociedade Brasileira de Medicina Tropical, Uberaba, v. 53, e20190470, 2020. DOI: https://doi.org/10.1590/0037-8682-0470-2019.
http://repositorio.ufla.br/jspui/handle/1/43458
identifier_str_mv FERREIRA, L. M.; SÁFADI, T.; FERREIRA, J. L. Evaluation of genome similarities using a wavelet-domain approach. Revista da Sociedade Brasileira de Medicina Tropical, Uberaba, v. 53, e20190470, 2020. DOI: https://doi.org/10.1590/0037-8682-0470-2019.
url http://repositorio.ufla.br/jspui/handle/1/43458
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv Attribution 4.0 International
http://creativecommons.org/licenses/by/4.0/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution 4.0 International
http://creativecommons.org/licenses/by/4.0/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Sociedade Brasileira de Medicina Tropical - SBMT
publisher.none.fl_str_mv Sociedade Brasileira de Medicina Tropical - SBMT
dc.source.none.fl_str_mv Revista da Sociedade Brasileira de Medicina Tropical
reponame:Repositório Institucional da UFLA
instname:Universidade Federal de Lavras (UFLA)
instacron:UFLA
instname_str Universidade Federal de Lavras (UFLA)
instacron_str UFLA
institution UFLA
reponame_str Repositório Institucional da UFLA
collection Repositório Institucional da UFLA
repository.name.fl_str_mv Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA)
repository.mail.fl_str_mv nivaldo@ufla.br || repositorio.biblioteca@ufla.br
_version_ 1815439087073492992