Evaluation of genome similarities using a wavelet-domain approach
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UFLA |
Texto Completo: | http://repositorio.ufla.br/jspui/handle/1/43458 |
Resumo: | INTRODUCTION: Tuberculosis is listed among the top 10 causes of deaths worldwide. The resistant strains causing this disease have been considered to be responsible for public health emergencies and health security threats. As stated by the World Health Organization (WHO), around 558,000 different cases coupled with resistance to rifampicin (the most operative first-line drug) have been estimated to date. Therefore, in order to detect the resistant strains using the genomes of Mycobacterium tuberculosis (MTB), we propose a new methodology for the analysis of genomic similarities that associate the different levels of decomposition of the genome (discrete non-decimated wavelet transform) and the Hurst exponent. METHODS: The signals corresponding to the ten analyzed sequences were obtained by assessing GC content, and then these signals were decomposed using the discrete non-decimated wavelet transform along with the Daubechies wavelet with four null moments at five levels of decomposition. The Hurst exponent was calculated at each decomposition level using five different methods. The cluster analysis was performed using the results obtained for the Hurst exponent. RESULTS: The aggregated variance, differenced aggregated variance, and aggregated absolute value methods presented the formation of three groups, whereas the Peng and R/S methods presented the formation of two groups. The aggregated variance method exhibited the best results with respect to the group formation between similar strains. CONCLUSION: The evaluation of Hurst exponent associated with discrete non-decimated wavelet transform can be used as a measure of similarity between genome sequences, thus leading to a refinement in the analysis. |
id |
UFLA_5cfe2226cfd5933c1f4b953ae14d6178 |
---|---|
oai_identifier_str |
oai:localhost:1/43458 |
network_acronym_str |
UFLA |
network_name_str |
Repositório Institucional da UFLA |
repository_id_str |
|
spelling |
Evaluation of genome similarities using a wavelet-domain approachGC contentHurst exponentMycobacterium tuberculosisDiscrete non-decimated wavelet transformTransformada não-decimada de ondaletasMétodo de variância agregadaExpoente de HurstINTRODUCTION: Tuberculosis is listed among the top 10 causes of deaths worldwide. The resistant strains causing this disease have been considered to be responsible for public health emergencies and health security threats. As stated by the World Health Organization (WHO), around 558,000 different cases coupled with resistance to rifampicin (the most operative first-line drug) have been estimated to date. Therefore, in order to detect the resistant strains using the genomes of Mycobacterium tuberculosis (MTB), we propose a new methodology for the analysis of genomic similarities that associate the different levels of decomposition of the genome (discrete non-decimated wavelet transform) and the Hurst exponent. METHODS: The signals corresponding to the ten analyzed sequences were obtained by assessing GC content, and then these signals were decomposed using the discrete non-decimated wavelet transform along with the Daubechies wavelet with four null moments at five levels of decomposition. The Hurst exponent was calculated at each decomposition level using five different methods. The cluster analysis was performed using the results obtained for the Hurst exponent. RESULTS: The aggregated variance, differenced aggregated variance, and aggregated absolute value methods presented the formation of three groups, whereas the Peng and R/S methods presented the formation of two groups. The aggregated variance method exhibited the best results with respect to the group formation between similar strains. CONCLUSION: The evaluation of Hurst exponent associated with discrete non-decimated wavelet transform can be used as a measure of similarity between genome sequences, thus leading to a refinement in the analysis.Sociedade Brasileira de Medicina Tropical - SBMT2020-10-19T21:54:35Z2020-10-19T21:54:35Z2020-05info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfFERREIRA, L. M.; SÁFADI, T.; FERREIRA, J. L. Evaluation of genome similarities using a wavelet-domain approach. Revista da Sociedade Brasileira de Medicina Tropical, Uberaba, v. 53, e20190470, 2020. DOI: https://doi.org/10.1590/0037-8682-0470-2019.http://repositorio.ufla.br/jspui/handle/1/43458Revista da Sociedade Brasileira de Medicina Tropicalreponame:Repositório Institucional da UFLAinstname:Universidade Federal de Lavras (UFLA)instacron:UFLAAttribution 4.0 Internationalhttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessFerreira, Leila MariaSáfadi, ThelmaFerreira, Juliano Linoeng2020-10-19T21:54:59Zoai:localhost:1/43458Repositório InstitucionalPUBhttp://repositorio.ufla.br/oai/requestnivaldo@ufla.br || repositorio.biblioteca@ufla.bropendoar:2020-10-19T21:54:59Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA)false |
dc.title.none.fl_str_mv |
Evaluation of genome similarities using a wavelet-domain approach |
title |
Evaluation of genome similarities using a wavelet-domain approach |
spellingShingle |
Evaluation of genome similarities using a wavelet-domain approach Ferreira, Leila Maria GC content Hurst exponent Mycobacterium tuberculosis Discrete non-decimated wavelet transform Transformada não-decimada de ondaletas Método de variância agregada Expoente de Hurst |
title_short |
Evaluation of genome similarities using a wavelet-domain approach |
title_full |
Evaluation of genome similarities using a wavelet-domain approach |
title_fullStr |
Evaluation of genome similarities using a wavelet-domain approach |
title_full_unstemmed |
Evaluation of genome similarities using a wavelet-domain approach |
title_sort |
Evaluation of genome similarities using a wavelet-domain approach |
author |
Ferreira, Leila Maria |
author_facet |
Ferreira, Leila Maria Sáfadi, Thelma Ferreira, Juliano Lino |
author_role |
author |
author2 |
Sáfadi, Thelma Ferreira, Juliano Lino |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Ferreira, Leila Maria Sáfadi, Thelma Ferreira, Juliano Lino |
dc.subject.por.fl_str_mv |
GC content Hurst exponent Mycobacterium tuberculosis Discrete non-decimated wavelet transform Transformada não-decimada de ondaletas Método de variância agregada Expoente de Hurst |
topic |
GC content Hurst exponent Mycobacterium tuberculosis Discrete non-decimated wavelet transform Transformada não-decimada de ondaletas Método de variância agregada Expoente de Hurst |
description |
INTRODUCTION: Tuberculosis is listed among the top 10 causes of deaths worldwide. The resistant strains causing this disease have been considered to be responsible for public health emergencies and health security threats. As stated by the World Health Organization (WHO), around 558,000 different cases coupled with resistance to rifampicin (the most operative first-line drug) have been estimated to date. Therefore, in order to detect the resistant strains using the genomes of Mycobacterium tuberculosis (MTB), we propose a new methodology for the analysis of genomic similarities that associate the different levels of decomposition of the genome (discrete non-decimated wavelet transform) and the Hurst exponent. METHODS: The signals corresponding to the ten analyzed sequences were obtained by assessing GC content, and then these signals were decomposed using the discrete non-decimated wavelet transform along with the Daubechies wavelet with four null moments at five levels of decomposition. The Hurst exponent was calculated at each decomposition level using five different methods. The cluster analysis was performed using the results obtained for the Hurst exponent. RESULTS: The aggregated variance, differenced aggregated variance, and aggregated absolute value methods presented the formation of three groups, whereas the Peng and R/S methods presented the formation of two groups. The aggregated variance method exhibited the best results with respect to the group formation between similar strains. CONCLUSION: The evaluation of Hurst exponent associated with discrete non-decimated wavelet transform can be used as a measure of similarity between genome sequences, thus leading to a refinement in the analysis. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-10-19T21:54:35Z 2020-10-19T21:54:35Z 2020-05 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
FERREIRA, L. M.; SÁFADI, T.; FERREIRA, J. L. Evaluation of genome similarities using a wavelet-domain approach. Revista da Sociedade Brasileira de Medicina Tropical, Uberaba, v. 53, e20190470, 2020. DOI: https://doi.org/10.1590/0037-8682-0470-2019. http://repositorio.ufla.br/jspui/handle/1/43458 |
identifier_str_mv |
FERREIRA, L. M.; SÁFADI, T.; FERREIRA, J. L. Evaluation of genome similarities using a wavelet-domain approach. Revista da Sociedade Brasileira de Medicina Tropical, Uberaba, v. 53, e20190470, 2020. DOI: https://doi.org/10.1590/0037-8682-0470-2019. |
url |
http://repositorio.ufla.br/jspui/handle/1/43458 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
Attribution 4.0 International http://creativecommons.org/licenses/by/4.0/ info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Attribution 4.0 International http://creativecommons.org/licenses/by/4.0/ |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Sociedade Brasileira de Medicina Tropical - SBMT |
publisher.none.fl_str_mv |
Sociedade Brasileira de Medicina Tropical - SBMT |
dc.source.none.fl_str_mv |
Revista da Sociedade Brasileira de Medicina Tropical reponame:Repositório Institucional da UFLA instname:Universidade Federal de Lavras (UFLA) instacron:UFLA |
instname_str |
Universidade Federal de Lavras (UFLA) |
instacron_str |
UFLA |
institution |
UFLA |
reponame_str |
Repositório Institucional da UFLA |
collection |
Repositório Institucional da UFLA |
repository.name.fl_str_mv |
Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA) |
repository.mail.fl_str_mv |
nivaldo@ufla.br || repositorio.biblioteca@ufla.br |
_version_ |
1815439087073492992 |