Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data

Detalhes bibliográficos
Autor(a) principal: VEDOVATTO, Thiago
Data de Publicação: 2019
Tipo de documento: Tese
Idioma: eng
Título da fonte: Repositório Institucional da UFPE
Texto Completo: https://repositorio.ufpe.br/handle/123456789/33110
Resumo: The Cauchy-Rayleigh (CR) distribution has been successfully used to describe asymmetrical and heavy-tail events from radar imagery. Employing such model to describe lifetime data may then seem attractive, but drawbacks arise: its probability density function does not cover non-modal behavior as well as its hazard rate function (hrf) assumes only one form. To outperform this difficulty, it is investigated the exponentiated Cauchy-Rayleigh (ECR) distribution. This byparameteric model is flexible enough to accommodate hrf with decreasing, decreasing-increasing-decreasing and upside-down bathtub forms. Several closed-form mathematical expressions for the ECR model are obtained: median, mode, some moments, (h;Φ)-entropies and Fisher information matrix. Their non-existence respective cases are also determined. It is proposed two estimators for the ECR parameters: maximum likelihood (ML) and percentile-based methods. Both of this methods may be biased for small and moderate sample sizes. To overcome it we furnish a expression for its second-order bias according to Cox and Snell (1968) and propose a third bias-corrected ML estimator. Further discussions about hypotheses-based inference and estimation formulas on censored-data are furnished as well. A simulation study is done to assess the estimators performance. The estimates existence and uniqueness are not guaranteed, thus procedures to constrained estimation are developed to overcome this trouble. Notes about hypothesis tests are given under censored and uncensored schemes. An application in a survival dataset illustrates the proposed model usefulness. Results point out that the ECR distribution may outperform classical lifetime biparametric models, such as the gamma, Birnbaum-Saunders, Weibull and log-normal laws, before heavy-tail data. The likelihood ratio test are compared against entropy-based tests as urban texture detector using a San Francisco synthetic aperture radar imagery. This same image is also the final application target which consist of compare segmentation algorithms based on CR and ECR entropies densities finite mixture. It is recurred to a result due Pardo et al. (1997) which provides the (h;Φ)-entropies asymptotic distribution. The finite mixture log-likelihood function is maximized using the Expectation-Maximization algorithm.
id UFPE_f285c27b4fabb5fe30f1c68e22377f48
oai_identifier_str oai:repositorio.ufpe.br:123456789/33110
network_acronym_str UFPE
network_name_str Repositório Institucional da UFPE
repository_id_str 2221
spelling VEDOVATTO, Thiagohttp://lattes.cnpq.br/1555798555635250http://lattes.cnpq.br/9853084384672692NASCIMENTO, Abraão David Costa do2019-09-17T21:33:49Z2019-09-17T21:33:49Z2019-02-25https://repositorio.ufpe.br/handle/123456789/33110The Cauchy-Rayleigh (CR) distribution has been successfully used to describe asymmetrical and heavy-tail events from radar imagery. Employing such model to describe lifetime data may then seem attractive, but drawbacks arise: its probability density function does not cover non-modal behavior as well as its hazard rate function (hrf) assumes only one form. To outperform this difficulty, it is investigated the exponentiated Cauchy-Rayleigh (ECR) distribution. This byparameteric model is flexible enough to accommodate hrf with decreasing, decreasing-increasing-decreasing and upside-down bathtub forms. Several closed-form mathematical expressions for the ECR model are obtained: median, mode, some moments, (h;Φ)-entropies and Fisher information matrix. Their non-existence respective cases are also determined. It is proposed two estimators for the ECR parameters: maximum likelihood (ML) and percentile-based methods. Both of this methods may be biased for small and moderate sample sizes. To overcome it we furnish a expression for its second-order bias according to Cox and Snell (1968) and propose a third bias-corrected ML estimator. Further discussions about hypotheses-based inference and estimation formulas on censored-data are furnished as well. A simulation study is done to assess the estimators performance. The estimates existence and uniqueness are not guaranteed, thus procedures to constrained estimation are developed to overcome this trouble. Notes about hypothesis tests are given under censored and uncensored schemes. An application in a survival dataset illustrates the proposed model usefulness. Results point out that the ECR distribution may outperform classical lifetime biparametric models, such as the gamma, Birnbaum-Saunders, Weibull and log-normal laws, before heavy-tail data. The likelihood ratio test are compared against entropy-based tests as urban texture detector using a San Francisco synthetic aperture radar imagery. This same image is also the final application target which consist of compare segmentation algorithms based on CR and ECR entropies densities finite mixture. It is recurred to a result due Pardo et al. (1997) which provides the (h;Φ)-entropies asymptotic distribution. The finite mixture log-likelihood function is maximized using the Expectation-Maximization algorithm.IFGA distribuição de Cauchy-Rayleigh (CR) tem sido usada com sucesso para descrever dados assimétricos e eventos com caudas pesadas de imagens de radar. Empregar tal modelo para descrever dados de sobrevivência poder ser atrativo, mas inconvenientes surgem: sua função de densidade de probabilidade não abriga comportamento amodal bem como sua função de taxa de falha (hrf) assume apenas uma forma. Para superar essa dificuldade, é investigada a distribuição Cauchy-Rayleigh exponencializada (ECR). Este modelo biparamétrico é flexivel o bastante para acomodar hrf com formas decrescente, decrescente-crescente-decrescente e banheira invertida. Várias expressões matemáticas em forma fechada para o modelo ECR são obtidas: mediana, moda, alguns momentos, (h;Φ)-entropias e matriz de informação de Fisher. Seus respectivos casos de não existência também são determinados. São propostos dois estimadores para os parâmetros da ECR: métodos de máxima verossimilhança (ML) e estimação quantílica. Ambos os métodos podem ser viesados para tamanhos de amostra pequenos e moderados. Para superar isto, fornecemos uma expressão para o viés de segunda ordem de acordo com Cox e Snell (1968) e propusemos um estimador de máxima verossimilhança com viés de terceira ordem corrigido. Discussões adicionais sobre inferência baseada em hipóteses e fórmulas de estimação em dados censurados são fornecidas. Um estudo de simulação é feito para aferir a performance dos estimadores. A existência e unicidade das estimativas não é garantida, assim procedimentos para estimação restrita são desenvolvidos para superar esse problema. Notas sobre teste de hipótese são dadas considerando esquemas censurados e não-censurados. Uma aplicação em dados de sobrevida ilustra a utilidade do modelo proposto. Os resultados apontam que a distribuição ECR pode superar modelos biparamétricos de sobrevivência clássicos como gama, Birnbaum-Saunders, Weibull e log-normal. O teste da razão de verossimilhança é comparado com testes baseados em entropias como detector de texturas urbanas em uma imagem synthetic aperture radar da cidade de São Francisco. Esta mesma imagem é também alvo da aplicação final que consiste em comparar algorítmos de segmentação baseados em misturas finitas das densidades das entropias dos modelos CR e ECR. Recorremos a um resultado de Pardo et al. (1997) o qual nos dá distribuição assintótica das (h;Φ)-entropias. A função de log-verossimilhança das misturas finitas é maximizada usando o algorítmo EM.engUniversidade Federal de PernambucoPrograma de Pos Graduacao em EstatisticaUFPEBrasilAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessEstatísticaInferênciaProbabilidadeInference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed dataInference, information theory, and segmentation based on the extended Cauchy-Rayleigh model: applications to heavy-tailed datainfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisdoutoradoreponame:Repositório Institucional da UFPEinstname:Universidade Federal de Pernambuco (UFPE)instacron:UFPETHUMBNAILTESE Thiago VedoVatto.pdf.jpgTESE Thiago VedoVatto.pdf.jpgGenerated Thumbnailimage/jpeg1372https://repositorio.ufpe.br/bitstream/123456789/33110/5/TESE%20Thiago%20VedoVatto.pdf.jpg1a7056fb3c0fe4a5fc31dcec6ba41a08MD55ORIGINALTESE Thiago VedoVatto.pdfTESE Thiago VedoVatto.pdfapplication/pdf9122405https://repositorio.ufpe.br/bitstream/123456789/33110/1/TESE%20Thiago%20VedoVatto.pdf443713a6f9ffc10db020c09dfa3750d1MD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufpe.br/bitstream/123456789/33110/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82310https://repositorio.ufpe.br/bitstream/123456789/33110/3/license.txtbd573a5ca8288eb7272482765f819534MD53TEXTTESE Thiago VedoVatto.pdf.txtTESE Thiago VedoVatto.pdf.txtExtracted texttext/plain297273https://repositorio.ufpe.br/bitstream/123456789/33110/4/TESE%20Thiago%20VedoVatto.pdf.txtb310509a573b67fb17d17e5b375dc48eMD54123456789/331102021-06-13 09:52:24.456oai:repositorio.ufpe.br:123456789/33110TGljZW7Dp2EgZGUgRGlzdHJpYnVpw6fDo28gTsOjbyBFeGNsdXNpdmEKClRvZG8gZGVwb3NpdGFudGUgZGUgbWF0ZXJpYWwgbm8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgKFJJKSBkZXZlIGNvbmNlZGVyLCDDoCBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkZSBQZXJuYW1idWNvIChVRlBFKSwgdW1hIExpY2Vuw6dhIGRlIERpc3RyaWJ1acOnw6NvIE7Do28gRXhjbHVzaXZhIHBhcmEgbWFudGVyIGUgdG9ybmFyIGFjZXNzw612ZWlzIG9zIHNldXMgZG9jdW1lbnRvcywgZW0gZm9ybWF0byBkaWdpdGFsLCBuZXN0ZSByZXBvc2l0w7NyaW8uCgpDb20gYSBjb25jZXNzw6NvIGRlc3RhIGxpY2Vuw6dhIG7Do28gZXhjbHVzaXZhLCBvIGRlcG9zaXRhbnRlIG1hbnTDqW0gdG9kb3Mgb3MgZGlyZWl0b3MgZGUgYXV0b3IuCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fXwoKTGljZW7Dp2EgZGUgRGlzdHJpYnVpw6fDo28gTsOjbyBFeGNsdXNpdmEKCkFvIGNvbmNvcmRhciBjb20gZXN0YSBsaWNlbsOnYSBlIGFjZWl0w6EtbGEsIHZvY8OqIChhdXRvciBvdSBkZXRlbnRvciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMpOgoKYSkgRGVjbGFyYSBxdWUgY29uaGVjZSBhIHBvbMOtdGljYSBkZSBjb3B5cmlnaHQgZGEgZWRpdG9yYSBkbyBzZXUgZG9jdW1lbnRvOwpiKSBEZWNsYXJhIHF1ZSBjb25oZWNlIGUgYWNlaXRhIGFzIERpcmV0cml6ZXMgcGFyYSBvIFJlcG9zaXTDs3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGUEU7CmMpIENvbmNlZGUgw6AgVUZQRSBvIGRpcmVpdG8gbsOjbyBleGNsdXNpdm8gZGUgYXJxdWl2YXIsIHJlcHJvZHV6aXIsIGNvbnZlcnRlciAoY29tbyBkZWZpbmlkbyBhIHNlZ3VpciksIGNvbXVuaWNhciBlL291IGRpc3RyaWJ1aXIsIG5vIFJJLCBvIGRvY3VtZW50byBlbnRyZWd1ZSAoaW5jbHVpbmRvIG8gcmVzdW1vL2Fic3RyYWN0KSBlbSBmb3JtYXRvIGRpZ2l0YWwgb3UgcG9yIG91dHJvIG1laW87CmQpIERlY2xhcmEgcXVlIGF1dG9yaXphIGEgVUZQRSBhIGFycXVpdmFyIG1haXMgZGUgdW1hIGPDs3BpYSBkZXN0ZSBkb2N1bWVudG8gZSBjb252ZXJ0w6otbG8sIHNlbSBhbHRlcmFyIG8gc2V1IGNvbnRlw7pkbywgcGFyYSBxdWFscXVlciBmb3JtYXRvIGRlIGZpY2hlaXJvLCBtZWlvIG91IHN1cG9ydGUsIHBhcmEgZWZlaXRvcyBkZSBzZWd1cmFuw6dhLCBwcmVzZXJ2YcOnw6NvIChiYWNrdXApIGUgYWNlc3NvOwplKSBEZWNsYXJhIHF1ZSBvIGRvY3VtZW50byBzdWJtZXRpZG8gw6kgbyBzZXUgdHJhYmFsaG8gb3JpZ2luYWwgZSBxdWUgZGV0w6ltIG8gZGlyZWl0byBkZSBjb25jZWRlciBhIHRlcmNlaXJvcyBvcyBkaXJlaXRvcyBjb250aWRvcyBuZXN0YSBsaWNlbsOnYS4gRGVjbGFyYSB0YW1iw6ltIHF1ZSBhIGVudHJlZ2EgZG8gZG9jdW1lbnRvIG7Do28gaW5mcmluZ2Ugb3MgZGlyZWl0b3MgZGUgb3V0cmEgcGVzc29hIG91IGVudGlkYWRlOwpmKSBEZWNsYXJhIHF1ZSwgbm8gY2FzbyBkbyBkb2N1bWVudG8gc3VibWV0aWRvIGNvbnRlciBtYXRlcmlhbCBkbyBxdWFsIG7Do28gZGV0w6ltIG9zIGRpcmVpdG9zIGRlCmF1dG9yLCBvYnRldmUgYSBhdXRvcml6YcOnw6NvIGlycmVzdHJpdGEgZG8gcmVzcGVjdGl2byBkZXRlbnRvciBkZXNzZXMgZGlyZWl0b3MgcGFyYSBjZWRlciDDoApVRlBFIG9zIGRpcmVpdG9zIHJlcXVlcmlkb3MgcG9yIGVzdGEgTGljZW7Dp2EgZSBhdXRvcml6YXIgYSB1bml2ZXJzaWRhZGUgYSB1dGlsaXrDoS1sb3MgbGVnYWxtZW50ZS4gRGVjbGFyYSB0YW1iw6ltIHF1ZSBlc3NlIG1hdGVyaWFsIGN1am9zIGRpcmVpdG9zIHPDo28gZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3UgY29udGXDumRvIGRvIGRvY3VtZW50byBlbnRyZWd1ZTsKZykgU2UgbyBkb2N1bWVudG8gZW50cmVndWUgw6kgYmFzZWFkbyBlbSB0cmFiYWxobyBmaW5hbmNpYWRvIG91IGFwb2lhZG8gcG9yIG91dHJhIGluc3RpdHVpw6fDo28gcXVlIG7Do28gYSBVRlBFLCBkZWNsYXJhIHF1ZSBjdW1wcml1IHF1YWlzcXVlciBvYnJpZ2HDp8O1ZXMgZXhpZ2lkYXMgcGVsbyByZXNwZWN0aXZvIGNvbnRyYXRvIG91IGFjb3Jkby4KCkEgVUZQRSBpZGVudGlmaWNhcsOhIGNsYXJhbWVudGUgbyhzKSBub21lKHMpIGRvKHMpIGF1dG9yIChlcykgZG9zIGRpcmVpdG9zIGRvIGRvY3VtZW50byBlbnRyZWd1ZSBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIHBhcmEgYWzDqW0gZG8gcHJldmlzdG8gbmEgYWzDrW5lYSBjKS4KRepositório InstitucionalPUBhttps://repositorio.ufpe.br/oai/requestattena@ufpe.bropendoar:22212021-06-13T12:52:24Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)false
dc.title.pt_BR.fl_str_mv Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data
dc.title.alternative.pt_BR.fl_str_mv Inference, information theory, and segmentation based on the extended Cauchy-Rayleigh model: applications to heavy-tailed data
title Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data
spellingShingle Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data
VEDOVATTO, Thiago
Estatística
Inferência
Probabilidade
title_short Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data
title_full Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data
title_fullStr Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data
title_full_unstemmed Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data
title_sort Inference, information theory and segmentation based on an extended Cauchy-Rayleigh distribution : applications to heavy tailed data
author VEDOVATTO, Thiago
author_facet VEDOVATTO, Thiago
author_role author
dc.contributor.authorLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/1555798555635250
dc.contributor.advisorLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/9853084384672692
dc.contributor.author.fl_str_mv VEDOVATTO, Thiago
dc.contributor.advisor1.fl_str_mv NASCIMENTO, Abraão David Costa do
contributor_str_mv NASCIMENTO, Abraão David Costa do
dc.subject.por.fl_str_mv Estatística
Inferência
Probabilidade
topic Estatística
Inferência
Probabilidade
description The Cauchy-Rayleigh (CR) distribution has been successfully used to describe asymmetrical and heavy-tail events from radar imagery. Employing such model to describe lifetime data may then seem attractive, but drawbacks arise: its probability density function does not cover non-modal behavior as well as its hazard rate function (hrf) assumes only one form. To outperform this difficulty, it is investigated the exponentiated Cauchy-Rayleigh (ECR) distribution. This byparameteric model is flexible enough to accommodate hrf with decreasing, decreasing-increasing-decreasing and upside-down bathtub forms. Several closed-form mathematical expressions for the ECR model are obtained: median, mode, some moments, (h;Φ)-entropies and Fisher information matrix. Their non-existence respective cases are also determined. It is proposed two estimators for the ECR parameters: maximum likelihood (ML) and percentile-based methods. Both of this methods may be biased for small and moderate sample sizes. To overcome it we furnish a expression for its second-order bias according to Cox and Snell (1968) and propose a third bias-corrected ML estimator. Further discussions about hypotheses-based inference and estimation formulas on censored-data are furnished as well. A simulation study is done to assess the estimators performance. The estimates existence and uniqueness are not guaranteed, thus procedures to constrained estimation are developed to overcome this trouble. Notes about hypothesis tests are given under censored and uncensored schemes. An application in a survival dataset illustrates the proposed model usefulness. Results point out that the ECR distribution may outperform classical lifetime biparametric models, such as the gamma, Birnbaum-Saunders, Weibull and log-normal laws, before heavy-tail data. The likelihood ratio test are compared against entropy-based tests as urban texture detector using a San Francisco synthetic aperture radar imagery. This same image is also the final application target which consist of compare segmentation algorithms based on CR and ECR entropies densities finite mixture. It is recurred to a result due Pardo et al. (1997) which provides the (h;Φ)-entropies asymptotic distribution. The finite mixture log-likelihood function is maximized using the Expectation-Maximization algorithm.
publishDate 2019
dc.date.accessioned.fl_str_mv 2019-09-17T21:33:49Z
dc.date.available.fl_str_mv 2019-09-17T21:33:49Z
dc.date.issued.fl_str_mv 2019-02-25
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://repositorio.ufpe.br/handle/123456789/33110
url https://repositorio.ufpe.br/handle/123456789/33110
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal de Pernambuco
dc.publisher.program.fl_str_mv Programa de Pos Graduacao em Estatistica
dc.publisher.initials.fl_str_mv UFPE
dc.publisher.country.fl_str_mv Brasil
publisher.none.fl_str_mv Universidade Federal de Pernambuco
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFPE
instname:Universidade Federal de Pernambuco (UFPE)
instacron:UFPE
instname_str Universidade Federal de Pernambuco (UFPE)
instacron_str UFPE
institution UFPE
reponame_str Repositório Institucional da UFPE
collection Repositório Institucional da UFPE
bitstream.url.fl_str_mv https://repositorio.ufpe.br/bitstream/123456789/33110/5/TESE%20Thiago%20VedoVatto.pdf.jpg
https://repositorio.ufpe.br/bitstream/123456789/33110/1/TESE%20Thiago%20VedoVatto.pdf
https://repositorio.ufpe.br/bitstream/123456789/33110/2/license_rdf
https://repositorio.ufpe.br/bitstream/123456789/33110/3/license.txt
https://repositorio.ufpe.br/bitstream/123456789/33110/4/TESE%20Thiago%20VedoVatto.pdf.txt
bitstream.checksum.fl_str_mv 1a7056fb3c0fe4a5fc31dcec6ba41a08
443713a6f9ffc10db020c09dfa3750d1
e39d27027a6cc9cb039ad269a5db8e34
bd573a5ca8288eb7272482765f819534
b310509a573b67fb17d17e5b375dc48e
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)
repository.mail.fl_str_mv attena@ufpe.br
_version_ 1802310704966926336