Delayed Transfer Entropy applied to Big Data

Jonas Rossi Dourado

Delayed Transfer Entropy applied to Big Data

Detalhes bibliográficos
Autor(a) principal:	Jonas Rossi Dourado
Data de Publicação:	2018
Tipo de documento:	Dissertação
Idioma:	eng
Título da fonte:	Biblioteca Digital de Teses e Dissertações da USP
Texto Completo:	https://doi.org/10.11606/D.18.2019.tde-19022019-134228
Resumo:	Recent popularization of technologies such as Smartphones, Wearables, Internet of Things, Social Networks and Video streaming increased data creation. Dealing with extensive data sets led the creation of term big data, often defined as when data volume, acquisition rate or representation demands nontraditional approaches to data analysis or requires horizontal scaling for data processing. Analysis is the most important Big Data phase, where it has the objective of extracting meaningful and often hidden information. One example of Big Data hidden information is causality, which can be inferred with Delayed Transfer Entropy (DTE). Despite DTE wide applicability, it has a high demanding processing power which is aggravated with large datasets as those found in big data. This research optimized DTE performance and modified existing code to enable DTE execution on a computer cluster. With big data trend in sight, this results may enable bigger datasets analysis or better statistical evidence.

Metadados do item

id	USP_ba554cdff312cf6004872e22fa0d48d8
oai_identifier_str	oai:teses.usp.br:tde-19022019-134228
network_acronym_str	USP
network_name_str	Biblioteca Digital de Teses e Dissertações da USP
repository_id_str	2721
spelling	info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesis Delayed Transfer Entropy applied to Big Data Delayed Transfer Entropy aplicado a Big Data 2018-11-30Carlos Dias MacielAlexandre Cláudio Botazzo DelbemAilton Akira ShinodaJonas Rossi DouradoUniversidade de São PauloEngenharia ElétricaUSPBR Análise de Big Data Big Data analysis Causalidade Causality Cluster heterogêneo de computadores Delayed Transfer Entropy Delayed Transfer Entropy Estratégias de paralelismo Heterogeneous computer cluster Parallelism strategies Surrogate Surrogate Recent popularization of technologies such as Smartphones, Wearables, Internet of Things, Social Networks and Video streaming increased data creation. Dealing with extensive data sets led the creation of term big data, often defined as when data volume, acquisition rate or representation demands nontraditional approaches to data analysis or requires horizontal scaling for data processing. Analysis is the most important Big Data phase, where it has the objective of extracting meaningful and often hidden information. One example of Big Data hidden information is causality, which can be inferred with Delayed Transfer Entropy (DTE). Despite DTE wide applicability, it has a high demanding processing power which is aggravated with large datasets as those found in big data. This research optimized DTE performance and modified existing code to enable DTE execution on a computer cluster. With big data trend in sight, this results may enable bigger datasets analysis or better statistical evidence. A recente popularização de tecnologias como Smartphones, Wearables, Internet das Coisas, Redes Sociais e streaming de Video aumentou a criação de dados. A manipulação de grande quantidade de dados levou a criação do termo Big Data, muitas vezes definido como quando o volume, a taxa de aquisição ou a representação dos dados demanda abordagens não tradicionais para analisar ou requer uma escala horizontal para o processamento de dados. A análise é a etapa de Big Data mais importante, tendo como objetivo extrair informações relevantes e às vezes escondidas. Um exemplo de informação escondida é a causalidade, que pode ser inferida utilizando Delayed Transfer Entropy (DTE). Apesar do DTE ter uma grande aplicabilidade, ele possui uma grande demanda computacional, esta última, é agravada devido a grandes bases de dados como as encontradas em Big Data. Essa pesquisa otimizou e modificou o código existente para permitir a execução de DTE em um cluster de computadores. Com a tendência de Big Data em vista, esse resultado pode permitir bancos de dados maiores ou melhores evidências estatísticas. https://doi.org/10.11606/D.18.2019.tde-19022019-134228info:eu-repo/semantics/openAccessengreponame:Biblioteca Digital de Teses e Dissertações da USPinstname:Universidade de São Paulo (USP)instacron:USP2023-12-21T19:22:41Zoai:teses.usp.br:tde-19022019-134228Biblioteca Digital de Teses e Dissertaçõeshttp://www.teses.usp.br/PUBhttp://www.teses.usp.br/cgi-bin/mtd2br.plvirginia@if.usp.br\|\| atendimento@aguia.usp.br\|\|virginia@if.usp.bropendoar:27212023-12-22T12:53:52.767667Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)false
dc.title.en.fl_str_mv	Delayed Transfer Entropy applied to Big Data
dc.title.alternative.pt.fl_str_mv	Delayed Transfer Entropy aplicado a Big Data
title	Delayed Transfer Entropy applied to Big Data
spellingShingle	Delayed Transfer Entropy applied to Big Data Jonas Rossi Dourado
title_short	Delayed Transfer Entropy applied to Big Data
title_full	Delayed Transfer Entropy applied to Big Data
title_fullStr	Delayed Transfer Entropy applied to Big Data
title_full_unstemmed	Delayed Transfer Entropy applied to Big Data
title_sort	Delayed Transfer Entropy applied to Big Data
author	Jonas Rossi Dourado
author_facet	Jonas Rossi Dourado
author_role	author
dc.contributor.advisor1.fl_str_mv	Carlos Dias Maciel
dc.contributor.referee1.fl_str_mv	Alexandre Cláudio Botazzo Delbem
dc.contributor.referee2.fl_str_mv	Ailton Akira Shinoda
dc.contributor.author.fl_str_mv	Jonas Rossi Dourado
contributor_str_mv	Carlos Dias Maciel Alexandre Cláudio Botazzo Delbem Ailton Akira Shinoda
description	Recent popularization of technologies such as Smartphones, Wearables, Internet of Things, Social Networks and Video streaming increased data creation. Dealing with extensive data sets led the creation of term big data, often defined as when data volume, acquisition rate or representation demands nontraditional approaches to data analysis or requires horizontal scaling for data processing. Analysis is the most important Big Data phase, where it has the objective of extracting meaningful and often hidden information. One example of Big Data hidden information is causality, which can be inferred with Delayed Transfer Entropy (DTE). Despite DTE wide applicability, it has a high demanding processing power which is aggravated with large datasets as those found in big data. This research optimized DTE performance and modified existing code to enable DTE execution on a computer cluster. With big data trend in sight, this results may enable bigger datasets analysis or better statistical evidence.
publishDate	2018
dc.date.issued.fl_str_mv	2018-11-30
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://doi.org/10.11606/D.18.2019.tde-19022019-134228
url	https://doi.org/10.11606/D.18.2019.tde-19022019-134228
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Universidade de São Paulo
dc.publisher.program.fl_str_mv	Engenharia Elétrica
dc.publisher.initials.fl_str_mv	USP
dc.publisher.country.fl_str_mv	BR
publisher.none.fl_str_mv	Universidade de São Paulo
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações da USP instname:Universidade de São Paulo (USP) instacron:USP
instname_str	Universidade de São Paulo (USP)
instacron_str	USP
institution	USP
reponame_str	Biblioteca Digital de Teses e Dissertações da USP
collection	Biblioteca Digital de Teses e Dissertações da USP
repository.name.fl_str_mv	Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)
repository.mail.fl_str_mv	virginia@if.usp.br\|\| atendimento@aguia.usp.br\|\|virginia@if.usp.br
_version_	1794502853733122048

Delayed Transfer Entropy applied to Big Data

Registros relacionados