Análise multimodal em blogs brasileiros
Autor(a) principal: | |
---|---|
Data de Publicação: | 2019 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Biblioteca Digital de Teses e Dissertações da PUC_RS |
Texto Completo: | http://tede2.pucrs.br/tede2/handle/tede/9049 |
Resumo: | The use of social media is increasingly present in our lives. It is through images, texts and videos that humans try to communicate on social networks and expose their opinions in the face of everyday events. Due to the increased volume of data transmitted over the Internet, it is difficult to perform a human analysis of the media without the use of computer resources. Scientific communities, with various motivations, such as: analyzing feelings in text, in images, detecting opinions in blogs, among others, feel challenged to discover characteristics to be extracted from these contents, being an example of the analysis of emotions in blogs. Although the area of classification of feelings through texts and images is under development, there are still several challenges. The main challenge is to build algorithms and methods that can infer subtle and subjective feelings as humans perceive them. This paper presents the corpus Cross-media Brazilian Blog, a dataset that was built based on BlogSet-BR. In addition, it was built the Ground Truth of these data (based on the opinions of subjects) about the feelings perceived in the texts and images of these blogs, which in this work become available for use. Some technologies used to predict sentiment in text and images have been tested in the Cross-Media Brazilian Blog corpus and compared with Ground Truth. In addition to the analyzes performed on the texts, a research was conducted specifically on contradictory posts, i.e. when the image is positive and the text is negative, or vice versa, when present on the same blog. Results indicate that methodologies for detecting feelings in blogs can be customized to detect conflicting posts and be able to better identify feelings in social media posts. |
id |
P_RS_ef2eed3cc313b74ccc9a132ac9a98213 |
---|---|
oai_identifier_str |
oai:tede2.pucrs.br:tede/9049 |
network_acronym_str |
P_RS |
network_name_str |
Biblioteca Digital de Teses e Dissertações da PUC_RS |
repository_id_str |
|
spelling |
Musse, Soraia Raupphttp://lattes.cnpq.br/2302314954133011http://lattes.cnpq.br/8168273804205288Molin, Greice Pinho Dal2019-12-02T12:24:56Z2019-08-30http://tede2.pucrs.br/tede2/handle/tede/9049The use of social media is increasingly present in our lives. It is through images, texts and videos that humans try to communicate on social networks and expose their opinions in the face of everyday events. Due to the increased volume of data transmitted over the Internet, it is difficult to perform a human analysis of the media without the use of computer resources. Scientific communities, with various motivations, such as: analyzing feelings in text, in images, detecting opinions in blogs, among others, feel challenged to discover characteristics to be extracted from these contents, being an example of the analysis of emotions in blogs. Although the area of classification of feelings through texts and images is under development, there are still several challenges. The main challenge is to build algorithms and methods that can infer subtle and subjective feelings as humans perceive them. This paper presents the corpus Cross-media Brazilian Blog, a dataset that was built based on BlogSet-BR. In addition, it was built the Ground Truth of these data (based on the opinions of subjects) about the feelings perceived in the texts and images of these blogs, which in this work become available for use. Some technologies used to predict sentiment in text and images have been tested in the Cross-Media Brazilian Blog corpus and compared with Ground Truth. In addition to the analyzes performed on the texts, a research was conducted specifically on contradictory posts, i.e. when the image is positive and the text is negative, or vice versa, when present on the same blog. Results indicate that methodologies for detecting feelings in blogs can be customized to detect conflicting posts and be able to better identify feelings in social media posts.O uso de mídias sociais está cada vez mais presente em nossas vidas. É através de imagens, textos e vídeos que os seres humanos tentam se comunicar nas redes sociais e expor suas opiniões diante dos acontecimentos cotidianos. Devido ao aumento do volume de dados transmitidos pela internet, torna-se difícil realizar uma análise humana da mídia sem o uso de recursos computacionais. As comunidades científicas, com diversas motivações, tais como: análisar sentimentos em texto, em imagens, detectar opiniões em blogs, dentre outras, sentem-se desafiadas a descobrirem características a serem extraídas desses conteúdos, sendo um exemplo a análise de emoções em blogs. Embora a área de classificação de sentimentos através de textos e imagens esteja em desenvolvimento, ainda existem vários desafios. O principal desafio é construir algoritmos e métodos que possam inferir sentimentos sutis e subjetivos como os humanos os percebem. Neste trabalho é apresentado o corpus Cross-media Brazilian Blog, um conjunto de dados que foi construído com base no BlogSet-BR. Além disso, construiu-se o Ground Truth desses dados (com base nas opiniões de sujeitos) sobre os sentimentos percebidos nos textos e nas imagens destes blogs, que neste trabalho se tornam disponíveis para uso. Algumas tecnologias utilizadas para prever o sentimento em textos e em imagens foram testadas no corpus Cross-media Brazilian Blog e comparadas com o Ground Truth e são apresentadas e discutidas neste trabalho. Em adição às análises realizadas sobre os textos, realizou-se uma pesquisa especificamente sobre posts contraditórios, ou seja, quando a imagem é positiva e o texto é negativo, ou vice-versa, quando presentes no mesmo blog. Resultados indicam que metodologias para detecção de sentimentos em blogs podem ser customizadas para detectar postagens contraditórias e serem capazes de melhor identificar sentimentos nas postagens de mídia social.Submitted by PPG Ciência da Computação (ppgcc@pucrs.br) on 2019-11-29T20:49:57Z No. of bitstreams: 1 GREICE PINHO DAL MOLIN_dis.pdf: 6505329 bytes, checksum: c908b30be027a916434e59620a2f7197 (MD5)Approved for entry into archive by Sarajane Pan (sarajane.pan@pucrs.br) on 2019-12-02T12:15:38Z (GMT) No. of bitstreams: 1 GREICE PINHO DAL MOLIN_dis.pdf: 6505329 bytes, checksum: c908b30be027a916434e59620a2f7197 (MD5)Made available in DSpace on 2019-12-02T12:24:56Z (GMT). No. of bitstreams: 1 GREICE PINHO DAL MOLIN_dis.pdf: 6505329 bytes, checksum: c908b30be027a916434e59620a2f7197 (MD5) Previous issue date: 2019-08-30application/pdfhttp://tede2.pucrs.br:80/tede2/retrieve/177344/GREICE%20PINHO%20DAL%20MOLIN_dis.pdf.jpgporPontifícia Universidade Católica do Rio Grande do SulPrograma de Pós-Graduação em Ciência da ComputaçãoPUCRSBrasilEscola PolitécnicaCross-Media BlogsetText Sentiment AnalysisImage Sentiment AnalysisLexiconsDomain ContradictionAnálise de Sentimentos em TextoAnálise de Sentimentos em ImagensCorpusLéxicosContradição Entre DomíniosCNNCIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAOAnálise multimodal em blogs brasileirosinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisTrabalho não apresenta restrição para publicação-4570527706994352458500500-862078257083325301info:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da PUC_RSinstname:Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)instacron:PUC_RSTHUMBNAILGREICE PINHO DAL MOLIN_dis.pdf.jpgGREICE PINHO DAL MOLIN_dis.pdf.jpgimage/jpeg5532http://tede2.pucrs.br/tede2/bitstream/tede/9049/4/GREICE+PINHO+DAL+MOLIN_dis.pdf.jpg6c1e8743093e40ff2c86428457a6097bMD54TEXTGREICE PINHO DAL MOLIN_dis.pdf.txtGREICE PINHO DAL MOLIN_dis.pdf.txttext/plain197263http://tede2.pucrs.br/tede2/bitstream/tede/9049/3/GREICE+PINHO+DAL+MOLIN_dis.pdf.txtdeb134206b34b1c8db011776f4af08b1MD53ORIGINALGREICE PINHO DAL MOLIN_dis.pdfGREICE PINHO DAL MOLIN_dis.pdfapplication/pdf6505329http://tede2.pucrs.br/tede2/bitstream/tede/9049/2/GREICE+PINHO+DAL+MOLIN_dis.pdfc908b30be027a916434e59620a2f7197MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-8590http://tede2.pucrs.br/tede2/bitstream/tede/9049/1/license.txt220e11f2d3ba5354f917c7035aadef24MD51tede/90492019-12-02 12:00:21.287oai:tede2.pucrs.br:tede/9049QXV0b3JpemE/P28gcGFyYSBQdWJsaWNhPz9vIEVsZXRyP25pY2E6IENvbSBiYXNlIG5vIGRpc3Bvc3RvIG5hIExlaSBGZWRlcmFsIG4/OS42MTAsIGRlIDE5IGRlIGZldmVyZWlybyBkZSAxOTk4LCBvIGF1dG9yIEFVVE9SSVpBIGEgcHVibGljYT8/byBlbGV0cj9uaWNhIGRhIHByZXNlbnRlIG9icmEgbm8gYWNlcnZvIGRhIEJpYmxpb3RlY2EgRGlnaXRhbCBkYSBQb250aWY/Y2lhIFVuaXZlcnNpZGFkZSBDYXQ/bGljYSBkbyBSaW8gR3JhbmRlIGRvIFN1bCwgc2VkaWFkYSBhIEF2LiBJcGlyYW5nYSA2NjgxLCBQb3J0byBBbGVncmUsIFJpbyBHcmFuZGUgZG8gU3VsLCBjb20gcmVnaXN0cm8gZGUgQ05QSiA4ODYzMDQxMzAwMDItODEgYmVtIGNvbW8gZW0gb3V0cmFzIGJpYmxpb3RlY2FzIGRpZ2l0YWlzLCBuYWNpb25haXMgZSBpbnRlcm5hY2lvbmFpcywgY29ucz9yY2lvcyBlIHJlZGVzID9zIHF1YWlzIGEgYmlibGlvdGVjYSBkYSBQVUNSUyBwb3NzYSBhIHZpciBwYXJ0aWNpcGFyLCBzZW0gP251cyBhbHVzaXZvIGFvcyBkaXJlaXRvcyBhdXRvcmFpcywgYSB0P3R1bG8gZGUgZGl2dWxnYT8/byBkYSBwcm9kdT8/byBjaWVudD9maWNhLgo=Biblioteca Digital de Teses e Dissertaçõeshttp://tede2.pucrs.br/tede2/PRIhttps://tede2.pucrs.br/oai/requestbiblioteca.central@pucrs.br||opendoar:2019-12-02T14:00:21Biblioteca Digital de Teses e Dissertações da PUC_RS - Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS)false |
dc.title.por.fl_str_mv |
Análise multimodal em blogs brasileiros |
title |
Análise multimodal em blogs brasileiros |
spellingShingle |
Análise multimodal em blogs brasileiros Molin, Greice Pinho Dal Cross-Media Blogset Text Sentiment Analysis Image Sentiment Analysis Lexicons Domain Contradiction Análise de Sentimentos em Texto Análise de Sentimentos em Imagens Corpus Léxicos Contradição Entre Domínios CNN CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO |
title_short |
Análise multimodal em blogs brasileiros |
title_full |
Análise multimodal em blogs brasileiros |
title_fullStr |
Análise multimodal em blogs brasileiros |
title_full_unstemmed |
Análise multimodal em blogs brasileiros |
title_sort |
Análise multimodal em blogs brasileiros |
author |
Molin, Greice Pinho Dal |
author_facet |
Molin, Greice Pinho Dal |
author_role |
author |
dc.contributor.advisor1.fl_str_mv |
Musse, Soraia Raupp |
dc.contributor.advisor1Lattes.fl_str_mv |
http://lattes.cnpq.br/2302314954133011 |
dc.contributor.authorLattes.fl_str_mv |
http://lattes.cnpq.br/8168273804205288 |
dc.contributor.author.fl_str_mv |
Molin, Greice Pinho Dal |
contributor_str_mv |
Musse, Soraia Raupp |
dc.subject.eng.fl_str_mv |
Cross-Media Blogset Text Sentiment Analysis Image Sentiment Analysis Lexicons Domain Contradiction |
topic |
Cross-Media Blogset Text Sentiment Analysis Image Sentiment Analysis Lexicons Domain Contradiction Análise de Sentimentos em Texto Análise de Sentimentos em Imagens Corpus Léxicos Contradição Entre Domínios CNN CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO |
dc.subject.por.fl_str_mv |
Análise de Sentimentos em Texto Análise de Sentimentos em Imagens Corpus Léxicos Contradição Entre Domínios CNN |
dc.subject.cnpq.fl_str_mv |
CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO |
description |
The use of social media is increasingly present in our lives. It is through images, texts and videos that humans try to communicate on social networks and expose their opinions in the face of everyday events. Due to the increased volume of data transmitted over the Internet, it is difficult to perform a human analysis of the media without the use of computer resources. Scientific communities, with various motivations, such as: analyzing feelings in text, in images, detecting opinions in blogs, among others, feel challenged to discover characteristics to be extracted from these contents, being an example of the analysis of emotions in blogs. Although the area of classification of feelings through texts and images is under development, there are still several challenges. The main challenge is to build algorithms and methods that can infer subtle and subjective feelings as humans perceive them. This paper presents the corpus Cross-media Brazilian Blog, a dataset that was built based on BlogSet-BR. In addition, it was built the Ground Truth of these data (based on the opinions of subjects) about the feelings perceived in the texts and images of these blogs, which in this work become available for use. Some technologies used to predict sentiment in text and images have been tested in the Cross-Media Brazilian Blog corpus and compared with Ground Truth. In addition to the analyzes performed on the texts, a research was conducted specifically on contradictory posts, i.e. when the image is positive and the text is negative, or vice versa, when present on the same blog. Results indicate that methodologies for detecting feelings in blogs can be customized to detect conflicting posts and be able to better identify feelings in social media posts. |
publishDate |
2019 |
dc.date.accessioned.fl_str_mv |
2019-12-02T12:24:56Z |
dc.date.issued.fl_str_mv |
2019-08-30 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://tede2.pucrs.br/tede2/handle/tede/9049 |
url |
http://tede2.pucrs.br/tede2/handle/tede/9049 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.program.fl_str_mv |
-4570527706994352458 |
dc.relation.confidence.fl_str_mv |
500 500 |
dc.relation.cnpq.fl_str_mv |
-862078257083325301 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Pontifícia Universidade Católica do Rio Grande do Sul |
dc.publisher.program.fl_str_mv |
Programa de Pós-Graduação em Ciência da Computação |
dc.publisher.initials.fl_str_mv |
PUCRS |
dc.publisher.country.fl_str_mv |
Brasil |
dc.publisher.department.fl_str_mv |
Escola Politécnica |
publisher.none.fl_str_mv |
Pontifícia Universidade Católica do Rio Grande do Sul |
dc.source.none.fl_str_mv |
reponame:Biblioteca Digital de Teses e Dissertações da PUC_RS instname:Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS) instacron:PUC_RS |
instname_str |
Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS) |
instacron_str |
PUC_RS |
institution |
PUC_RS |
reponame_str |
Biblioteca Digital de Teses e Dissertações da PUC_RS |
collection |
Biblioteca Digital de Teses e Dissertações da PUC_RS |
bitstream.url.fl_str_mv |
http://tede2.pucrs.br/tede2/bitstream/tede/9049/4/GREICE+PINHO+DAL+MOLIN_dis.pdf.jpg http://tede2.pucrs.br/tede2/bitstream/tede/9049/3/GREICE+PINHO+DAL+MOLIN_dis.pdf.txt http://tede2.pucrs.br/tede2/bitstream/tede/9049/2/GREICE+PINHO+DAL+MOLIN_dis.pdf http://tede2.pucrs.br/tede2/bitstream/tede/9049/1/license.txt |
bitstream.checksum.fl_str_mv |
6c1e8743093e40ff2c86428457a6097b deb134206b34b1c8db011776f4af08b1 c908b30be027a916434e59620a2f7197 220e11f2d3ba5354f917c7035aadef24 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 |
repository.name.fl_str_mv |
Biblioteca Digital de Teses e Dissertações da PUC_RS - Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS) |
repository.mail.fl_str_mv |
biblioteca.central@pucrs.br|| |
_version_ |
1799765343551356928 |