Medindo valores humanos por meio de processamento de linguagem natural

Detalhes bibliográficos
Autor(a) principal: Nascimento, Anderson Mesquita do
Data de Publicação: 2019
Tipo de documento: Tese
Idioma: por
Título da fonte: Biblioteca Digital de Teses e Dissertações da UFPB
Texto Completo: https://repositorio.ufpb.br/jspui/handle/123456789/20026
Resumo: The study of human values plays a central role in Social Psychology field. Human values are defined as abstract characteristics that serve as a guiding principle in the lives of individuals. Since the second half of the last century, some theoretical models have been proposed that sought to identify how human values are organized. Among these, the Functionalist Theory of Human Values emerged, assuming that the structure of human values is defined by two main functions: guiding behavior and cognitively expressing the needs of individuals. Regarding the methodological strategy for measuring human values, it has been based almost exclusively on self-report measures. However, recent technological advances have allowed the development of analysis strategies that make possible to extract relevant psychological characteristics from quantitative data derived from textual bases, a field known as natural language processing. The present thesis aims to test the hypothesis that the use of natural language processing is adequate to measure human values from lexical indicators (words). This thesis is divided into three articles. The first is a theoretical paper that sought to identify the main aspects of the nature of human values that influence their measurement. In the second paper we used the closed vocabulary strategy to analyze 33,941 speeches of federal deputies in the Brazilian Legislative Chamber between 2011 and 2014. In this, human values were measured from a predefined vocabulary of words, selected from judge selection process. To develop this vocabulary, an initial set of 100,886 words was used to achieve a final list of 24 lexical indicators, four for each subfunction. The results of the second paper showed that the lexical indicators of each subfunction presented a higher co-occurrence index with indicators of the same evaluative subfunction than with others, t (17) = 4.12, p = 0.001. In addition, the mean test-rest correlation of the evaluative subfunctions over the intervals between 2011 and 2014 was 0.70 indicating the temporal stability of the proposed vocabulary. Finally, multilevel regression analyzes have shown that gender and party ideology have an effect on the prevalence of lexical values indicators in deputies' speech. The aim of the third paper was to investigate which language characters are most related to different types of basic values, based on the functionalist theory of values. For this purpose, both Linguistic Inquiry and Word Count and Open Vocabulary Differential Language Analysis approaches were used to analyze 1,110,080 tweets from 1,883 participants (80.4% female), which answered the 18 items of the basic values questionnaire. The results showed that each of the evaluative subfunctions presented positive associations with language characters that support their face validity and point out to relationships with behavior previously found in the literature. In the pattern of negative relationships, there was a predominance of language suggestive of negative affects, emotional instability, and personal distress for almost all evaluative subfunctions. The findings suggest that the language of Twitter can be used to characterize the values of individuals. The present thesis is expected to contribute to the measurement of human values via textual data, to complement those derived from self-report measures and to allow the analysis of natural language databases available to researchers in large volume (e.g. text messages from social media).
id UFPB_f96baa24c33eb93db4e5de4dedb466fb
oai_identifier_str oai:repositorio.ufpb.br:123456789/20026
network_acronym_str UFPB
network_name_str Biblioteca Digital de Teses e Dissertações da UFPB
repository_id_str
spelling Medindo valores humanos por meio de processamento de linguagem naturalValores humanosProcessamento de linguagem naturalDados textuaisHuman valuesNatural language processingTextual dataCNPQ::CIENCIAS HUMANAS::PSICOLOGIA::PSICOLOGIA SOCIALThe study of human values plays a central role in Social Psychology field. Human values are defined as abstract characteristics that serve as a guiding principle in the lives of individuals. Since the second half of the last century, some theoretical models have been proposed that sought to identify how human values are organized. Among these, the Functionalist Theory of Human Values emerged, assuming that the structure of human values is defined by two main functions: guiding behavior and cognitively expressing the needs of individuals. Regarding the methodological strategy for measuring human values, it has been based almost exclusively on self-report measures. However, recent technological advances have allowed the development of analysis strategies that make possible to extract relevant psychological characteristics from quantitative data derived from textual bases, a field known as natural language processing. The present thesis aims to test the hypothesis that the use of natural language processing is adequate to measure human values from lexical indicators (words). This thesis is divided into three articles. The first is a theoretical paper that sought to identify the main aspects of the nature of human values that influence their measurement. In the second paper we used the closed vocabulary strategy to analyze 33,941 speeches of federal deputies in the Brazilian Legislative Chamber between 2011 and 2014. In this, human values were measured from a predefined vocabulary of words, selected from judge selection process. To develop this vocabulary, an initial set of 100,886 words was used to achieve a final list of 24 lexical indicators, four for each subfunction. The results of the second paper showed that the lexical indicators of each subfunction presented a higher co-occurrence index with indicators of the same evaluative subfunction than with others, t (17) = 4.12, p = 0.001. In addition, the mean test-rest correlation of the evaluative subfunctions over the intervals between 2011 and 2014 was 0.70 indicating the temporal stability of the proposed vocabulary. Finally, multilevel regression analyzes have shown that gender and party ideology have an effect on the prevalence of lexical values indicators in deputies' speech. The aim of the third paper was to investigate which language characters are most related to different types of basic values, based on the functionalist theory of values. For this purpose, both Linguistic Inquiry and Word Count and Open Vocabulary Differential Language Analysis approaches were used to analyze 1,110,080 tweets from 1,883 participants (80.4% female), which answered the 18 items of the basic values questionnaire. The results showed that each of the evaluative subfunctions presented positive associations with language characters that support their face validity and point out to relationships with behavior previously found in the literature. In the pattern of negative relationships, there was a predominance of language suggestive of negative affects, emotional instability, and personal distress for almost all evaluative subfunctions. The findings suggest that the language of Twitter can be used to characterize the values of individuals. The present thesis is expected to contribute to the measurement of human values via textual data, to complement those derived from self-report measures and to allow the analysis of natural language databases available to researchers in large volume (e.g. text messages from social media).Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPESO estudo dos valores humanos ocupa um lugar central dentro da Psicologia Social. Os valores humanos são definidos como características abstratas que servem como princípio-guia na vida dos indivíduos. Desde a segunda metade do século passado, foram propostos alguns modelos teóricos que buscaram identificar de que maneira os valores humanos estão organizados. Dentre estes, a Teoria Funcionalista dos Valores Humanos emergiu, partindo do pressuposto de que a estrutura dos valores humanos é definida por duas funções principais: guiar o comportamento e expressar cognitivamente as necessidades dos indivíduos. No tocante à estratégia metodológica para a mensuração dos valores humanos, esta tem se pautado quase que exclusivamente em medidas de autorrelato. No entanto, avanços tecnológicos recentes têm permitido o desenvolvimento de estratégias de análise que possibilitam extrair características psicológicas relevantes de dados quantitativos oriundos de bases textuais, campo conhecido como processamento de linguagem natural. A presente tese tem como objetivo geral testar a hipótese de que o uso do processamento de linguagem natural é adequado para mensurar os valores humanos a partir de indicadores léxicos (palavras). Esta tese encontra-se dividida em três artigos. O primeiro trata-se de um artigo teórico que buscou identificar quais os aspectos principais da natureza dos valores humanos que influenciam em sua mensuração. No segundo artigo foi utilizada a estratégia de vocabulário fechado para analisar 33.941 discursos de deputados federais na Câmara Legislativa Brasileira entre os anos de 2011 e 2014. Nesta, os valores humanos foram mensurados a partir de um vocabulário de palavras pré-definido, selecionadas a partir de um processo de análise de juízes. Para o desenvolvimento deste vocabulário, partiu-se de um conjunto inicial de 100.886 palavras para chegar uma lista final de 24 indicadores léxicos, quatro para cada subfunção. Os resultados do segundo artigo mostraram que os indicadores léxicos de cada subfunção apresentaram maior índice de co-ocorrência com indicadores da mesma subfunção valorativa do que com outras, t (17) = 4,12, p = 0,001. Ademais, a média da correlação teste-resteste das subfunções valorativas ao longo dos intervalos entre 2011 e 2014 foi de 0,70 dando indícios da estabilidade temporal do vocabulário proposto. Por fim, análises de regressão multinível demonstraram haver efeito do gênero e de ideologia partidária na prevalência dos indicadores léxicos de valores nos discursos dos deputados. O objetivo do terceiro estudo foi investigar quais caracteres de linguagem tem maior relação com diferentes tipos de valores básicos, usando como base a teoria funcionalista dos valores. Para tal, foram utilizadas tanto abordagens de vocabulário fechado (Linguistic Inquiry and Word Count) e vocabulário aberto (Differential Language Analysis) para analisar 1.110.080 tweets de 1.883 participantes (80,4% do sexo feminino), os quais responderam aos 18 itens do questionário de valores básicos. Os resultados mostraram que cada uma das subfunções valorativas apresentou associações positivas com caracteres de linguagem que dão suporte a sua validade de face e que apontam para relações com o comportamento previamente encontradas na literatura. No padrão das relações negativas, houve uma predominância de linguagem sugestiva de afetos negativos, instabilidade emocional e angústia pessoal para quase todas as subfunções valorativas. Os achados sugerem que a linguagem do Twitter pode ser utilizada para caracterizar os valores dos indivíduos. Espera-se que a presente tese contribua para a mensuração dos valores humanos através de dados textuais, de modo a complementar aqueles provenientes de medidas de autorrelato e permitindo a análise de bases de dados de linguagem natural disponíveis aos pesquisadores em grande volume (e.g. mensagens de redes sociais).Universidade Federal da ParaíbaBrasilPsicologia SocialPrograma de Pós-Graduação em Psicologia SocialUFPBGouveia, Valdiney Velosohttp://lattes.cnpq.br/6960379064948678Barbosa, Yuri de Almeida Malheiroshttp://lattes.cnpq.br/6396235096236217Nascimento, Anderson Mesquita do2021-05-11T19:38:56Z2020-09-192021-05-11T19:38:56Z2019-09-19info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesishttps://repositorio.ufpb.br/jspui/handle/123456789/20026porhttp://creativecommons.org/licenses/by-nd/3.0/br/info:eu-repo/semantics/embargoedAccessreponame:Biblioteca Digital de Teses e Dissertações da UFPBinstname:Universidade Federal da Paraíba (UFPB)instacron:UFPB2021-06-21T14:28:02Zoai:repositorio.ufpb.br:123456789/20026Biblioteca Digital de Teses e Dissertaçõeshttps://repositorio.ufpb.br/PUBhttp://tede.biblioteca.ufpb.br:8080/oai/requestdiretoria@ufpb.br|| diretoria@ufpb.bropendoar:2021-06-21T14:28:02Biblioteca Digital de Teses e Dissertações da UFPB - Universidade Federal da Paraíba (UFPB)false
dc.title.none.fl_str_mv Medindo valores humanos por meio de processamento de linguagem natural
title Medindo valores humanos por meio de processamento de linguagem natural
spellingShingle Medindo valores humanos por meio de processamento de linguagem natural
Nascimento, Anderson Mesquita do
Valores humanos
Processamento de linguagem natural
Dados textuais
Human values
Natural language processing
Textual data
CNPQ::CIENCIAS HUMANAS::PSICOLOGIA::PSICOLOGIA SOCIAL
title_short Medindo valores humanos por meio de processamento de linguagem natural
title_full Medindo valores humanos por meio de processamento de linguagem natural
title_fullStr Medindo valores humanos por meio de processamento de linguagem natural
title_full_unstemmed Medindo valores humanos por meio de processamento de linguagem natural
title_sort Medindo valores humanos por meio de processamento de linguagem natural
author Nascimento, Anderson Mesquita do
author_facet Nascimento, Anderson Mesquita do
author_role author
dc.contributor.none.fl_str_mv Gouveia, Valdiney Veloso
http://lattes.cnpq.br/6960379064948678
Barbosa, Yuri de Almeida Malheiros
http://lattes.cnpq.br/6396235096236217
dc.contributor.author.fl_str_mv Nascimento, Anderson Mesquita do
dc.subject.por.fl_str_mv Valores humanos
Processamento de linguagem natural
Dados textuais
Human values
Natural language processing
Textual data
CNPQ::CIENCIAS HUMANAS::PSICOLOGIA::PSICOLOGIA SOCIAL
topic Valores humanos
Processamento de linguagem natural
Dados textuais
Human values
Natural language processing
Textual data
CNPQ::CIENCIAS HUMANAS::PSICOLOGIA::PSICOLOGIA SOCIAL
description The study of human values plays a central role in Social Psychology field. Human values are defined as abstract characteristics that serve as a guiding principle in the lives of individuals. Since the second half of the last century, some theoretical models have been proposed that sought to identify how human values are organized. Among these, the Functionalist Theory of Human Values emerged, assuming that the structure of human values is defined by two main functions: guiding behavior and cognitively expressing the needs of individuals. Regarding the methodological strategy for measuring human values, it has been based almost exclusively on self-report measures. However, recent technological advances have allowed the development of analysis strategies that make possible to extract relevant psychological characteristics from quantitative data derived from textual bases, a field known as natural language processing. The present thesis aims to test the hypothesis that the use of natural language processing is adequate to measure human values from lexical indicators (words). This thesis is divided into three articles. The first is a theoretical paper that sought to identify the main aspects of the nature of human values that influence their measurement. In the second paper we used the closed vocabulary strategy to analyze 33,941 speeches of federal deputies in the Brazilian Legislative Chamber between 2011 and 2014. In this, human values were measured from a predefined vocabulary of words, selected from judge selection process. To develop this vocabulary, an initial set of 100,886 words was used to achieve a final list of 24 lexical indicators, four for each subfunction. The results of the second paper showed that the lexical indicators of each subfunction presented a higher co-occurrence index with indicators of the same evaluative subfunction than with others, t (17) = 4.12, p = 0.001. In addition, the mean test-rest correlation of the evaluative subfunctions over the intervals between 2011 and 2014 was 0.70 indicating the temporal stability of the proposed vocabulary. Finally, multilevel regression analyzes have shown that gender and party ideology have an effect on the prevalence of lexical values indicators in deputies' speech. The aim of the third paper was to investigate which language characters are most related to different types of basic values, based on the functionalist theory of values. For this purpose, both Linguistic Inquiry and Word Count and Open Vocabulary Differential Language Analysis approaches were used to analyze 1,110,080 tweets from 1,883 participants (80.4% female), which answered the 18 items of the basic values questionnaire. The results showed that each of the evaluative subfunctions presented positive associations with language characters that support their face validity and point out to relationships with behavior previously found in the literature. In the pattern of negative relationships, there was a predominance of language suggestive of negative affects, emotional instability, and personal distress for almost all evaluative subfunctions. The findings suggest that the language of Twitter can be used to characterize the values of individuals. The present thesis is expected to contribute to the measurement of human values via textual data, to complement those derived from self-report measures and to allow the analysis of natural language databases available to researchers in large volume (e.g. text messages from social media).
publishDate 2019
dc.date.none.fl_str_mv 2019-09-19
2020-09-19
2021-05-11T19:38:56Z
2021-05-11T19:38:56Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://repositorio.ufpb.br/jspui/handle/123456789/20026
url https://repositorio.ufpb.br/jspui/handle/123456789/20026
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv http://creativecommons.org/licenses/by-nd/3.0/br/
info:eu-repo/semantics/embargoedAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by-nd/3.0/br/
eu_rights_str_mv embargoedAccess
dc.publisher.none.fl_str_mv Universidade Federal da Paraíba
Brasil
Psicologia Social
Programa de Pós-Graduação em Psicologia Social
UFPB
publisher.none.fl_str_mv Universidade Federal da Paraíba
Brasil
Psicologia Social
Programa de Pós-Graduação em Psicologia Social
UFPB
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da UFPB
instname:Universidade Federal da Paraíba (UFPB)
instacron:UFPB
instname_str Universidade Federal da Paraíba (UFPB)
instacron_str UFPB
institution UFPB
reponame_str Biblioteca Digital de Teses e Dissertações da UFPB
collection Biblioteca Digital de Teses e Dissertações da UFPB
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da UFPB - Universidade Federal da Paraíba (UFPB)
repository.mail.fl_str_mv diretoria@ufpb.br|| diretoria@ufpb.br
_version_ 1801842859334172672