Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro

Alves, Fernando Cabral

Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro

Bibliographic Details
Main Author:	Alves, Fernando Cabral
Publication Date:	2017
Format:	Master thesis
Language:	por
Source:	Biblioteca Digital de Teses e Dissertações da UFPB
Download full:	https://repositorio.ufpb.br/jspui/handle/123456789/14271
Summary:	The present work is part of the studies that seek to represent and investigate linguistic systems using mathematical models. In such context, a Maximum Entropy model of phonotactics developed by Hayes and Wilson (2008) has exhibted a high level of accuracy in relation to experimental data when applied to English, outperforming other phonotactic modelling proposals. Nevertheless, despite its good results, we are ignorant of any work in Brazil which makes use of the model or of Maximum Entropy models in general. Since the model is universal (i.e. applicable to any language), we have taken our objective to be measuring the level of accuracy of the model when applying it to Brazilian Portuguese. The text is divided into three chapters. In the first chapter, we have described in details the model to be tested. In the second one, we have presented the methodology employed to: i) apply the phonotactic model to Brazilian Portuguese; and ii) collect experimental data against which we measure the accuracy of the model predictions obtained in i). The methodological procedures involved the creation of two softwares, one for automated phonological transcription of Brazilian Portuguese and a second one for carrying out magnitude estimation experiments. Finally, in chapter three we show the results. In two applications, the correlation between model predictions and experimental data, measured by the Pearson coefficient, were found to be in the region of 0 and 0,5, thus showing a much weaker linear dependence than that found for English (0,946).

Item metadata

id	UFPB_e54f5ee270345dba4baad811fb9988b4
oai_identifier_str	oai:repositorio.ufpb.br:123456789/14271
network_acronym_str	UFPB
network_name_str	Biblioteca Digital de Teses e Dissertações da UFPB
repository_id_str
spelling	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiroFonotáticaEntropia máximaAprendizado de máquinaEstimação de magnitudePhonotacticsMaximum entropyMachine learningMagnitude estimationCNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICAThe present work is part of the studies that seek to represent and investigate linguistic systems using mathematical models. In such context, a Maximum Entropy model of phonotactics developed by Hayes and Wilson (2008) has exhibted a high level of accuracy in relation to experimental data when applied to English, outperforming other phonotactic modelling proposals. Nevertheless, despite its good results, we are ignorant of any work in Brazil which makes use of the model or of Maximum Entropy models in general. Since the model is universal (i.e. applicable to any language), we have taken our objective to be measuring the level of accuracy of the model when applying it to Brazilian Portuguese. The text is divided into three chapters. In the first chapter, we have described in details the model to be tested. In the second one, we have presented the methodology employed to: i) apply the phonotactic model to Brazilian Portuguese; and ii) collect experimental data against which we measure the accuracy of the model predictions obtained in i). The methodological procedures involved the creation of two softwares, one for automated phonological transcription of Brazilian Portuguese and a second one for carrying out magnitude estimation experiments. Finally, in chapter three we show the results. In two applications, the correlation between model predictions and experimental data, measured by the Pearson coefficient, were found to be in the region of 0 and 0,5, thus showing a much weaker linear dependence than that found for English (0,946).Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPESO presente trabalho faz parte dos estudos que buscam representar e investigar sistemas linguísticos a partir de modelos matemáticos. Neste contexto, um modelo de Entropia Máxima para fonotática e aprendizado fonotático desenvolvido por Hayes e Wilson (2008) apresentou uma alta correlação com dados experimentais quando aplicado ao inglês, superando outras propostas de modelagem fonotática (HAYES&WILSON, 2008, p.401). Porém, apesar dos bons resultados, desconhecemos qualquer trabalho no Brasil que utilize o modelo em questão ou modelos de Entropia Máxima em geral. Uma vez que o modelo é universal (i.e. aplicável a qualquer língua), tomamos como objetivo desta dissertação medir o nível de acurácia do modelo quando aplicado ao português brasileiro (doravante PB). O texto se divide em três capítulos. No primeiro, tratamos de descrever em detalhes o modelo a ser testado. No segundo, apresentamos a metodologia utilizada para: i) aplicar o modelo ao português brasileiro; e ii) coletar dados experimentais para medir a acurácia das previsões do modelo obtidas em i). Os procedimentos metodológicos envolveram a criação de dois softwares, um para transcrição fonológica automática do PB e outro para realização de experimentos de estimação de magnitude. Por fim, no terceiro capítulo, apresentamos os resultados. Nas duas aplicações realizadas, a correlação entre as previsões do modelo e os dados experimentais, medida a partir do coeficiente Pearson, ficaram em torno de 0 e 0,5, demonstrando assim uma dependência linear muito mais fraca que aquela encontrada para o inglês (0,946).Universidade Federal da ParaíbaBrasilLinguísticaPrograma de Pós-Graduação em LinguísticaUFPBLucena, Rubens Marques dehttp://lattes.cnpq.br/1376297327951154Alves, Fernando Cabral2019-05-16T16:32:52Z2019-05-162019-05-16T16:32:52Z2017-03-29info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesishttps://repositorio.ufpb.br/jspui/handle/123456789/14271porAttribution-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nd/3.0/br/info:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UFPBinstname:Universidade Federal da Paraíba (UFPB)instacron:UFPB2019-05-17T06:04:55Zoai:repositorio.ufpb.br:123456789/14271Biblioteca Digital de Teses e Dissertaçõeshttps://repositorio.ufpb.br/PUBhttp://tede.biblioteca.ufpb.br:8080/oai/requestdiretoria@ufpb.br\|\| diretoria@ufpb.bropendoar:2019-05-17T06:04:55Biblioteca Digital de Teses e Dissertações da UFPB - Universidade Federal da Paraíba (UFPB)false
dc.title.none.fl_str_mv	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro
title	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro
spellingShingle	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro Alves, Fernando Cabral Fonotática Entropia máxima Aprendizado de máquina Estimação de magnitude Phonotactics Maximum entropy Machine learning Magnitude estimation CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA
title_short	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro
title_full	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro
title_fullStr	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro
title_full_unstemmed	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro
title_sort	Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro
author	Alves, Fernando Cabral
author_facet	Alves, Fernando Cabral
author_role	author
dc.contributor.none.fl_str_mv	Lucena, Rubens Marques de http://lattes.cnpq.br/1376297327951154
dc.contributor.author.fl_str_mv	Alves, Fernando Cabral
dc.subject.por.fl_str_mv	Fonotática Entropia máxima Aprendizado de máquina Estimação de magnitude Phonotactics Maximum entropy Machine learning Magnitude estimation CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA
topic	Fonotática Entropia máxima Aprendizado de máquina Estimação de magnitude Phonotactics Maximum entropy Machine learning Magnitude estimation CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA
description	The present work is part of the studies that seek to represent and investigate linguistic systems using mathematical models. In such context, a Maximum Entropy model of phonotactics developed by Hayes and Wilson (2008) has exhibted a high level of accuracy in relation to experimental data when applied to English, outperforming other phonotactic modelling proposals. Nevertheless, despite its good results, we are ignorant of any work in Brazil which makes use of the model or of Maximum Entropy models in general. Since the model is universal (i.e. applicable to any language), we have taken our objective to be measuring the level of accuracy of the model when applying it to Brazilian Portuguese. The text is divided into three chapters. In the first chapter, we have described in details the model to be tested. In the second one, we have presented the methodology employed to: i) apply the phonotactic model to Brazilian Portuguese; and ii) collect experimental data against which we measure the accuracy of the model predictions obtained in i). The methodological procedures involved the creation of two softwares, one for automated phonological transcription of Brazilian Portuguese and a second one for carrying out magnitude estimation experiments. Finally, in chapter three we show the results. In two applications, the correlation between model predictions and experimental data, measured by the Pearson coefficient, were found to be in the region of 0 and 0,5, thus showing a much weaker linear dependence than that found for English (0,946).
publishDate	2017
dc.date.none.fl_str_mv	2017-03-29 2019-05-16T16:32:52Z 2019-05-16 2019-05-16T16:32:52Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://repositorio.ufpb.br/jspui/handle/123456789/14271
url	https://repositorio.ufpb.br/jspui/handle/123456789/14271
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	Attribution-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nd/3.0/br/ info:eu-repo/semantics/openAccess
rights_invalid_str_mv	Attribution-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nd/3.0/br/
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Universidade Federal da Paraíba Brasil Linguística Programa de Pós-Graduação em Linguística UFPB
publisher.none.fl_str_mv	Universidade Federal da Paraíba Brasil Linguística Programa de Pós-Graduação em Linguística UFPB
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações da UFPB instname:Universidade Federal da Paraíba (UFPB) instacron:UFPB
instname_str	Universidade Federal da Paraíba (UFPB)
instacron_str	UFPB
institution	UFPB
reponame_str	Biblioteca Digital de Teses e Dissertações da UFPB
collection	Biblioteca Digital de Teses e Dissertações da UFPB
repository.name.fl_str_mv	Biblioteca Digital de Teses e Dissertações da UFPB - Universidade Federal da Paraíba (UFPB)
repository.mail.fl_str_mv	diretoria@ufpb.br\|\| diretoria@ufpb.br
_version_	1798963960557338624

Acurácia de um modelo fonotático de entropia máxima aplicado ao português brasileiro

Similar Items