Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis

Detalhes bibliográficos
Autor(a) principal: Kauffmann, Carlos Henrique
Data de Publicação: 2020
Tipo de documento: Tese
Idioma: por
Título da fonte: Biblioteca Digital de Teses e Dissertações da PUC_SP
Texto Completo: https://tede2.pucsp.br/handle/handle/23128
Resumo: The goal of this study is to analyze linguistic variation across fictional prose of Brazilian writer Machado de Assis from the perspective of style, grounded theoretically and methodologically by Corpus Linguistics (Berber Sardinha, 2004) in order to highlight the author's main communicative and thematic aesthetic dimensions. To carry out this purpose, two corpora were collected: Machado de Assis's Literary Corpus (CLIMA), which comprises 9 novels and 76 short stories written by him (859,521 words), and Comparable Literary Corpus (CLIC), reference corpus constituted of 92 works by 23 writers from 1850-1910 period (3,386,037 words). Both were tagged for part-of-speech features and had their counting normalized to 1,000 words. A list of the most frequent words and lemmas from Corpus CLIMA was also compiled. Several multidimensional analyses (MDAs) were then performed to extract co-ocurrent groups of linguistic features or words and lemmas from textual variation, the so-called dimensions of functional or lexical variation (Biber, 1988; Berber Sardinha & Veirano Pinto, 2014, 2019). Standardized scores of the dimensions were individualized by text, enabling comparative and contrasting stylistic analyses. Two additive MDAs are initially presented, which compare the work of Machado de Assis with that of writers of the period and in relation to the functional dimensions of current Brazilian Portuguese (Berber Sardinha, Kauffmann & Acunzo, 2014a). Next, three complete MDAs were performed, two of which in the functional approach, to verify linguistic variation across the reference corpus and within the individual work of Machado de Assis, and one lexical MDA of the CLIMA corpus focused in identifying its main lexical dimensions. Additionally, a canonical correlation analysis was carried out to verify associations between the functional and lexical dimensions identified in Machado de Assis's work, leading to the emergence of three aesthetic dimensions, named "Formal introspective romanticism", "Sentimental oralized narrative", and "Dramatic representation"
id PUC_SP-1_eba8b2a84f163c13cfcb426dd71b18d8
oai_identifier_str oai:repositorio.pucsp.br:handle/23128
network_acronym_str PUC_SP-1
network_name_str Biblioteca Digital de Teses e Dissertações da PUC_SP
repository_id_str
spelling Sardinha, Antonio Paulo Berberhttp://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4756048Z4Kauffmann, Carlos Henrique2020-08-04T11:21:22Z2020-04-30Kauffmann, Carlos Henrique. Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis. 2020. 277 f. Tese (Doutorado em Linguística Aplicada e Estudos da Linguagem) - Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem, Pontifícia Universidade Católica de São Paulo, São Paulo, 2020.https://tede2.pucsp.br/handle/handle/23128The goal of this study is to analyze linguistic variation across fictional prose of Brazilian writer Machado de Assis from the perspective of style, grounded theoretically and methodologically by Corpus Linguistics (Berber Sardinha, 2004) in order to highlight the author's main communicative and thematic aesthetic dimensions. To carry out this purpose, two corpora were collected: Machado de Assis's Literary Corpus (CLIMA), which comprises 9 novels and 76 short stories written by him (859,521 words), and Comparable Literary Corpus (CLIC), reference corpus constituted of 92 works by 23 writers from 1850-1910 period (3,386,037 words). Both were tagged for part-of-speech features and had their counting normalized to 1,000 words. A list of the most frequent words and lemmas from Corpus CLIMA was also compiled. Several multidimensional analyses (MDAs) were then performed to extract co-ocurrent groups of linguistic features or words and lemmas from textual variation, the so-called dimensions of functional or lexical variation (Biber, 1988; Berber Sardinha & Veirano Pinto, 2014, 2019). Standardized scores of the dimensions were individualized by text, enabling comparative and contrasting stylistic analyses. Two additive MDAs are initially presented, which compare the work of Machado de Assis with that of writers of the period and in relation to the functional dimensions of current Brazilian Portuguese (Berber Sardinha, Kauffmann & Acunzo, 2014a). Next, three complete MDAs were performed, two of which in the functional approach, to verify linguistic variation across the reference corpus and within the individual work of Machado de Assis, and one lexical MDA of the CLIMA corpus focused in identifying its main lexical dimensions. Additionally, a canonical correlation analysis was carried out to verify associations between the functional and lexical dimensions identified in Machado de Assis's work, leading to the emergence of three aesthetic dimensions, named "Formal introspective romanticism", "Sentimental oralized narrative", and "Dramatic representation"Este estudo tem como objetivo analisar a variação linguística da prosa ficcional do escritor Machado de Assis na perspectiva de estilo, alicerçado teórica e metodologicamente pela linguística de corpus (BERBER SARDINHA, 2004), a fim de evidenciar as principais dimensões estéticas do escritor, de cunho comunicativo e temático, expressadas pela língua. Para a consecução da pesquisa, foram coletados dois corpora: o Corpus Literário de Machado de Assis (CLIMA), que abrange 9 romances e 76 contos do escritor (859.521 palavras), e o Corpus Literário Congênere (CLIC), corpus de referência composto por 92 obras de 23 escritores do período 1850-1910 (3.386.037 palavras). Ambos foram etiquetados morfossintaticamente e normalizados por 1.000 palavras e, no caso do CLIMA, também houve uma seleção de palavras e lemas mais frequentes. Foram realizadas diversas análises multidimensionais (AMDs) para extrair grupos coocorrentes de características linguísticas ou de palavras e lemas atuantes na variação textual, as chamadas dimensões de variação funcional ou lexical (BIBER, 1988; BERBER SARDINHA; VEIRANO PINTO, 2014, 2019). Os escores padronizados das dimensões foram individualizados por texto, possibilitando análises estilísticas de natureza comparativa e contrastiva. São inicialmente apresentadas duas AMDs aditivas, que comparam a obra de Machado de Assis com a de escritores do período e em relação às dimensões funcionais do português brasileiro atual (BERBER SARDINHA; KAUFFMANN; ACUNZO, 2014a). A seguir, foram realizadas três AMDs completas, das quais duas na vertente funcional, que verificam a variação linguística no corpus de referência e na obra individual de Machado de Assis, e uma na abordagem lexical, do corpus CLIMA. Adicionalmente, foi realizada uma análise de correlação canônica para verificar associações existentes entre as dimensões funcionais e lexicais identificadas na obra de Machado de Assis, levando à emergência de três dimensões estéticas, denominadas "Romantismo introspectivo formal", "Narrativa oralizada sentimental" e "Representação dramática"Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPESapplication/pdfhttp://tede2.pucsp.br/tede/retrieve/52051/Carlos%20Henrique%20Kauffmann.pdf.jpgporPontifícia Universidade Católica de São PauloPrograma de Estudos Pós-Graduados em Linguística Aplicada e Estudos da LinguagemPUC-SPBrasilFaculdade de Filosofia, Comunicação, Letras e ArtesEstilo literárioVariação linguísticaAssis, Machado de [1839-1908] - Crítica e interpretaçãoLiterary styleLinguistic variationAssis, Machado de [1839-1908] - Criticism and interpretationCNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADALinguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de AssisCorpus linguistics and style: multi-dimensional and canonical analyses in Machado de Assis's fictioninfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da PUC_SPinstname:Pontifícia Universidade Católica de São Paulo (PUC-SP)instacron:PUC_SPTEXTCarlos Henrique Kauffmann.pdf.txtCarlos Henrique Kauffmann.pdf.txtExtracted texttext/plain483996https://repositorio.pucsp.br/xmlui/bitstream/handle/23128/4/Carlos%20Henrique%20Kauffmann.pdf.txt913fdbe2598edc0e23ef89ee03bae1aeMD54LICENSElicense.txtlicense.txttext/plain; charset=utf-82165https://repositorio.pucsp.br/xmlui/bitstream/handle/23128/1/license.txtbd3efa91386c1718a7f26a329fdcb468MD51ORIGINALCarlos Henrique Kauffmann.pdfCarlos Henrique Kauffmann.pdfapplication/pdf1977344https://repositorio.pucsp.br/xmlui/bitstream/handle/23128/2/Carlos%20Henrique%20Kauffmann.pdf2bc4106ca99d086f19a41a555957fddbMD52THUMBNAILCarlos Henrique Kauffmann.pdf.jpgCarlos Henrique Kauffmann.pdf.jpgGenerated Thumbnailimage/jpeg3295https://repositorio.pucsp.br/xmlui/bitstream/handle/23128/3/Carlos%20Henrique%20Kauffmann.pdf.jpg8a6eba9a393273669dc038aec0098808MD53handle/231282022-04-28 15:37:57.437oai:repositorio.pucsp.br:handle/23128Tk9UQTogQ09MT1FVRSBBUVVJIEEgU1VBIFBSw5NQUklBIExJQ0VOw4dBCkVzdGEgbGljZW7Dp2EgZGUgZXhlbXBsbyDDqSBmb3JuZWNpZGEgYXBlbmFzIHBhcmEgZmlucyBpbmZvcm1hdGl2b3MuCgpMSUNFTsOHQSBERSBESVNUUklCVUnDh8ODTyBOw4NPLUVYQ0xVU0lWQQoKQ29tIGEgYXByZXNlbnRhw6fDo28gZGVzdGEgbGljZW7Dp2EsIHZvY8OqIChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSDDoCBVbml2ZXJzaWRhZGUgClhYWCAoU2lnbGEgZGEgVW5pdmVyc2lkYWRlKSBvIGRpcmVpdG8gbsOjby1leGNsdXNpdm8gZGUgcmVwcm9kdXppciwgIHRyYWR1emlyIChjb25mb3JtZSBkZWZpbmlkbyBhYmFpeG8pLCBlL291IApkaXN0cmlidWlyIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyAoaW5jbHVpbmRvIG8gcmVzdW1vKSBwb3IgdG9kbyBvIG11bmRvIG5vIGZvcm1hdG8gaW1wcmVzc28gZSBlbGV0csO0bmljbyBlIAplbSBxdWFscXVlciBtZWlvLCBpbmNsdWluZG8gb3MgZm9ybWF0b3Mgw6F1ZGlvIG91IHbDrWRlby4KClZvY8OqIGNvbmNvcmRhIHF1ZSBhIFNpZ2xhIGRlIFVuaXZlcnNpZGFkZSBwb2RlLCBzZW0gYWx0ZXJhciBvIGNvbnRlw7pkbywgdHJhbnNwb3IgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIApwYXJhIHF1YWxxdWVyIG1laW8gb3UgZm9ybWF0byBwYXJhIGZpbnMgZGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIHRhbWLDqW0gY29uY29yZGEgcXVlIGEgU2lnbGEgZGUgVW5pdmVyc2lkYWRlIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGPDs3BpYSBhIHN1YSB0ZXNlIG91IApkaXNzZXJ0YcOnw6NvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIGRlY2xhcmEgcXVlIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyDDqSBvcmlnaW5hbCBlIHF1ZSB2b2PDqiB0ZW0gbyBwb2RlciBkZSBjb25jZWRlciBvcyBkaXJlaXRvcyBjb250aWRvcyAKbmVzdGEgbGljZW7Dp2EuIFZvY8OqIHRhbWLDqW0gZGVjbGFyYSBxdWUgbyBkZXDDs3NpdG8gZGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBuw6NvLCBxdWUgc2VqYSBkZSBzZXUgCmNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiAKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc8OjbyBpcnJlc3RyaXRhIGRvIGRldGVudG9yIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBwYXJhIGNvbmNlZGVyIMOgIFNpZ2xhIGRlIFVuaXZlcnNpZGFkZSAKb3MgZGlyZWl0b3MgYXByZXNlbnRhZG9zIG5lc3RhIGxpY2Vuw6dhLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3TDoSBjbGFyYW1lbnRlIAppZGVudGlmaWNhZG8gZSByZWNvbmhlY2lkbyBubyB0ZXh0byBvdSBubyBjb250ZcO6ZG8gZGEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIG9yYSBkZXBvc2l0YWRhLgoKQ0FTTyBBIFRFU0UgT1UgRElTU0VSVEHDh8ODTyBPUkEgREVQT1NJVEFEQSBURU5IQSBTSURPIFJFU1VMVEFETyBERSBVTSBQQVRST0PDjU5JTyBPVSAKQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PIFFVRSBOw4NPIFNFSkEgQSBTSUdMQSBERSAKVU5JVkVSU0lEQURFLCBWT0PDiiBERUNMQVJBIFFVRSBSRVNQRUlUT1UgVE9ET1MgRSBRVUFJU1FVRVIgRElSRUlUT1MgREUgUkVWSVPDg08gQ09NTyAKVEFNQsOJTSBBUyBERU1BSVMgT0JSSUdBw4fDlUVTIEVYSUdJREFTIFBPUiBDT05UUkFUTyBPVSBBQ09SRE8uCgpBIFNpZ2xhIGRlIFVuaXZlcnNpZGFkZSBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lIChzKSBvdSBvKHMpIG5vbWUocykgZG8ocykgCmRldGVudG9yKGVzKSBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvLCBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIGFsw6ltIGRhcXVlbGFzIApjb25jZWRpZGFzIHBvciBlc3RhIGxpY2Vuw6dhLgo=Biblioteca Digital de Teses e Dissertaçõeshttps://sapientia.pucsp.br/https://sapientia.pucsp.br/oai/requestbngkatende@pucsp.br||rapassi@pucsp.bropendoar:2022-04-28T18:37:57Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)false
dc.title.por.fl_str_mv Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis
dc.title.alternative.eng.fl_str_mv Corpus linguistics and style: multi-dimensional and canonical analyses in Machado de Assis's fiction
title Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis
spellingShingle Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis
Kauffmann, Carlos Henrique
Estilo literário
Variação linguística
Assis, Machado de [1839-1908] - Crítica e interpretação
Literary style
Linguistic variation
Assis, Machado de [1839-1908] - Criticism and interpretation
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
title_short Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis
title_full Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis
title_fullStr Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis
title_full_unstemmed Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis
title_sort Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis
author Kauffmann, Carlos Henrique
author_facet Kauffmann, Carlos Henrique
author_role author
dc.contributor.advisor1.fl_str_mv Sardinha, Antonio Paulo Berber
dc.contributor.authorLattes.fl_str_mv http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4756048Z4
dc.contributor.author.fl_str_mv Kauffmann, Carlos Henrique
contributor_str_mv Sardinha, Antonio Paulo Berber
dc.subject.por.fl_str_mv Estilo literário
Variação linguística
Assis, Machado de [1839-1908] - Crítica e interpretação
topic Estilo literário
Variação linguística
Assis, Machado de [1839-1908] - Crítica e interpretação
Literary style
Linguistic variation
Assis, Machado de [1839-1908] - Criticism and interpretation
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
dc.subject.eng.fl_str_mv Literary style
Linguistic variation
Assis, Machado de [1839-1908] - Criticism and interpretation
dc.subject.cnpq.fl_str_mv CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
description The goal of this study is to analyze linguistic variation across fictional prose of Brazilian writer Machado de Assis from the perspective of style, grounded theoretically and methodologically by Corpus Linguistics (Berber Sardinha, 2004) in order to highlight the author's main communicative and thematic aesthetic dimensions. To carry out this purpose, two corpora were collected: Machado de Assis's Literary Corpus (CLIMA), which comprises 9 novels and 76 short stories written by him (859,521 words), and Comparable Literary Corpus (CLIC), reference corpus constituted of 92 works by 23 writers from 1850-1910 period (3,386,037 words). Both were tagged for part-of-speech features and had their counting normalized to 1,000 words. A list of the most frequent words and lemmas from Corpus CLIMA was also compiled. Several multidimensional analyses (MDAs) were then performed to extract co-ocurrent groups of linguistic features or words and lemmas from textual variation, the so-called dimensions of functional or lexical variation (Biber, 1988; Berber Sardinha & Veirano Pinto, 2014, 2019). Standardized scores of the dimensions were individualized by text, enabling comparative and contrasting stylistic analyses. Two additive MDAs are initially presented, which compare the work of Machado de Assis with that of writers of the period and in relation to the functional dimensions of current Brazilian Portuguese (Berber Sardinha, Kauffmann & Acunzo, 2014a). Next, three complete MDAs were performed, two of which in the functional approach, to verify linguistic variation across the reference corpus and within the individual work of Machado de Assis, and one lexical MDA of the CLIMA corpus focused in identifying its main lexical dimensions. Additionally, a canonical correlation analysis was carried out to verify associations between the functional and lexical dimensions identified in Machado de Assis's work, leading to the emergence of three aesthetic dimensions, named "Formal introspective romanticism", "Sentimental oralized narrative", and "Dramatic representation"
publishDate 2020
dc.date.accessioned.fl_str_mv 2020-08-04T11:21:22Z
dc.date.issued.fl_str_mv 2020-04-30
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv Kauffmann, Carlos Henrique. Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis. 2020. 277 f. Tese (Doutorado em Linguística Aplicada e Estudos da Linguagem) - Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem, Pontifícia Universidade Católica de São Paulo, São Paulo, 2020.
dc.identifier.uri.fl_str_mv https://tede2.pucsp.br/handle/handle/23128
identifier_str_mv Kauffmann, Carlos Henrique. Linguística de corpus e estilo: análises multidimensional e canônica na ficção de Machado de Assis. 2020. 277 f. Tese (Doutorado em Linguística Aplicada e Estudos da Linguagem) - Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem, Pontifícia Universidade Católica de São Paulo, São Paulo, 2020.
url https://tede2.pucsp.br/handle/handle/23128
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Pontifícia Universidade Católica de São Paulo
dc.publisher.program.fl_str_mv Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem
dc.publisher.initials.fl_str_mv PUC-SP
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv Faculdade de Filosofia, Comunicação, Letras e Artes
publisher.none.fl_str_mv Pontifícia Universidade Católica de São Paulo
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da PUC_SP
instname:Pontifícia Universidade Católica de São Paulo (PUC-SP)
instacron:PUC_SP
instname_str Pontifícia Universidade Católica de São Paulo (PUC-SP)
instacron_str PUC_SP
institution PUC_SP
reponame_str Biblioteca Digital de Teses e Dissertações da PUC_SP
collection Biblioteca Digital de Teses e Dissertações da PUC_SP
bitstream.url.fl_str_mv https://repositorio.pucsp.br/xmlui/bitstream/handle/23128/4/Carlos%20Henrique%20Kauffmann.pdf.txt
https://repositorio.pucsp.br/xmlui/bitstream/handle/23128/1/license.txt
https://repositorio.pucsp.br/xmlui/bitstream/handle/23128/2/Carlos%20Henrique%20Kauffmann.pdf
https://repositorio.pucsp.br/xmlui/bitstream/handle/23128/3/Carlos%20Henrique%20Kauffmann.pdf.jpg
bitstream.checksum.fl_str_mv 913fdbe2598edc0e23ef89ee03bae1ae
bd3efa91386c1718a7f26a329fdcb468
2bc4106ca99d086f19a41a555957fddb
8a6eba9a393273669dc038aec0098808
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)
repository.mail.fl_str_mv bngkatende@pucsp.br||rapassi@pucsp.br
_version_ 1799796165875597312