A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional

Detalhes bibliográficos
Autor(a) principal: Pinto, Marcia Veirano
Data de Publicação: 2013
Tipo de documento: Tese
Idioma: por
Título da fonte: Biblioteca Digital de Teses e Dissertações da PUC_SP
Texto Completo: https://tede2.pucsp.br/handle/handle/13634
Resumo: This study will investigate variation in the language of American movies over an 80- year time period. The corpus is a collection of 640 movie subtitles that includes comedies, dramas, action-adventure, and horror-suspense movies from 1930 to 2010 totaling, approximately, 7,750,000 tokens and 325,000 types. The sampling frame for the corpus comprised the film guide 1001 Movies you Must See before you Die , in which sixty-seven renowned film critics elected the best movies of each decade, the website www.filmsite.org, as well as nominations for the Oscar, BAFTA, and the Academy of Science Fiction, Fantasy and Horror Films awards. The corpus was tagged for part of speech with the Biber tagger, and post-processed with the TagCount programs, which provides further syntactic and semantic analyses. This study will provide two different MD analyses, namely one that uses the existing 1988 dimensions to characterize the movies, and another that extracts new dimensions of variation based solely on movies. For the first one, scores were computed for each movie on each of the first five 1988 dimensions, for the corpus as a whole as well as for each individual movie genre. And for the second analysis, a new factor analysis was run to determine the factors that best represent the data, which were then interpreted in terms of underlying dimensions of variation. For both analyses, the effects of a set of independent variables were considered, including year of first screening, award nomination/win, length, genre, director, ratings, original/adapted script and production studio. Cluster analyses were also carried out to determine the optimal groups of movies within each dimension, such that they cut across the existing genre labels and provided characterizations that were grounded on linguistic characteristics rather than marketdriven concerns. The chapter will present each analysis together with text samples that illustrate the salient features of movies on each dimension. It will also present the results for cluster analysis that dissects time periods as well as statistical tests on these clusters that determine whether there are significant differences across movie genres over time.
id PUC_SP-1_aed392fd03bae3cb7d5ae26ba084a24c
oai_identifier_str oai:repositorio.pucsp.br:handle/13634
network_acronym_str PUC_SP-1
network_name_str Biblioteca Digital de Teses e Dissertações da PUC_SP
repository_id_str
spelling Sardinha, Antonio Paulo Berberhttp://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4233055U9Pinto, Marcia Veirano2016-04-28T18:22:42Z2013-09-052013-08-05Pinto, Marcia Veirano. A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional. 2013. 489 f. Tese (Doutorado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2013.https://tede2.pucsp.br/handle/handle/13634This study will investigate variation in the language of American movies over an 80- year time period. The corpus is a collection of 640 movie subtitles that includes comedies, dramas, action-adventure, and horror-suspense movies from 1930 to 2010 totaling, approximately, 7,750,000 tokens and 325,000 types. The sampling frame for the corpus comprised the film guide 1001 Movies you Must See before you Die , in which sixty-seven renowned film critics elected the best movies of each decade, the website www.filmsite.org, as well as nominations for the Oscar, BAFTA, and the Academy of Science Fiction, Fantasy and Horror Films awards. The corpus was tagged for part of speech with the Biber tagger, and post-processed with the TagCount programs, which provides further syntactic and semantic analyses. This study will provide two different MD analyses, namely one that uses the existing 1988 dimensions to characterize the movies, and another that extracts new dimensions of variation based solely on movies. For the first one, scores were computed for each movie on each of the first five 1988 dimensions, for the corpus as a whole as well as for each individual movie genre. And for the second analysis, a new factor analysis was run to determine the factors that best represent the data, which were then interpreted in terms of underlying dimensions of variation. For both analyses, the effects of a set of independent variables were considered, including year of first screening, award nomination/win, length, genre, director, ratings, original/adapted script and production studio. Cluster analyses were also carried out to determine the optimal groups of movies within each dimension, such that they cut across the existing genre labels and provided characterizations that were grounded on linguistic characteristics rather than marketdriven concerns. The chapter will present each analysis together with text samples that illustrate the salient features of movies on each dimension. It will also present the results for cluster analysis that dissects time periods as well as statistical tests on these clusters that determine whether there are significant differences across movie genres over time.O objetivo desta pesquisa é investigar dimensões de variação na linguagem dos filmes americanos ao longo de 80 anos de cinema falado. Para tanto, buscou-se suporte teórico na visão de linguagem da Linguística de Corpus e na abordagem metodológica da Análise Multidimensional. Mais especificamente, nas duas modalidades propostas por essa metodologia: a que mapeia o registro filme nas 5 dimensões de variação do inglês apontadas por Biber (1988), e a que propõe novas dimensões de variação baseadas apenas nos textos retratados dos filmes. Ambas as modalidades de análise levaram em conta as variáveis extralinguísticas: ano em que os filmes foram apresentados ao público, indicações/ premiações, duração, gênero, diretores, roteiro original versus adaptado, estúdio em que foram produzidos e avaliação da crítica. O corpus utilizado na pesquisa é composto por 640 legendas de filmes de gêneros diversos organizados sob quatro grandes categorias: comédia, drama, ação-aventura e horror-suspense-mistério. Os filmes foram selecionados com base no guia de filmes 1001 filmes par a ver antes de morrer , elaborado por sessenta e sete críticos de cinema renomados, que elegeram os melhores filmes de cada década, do sítio www.filmesite.org, e das nomeações e premiações recebidas pelas seguintes Academias: Oscar, BAFTA e Filmes de Horror, Fantasia e Ficção Científica (The Academy of Science Fiction, Fantasy and Horror Films). Para que se pudesse concluir o trabalho com a proposição de uma nova categorização para os filmes, baseada nas dimensões de variação encontradas recorreuse à análise estatística de Clusters, que permitiu o agrupamento ideal dos filmes de acordo com suas características morfossintáticas e semânticasapplication/pdfhttp://tede2.pucsp.br/tede/retrieve/30354/Marcia%20Veirano%20Pinto.pdf.jpgporPontifícia Universidade Católica de São PauloPrograma de Estudos Pós-Graduados em Linguística Aplicada e Estudos da LinguagemPUC-SPBRLingüísticaLinguagem dos filmes americanosLanguage of American moviesCNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADAA linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensionalinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da PUC_SPinstname:Pontifícia Universidade Católica de São Paulo (PUC-SP)instacron:PUC_SPTEXTMarcia Veirano Pinto.pdf.txtMarcia Veirano Pinto.pdf.txtExtracted texttext/plain1042134https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/3/Marcia%20Veirano%20Pinto.pdf.txt76e69be017877c1a4f077acc1ae9347fMD53ORIGINALMarcia Veirano Pinto.pdfapplication/pdf5387766https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/1/Marcia%20Veirano%20Pinto.pdf9f381764386ef7108e63b97937f3f77eMD51THUMBNAILMarcia Veirano Pinto.pdf.jpgMarcia Veirano Pinto.pdf.jpgGenerated Thumbnailimage/jpeg1943https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/2/Marcia%20Veirano%20Pinto.pdf.jpgcc73c4c239a4c332d642ba1e7c7a9fb2MD52handle/136342022-04-27 22:18:37.298oai:repositorio.pucsp.br:handle/13634Biblioteca Digital de Teses e Dissertaçõeshttps://sapientia.pucsp.br/https://sapientia.pucsp.br/oai/requestbngkatende@pucsp.br||rapassi@pucsp.bropendoar:2022-04-28T01:18:37Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)false
dc.title.por.fl_str_mv A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
title A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
spellingShingle A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
Pinto, Marcia Veirano
Linguagem dos filmes americanos
Language of American movies
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
title_short A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
title_full A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
title_fullStr A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
title_full_unstemmed A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
title_sort A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
author Pinto, Marcia Veirano
author_facet Pinto, Marcia Veirano
author_role author
dc.contributor.advisor1.fl_str_mv Sardinha, Antonio Paulo Berber
dc.contributor.authorLattes.fl_str_mv http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4233055U9
dc.contributor.author.fl_str_mv Pinto, Marcia Veirano
contributor_str_mv Sardinha, Antonio Paulo Berber
dc.subject.por.fl_str_mv Linguagem dos filmes americanos
topic Linguagem dos filmes americanos
Language of American movies
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
dc.subject.eng.fl_str_mv Language of American movies
dc.subject.cnpq.fl_str_mv CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
description This study will investigate variation in the language of American movies over an 80- year time period. The corpus is a collection of 640 movie subtitles that includes comedies, dramas, action-adventure, and horror-suspense movies from 1930 to 2010 totaling, approximately, 7,750,000 tokens and 325,000 types. The sampling frame for the corpus comprised the film guide 1001 Movies you Must See before you Die , in which sixty-seven renowned film critics elected the best movies of each decade, the website www.filmsite.org, as well as nominations for the Oscar, BAFTA, and the Academy of Science Fiction, Fantasy and Horror Films awards. The corpus was tagged for part of speech with the Biber tagger, and post-processed with the TagCount programs, which provides further syntactic and semantic analyses. This study will provide two different MD analyses, namely one that uses the existing 1988 dimensions to characterize the movies, and another that extracts new dimensions of variation based solely on movies. For the first one, scores were computed for each movie on each of the first five 1988 dimensions, for the corpus as a whole as well as for each individual movie genre. And for the second analysis, a new factor analysis was run to determine the factors that best represent the data, which were then interpreted in terms of underlying dimensions of variation. For both analyses, the effects of a set of independent variables were considered, including year of first screening, award nomination/win, length, genre, director, ratings, original/adapted script and production studio. Cluster analyses were also carried out to determine the optimal groups of movies within each dimension, such that they cut across the existing genre labels and provided characterizations that were grounded on linguistic characteristics rather than marketdriven concerns. The chapter will present each analysis together with text samples that illustrate the salient features of movies on each dimension. It will also present the results for cluster analysis that dissects time periods as well as statistical tests on these clusters that determine whether there are significant differences across movie genres over time.
publishDate 2013
dc.date.available.fl_str_mv 2013-09-05
dc.date.issued.fl_str_mv 2013-08-05
dc.date.accessioned.fl_str_mv 2016-04-28T18:22:42Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv Pinto, Marcia Veirano. A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional. 2013. 489 f. Tese (Doutorado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2013.
dc.identifier.uri.fl_str_mv https://tede2.pucsp.br/handle/handle/13634
identifier_str_mv Pinto, Marcia Veirano. A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional. 2013. 489 f. Tese (Doutorado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2013.
url https://tede2.pucsp.br/handle/handle/13634
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Pontifícia Universidade Católica de São Paulo
dc.publisher.program.fl_str_mv Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem
dc.publisher.initials.fl_str_mv PUC-SP
dc.publisher.country.fl_str_mv BR
dc.publisher.department.fl_str_mv Lingüística
publisher.none.fl_str_mv Pontifícia Universidade Católica de São Paulo
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da PUC_SP
instname:Pontifícia Universidade Católica de São Paulo (PUC-SP)
instacron:PUC_SP
instname_str Pontifícia Universidade Católica de São Paulo (PUC-SP)
instacron_str PUC_SP
institution PUC_SP
reponame_str Biblioteca Digital de Teses e Dissertações da PUC_SP
collection Biblioteca Digital de Teses e Dissertações da PUC_SP
bitstream.url.fl_str_mv https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/3/Marcia%20Veirano%20Pinto.pdf.txt
https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/1/Marcia%20Veirano%20Pinto.pdf
https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/2/Marcia%20Veirano%20Pinto.pdf.jpg
bitstream.checksum.fl_str_mv 76e69be017877c1a4f077acc1ae9347f
9f381764386ef7108e63b97937f3f77e
cc73c4c239a4c332d642ba1e7c7a9fb2
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)
repository.mail.fl_str_mv bngkatende@pucsp.br||rapassi@pucsp.br
_version_ 1809277952132644864