A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional
Autor(a) principal: | |
---|---|
Data de Publicação: | 2013 |
Tipo de documento: | Tese |
Idioma: | por |
Título da fonte: | Biblioteca Digital de Teses e Dissertações da PUC_SP |
Texto Completo: | https://tede2.pucsp.br/handle/handle/13634 |
Resumo: | This study will investigate variation in the language of American movies over an 80- year time period. The corpus is a collection of 640 movie subtitles that includes comedies, dramas, action-adventure, and horror-suspense movies from 1930 to 2010 totaling, approximately, 7,750,000 tokens and 325,000 types. The sampling frame for the corpus comprised the film guide 1001 Movies you Must See before you Die , in which sixty-seven renowned film critics elected the best movies of each decade, the website www.filmsite.org, as well as nominations for the Oscar, BAFTA, and the Academy of Science Fiction, Fantasy and Horror Films awards. The corpus was tagged for part of speech with the Biber tagger, and post-processed with the TagCount programs, which provides further syntactic and semantic analyses. This study will provide two different MD analyses, namely one that uses the existing 1988 dimensions to characterize the movies, and another that extracts new dimensions of variation based solely on movies. For the first one, scores were computed for each movie on each of the first five 1988 dimensions, for the corpus as a whole as well as for each individual movie genre. And for the second analysis, a new factor analysis was run to determine the factors that best represent the data, which were then interpreted in terms of underlying dimensions of variation. For both analyses, the effects of a set of independent variables were considered, including year of first screening, award nomination/win, length, genre, director, ratings, original/adapted script and production studio. Cluster analyses were also carried out to determine the optimal groups of movies within each dimension, such that they cut across the existing genre labels and provided characterizations that were grounded on linguistic characteristics rather than marketdriven concerns. The chapter will present each analysis together with text samples that illustrate the salient features of movies on each dimension. It will also present the results for cluster analysis that dissects time periods as well as statistical tests on these clusters that determine whether there are significant differences across movie genres over time. |
id |
PUC_SP-1_aed392fd03bae3cb7d5ae26ba084a24c |
---|---|
oai_identifier_str |
oai:repositorio.pucsp.br:handle/13634 |
network_acronym_str |
PUC_SP-1 |
network_name_str |
Biblioteca Digital de Teses e Dissertações da PUC_SP |
repository_id_str |
|
spelling |
Sardinha, Antonio Paulo Berberhttp://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4233055U9Pinto, Marcia Veirano2016-04-28T18:22:42Z2013-09-052013-08-05Pinto, Marcia Veirano. A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional. 2013. 489 f. Tese (Doutorado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2013.https://tede2.pucsp.br/handle/handle/13634This study will investigate variation in the language of American movies over an 80- year time period. The corpus is a collection of 640 movie subtitles that includes comedies, dramas, action-adventure, and horror-suspense movies from 1930 to 2010 totaling, approximately, 7,750,000 tokens and 325,000 types. The sampling frame for the corpus comprised the film guide 1001 Movies you Must See before you Die , in which sixty-seven renowned film critics elected the best movies of each decade, the website www.filmsite.org, as well as nominations for the Oscar, BAFTA, and the Academy of Science Fiction, Fantasy and Horror Films awards. The corpus was tagged for part of speech with the Biber tagger, and post-processed with the TagCount programs, which provides further syntactic and semantic analyses. This study will provide two different MD analyses, namely one that uses the existing 1988 dimensions to characterize the movies, and another that extracts new dimensions of variation based solely on movies. For the first one, scores were computed for each movie on each of the first five 1988 dimensions, for the corpus as a whole as well as for each individual movie genre. And for the second analysis, a new factor analysis was run to determine the factors that best represent the data, which were then interpreted in terms of underlying dimensions of variation. For both analyses, the effects of a set of independent variables were considered, including year of first screening, award nomination/win, length, genre, director, ratings, original/adapted script and production studio. Cluster analyses were also carried out to determine the optimal groups of movies within each dimension, such that they cut across the existing genre labels and provided characterizations that were grounded on linguistic characteristics rather than marketdriven concerns. The chapter will present each analysis together with text samples that illustrate the salient features of movies on each dimension. It will also present the results for cluster analysis that dissects time periods as well as statistical tests on these clusters that determine whether there are significant differences across movie genres over time.O objetivo desta pesquisa é investigar dimensões de variação na linguagem dos filmes americanos ao longo de 80 anos de cinema falado. Para tanto, buscou-se suporte teórico na visão de linguagem da Linguística de Corpus e na abordagem metodológica da Análise Multidimensional. Mais especificamente, nas duas modalidades propostas por essa metodologia: a que mapeia o registro filme nas 5 dimensões de variação do inglês apontadas por Biber (1988), e a que propõe novas dimensões de variação baseadas apenas nos textos retratados dos filmes. Ambas as modalidades de análise levaram em conta as variáveis extralinguísticas: ano em que os filmes foram apresentados ao público, indicações/ premiações, duração, gênero, diretores, roteiro original versus adaptado, estúdio em que foram produzidos e avaliação da crítica. O corpus utilizado na pesquisa é composto por 640 legendas de filmes de gêneros diversos organizados sob quatro grandes categorias: comédia, drama, ação-aventura e horror-suspense-mistério. Os filmes foram selecionados com base no guia de filmes 1001 filmes par a ver antes de morrer , elaborado por sessenta e sete críticos de cinema renomados, que elegeram os melhores filmes de cada década, do sítio www.filmesite.org, e das nomeações e premiações recebidas pelas seguintes Academias: Oscar, BAFTA e Filmes de Horror, Fantasia e Ficção Científica (The Academy of Science Fiction, Fantasy and Horror Films). Para que se pudesse concluir o trabalho com a proposição de uma nova categorização para os filmes, baseada nas dimensões de variação encontradas recorreuse à análise estatística de Clusters, que permitiu o agrupamento ideal dos filmes de acordo com suas características morfossintáticas e semânticasapplication/pdfhttp://tede2.pucsp.br/tede/retrieve/30354/Marcia%20Veirano%20Pinto.pdf.jpgporPontifícia Universidade Católica de São PauloPrograma de Estudos Pós-Graduados em Linguística Aplicada e Estudos da LinguagemPUC-SPBRLingüísticaLinguagem dos filmes americanosLanguage of American moviesCNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADAA linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensionalinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da PUC_SPinstname:Pontifícia Universidade Católica de São Paulo (PUC-SP)instacron:PUC_SPTEXTMarcia Veirano Pinto.pdf.txtMarcia Veirano Pinto.pdf.txtExtracted texttext/plain1042134https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/3/Marcia%20Veirano%20Pinto.pdf.txt76e69be017877c1a4f077acc1ae9347fMD53ORIGINALMarcia Veirano Pinto.pdfapplication/pdf5387766https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/1/Marcia%20Veirano%20Pinto.pdf9f381764386ef7108e63b97937f3f77eMD51THUMBNAILMarcia Veirano Pinto.pdf.jpgMarcia Veirano Pinto.pdf.jpgGenerated Thumbnailimage/jpeg1943https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/2/Marcia%20Veirano%20Pinto.pdf.jpgcc73c4c239a4c332d642ba1e7c7a9fb2MD52handle/136342022-04-27 22:18:37.298oai:repositorio.pucsp.br:handle/13634Biblioteca Digital de Teses e Dissertaçõeshttps://sapientia.pucsp.br/https://sapientia.pucsp.br/oai/requestbngkatende@pucsp.br||rapassi@pucsp.bropendoar:2022-04-28T01:18:37Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)false |
dc.title.por.fl_str_mv |
A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional |
title |
A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional |
spellingShingle |
A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional Pinto, Marcia Veirano Linguagem dos filmes americanos Language of American movies CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
title_short |
A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional |
title_full |
A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional |
title_fullStr |
A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional |
title_full_unstemmed |
A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional |
title_sort |
A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional |
author |
Pinto, Marcia Veirano |
author_facet |
Pinto, Marcia Veirano |
author_role |
author |
dc.contributor.advisor1.fl_str_mv |
Sardinha, Antonio Paulo Berber |
dc.contributor.authorLattes.fl_str_mv |
http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4233055U9 |
dc.contributor.author.fl_str_mv |
Pinto, Marcia Veirano |
contributor_str_mv |
Sardinha, Antonio Paulo Berber |
dc.subject.por.fl_str_mv |
Linguagem dos filmes americanos |
topic |
Linguagem dos filmes americanos Language of American movies CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
dc.subject.eng.fl_str_mv |
Language of American movies |
dc.subject.cnpq.fl_str_mv |
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
description |
This study will investigate variation in the language of American movies over an 80- year time period. The corpus is a collection of 640 movie subtitles that includes comedies, dramas, action-adventure, and horror-suspense movies from 1930 to 2010 totaling, approximately, 7,750,000 tokens and 325,000 types. The sampling frame for the corpus comprised the film guide 1001 Movies you Must See before you Die , in which sixty-seven renowned film critics elected the best movies of each decade, the website www.filmsite.org, as well as nominations for the Oscar, BAFTA, and the Academy of Science Fiction, Fantasy and Horror Films awards. The corpus was tagged for part of speech with the Biber tagger, and post-processed with the TagCount programs, which provides further syntactic and semantic analyses. This study will provide two different MD analyses, namely one that uses the existing 1988 dimensions to characterize the movies, and another that extracts new dimensions of variation based solely on movies. For the first one, scores were computed for each movie on each of the first five 1988 dimensions, for the corpus as a whole as well as for each individual movie genre. And for the second analysis, a new factor analysis was run to determine the factors that best represent the data, which were then interpreted in terms of underlying dimensions of variation. For both analyses, the effects of a set of independent variables were considered, including year of first screening, award nomination/win, length, genre, director, ratings, original/adapted script and production studio. Cluster analyses were also carried out to determine the optimal groups of movies within each dimension, such that they cut across the existing genre labels and provided characterizations that were grounded on linguistic characteristics rather than marketdriven concerns. The chapter will present each analysis together with text samples that illustrate the salient features of movies on each dimension. It will also present the results for cluster analysis that dissects time periods as well as statistical tests on these clusters that determine whether there are significant differences across movie genres over time. |
publishDate |
2013 |
dc.date.available.fl_str_mv |
2013-09-05 |
dc.date.issued.fl_str_mv |
2013-08-05 |
dc.date.accessioned.fl_str_mv |
2016-04-28T18:22:42Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
format |
doctoralThesis |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
Pinto, Marcia Veirano. A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional. 2013. 489 f. Tese (Doutorado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2013. |
dc.identifier.uri.fl_str_mv |
https://tede2.pucsp.br/handle/handle/13634 |
identifier_str_mv |
Pinto, Marcia Veirano. A linguagem dos filmes norte-americanos ao longo dos anos: uma abordagem multidimensional. 2013. 489 f. Tese (Doutorado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2013. |
url |
https://tede2.pucsp.br/handle/handle/13634 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Pontifícia Universidade Católica de São Paulo |
dc.publisher.program.fl_str_mv |
Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem |
dc.publisher.initials.fl_str_mv |
PUC-SP |
dc.publisher.country.fl_str_mv |
BR |
dc.publisher.department.fl_str_mv |
Lingüística |
publisher.none.fl_str_mv |
Pontifícia Universidade Católica de São Paulo |
dc.source.none.fl_str_mv |
reponame:Biblioteca Digital de Teses e Dissertações da PUC_SP instname:Pontifícia Universidade Católica de São Paulo (PUC-SP) instacron:PUC_SP |
instname_str |
Pontifícia Universidade Católica de São Paulo (PUC-SP) |
instacron_str |
PUC_SP |
institution |
PUC_SP |
reponame_str |
Biblioteca Digital de Teses e Dissertações da PUC_SP |
collection |
Biblioteca Digital de Teses e Dissertações da PUC_SP |
bitstream.url.fl_str_mv |
https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/3/Marcia%20Veirano%20Pinto.pdf.txt https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/1/Marcia%20Veirano%20Pinto.pdf https://repositorio.pucsp.br/xmlui/bitstream/handle/13634/2/Marcia%20Veirano%20Pinto.pdf.jpg |
bitstream.checksum.fl_str_mv |
76e69be017877c1a4f077acc1ae9347f 9f381764386ef7108e63b97937f3f77e cc73c4c239a4c332d642ba1e7c7a9fb2 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 |
repository.name.fl_str_mv |
Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP) |
repository.mail.fl_str_mv |
bngkatende@pucsp.br||rapassi@pucsp.br |
_version_ |
1809277952132644864 |