Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico

Detalhes bibliográficos
Autor(a) principal: Mazza, Luciene Novais
Data de Publicação: 2009
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Biblioteca Digital de Teses e Dissertações da PUC_SP
Texto Completo: https://tede2.pucsp.br/handle/handle/14101
Resumo: The present study explored a specific document of the pharmaceutical segment called Site Master File through the investigation of words combinations defined as lexical bundles (Biber et al.,1999). The aim of the study was to draw out the bundles so that to verify the degree of conformity of the linguistic features the use of lexical bundles may achieve, as being part of a document organized in a similar way, produced by different authors at different locations around the world. The theoretical-methodological approach was developed on the principles of Corpus Linguistics (Stubbs, 1996; Scott and Tribble, 2006; Berber Sardinha and Barbara, 2008; amongst others), an approach that makes use of a vast variety of authentic texts of language in use supported by computational tools. We compiled for this study fifteen samples of the Site Master File document stored in machine-readable form that belong to the same multinational pharmaceutical company based in Europe, which has more than a hundred of plants situated across the world. The Site Master File is a document prepared by pharmaceutical manufacturers that contains specific information about the quality assurance, the production and quality control of pharmaceutical manufacturing operations carried out at a named site/plant in order to be submitted to a regulatory authority. In addition, all documents must be officially certified in English. The analysis of the corpus data was performed to extract three-word bundles by using scripting languages such as Perl and Cygwin. Besides, a computer application was also designed to provide the cross-reference of data. The results of data analysis showed that although the samples of Site Master File bring a large range of similarity in its organization, we have not found regularity on the use of recurrent lexical bundles across the Site Master File documents. Thus, considering the absences of common lexical bundles across documents, we observed that, in each operating area of the pharmaceutical business unit there are some typical characteristics in relation to the type of product manufactured in the site, the processes engaged in the unit pharmaceutical operations as well as the geographic nearness relationships to the linguistic choices made by the different authors. Therefore, this study offers a contribution to the knowledge of variation in English use in preparing the Site Master File by authors allocated in a specific site. Moreover, the present study involves further research into the field of English for Specific Purposes based on corpora and into the studies of terminology
id PUC_SP-1_afa05e5d97241629bae83246e8a9ca46
oai_identifier_str oai:repositorio.pucsp.br:handle/14101
network_acronym_str PUC_SP-1
network_name_str Biblioteca Digital de Teses e Dissertações da PUC_SP
repository_id_str
spelling Ramos, Rosinda de Castro Guerrahttp://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4228809E3Mazza, Luciene Novais2016-04-28T18:24:09Z2015-03-232009-07-29Mazza, Luciene Novais. Lexical bundles searching similarities in a document of pharmaceutical sector. 2009. 149 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2009.https://tede2.pucsp.br/handle/handle/14101The present study explored a specific document of the pharmaceutical segment called Site Master File through the investigation of words combinations defined as lexical bundles (Biber et al.,1999). The aim of the study was to draw out the bundles so that to verify the degree of conformity of the linguistic features the use of lexical bundles may achieve, as being part of a document organized in a similar way, produced by different authors at different locations around the world. The theoretical-methodological approach was developed on the principles of Corpus Linguistics (Stubbs, 1996; Scott and Tribble, 2006; Berber Sardinha and Barbara, 2008; amongst others), an approach that makes use of a vast variety of authentic texts of language in use supported by computational tools. We compiled for this study fifteen samples of the Site Master File document stored in machine-readable form that belong to the same multinational pharmaceutical company based in Europe, which has more than a hundred of plants situated across the world. The Site Master File is a document prepared by pharmaceutical manufacturers that contains specific information about the quality assurance, the production and quality control of pharmaceutical manufacturing operations carried out at a named site/plant in order to be submitted to a regulatory authority. In addition, all documents must be officially certified in English. The analysis of the corpus data was performed to extract three-word bundles by using scripting languages such as Perl and Cygwin. Besides, a computer application was also designed to provide the cross-reference of data. The results of data analysis showed that although the samples of Site Master File bring a large range of similarity in its organization, we have not found regularity on the use of recurrent lexical bundles across the Site Master File documents. Thus, considering the absences of common lexical bundles across documents, we observed that, in each operating area of the pharmaceutical business unit there are some typical characteristics in relation to the type of product manufactured in the site, the processes engaged in the unit pharmaceutical operations as well as the geographic nearness relationships to the linguistic choices made by the different authors. Therefore, this study offers a contribution to the knowledge of variation in English use in preparing the Site Master File by authors allocated in a specific site. Moreover, the present study involves further research into the field of English for Specific Purposes based on corpora and into the studies of terminologyO objetivo deste trabalho foi examinar o documento Site Master File do setor farmacêutico a partir da investigação de uma combinação de palavras denominada lexical bundles (Biber et al. 1999) com o propósito de verificar o grau de conformidade com elementos lingüísticos que um documento com a mesma organização estrutural, escrita por diferentes autores em diferentes partes do mundo pode atingir. A presente pesquisa teve como principal suporte teórico e metodológico a Lingüística de Corpus (Stubbs, 1996; Scott e Tribble, 2006; Berber Sardinha e Barbara, 2008; entre outros), uma abordagem que permite investigar como a língua ocorre naturalmente no discurso por meio de ferramentas computacionais. Para esta investigação foram compilados quinze exemplares do documento Site Master File pertencente a um mesmo grupo farmacêutico multinacional com sede na Europa e com unidades de negócios espalhadas em mais de 100 países. O documento Site Master File é um conjunto de textos produzidos pelas indústrias farmacêuticas para atender as exigências de garantia e controle da qualidade dos medicamentos, a fim de se obter certificação internacional junto aos órgãos de vigilância sanitária. Ademais, todos os documentos devem ser oficialmente produzidos em língua inglesa. Para a análise dos dados foram utilizadas as linguagens de programação Perl e Cygwin, como também foi desenvolvido um aplicativo para gerar a extração dos lexical bundles de três palavras. Os resultados da análise dos dados indicaram, que embora o documento Site Master File apresente semelhanças em sua organização, não há uma regularidade de lexical bundles recorrentes entre as amostras dos quinze exemplares. Assim, dessa ausência de bundles semelhantes, foi possível observar traços característicos do tipo de negócio que cada unidade da empresa está envolvida, dos processos e produtos fabricados e, ainda, a relação da proximidade geográfica com as escolhas lingüísticas feitas pelos autores. Portanto, este estudo além de contribuir para o conhecimento das variações de uso da língua inglesa por autores de diferentes localidades na elaboração do documento Site Master File, também implica em futuras pesquisas no ensino de línguas para fins específicos baseado em corpora e nos estudos sobre terminologiaConselho Nacional de Desenvolvimento Científico e Tecnológicoapplication/pdfhttp://tede2.pucsp.br/tede/retrieve/29377/Luciene%20Novais%20Mazza.pdf.jpgporPontifícia Universidade Católica de São PauloPrograma de Estudos Pós-Graduados em Linguística Aplicada e Estudos da LinguagemPUC-SPBRLingüísticaLingüística de corpusSite Master FileCorpus (Linguistica)Lingua inglesa -- Analise do discursoLexical bundlesCorpus linguisticsCNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADAOs lexical bundles na busca por semelhanças em um documento do setor farmacêuticoLexical bundles searching similarities in a document of pharmaceutical sectorinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da PUC_SPinstname:Pontifícia Universidade Católica de São Paulo (PUC-SP)instacron:PUC_SPTEXTLuciene Novais Mazza.pdf.txtLuciene Novais Mazza.pdf.txtExtracted texttext/plain314007https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/3/Luciene%20Novais%20Mazza.pdf.txt464bf7a983901563b6b9ce333cd6eac0MD53ORIGINALLuciene Novais Mazza.pdfapplication/pdf2077236https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/1/Luciene%20Novais%20Mazza.pdfab60b489f57494f9b4bd86ae30c618c1MD51THUMBNAILLuciene Novais Mazza.pdf.jpgLuciene Novais Mazza.pdf.jpgGenerated Thumbnailimage/jpeg1943https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/2/Luciene%20Novais%20Mazza.pdf.jpgcc73c4c239a4c332d642ba1e7c7a9fb2MD52handle/141012022-04-28 02:07:23.478oai:repositorio.pucsp.br:handle/14101Biblioteca Digital de Teses e Dissertaçõeshttps://sapientia.pucsp.br/https://sapientia.pucsp.br/oai/requestbngkatende@pucsp.br||rapassi@pucsp.bropendoar:2022-04-28T05:07:23Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)false
dc.title.por.fl_str_mv Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
dc.title.alternative.eng.fl_str_mv Lexical bundles searching similarities in a document of pharmaceutical sector
title Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
spellingShingle Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
Mazza, Luciene Novais
Lingüística de corpus
Site Master File
Corpus (Linguistica)
Lingua inglesa -- Analise do discurso
Lexical bundles
Corpus linguistics
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
title_short Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
title_full Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
title_fullStr Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
title_full_unstemmed Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
title_sort Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
author Mazza, Luciene Novais
author_facet Mazza, Luciene Novais
author_role author
dc.contributor.advisor1.fl_str_mv Ramos, Rosinda de Castro Guerra
dc.contributor.authorLattes.fl_str_mv http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4228809E3
dc.contributor.author.fl_str_mv Mazza, Luciene Novais
contributor_str_mv Ramos, Rosinda de Castro Guerra
dc.subject.por.fl_str_mv Lingüística de corpus
Site Master File
Corpus (Linguistica)
Lingua inglesa -- Analise do discurso
topic Lingüística de corpus
Site Master File
Corpus (Linguistica)
Lingua inglesa -- Analise do discurso
Lexical bundles
Corpus linguistics
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
dc.subject.eng.fl_str_mv Lexical bundles
Corpus linguistics
dc.subject.cnpq.fl_str_mv CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
description The present study explored a specific document of the pharmaceutical segment called Site Master File through the investigation of words combinations defined as lexical bundles (Biber et al.,1999). The aim of the study was to draw out the bundles so that to verify the degree of conformity of the linguistic features the use of lexical bundles may achieve, as being part of a document organized in a similar way, produced by different authors at different locations around the world. The theoretical-methodological approach was developed on the principles of Corpus Linguistics (Stubbs, 1996; Scott and Tribble, 2006; Berber Sardinha and Barbara, 2008; amongst others), an approach that makes use of a vast variety of authentic texts of language in use supported by computational tools. We compiled for this study fifteen samples of the Site Master File document stored in machine-readable form that belong to the same multinational pharmaceutical company based in Europe, which has more than a hundred of plants situated across the world. The Site Master File is a document prepared by pharmaceutical manufacturers that contains specific information about the quality assurance, the production and quality control of pharmaceutical manufacturing operations carried out at a named site/plant in order to be submitted to a regulatory authority. In addition, all documents must be officially certified in English. The analysis of the corpus data was performed to extract three-word bundles by using scripting languages such as Perl and Cygwin. Besides, a computer application was also designed to provide the cross-reference of data. The results of data analysis showed that although the samples of Site Master File bring a large range of similarity in its organization, we have not found regularity on the use of recurrent lexical bundles across the Site Master File documents. Thus, considering the absences of common lexical bundles across documents, we observed that, in each operating area of the pharmaceutical business unit there are some typical characteristics in relation to the type of product manufactured in the site, the processes engaged in the unit pharmaceutical operations as well as the geographic nearness relationships to the linguistic choices made by the different authors. Therefore, this study offers a contribution to the knowledge of variation in English use in preparing the Site Master File by authors allocated in a specific site. Moreover, the present study involves further research into the field of English for Specific Purposes based on corpora and into the studies of terminology
publishDate 2009
dc.date.issued.fl_str_mv 2009-07-29
dc.date.available.fl_str_mv 2015-03-23
dc.date.accessioned.fl_str_mv 2016-04-28T18:24:09Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv Mazza, Luciene Novais. Lexical bundles searching similarities in a document of pharmaceutical sector. 2009. 149 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2009.
dc.identifier.uri.fl_str_mv https://tede2.pucsp.br/handle/handle/14101
identifier_str_mv Mazza, Luciene Novais. Lexical bundles searching similarities in a document of pharmaceutical sector. 2009. 149 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2009.
url https://tede2.pucsp.br/handle/handle/14101
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Pontifícia Universidade Católica de São Paulo
dc.publisher.program.fl_str_mv Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem
dc.publisher.initials.fl_str_mv PUC-SP
dc.publisher.country.fl_str_mv BR
dc.publisher.department.fl_str_mv Lingüística
publisher.none.fl_str_mv Pontifícia Universidade Católica de São Paulo
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da PUC_SP
instname:Pontifícia Universidade Católica de São Paulo (PUC-SP)
instacron:PUC_SP
instname_str Pontifícia Universidade Católica de São Paulo (PUC-SP)
instacron_str PUC_SP
institution PUC_SP
reponame_str Biblioteca Digital de Teses e Dissertações da PUC_SP
collection Biblioteca Digital de Teses e Dissertações da PUC_SP
bitstream.url.fl_str_mv https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/3/Luciene%20Novais%20Mazza.pdf.txt
https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/1/Luciene%20Novais%20Mazza.pdf
https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/2/Luciene%20Novais%20Mazza.pdf.jpg
bitstream.checksum.fl_str_mv 464bf7a983901563b6b9ce333cd6eac0
ab60b489f57494f9b4bd86ae30c618c1
cc73c4c239a4c332d642ba1e7c7a9fb2
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)
repository.mail.fl_str_mv bngkatende@pucsp.br||rapassi@pucsp.br
_version_ 1809277814399041536