Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico
Autor(a) principal: | |
---|---|
Data de Publicação: | 2009 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Biblioteca Digital de Teses e Dissertações da PUC_SP |
Texto Completo: | https://tede2.pucsp.br/handle/handle/14101 |
Resumo: | The present study explored a specific document of the pharmaceutical segment called Site Master File through the investigation of words combinations defined as lexical bundles (Biber et al.,1999). The aim of the study was to draw out the bundles so that to verify the degree of conformity of the linguistic features the use of lexical bundles may achieve, as being part of a document organized in a similar way, produced by different authors at different locations around the world. The theoretical-methodological approach was developed on the principles of Corpus Linguistics (Stubbs, 1996; Scott and Tribble, 2006; Berber Sardinha and Barbara, 2008; amongst others), an approach that makes use of a vast variety of authentic texts of language in use supported by computational tools. We compiled for this study fifteen samples of the Site Master File document stored in machine-readable form that belong to the same multinational pharmaceutical company based in Europe, which has more than a hundred of plants situated across the world. The Site Master File is a document prepared by pharmaceutical manufacturers that contains specific information about the quality assurance, the production and quality control of pharmaceutical manufacturing operations carried out at a named site/plant in order to be submitted to a regulatory authority. In addition, all documents must be officially certified in English. The analysis of the corpus data was performed to extract three-word bundles by using scripting languages such as Perl and Cygwin. Besides, a computer application was also designed to provide the cross-reference of data. The results of data analysis showed that although the samples of Site Master File bring a large range of similarity in its organization, we have not found regularity on the use of recurrent lexical bundles across the Site Master File documents. Thus, considering the absences of common lexical bundles across documents, we observed that, in each operating area of the pharmaceutical business unit there are some typical characteristics in relation to the type of product manufactured in the site, the processes engaged in the unit pharmaceutical operations as well as the geographic nearness relationships to the linguistic choices made by the different authors. Therefore, this study offers a contribution to the knowledge of variation in English use in preparing the Site Master File by authors allocated in a specific site. Moreover, the present study involves further research into the field of English for Specific Purposes based on corpora and into the studies of terminology |
id |
PUC_SP-1_afa05e5d97241629bae83246e8a9ca46 |
---|---|
oai_identifier_str |
oai:repositorio.pucsp.br:handle/14101 |
network_acronym_str |
PUC_SP-1 |
network_name_str |
Biblioteca Digital de Teses e Dissertações da PUC_SP |
repository_id_str |
|
spelling |
Ramos, Rosinda de Castro Guerrahttp://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4228809E3Mazza, Luciene Novais2016-04-28T18:24:09Z2015-03-232009-07-29Mazza, Luciene Novais. Lexical bundles searching similarities in a document of pharmaceutical sector. 2009. 149 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2009.https://tede2.pucsp.br/handle/handle/14101The present study explored a specific document of the pharmaceutical segment called Site Master File through the investigation of words combinations defined as lexical bundles (Biber et al.,1999). The aim of the study was to draw out the bundles so that to verify the degree of conformity of the linguistic features the use of lexical bundles may achieve, as being part of a document organized in a similar way, produced by different authors at different locations around the world. The theoretical-methodological approach was developed on the principles of Corpus Linguistics (Stubbs, 1996; Scott and Tribble, 2006; Berber Sardinha and Barbara, 2008; amongst others), an approach that makes use of a vast variety of authentic texts of language in use supported by computational tools. We compiled for this study fifteen samples of the Site Master File document stored in machine-readable form that belong to the same multinational pharmaceutical company based in Europe, which has more than a hundred of plants situated across the world. The Site Master File is a document prepared by pharmaceutical manufacturers that contains specific information about the quality assurance, the production and quality control of pharmaceutical manufacturing operations carried out at a named site/plant in order to be submitted to a regulatory authority. In addition, all documents must be officially certified in English. The analysis of the corpus data was performed to extract three-word bundles by using scripting languages such as Perl and Cygwin. Besides, a computer application was also designed to provide the cross-reference of data. The results of data analysis showed that although the samples of Site Master File bring a large range of similarity in its organization, we have not found regularity on the use of recurrent lexical bundles across the Site Master File documents. Thus, considering the absences of common lexical bundles across documents, we observed that, in each operating area of the pharmaceutical business unit there are some typical characteristics in relation to the type of product manufactured in the site, the processes engaged in the unit pharmaceutical operations as well as the geographic nearness relationships to the linguistic choices made by the different authors. Therefore, this study offers a contribution to the knowledge of variation in English use in preparing the Site Master File by authors allocated in a specific site. Moreover, the present study involves further research into the field of English for Specific Purposes based on corpora and into the studies of terminologyO objetivo deste trabalho foi examinar o documento Site Master File do setor farmacêutico a partir da investigação de uma combinação de palavras denominada lexical bundles (Biber et al. 1999) com o propósito de verificar o grau de conformidade com elementos lingüísticos que um documento com a mesma organização estrutural, escrita por diferentes autores em diferentes partes do mundo pode atingir. A presente pesquisa teve como principal suporte teórico e metodológico a Lingüística de Corpus (Stubbs, 1996; Scott e Tribble, 2006; Berber Sardinha e Barbara, 2008; entre outros), uma abordagem que permite investigar como a língua ocorre naturalmente no discurso por meio de ferramentas computacionais. Para esta investigação foram compilados quinze exemplares do documento Site Master File pertencente a um mesmo grupo farmacêutico multinacional com sede na Europa e com unidades de negócios espalhadas em mais de 100 países. O documento Site Master File é um conjunto de textos produzidos pelas indústrias farmacêuticas para atender as exigências de garantia e controle da qualidade dos medicamentos, a fim de se obter certificação internacional junto aos órgãos de vigilância sanitária. Ademais, todos os documentos devem ser oficialmente produzidos em língua inglesa. Para a análise dos dados foram utilizadas as linguagens de programação Perl e Cygwin, como também foi desenvolvido um aplicativo para gerar a extração dos lexical bundles de três palavras. Os resultados da análise dos dados indicaram, que embora o documento Site Master File apresente semelhanças em sua organização, não há uma regularidade de lexical bundles recorrentes entre as amostras dos quinze exemplares. Assim, dessa ausência de bundles semelhantes, foi possível observar traços característicos do tipo de negócio que cada unidade da empresa está envolvida, dos processos e produtos fabricados e, ainda, a relação da proximidade geográfica com as escolhas lingüísticas feitas pelos autores. Portanto, este estudo além de contribuir para o conhecimento das variações de uso da língua inglesa por autores de diferentes localidades na elaboração do documento Site Master File, também implica em futuras pesquisas no ensino de línguas para fins específicos baseado em corpora e nos estudos sobre terminologiaConselho Nacional de Desenvolvimento Científico e Tecnológicoapplication/pdfhttp://tede2.pucsp.br/tede/retrieve/29377/Luciene%20Novais%20Mazza.pdf.jpgporPontifícia Universidade Católica de São PauloPrograma de Estudos Pós-Graduados em Linguística Aplicada e Estudos da LinguagemPUC-SPBRLingüísticaLingüística de corpusSite Master FileCorpus (Linguistica)Lingua inglesa -- Analise do discursoLexical bundlesCorpus linguisticsCNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADAOs lexical bundles na busca por semelhanças em um documento do setor farmacêuticoLexical bundles searching similarities in a document of pharmaceutical sectorinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da PUC_SPinstname:Pontifícia Universidade Católica de São Paulo (PUC-SP)instacron:PUC_SPTEXTLuciene Novais Mazza.pdf.txtLuciene Novais Mazza.pdf.txtExtracted texttext/plain314007https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/3/Luciene%20Novais%20Mazza.pdf.txt464bf7a983901563b6b9ce333cd6eac0MD53ORIGINALLuciene Novais Mazza.pdfapplication/pdf2077236https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/1/Luciene%20Novais%20Mazza.pdfab60b489f57494f9b4bd86ae30c618c1MD51THUMBNAILLuciene Novais Mazza.pdf.jpgLuciene Novais Mazza.pdf.jpgGenerated Thumbnailimage/jpeg1943https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/2/Luciene%20Novais%20Mazza.pdf.jpgcc73c4c239a4c332d642ba1e7c7a9fb2MD52handle/141012022-04-28 02:07:23.478oai:repositorio.pucsp.br:handle/14101Biblioteca Digital de Teses e Dissertaçõeshttps://sapientia.pucsp.br/https://sapientia.pucsp.br/oai/requestbngkatende@pucsp.br||rapassi@pucsp.bropendoar:2022-04-28T05:07:23Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)false |
dc.title.por.fl_str_mv |
Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico |
dc.title.alternative.eng.fl_str_mv |
Lexical bundles searching similarities in a document of pharmaceutical sector |
title |
Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico |
spellingShingle |
Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico Mazza, Luciene Novais Lingüística de corpus Site Master File Corpus (Linguistica) Lingua inglesa -- Analise do discurso Lexical bundles Corpus linguistics CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
title_short |
Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico |
title_full |
Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico |
title_fullStr |
Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico |
title_full_unstemmed |
Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico |
title_sort |
Os lexical bundles na busca por semelhanças em um documento do setor farmacêutico |
author |
Mazza, Luciene Novais |
author_facet |
Mazza, Luciene Novais |
author_role |
author |
dc.contributor.advisor1.fl_str_mv |
Ramos, Rosinda de Castro Guerra |
dc.contributor.authorLattes.fl_str_mv |
http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4228809E3 |
dc.contributor.author.fl_str_mv |
Mazza, Luciene Novais |
contributor_str_mv |
Ramos, Rosinda de Castro Guerra |
dc.subject.por.fl_str_mv |
Lingüística de corpus Site Master File Corpus (Linguistica) Lingua inglesa -- Analise do discurso |
topic |
Lingüística de corpus Site Master File Corpus (Linguistica) Lingua inglesa -- Analise do discurso Lexical bundles Corpus linguistics CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
dc.subject.eng.fl_str_mv |
Lexical bundles Corpus linguistics |
dc.subject.cnpq.fl_str_mv |
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
description |
The present study explored a specific document of the pharmaceutical segment called Site Master File through the investigation of words combinations defined as lexical bundles (Biber et al.,1999). The aim of the study was to draw out the bundles so that to verify the degree of conformity of the linguistic features the use of lexical bundles may achieve, as being part of a document organized in a similar way, produced by different authors at different locations around the world. The theoretical-methodological approach was developed on the principles of Corpus Linguistics (Stubbs, 1996; Scott and Tribble, 2006; Berber Sardinha and Barbara, 2008; amongst others), an approach that makes use of a vast variety of authentic texts of language in use supported by computational tools. We compiled for this study fifteen samples of the Site Master File document stored in machine-readable form that belong to the same multinational pharmaceutical company based in Europe, which has more than a hundred of plants situated across the world. The Site Master File is a document prepared by pharmaceutical manufacturers that contains specific information about the quality assurance, the production and quality control of pharmaceutical manufacturing operations carried out at a named site/plant in order to be submitted to a regulatory authority. In addition, all documents must be officially certified in English. The analysis of the corpus data was performed to extract three-word bundles by using scripting languages such as Perl and Cygwin. Besides, a computer application was also designed to provide the cross-reference of data. The results of data analysis showed that although the samples of Site Master File bring a large range of similarity in its organization, we have not found regularity on the use of recurrent lexical bundles across the Site Master File documents. Thus, considering the absences of common lexical bundles across documents, we observed that, in each operating area of the pharmaceutical business unit there are some typical characteristics in relation to the type of product manufactured in the site, the processes engaged in the unit pharmaceutical operations as well as the geographic nearness relationships to the linguistic choices made by the different authors. Therefore, this study offers a contribution to the knowledge of variation in English use in preparing the Site Master File by authors allocated in a specific site. Moreover, the present study involves further research into the field of English for Specific Purposes based on corpora and into the studies of terminology |
publishDate |
2009 |
dc.date.issued.fl_str_mv |
2009-07-29 |
dc.date.available.fl_str_mv |
2015-03-23 |
dc.date.accessioned.fl_str_mv |
2016-04-28T18:24:09Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
Mazza, Luciene Novais. Lexical bundles searching similarities in a document of pharmaceutical sector. 2009. 149 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2009. |
dc.identifier.uri.fl_str_mv |
https://tede2.pucsp.br/handle/handle/14101 |
identifier_str_mv |
Mazza, Luciene Novais. Lexical bundles searching similarities in a document of pharmaceutical sector. 2009. 149 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2009. |
url |
https://tede2.pucsp.br/handle/handle/14101 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Pontifícia Universidade Católica de São Paulo |
dc.publisher.program.fl_str_mv |
Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem |
dc.publisher.initials.fl_str_mv |
PUC-SP |
dc.publisher.country.fl_str_mv |
BR |
dc.publisher.department.fl_str_mv |
Lingüística |
publisher.none.fl_str_mv |
Pontifícia Universidade Católica de São Paulo |
dc.source.none.fl_str_mv |
reponame:Biblioteca Digital de Teses e Dissertações da PUC_SP instname:Pontifícia Universidade Católica de São Paulo (PUC-SP) instacron:PUC_SP |
instname_str |
Pontifícia Universidade Católica de São Paulo (PUC-SP) |
instacron_str |
PUC_SP |
institution |
PUC_SP |
reponame_str |
Biblioteca Digital de Teses e Dissertações da PUC_SP |
collection |
Biblioteca Digital de Teses e Dissertações da PUC_SP |
bitstream.url.fl_str_mv |
https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/3/Luciene%20Novais%20Mazza.pdf.txt https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/1/Luciene%20Novais%20Mazza.pdf https://repositorio.pucsp.br/xmlui/bitstream/handle/14101/2/Luciene%20Novais%20Mazza.pdf.jpg |
bitstream.checksum.fl_str_mv |
464bf7a983901563b6b9ce333cd6eac0 ab60b489f57494f9b4bd86ae30c618c1 cc73c4c239a4c332d642ba1e7c7a9fb2 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 |
repository.name.fl_str_mv |
Biblioteca Digital de Teses e Dissertações da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP) |
repository.mail.fl_str_mv |
bngkatende@pucsp.br||rapassi@pucsp.br |
_version_ |
1809277814399041536 |