Probability distribution in a quantitative linguistic problem
Autor(a) principal: | |
---|---|
Data de Publicação: | 2009 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Brazilian Journal of Physics |
Texto Completo: | http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-97332009000400028 |
Resumo: | In the present contribution, we propose a possible way to discuss the distributions of words in a given text. We have devoted our study to discuss some relevant properties observed in Spanish texts of Latin-American writers. We start analyzing the appearance of distributions of the frequency of occurrence in the Zipf perspective. We identify two regions of behavior separated by a special point. In order to correctly define such a point, we work beyond the Zipf law, defining other probability distribution that takes the frequency of repetition of a particular word among other different words into account. At this point, we take the linguistic problem to a statistical level. We make an effort to characterize the point of separation between two regions, via the Binder cumulant of fourth order, as it is made in the characterization of critical points in phase transitions of physical systems. |
id |
SBF-2_f38efe3a67eebaed702f0a7449ed4531 |
---|---|
oai_identifier_str |
oai:scielo:S0103-97332009000400028 |
network_acronym_str |
SBF-2 |
network_name_str |
Brazilian Journal of Physics |
repository_id_str |
|
spelling |
Probability distribution in a quantitative linguistic problemProbability distributionZipf lawBinder cumulantsIn the present contribution, we propose a possible way to discuss the distributions of words in a given text. We have devoted our study to discuss some relevant properties observed in Spanish texts of Latin-American writers. We start analyzing the appearance of distributions of the frequency of occurrence in the Zipf perspective. We identify two regions of behavior separated by a special point. In order to correctly define such a point, we work beyond the Zipf law, defining other probability distribution that takes the frequency of repetition of a particular word among other different words into account. At this point, we take the linguistic problem to a statistical level. We make an effort to characterize the point of separation between two regions, via the Binder cumulant of fourth order, as it is made in the characterization of critical points in phase transitions of physical systems.Sociedade Brasileira de Física2009-08-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-97332009000400028Brazilian Journal of Physics v.39 n.2a 2009reponame:Brazilian Journal of Physicsinstname:Sociedade Brasileira de Física (SBF)instacron:SBF10.1590/S0103-97332009000400028info:eu-repo/semantics/openAccessCalderón,F.Curilef,S.Ladrón de Guevara,M. L.eng2009-09-10T00:00:00Zoai:scielo:S0103-97332009000400028Revistahttp://www.sbfisica.org.br/v1/home/index.php/pt/ONGhttps://old.scielo.br/oai/scielo-oai.phpsbfisica@sbfisica.org.br||sbfisica@sbfisica.org.br1678-44480103-9733opendoar:2009-09-10T00:00Brazilian Journal of Physics - Sociedade Brasileira de Física (SBF)false |
dc.title.none.fl_str_mv |
Probability distribution in a quantitative linguistic problem |
title |
Probability distribution in a quantitative linguistic problem |
spellingShingle |
Probability distribution in a quantitative linguistic problem Calderón,F. Probability distribution Zipf law Binder cumulants |
title_short |
Probability distribution in a quantitative linguistic problem |
title_full |
Probability distribution in a quantitative linguistic problem |
title_fullStr |
Probability distribution in a quantitative linguistic problem |
title_full_unstemmed |
Probability distribution in a quantitative linguistic problem |
title_sort |
Probability distribution in a quantitative linguistic problem |
author |
Calderón,F. |
author_facet |
Calderón,F. Curilef,S. Ladrón de Guevara,M. L. |
author_role |
author |
author2 |
Curilef,S. Ladrón de Guevara,M. L. |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Calderón,F. Curilef,S. Ladrón de Guevara,M. L. |
dc.subject.por.fl_str_mv |
Probability distribution Zipf law Binder cumulants |
topic |
Probability distribution Zipf law Binder cumulants |
description |
In the present contribution, we propose a possible way to discuss the distributions of words in a given text. We have devoted our study to discuss some relevant properties observed in Spanish texts of Latin-American writers. We start analyzing the appearance of distributions of the frequency of occurrence in the Zipf perspective. We identify two regions of behavior separated by a special point. In order to correctly define such a point, we work beyond the Zipf law, defining other probability distribution that takes the frequency of repetition of a particular word among other different words into account. At this point, we take the linguistic problem to a statistical level. We make an effort to characterize the point of separation between two regions, via the Binder cumulant of fourth order, as it is made in the characterization of critical points in phase transitions of physical systems. |
publishDate |
2009 |
dc.date.none.fl_str_mv |
2009-08-01 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-97332009000400028 |
url |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-97332009000400028 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1590/S0103-97332009000400028 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
text/html |
dc.publisher.none.fl_str_mv |
Sociedade Brasileira de Física |
publisher.none.fl_str_mv |
Sociedade Brasileira de Física |
dc.source.none.fl_str_mv |
Brazilian Journal of Physics v.39 n.2a 2009 reponame:Brazilian Journal of Physics instname:Sociedade Brasileira de Física (SBF) instacron:SBF |
instname_str |
Sociedade Brasileira de Física (SBF) |
instacron_str |
SBF |
institution |
SBF |
reponame_str |
Brazilian Journal of Physics |
collection |
Brazilian Journal of Physics |
repository.name.fl_str_mv |
Brazilian Journal of Physics - Sociedade Brasileira de Física (SBF) |
repository.mail.fl_str_mv |
sbfisica@sbfisica.org.br||sbfisica@sbfisica.org.br |
_version_ |
1754734865189699584 |