Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Texto livre |
Texto Completo: | https://periodicos.ufmg.br/index.php/textolivre/article/view/24396 |
Resumo: | Classifying literary genres has always been methodologically confined to philological methods and what is commonly known as Vector Space Clustering (VSC). The problem has been exasperated with the widening gap between computational theory and traditional analysis of literary texts. Towards finding a solution to this problem, the current study utilizes a synergetic approach that brings together two established methods. First, a computational model of genre classification is drawn upon for identifying concept-based, rather than word-bound, topics, where the representation of texts is secured via the ‘bag of concepts’ (BOC) model as well as the sense-restricted knowledge and meaningful links holding between and among concepts; relatedly, the two model strands of explicit semantic analysis (ESA) and ConceptNet have enacted text classification. Second, a contextual lexical semantic approach (CRUSE, 1986, 2000) is employed so that the contextual variability of word meanings and concepts can be tackled within the confines of the target literary genres classified. The findings of present study have shown that the current composite approach of computational and semantic models has resulted in improved performance in classifying literary genres, especially with respect to delineating the links between each cluster’s document-members and generalizing about their unifying genre. Further implications have emerged from the present study, namely, the benefits reserved for digital libraries and the process of archiving, where literary-text classification has proven problematic to both users and readers in many cases. |
id |
UFMG-9_0a4a8a6891824b56ec6f8accf72bf539 |
---|---|
oai_identifier_str |
oai:periodicos.ufmg.br:article/24396 |
network_acronym_str |
UFMG-9 |
network_name_str |
Texto livre |
repository_id_str |
|
spelling |
Classifying literary genres: a methodological synergy of computational modelling and lexical semanticsClassificação de gêneros literários: uma sinergia metodológica de modelagem computacional e semântica lexicalBolsa de conceitos (COB)ConceptNetAnálise Semântica Explícita (ASE)Classificação de gêneroConceitos de tópicosVector Space Clustering (VSC)Bag of concepts (BOC)ConceptNetExplicit Semantic Analysis (ESA)Genre classificationTopic conceptsVector Space clustering (VSC)Classifying literary genres has always been methodologically confined to philological methods and what is commonly known as Vector Space Clustering (VSC). The problem has been exasperated with the widening gap between computational theory and traditional analysis of literary texts. Towards finding a solution to this problem, the current study utilizes a synergetic approach that brings together two established methods. First, a computational model of genre classification is drawn upon for identifying concept-based, rather than word-bound, topics, where the representation of texts is secured via the ‘bag of concepts’ (BOC) model as well as the sense-restricted knowledge and meaningful links holding between and among concepts; relatedly, the two model strands of explicit semantic analysis (ESA) and ConceptNet have enacted text classification. Second, a contextual lexical semantic approach (CRUSE, 1986, 2000) is employed so that the contextual variability of word meanings and concepts can be tackled within the confines of the target literary genres classified. The findings of present study have shown that the current composite approach of computational and semantic models has resulted in improved performance in classifying literary genres, especially with respect to delineating the links between each cluster’s document-members and generalizing about their unifying genre. Further implications have emerged from the present study, namely, the benefits reserved for digital libraries and the process of archiving, where literary-text classification has proven problematic to both users and readers in many cases.A classificação de gêneros literários sempre se restringiu metodologicamente aos métodos filológicos e ao que é comumente conhecido como Vector Space Clustering (VSC). O problema foi exasperado com a crescente lacuna entre a teoria computacional e a análise tradicional de textos literários. Para encontrar uma solução para esse problema, o presente estudo utiliza uma abordagem sinérgica que reúne dois métodos estabelecidos. Primeiro, um modelo computacional de classificação de gênero é utilizado para identificar tópicos baseados em conceito, em vez de vinculados a palavras, em que a representação de textos é protegida por meio do modelo “bolsa de conceitos” (BOC), bem como o conhecimento restrito aos sentidos e os vínculos significativos entre os conceitos; De maneira semelhante, os dois modelos de análise semântica explícita (ASE) e ConceptNet promulgaram a classificação do texto. Segundo, uma abordagem semântica lexical contextual (CRUSE, 1986, 2000) é empregada para que a variabilidade contextual dos significados e conceitos das palavras possa ser abordada dentro dos limites dos gêneros literários alvo classificados. As descobertas do presente estudo mostraram que a atual abordagem composta de modelos computacionais e semânticos resultou em melhor desempenho na classificação de gêneros literários, especialmente no que diz respeito a delinear os vínculos entre os membros do documento de cada grupo e generalizar sobre seu gênero unificador. Outras implicações emergiram do presente estudo, a saber, os benefícios reservados para as bibliotecas digitais e o processo de arquivamento, em que a classificação de textos literários se mostrou problemática para usuários e leitores em muitos casos.Universidade Federal de Minas Gerais2020-07-24info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfapplication/ziphttps://periodicos.ufmg.br/index.php/textolivre/article/view/2439610.35699/1983-3652.2020.24396Texto Livre; Vol. 13 No. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101Texto Livre; Vol. 13 Núm. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101Texto Livre; Vol. 13 No 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101Texto Livre; v. 13 n. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-1011983-3652reponame:Texto livreinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGenghttps://periodicos.ufmg.br/index.php/textolivre/article/view/24396/19506https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27681https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27682https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27683https://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessOmar, Abdulfattah 2021-07-07T23:11:34Zoai:periodicos.ufmg.br:article/24396Revistahttp://www.periodicos.letras.ufmg.br/index.php/textolivrePUBhttps://periodicos.ufmg.br/index.php/textolivre/oairevistatextolivre@letras.ufmg.br1983-36521983-3652opendoar:2021-07-07T23:11:34Texto livre - Universidade Federal de Minas Gerais (UFMG)false |
dc.title.none.fl_str_mv |
Classifying literary genres: a methodological synergy of computational modelling and lexical semantics Classificação de gêneros literários: uma sinergia metodológica de modelagem computacional e semântica lexical |
title |
Classifying literary genres: a methodological synergy of computational modelling and lexical semantics |
spellingShingle |
Classifying literary genres: a methodological synergy of computational modelling and lexical semantics Omar, Abdulfattah Bolsa de conceitos (COB) ConceptNet Análise Semântica Explícita (ASE) Classificação de gênero Conceitos de tópicos Vector Space Clustering (VSC) Bag of concepts (BOC) ConceptNet Explicit Semantic Analysis (ESA) Genre classification Topic concepts Vector Space clustering (VSC) |
title_short |
Classifying literary genres: a methodological synergy of computational modelling and lexical semantics |
title_full |
Classifying literary genres: a methodological synergy of computational modelling and lexical semantics |
title_fullStr |
Classifying literary genres: a methodological synergy of computational modelling and lexical semantics |
title_full_unstemmed |
Classifying literary genres: a methodological synergy of computational modelling and lexical semantics |
title_sort |
Classifying literary genres: a methodological synergy of computational modelling and lexical semantics |
author |
Omar, Abdulfattah |
author_facet |
Omar, Abdulfattah |
author_role |
author |
dc.contributor.author.fl_str_mv |
Omar, Abdulfattah |
dc.subject.por.fl_str_mv |
Bolsa de conceitos (COB) ConceptNet Análise Semântica Explícita (ASE) Classificação de gênero Conceitos de tópicos Vector Space Clustering (VSC) Bag of concepts (BOC) ConceptNet Explicit Semantic Analysis (ESA) Genre classification Topic concepts Vector Space clustering (VSC) |
topic |
Bolsa de conceitos (COB) ConceptNet Análise Semântica Explícita (ASE) Classificação de gênero Conceitos de tópicos Vector Space Clustering (VSC) Bag of concepts (BOC) ConceptNet Explicit Semantic Analysis (ESA) Genre classification Topic concepts Vector Space clustering (VSC) |
description |
Classifying literary genres has always been methodologically confined to philological methods and what is commonly known as Vector Space Clustering (VSC). The problem has been exasperated with the widening gap between computational theory and traditional analysis of literary texts. Towards finding a solution to this problem, the current study utilizes a synergetic approach that brings together two established methods. First, a computational model of genre classification is drawn upon for identifying concept-based, rather than word-bound, topics, where the representation of texts is secured via the ‘bag of concepts’ (BOC) model as well as the sense-restricted knowledge and meaningful links holding between and among concepts; relatedly, the two model strands of explicit semantic analysis (ESA) and ConceptNet have enacted text classification. Second, a contextual lexical semantic approach (CRUSE, 1986, 2000) is employed so that the contextual variability of word meanings and concepts can be tackled within the confines of the target literary genres classified. The findings of present study have shown that the current composite approach of computational and semantic models has resulted in improved performance in classifying literary genres, especially with respect to delineating the links between each cluster’s document-members and generalizing about their unifying genre. Further implications have emerged from the present study, namely, the benefits reserved for digital libraries and the process of archiving, where literary-text classification has proven problematic to both users and readers in many cases. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-07-24 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://periodicos.ufmg.br/index.php/textolivre/article/view/24396 10.35699/1983-3652.2020.24396 |
url |
https://periodicos.ufmg.br/index.php/textolivre/article/view/24396 |
identifier_str_mv |
10.35699/1983-3652.2020.24396 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/19506 https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27681 https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27682 https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27683 |
dc.rights.driver.fl_str_mv |
https://creativecommons.org/licenses/by/4.0 info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
https://creativecommons.org/licenses/by/4.0 |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf application/zip |
dc.publisher.none.fl_str_mv |
Universidade Federal de Minas Gerais |
publisher.none.fl_str_mv |
Universidade Federal de Minas Gerais |
dc.source.none.fl_str_mv |
Texto Livre; Vol. 13 No. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101 Texto Livre; Vol. 13 Núm. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101 Texto Livre; Vol. 13 No 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101 Texto Livre; v. 13 n. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101 1983-3652 reponame:Texto livre instname:Universidade Federal de Minas Gerais (UFMG) instacron:UFMG |
instname_str |
Universidade Federal de Minas Gerais (UFMG) |
instacron_str |
UFMG |
institution |
UFMG |
reponame_str |
Texto livre |
collection |
Texto livre |
repository.name.fl_str_mv |
Texto livre - Universidade Federal de Minas Gerais (UFMG) |
repository.mail.fl_str_mv |
revistatextolivre@letras.ufmg.br |
_version_ |
1799711142941032448 |