Classifying literary genres: a methodological synergy of computational modelling and lexical semantics

Detalhes bibliográficos
Autor(a) principal: Omar, Abdulfattah
Data de Publicação: 2020
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Texto livre
Texto Completo: https://periodicos.ufmg.br/index.php/textolivre/article/view/24396
Resumo: Classifying literary genres has always been methodologically confined to philological methods and what is commonly known as Vector Space Clustering (VSC). The problem has been exasperated with the widening gap between computational theory and traditional analysis of literary texts. Towards finding a solution to this problem, the current study utilizes a synergetic approach that brings together two established methods. First, a computational model of genre classification is drawn upon for identifying concept-based, rather than word-bound, topics, where the representation of texts is secured via the ‘bag of concepts’ (BOC) model as well as the sense-restricted knowledge and meaningful links holding between and among concepts; relatedly, the two model strands of explicit semantic analysis (ESA) and ConceptNet have enacted text classification. Second, a contextual lexical semantic approach (CRUSE, 1986, 2000) is employed so that the contextual variability of word meanings and concepts can be tackled within the confines of the target literary genres classified. The findings of present study have shown that the current composite approach of computational and semantic models has resulted in improved performance in classifying literary genres, especially with respect to delineating the links between each cluster’s document-members and generalizing about their unifying genre. Further implications have emerged from the present study, namely, the benefits reserved for digital libraries and the process of archiving, where literary-text classification has proven problematic to both users and readers in many cases.
id UFMG-9_0a4a8a6891824b56ec6f8accf72bf539
oai_identifier_str oai:periodicos.ufmg.br:article/24396
network_acronym_str UFMG-9
network_name_str Texto livre
repository_id_str
spelling Classifying literary genres: a methodological synergy of computational modelling and lexical semanticsClassificação de gêneros literários: uma sinergia metodológica de modelagem computacional e semântica lexicalBolsa de conceitos (COB)ConceptNetAnálise Semântica Explícita (ASE)Classificação de gêneroConceitos de tópicosVector Space Clustering (VSC)Bag of concepts (BOC)ConceptNetExplicit Semantic Analysis (ESA)Genre classificationTopic conceptsVector Space clustering (VSC)Classifying literary genres has always been methodologically confined to philological methods and what is commonly known as Vector Space Clustering (VSC). The problem has been exasperated with the widening gap between computational theory and traditional analysis of literary texts. Towards finding a solution to this problem, the current study utilizes a synergetic approach that brings together two established methods. First, a computational model of genre classification is drawn upon for identifying concept-based, rather than word-bound, topics, where the representation of texts is secured via the ‘bag of concepts’ (BOC) model as well as the sense-restricted knowledge and meaningful links holding between and among concepts; relatedly, the two model strands of explicit semantic analysis (ESA) and ConceptNet have enacted text classification. Second, a contextual lexical semantic approach (CRUSE, 1986, 2000) is employed so that the contextual variability of word meanings and concepts can be tackled within the confines of the target literary genres classified. The findings of present study have shown that the current composite approach of computational and semantic models has resulted in improved performance in classifying literary genres, especially with respect to delineating the links between each cluster’s document-members and generalizing about their unifying genre. Further implications have emerged from the present study, namely, the benefits reserved for digital libraries and the process of archiving, where literary-text classification has proven problematic to both users and readers in many cases.A classificação de gêneros literários sempre se restringiu metodologicamente aos métodos filológicos e ao que é comumente conhecido como Vector Space Clustering (VSC). O problema foi exasperado com a crescente lacuna entre a teoria computacional e a análise tradicional de textos literários. Para encontrar uma solução para esse problema, o presente estudo utiliza uma abordagem sinérgica que reúne dois métodos estabelecidos. Primeiro, um modelo computacional de classificação de gênero é utilizado para identificar tópicos baseados em conceito, em vez de vinculados a palavras, em que a representação de textos é protegida por meio do modelo “bolsa de conceitos” (BOC), bem como o conhecimento restrito aos sentidos e os vínculos significativos entre os conceitos; De maneira semelhante, os dois modelos de análise semântica explícita (ASE) e ConceptNet promulgaram a classificação do texto. Segundo, uma abordagem semântica lexical contextual (CRUSE, 1986, 2000) é empregada para que a variabilidade contextual dos significados e conceitos das palavras possa ser abordada dentro dos limites dos gêneros literários alvo classificados. As descobertas do presente estudo mostraram que a atual abordagem composta de modelos computacionais e semânticos resultou em melhor desempenho na classificação de gêneros literários, especialmente no que diz respeito a delinear os vínculos entre os membros do documento de cada grupo e generalizar sobre seu gênero unificador. Outras implicações emergiram do presente estudo, a saber, os benefícios reservados para as bibliotecas digitais e o processo de arquivamento, em que a classificação de textos literários se mostrou problemática para usuários e leitores em muitos casos.Universidade Federal de Minas Gerais2020-07-24info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfapplication/ziphttps://periodicos.ufmg.br/index.php/textolivre/article/view/2439610.35699/1983-3652.2020.24396Texto Livre; Vol. 13 No. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101Texto Livre; Vol. 13 Núm. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101Texto Livre; Vol. 13 No 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101Texto Livre; v. 13 n. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-1011983-3652reponame:Texto livreinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGenghttps://periodicos.ufmg.br/index.php/textolivre/article/view/24396/19506https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27681https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27682https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27683https://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessOmar, Abdulfattah 2021-07-07T23:11:34Zoai:periodicos.ufmg.br:article/24396Revistahttp://www.periodicos.letras.ufmg.br/index.php/textolivrePUBhttps://periodicos.ufmg.br/index.php/textolivre/oairevistatextolivre@letras.ufmg.br1983-36521983-3652opendoar:2021-07-07T23:11:34Texto livre - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
Classificação de gêneros literários: uma sinergia metodológica de modelagem computacional e semântica lexical
title Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
spellingShingle Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
Omar, Abdulfattah
Bolsa de conceitos (COB)
ConceptNet
Análise Semântica Explícita (ASE)
Classificação de gênero
Conceitos de tópicos
Vector Space Clustering (VSC)
Bag of concepts (BOC)
ConceptNet
Explicit Semantic Analysis (ESA)
Genre classification
Topic concepts
Vector Space clustering (VSC)
title_short Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
title_full Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
title_fullStr Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
title_full_unstemmed Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
title_sort Classifying literary genres: a methodological synergy of computational modelling and lexical semantics
author Omar, Abdulfattah
author_facet Omar, Abdulfattah
author_role author
dc.contributor.author.fl_str_mv Omar, Abdulfattah
dc.subject.por.fl_str_mv Bolsa de conceitos (COB)
ConceptNet
Análise Semântica Explícita (ASE)
Classificação de gênero
Conceitos de tópicos
Vector Space Clustering (VSC)
Bag of concepts (BOC)
ConceptNet
Explicit Semantic Analysis (ESA)
Genre classification
Topic concepts
Vector Space clustering (VSC)
topic Bolsa de conceitos (COB)
ConceptNet
Análise Semântica Explícita (ASE)
Classificação de gênero
Conceitos de tópicos
Vector Space Clustering (VSC)
Bag of concepts (BOC)
ConceptNet
Explicit Semantic Analysis (ESA)
Genre classification
Topic concepts
Vector Space clustering (VSC)
description Classifying literary genres has always been methodologically confined to philological methods and what is commonly known as Vector Space Clustering (VSC). The problem has been exasperated with the widening gap between computational theory and traditional analysis of literary texts. Towards finding a solution to this problem, the current study utilizes a synergetic approach that brings together two established methods. First, a computational model of genre classification is drawn upon for identifying concept-based, rather than word-bound, topics, where the representation of texts is secured via the ‘bag of concepts’ (BOC) model as well as the sense-restricted knowledge and meaningful links holding between and among concepts; relatedly, the two model strands of explicit semantic analysis (ESA) and ConceptNet have enacted text classification. Second, a contextual lexical semantic approach (CRUSE, 1986, 2000) is employed so that the contextual variability of word meanings and concepts can be tackled within the confines of the target literary genres classified. The findings of present study have shown that the current composite approach of computational and semantic models has resulted in improved performance in classifying literary genres, especially with respect to delineating the links between each cluster’s document-members and generalizing about their unifying genre. Further implications have emerged from the present study, namely, the benefits reserved for digital libraries and the process of archiving, where literary-text classification has proven problematic to both users and readers in many cases.
publishDate 2020
dc.date.none.fl_str_mv 2020-07-24
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://periodicos.ufmg.br/index.php/textolivre/article/view/24396
10.35699/1983-3652.2020.24396
url https://periodicos.ufmg.br/index.php/textolivre/article/view/24396
identifier_str_mv 10.35699/1983-3652.2020.24396
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/19506
https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27681
https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27682
https://periodicos.ufmg.br/index.php/textolivre/article/view/24396/27683
dc.rights.driver.fl_str_mv https://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
application/zip
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv Texto Livre; Vol. 13 No. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101
Texto Livre; Vol. 13 Núm. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101
Texto Livre; Vol. 13 No 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101
Texto Livre; v. 13 n. 2 (2020): Texto Livre: Linguagem e Tecnologia; 83-101
1983-3652
reponame:Texto livre
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Texto livre
collection Texto livre
repository.name.fl_str_mv Texto livre - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv revistatextolivre@letras.ufmg.br
_version_ 1799711142941032448