Evaluation of performance of a software of automatic sumarization of texts

Detalhes bibliográficos
Autor(a) principal: Tabosa, Hamilton Rodrigues
Data de Publicação: 2020
Outros Autores: Souza, Osvaldo de, Cândido, José Carlos dos Santos, Melo, Ana Cristina Azevedo Ursulino, Reis, Keila Giullianna Braga
Tipo de documento: Artigo
Idioma: por
Título da fonte: Informação & Informação
Texto Completo: https://ojs.uel.br/revistas/uel/index.php/informacao/article/view/35928
Resumo: Intrudoction: Since 2014 we have developed a research to produce a software (prototype) that would be able to elaborate automatic summaries of texts based on techniques of Natural Language Processing and frequency statistics of words. The first empirical tests of the tool generated results that indicated a significant reduction of the dimensionality of the texts, with considerable preservation of their semantic value. Objective: In this article, we present the results of the continuity of our investigative work, based on a human evaluation of the quality of these abstracts from blind tests. Metodology: A group of three librarians received a mixed and unidentified block of abstracts - produced by humans and the automatic abstracts made by the software - and carried out an evaluation, according to the criteria of grammatical correctness, preservation of central ideas, coherence and readability, extension of abstract, whether there was paraphrase or copy of fragments, and if there was introduction of ideas not contained in the original text. Results: The results showed that in four of the five evaluation criteria adopted, there was a qualitative equivalence between the abstracts produced by humans and those produced by the software, which seems to represent a relative success since the prototype could replace a person in the resume activity texts without leaving anything to be desired, except in the fifth evaluation center, referring to the dimension of the abstract, in which the text produced by the software was pointed out as extensive beyond what was necessary. Conclusions: Despite the good results of the prototype, we realized the need for improvements in its performance, as well as to evaluate it by more comprehensive methods, from more representative samples and by a larger group of evaluators.
id UEL-8_647bdf4a9e1f3befb5d10b8a51ca2380
oai_identifier_str oai:ojs.pkp.sfu.ca:article/35928
network_acronym_str UEL-8
network_name_str Informação & Informação
repository_id_str
spelling Evaluation of performance of a software of automatic sumarization of textsEvaluación del desempeño de un software de resumen de texto automáticoAvaliação do desempenho de um software de sumarização automática de textosAutomatic Summarization of TextsAccess to InformationNatural Language ProcessingMediation (Practice)Sumarización Automática de TextosAcceso a la InformaciónProcesamiento del Lenguaje NaturalMediación (Práctica)Sumarização Automática de TextosAcesso à InformaçãoProcessamento da Linguagem NaturalMediação (Prática)Intrudoction: Since 2014 we have developed a research to produce a software (prototype) that would be able to elaborate automatic summaries of texts based on techniques of Natural Language Processing and frequency statistics of words. The first empirical tests of the tool generated results that indicated a significant reduction of the dimensionality of the texts, with considerable preservation of their semantic value. Objective: In this article, we present the results of the continuity of our investigative work, based on a human evaluation of the quality of these abstracts from blind tests. Metodology: A group of three librarians received a mixed and unidentified block of abstracts - produced by humans and the automatic abstracts made by the software - and carried out an evaluation, according to the criteria of grammatical correctness, preservation of central ideas, coherence and readability, extension of abstract, whether there was paraphrase or copy of fragments, and if there was introduction of ideas not contained in the original text. Results: The results showed that in four of the five evaluation criteria adopted, there was a qualitative equivalence between the abstracts produced by humans and those produced by the software, which seems to represent a relative success since the prototype could replace a person in the resume activity texts without leaving anything to be desired, except in the fifth evaluation center, referring to the dimension of the abstract, in which the text produced by the software was pointed out as extensive beyond what was necessary. Conclusions: Despite the good results of the prototype, we realized the need for improvements in its performance, as well as to evaluate it by more comprehensive methods, from more representative samples and by a larger group of evaluators.Introduccion: Desde 2014 desarrollamos una investigación con el fin de producir un software (prototipo) que sería capaz de elaborar resúmenes automáticos de textos basados en técnicas de Procesamiento de Lenguaje Natural y estadísticas de frecuencia de palabras. Las primeras pruebas empíricas de la herramienta generaron resultados que indicaron una significativa reducción de la dimensionalidad de los textos, con considerable preservación de su valor semántico. Objetivos: En este artículo, presentamos los resultados de la continuidad de nuestro trabajo investigativo, a partir de una evaluación humana de la calidad de esos resúmenes a partir de la realización de pruebas ciegos. Metodología: Un grupo de tres bibliotecarios recibió un bloque mixto y no identificado de resúmenes - producidos por humanos y los resúmenes automáticos hechos por el software - y procedió a una evaluación, según los criterios de corrección gramatical, preservación de las ideas centrales, coherencia y legibilidad, en resumen, si hubo paráfrasis o copia de fragmentos y, si hubo introducción de ideas no contenidas en el texto original. Resultados: Los resultados mostraron que en cuatro de los cinco criterios de evaluación adoptados, hubo equivalencia cualitativa entre los resúmenes producidos por humanos y los producidos por el software, lo que parece representar un relativo éxito, ya que el prototipo podría sustituir a una persona en la actividad de resumir los textos sin dejar a desear, a no ser en el quinto creatorio de evaluación, referente al tamaño del resumen, en que el texto producido por el software fue señalado como extenso más allá de lo necesario. Conclusiones: a pesar de los buenos resultados del prototipo, nos dimos cuenta de la necesidad de mejorar su rendimiento, además de evaluarlo con métodos más completos, de muestras más representativas y de un grupo más grande de evaluadores.Introdução: Desde 2014 desenvolvemos uma pesquisa com o intuito de produzir um software (protótipo) que seria capaz de elaborar resumos automáticos de textos baseado em técnicas de Processamento de Linguagem Natural e estatísticas de frequência de palavras. Os primeiros testes da ferramenta geraram resultados que indicaram uma significativa redução da dimensionalidade dos textos, com considerável preservação do seu valor semântico. Objetivo: Neste artigo, apresentamos os resultados da continuidade do nosso trabalho investigativo, a partir de uma avaliação humana da qualidade desses resumos baseada na realização de testes cegos. Metodologia: Um grupo de três bibliotecárias recebeu um bloco misto e não identificado de resumos - produzidos por humanos e os resumos automáticos feitos pelo software - e procedeu a uma avaliação, segundo os critérios de corretude gramatical, preservação das ideias centrais, coerência e legibilidade, extensão do resumo, se houve paráfrase ou cópia de fragmentos e, se houve introdução de ideias não contidas no texto original. Resultados: Os resultados mostraram que em quatro, dos cinco critérios de avaliação adotados, houve equivalência qualitativa entre os resumos produzidos por humanos e os produzidos pelo software, o que parece representar um relativo sucesso, uma vez que o protótipo poderia substituir uma pessoa na atividade de resumir textos sem deixar a desejar, a não ser no quinto critério de avaliação, referente à dimensão do resumo, em que o texto produzido pelo software foi apontado como extenso além do necessário. Conclusões: Apesar dos bons resultados do protótipo, percebemos a necessidade de melhorias em seu desempenho, além de avaliá-lo por métodos mais abrangentes, a partir de amostras mais representativas e por um grupo maior de avaliadores.Universidade Estadual de Londrina2020-04-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://ojs.uel.br/revistas/uel/index.php/informacao/article/view/3592810.5433/1981-8920.2020v25n1p189Informação & Informação; v. 25 n. 1 (2020); 189-2101981-8920reponame:Informação & Informaçãoinstname:Universidade Estadual de Londrina (UEL)instacron:UELporhttps://ojs.uel.br/revistas/uel/index.php/informacao/article/view/35928/pdfCopyright (c) 2021 Informação & Informaçãoinfo:eu-repo/semantics/openAccessTabosa, Hamilton RodriguesSouza, Osvaldo deCândido, José Carlos dos SantosMelo, Ana Cristina Azevedo UrsulinoReis, Keila Giullianna Braga2021-04-05T19:00:37Zoai:ojs.pkp.sfu.ca:article/35928Revistahttps://www.uel.br/revistas/uel/index.php/informacao/indexPUBhttps://www.uel.br/revistas/uel/index.php/informacao/oai||infoeinfo@uel.br10.5433/1981-89201981-89201414-2139opendoar:2021-04-05T19:00:37Informação & Informação - Universidade Estadual de Londrina (UEL)false
dc.title.none.fl_str_mv Evaluation of performance of a software of automatic sumarization of texts
Evaluación del desempeño de un software de resumen de texto automático
Avaliação do desempenho de um software de sumarização automática de textos
title Evaluation of performance of a software of automatic sumarization of texts
spellingShingle Evaluation of performance of a software of automatic sumarization of texts
Tabosa, Hamilton Rodrigues
Automatic Summarization of Texts
Access to Information
Natural Language Processing
Mediation (Practice)
Sumarización Automática de Textos
Acceso a la Información
Procesamiento del Lenguaje Natural
Mediación (Práctica)
Sumarização Automática de Textos
Acesso à Informação
Processamento da Linguagem Natural
Mediação (Prática)
title_short Evaluation of performance of a software of automatic sumarization of texts
title_full Evaluation of performance of a software of automatic sumarization of texts
title_fullStr Evaluation of performance of a software of automatic sumarization of texts
title_full_unstemmed Evaluation of performance of a software of automatic sumarization of texts
title_sort Evaluation of performance of a software of automatic sumarization of texts
author Tabosa, Hamilton Rodrigues
author_facet Tabosa, Hamilton Rodrigues
Souza, Osvaldo de
Cândido, José Carlos dos Santos
Melo, Ana Cristina Azevedo Ursulino
Reis, Keila Giullianna Braga
author_role author
author2 Souza, Osvaldo de
Cândido, José Carlos dos Santos
Melo, Ana Cristina Azevedo Ursulino
Reis, Keila Giullianna Braga
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Tabosa, Hamilton Rodrigues
Souza, Osvaldo de
Cândido, José Carlos dos Santos
Melo, Ana Cristina Azevedo Ursulino
Reis, Keila Giullianna Braga
dc.subject.por.fl_str_mv Automatic Summarization of Texts
Access to Information
Natural Language Processing
Mediation (Practice)
Sumarización Automática de Textos
Acceso a la Información
Procesamiento del Lenguaje Natural
Mediación (Práctica)
Sumarização Automática de Textos
Acesso à Informação
Processamento da Linguagem Natural
Mediação (Prática)
topic Automatic Summarization of Texts
Access to Information
Natural Language Processing
Mediation (Practice)
Sumarización Automática de Textos
Acceso a la Información
Procesamiento del Lenguaje Natural
Mediación (Práctica)
Sumarização Automática de Textos
Acesso à Informação
Processamento da Linguagem Natural
Mediação (Prática)
description Intrudoction: Since 2014 we have developed a research to produce a software (prototype) that would be able to elaborate automatic summaries of texts based on techniques of Natural Language Processing and frequency statistics of words. The first empirical tests of the tool generated results that indicated a significant reduction of the dimensionality of the texts, with considerable preservation of their semantic value. Objective: In this article, we present the results of the continuity of our investigative work, based on a human evaluation of the quality of these abstracts from blind tests. Metodology: A group of three librarians received a mixed and unidentified block of abstracts - produced by humans and the automatic abstracts made by the software - and carried out an evaluation, according to the criteria of grammatical correctness, preservation of central ideas, coherence and readability, extension of abstract, whether there was paraphrase or copy of fragments, and if there was introduction of ideas not contained in the original text. Results: The results showed that in four of the five evaluation criteria adopted, there was a qualitative equivalence between the abstracts produced by humans and those produced by the software, which seems to represent a relative success since the prototype could replace a person in the resume activity texts without leaving anything to be desired, except in the fifth evaluation center, referring to the dimension of the abstract, in which the text produced by the software was pointed out as extensive beyond what was necessary. Conclusions: Despite the good results of the prototype, we realized the need for improvements in its performance, as well as to evaluate it by more comprehensive methods, from more representative samples and by a larger group of evaluators.
publishDate 2020
dc.date.none.fl_str_mv 2020-04-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://ojs.uel.br/revistas/uel/index.php/informacao/article/view/35928
10.5433/1981-8920.2020v25n1p189
url https://ojs.uel.br/revistas/uel/index.php/informacao/article/view/35928
identifier_str_mv 10.5433/1981-8920.2020v25n1p189
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://ojs.uel.br/revistas/uel/index.php/informacao/article/view/35928/pdf
dc.rights.driver.fl_str_mv Copyright (c) 2021 Informação & Informação
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2021 Informação & Informação
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Estadual de Londrina
publisher.none.fl_str_mv Universidade Estadual de Londrina
dc.source.none.fl_str_mv Informação & Informação; v. 25 n. 1 (2020); 189-210
1981-8920
reponame:Informação & Informação
instname:Universidade Estadual de Londrina (UEL)
instacron:UEL
instname_str Universidade Estadual de Londrina (UEL)
instacron_str UEL
institution UEL
reponame_str Informação & Informação
collection Informação & Informação
repository.name.fl_str_mv Informação & Informação - Universidade Estadual de Londrina (UEL)
repository.mail.fl_str_mv ||infoeinfo@uel.br
_version_ 1799305985622278144