Reconhecimento de entidades nomeadas em textos de boletins de ocorrências

Araújo, Natanael da Silva

Reconhecimento de entidades nomeadas em textos de boletins de ocorrências

Detalhes bibliográficos
Autor(a) principal:	Araújo, Natanael da Silva
Data de Publicação:	2019
Tipo de documento:	Trabalho de conclusão de curso
Idioma:	por
Título da fonte:	Repositório Institucional da Universidade Federal do Ceará (UFC)
Texto Completo:	http://www.repositorio.ufc.br/handle/riufc/49711
Resumo:	Classifying entities in a given text is a challenging problem in Natural Language Processing (NLP). A common technique that deals with this problem is Named Entity Recognition (NER). Deep Learning techniques have been widely applied to NER tasks because they require little resource engineering and are free of language-specific resources, learning important resources from combinations of words or characters trained in large amounts of data. However, these techniques require a huge amount of data for training. However, these techniques require a huge amount of data for training. This work proposes Human NERD (stands for Human Named Entity Recognition with Deep learning) which addresses this problem by including humans in the loop. Human NERD is an interactive framework to assist the user in NER classification tasks from creating a massive dataset to building/maintaining a deep learning NER model. Human NERD framework allows the rapid verification of automatic named entity recognition and the correction of errors. It takes into account user corrections, and the deep learning model learns and builds upon these actions. The interface allows for rapid correction using drag and drop user actions. The present work also proposes two NER models, one using the Spacy framework and the other the Keras library, both complement each other.

Metadados do item

id	UFC-7_c17f6c54e7687e4fdf3d0400ff9bc10d
oai_identifier_str	oai:repositorio.ufc.br:riufc/49711
network_acronym_str	UFC-7
network_name_str	Repositório Institucional da Universidade Federal do Ceará (UFC)
repository_id_str
spelling	Reconhecimento de entidades nomeadas em textos de boletins de ocorrênciasProcessamento de linguagem naturalReconhecimento de entidade nomeadaAprendizagem profundaClassifying entities in a given text is a challenging problem in Natural Language Processing (NLP). A common technique that deals with this problem is Named Entity Recognition (NER). Deep Learning techniques have been widely applied to NER tasks because they require little resource engineering and are free of language-specific resources, learning important resources from combinations of words or characters trained in large amounts of data. However, these techniques require a huge amount of data for training. However, these techniques require a huge amount of data for training. This work proposes Human NERD (stands for Human Named Entity Recognition with Deep learning) which addresses this problem by including humans in the loop. Human NERD is an interactive framework to assist the user in NER classification tasks from creating a massive dataset to building/maintaining a deep learning NER model. Human NERD framework allows the rapid verification of automatic named entity recognition and the correction of errors. It takes into account user corrections, and the deep learning model learns and builds upon these actions. The interface allows for rapid correction using drag and drop user actions. The present work also proposes two NER models, one using the Spacy framework and the other the Keras library, both complement each other.Classificar entidades em um determinado texto é um problema desafiador no Processamento de Linguagem Natural. Uma técnica comum que lida com esse problema é o Reconhecimento de Entidades Nomeadas, em inglês é chamado de Named Entity Recognition (NER). As técnicas de Deep Learning são amplamente aplicadas nas tarefas NER, porque requerem pouca engenharia de recursos e estão livres de recursos específicos do idioma, aprendendo recursos importantes a partir de combinações de palavras ou caracteres treinadas em grandes quantidades de dados. No entanto, essas técnicas exigem uma enorme quantidade de dados para treinamento. Este trabalho propõe o Human Named Entity Recognition with Deep learning (Human NERD), que aborda esse problema incluindo o humano no ciclo. O Human NERD é uma ferramenta interativa para ajudar o usuário nas tarefas de classificação NER, desde a criação de um conjunto de dados massivo até a criação / manutenção de um modelo NER de Deep Learning. A estrutura Human NERD permite a verificação rápida do reconhecimento automático de entidades nomeadas e a correção de erros. Ele leva em consideração as correções do usuário, e o modelo de Deep Learning aprende e desenvolve essas ações. Sua interface gráfica (GUI) permite uma correção rápida usando ações de arrastar e soltar do usuário. O presente trabalho também propõe dois modelos NER, um utilizando o framework Spacy e o outro a biblioteca Keras, em que as predições realizadas por ambos se complementam.Silva, Ticiana Linhares Coelho daAraújo, Natanael da Silva2020-01-29T19:15:01Z2020-01-29T19:15:01Z2019info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bachelorThesisapplication/pdfARAÚJO, Natanael da Silva. Reconhecimento de entidades nomeadas em textos de boletins de ocorrências 2019. 41 f. Trabalho de Conclusão de Curso (Graduação em Ciência da Computação)- Universidade Federal do Ceará, Campus de Quixadá, Quixadá, 2019.http://www.repositorio.ufc.br/handle/riufc/49711porreponame:Repositório Institucional da Universidade Federal do Ceará (UFC)instname:Universidade Federal do Ceará (UFC)instacron:UFCinfo:eu-repo/semantics/openAccess2020-01-29T19:15:01Zoai:repositorio.ufc.br:riufc/49711Repositório InstitucionalPUBhttp://www.repositorio.ufc.br/ri-oai/requestbu@ufc.br \|\| repositorio@ufc.bropendoar:2024-09-11T18:20:22.999283Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)false
dc.title.none.fl_str_mv	Reconhecimento de entidades nomeadas em textos de boletins de ocorrências
title	Reconhecimento de entidades nomeadas em textos de boletins de ocorrências
spellingShingle	Reconhecimento de entidades nomeadas em textos de boletins de ocorrências Araújo, Natanael da Silva Processamento de linguagem natural Reconhecimento de entidade nomeada Aprendizagem profunda
title_short	Reconhecimento de entidades nomeadas em textos de boletins de ocorrências
title_full	Reconhecimento de entidades nomeadas em textos de boletins de ocorrências
title_fullStr	Reconhecimento de entidades nomeadas em textos de boletins de ocorrências
title_full_unstemmed	Reconhecimento de entidades nomeadas em textos de boletins de ocorrências
title_sort	Reconhecimento de entidades nomeadas em textos de boletins de ocorrências
author	Araújo, Natanael da Silva
author_facet	Araújo, Natanael da Silva
author_role	author
dc.contributor.none.fl_str_mv	Silva, Ticiana Linhares Coelho da
dc.contributor.author.fl_str_mv	Araújo, Natanael da Silva
dc.subject.por.fl_str_mv	Processamento de linguagem natural Reconhecimento de entidade nomeada Aprendizagem profunda
topic	Processamento de linguagem natural Reconhecimento de entidade nomeada Aprendizagem profunda
description	Classifying entities in a given text is a challenging problem in Natural Language Processing (NLP). A common technique that deals with this problem is Named Entity Recognition (NER). Deep Learning techniques have been widely applied to NER tasks because they require little resource engineering and are free of language-specific resources, learning important resources from combinations of words or characters trained in large amounts of data. However, these techniques require a huge amount of data for training. However, these techniques require a huge amount of data for training. This work proposes Human NERD (stands for Human Named Entity Recognition with Deep learning) which addresses this problem by including humans in the loop. Human NERD is an interactive framework to assist the user in NER classification tasks from creating a massive dataset to building/maintaining a deep learning NER model. Human NERD framework allows the rapid verification of automatic named entity recognition and the correction of errors. It takes into account user corrections, and the deep learning model learns and builds upon these actions. The interface allows for rapid correction using drag and drop user actions. The present work also proposes two NER models, one using the Spacy framework and the other the Keras library, both complement each other.
publishDate	2019
dc.date.none.fl_str_mv	2019 2020-01-29T19:15:01Z 2020-01-29T19:15:01Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/bachelorThesis
format	bachelorThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	ARAÚJO, Natanael da Silva. Reconhecimento de entidades nomeadas em textos de boletins de ocorrências 2019. 41 f. Trabalho de Conclusão de Curso (Graduação em Ciência da Computação)- Universidade Federal do Ceará, Campus de Quixadá, Quixadá, 2019. http://www.repositorio.ufc.br/handle/riufc/49711
identifier_str_mv	ARAÚJO, Natanael da Silva. Reconhecimento de entidades nomeadas em textos de boletins de ocorrências 2019. 41 f. Trabalho de Conclusão de Curso (Graduação em Ciência da Computação)- Universidade Federal do Ceará, Campus de Quixadá, Quixadá, 2019.
url	http://www.repositorio.ufc.br/handle/riufc/49711
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.source.none.fl_str_mv	reponame:Repositório Institucional da Universidade Federal do Ceará (UFC) instname:Universidade Federal do Ceará (UFC) instacron:UFC
instname_str	Universidade Federal do Ceará (UFC)
instacron_str	UFC
institution	UFC
reponame_str	Repositório Institucional da Universidade Federal do Ceará (UFC)
collection	Repositório Institucional da Universidade Federal do Ceará (UFC)
repository.name.fl_str_mv	Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)
repository.mail.fl_str_mv	bu@ufc.br \|\| repositorio@ufc.br
_version_	1813028761899106304

Reconhecimento de entidades nomeadas em textos de boletins de ocorrências

Registros relacionados