Dados linguísticos e iniciativa Linking Open Data

Botácio, Andrieli Cristina

Dados linguísticos e iniciativa Linking Open Data

Detalhes bibliográficos
Autor(a) principal:	Botácio, Andrieli Cristina
Data de Publicação:	2020
Tipo de documento:	Trabalho de conclusão de curso
Idioma:	por
Título da fonte:	Repositório Institucional da UFSCAR
Texto Completo:	https://repositorio.ufscar.br/handle/ufscar/14181
Resumo:	Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category.

Metadados do item

id	SCAR_0e9c23aa5218cc01bcb86f9aa77b4e24
oai_identifier_str	oai:repositorio.ufscar.br:ufscar/14181
network_acronym_str	SCAR
network_name_str	Repositório Institucional da UFSCAR
repository_id_str	4322
spelling	Botácio, Andrieli CristinaArakaki, Ana Carolina Simionatohttp://lattes.cnpq.br/9896600626524397http://lattes.cnpq.br/1531142132127667cc4f37ed-dacd-41fd-b3a9-37c1fdfe305d2021-04-27T13:47:53Z2021-04-27T13:47:53Z2020-12-09BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181.https://repositorio.ufscar.br/handle/ufscar/14181Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category.Tim Berners-Lee propôs a Web Semântica para uma melhor recuperação da informação. Nesse contexto encontram-se os princípios Linked Data por meio dos quais se estabelece a conexão de dados na Web, fazendo com que os agentes de software e as pessoas possam trabalhar de maneira cooperativa, alcançando os objetivos de modo eficiente. Pensando nisso, foi analisada uma iniciativa que é um exemplo da aplicação do Linked Open Data, que é o Linking Open Data (LOD), que traz dados publicados em formato de dados ligados. A pesquisa se dedica a descrever e analisar os datasets linguísticos presentes nessa iniciativa. Desse ponto, incita-se como problemática de pesquisa: o que é identificado dentro das ligações e nos datasets linguísticos na iniciativa Linking Open Data? Tendo como foco tal problemática, o objetivo é mapear os datasets correspondentes à categoria ‘Linguística’ inseridos na iniciativa Linking Open Data. Trata-se de uma pesquisa exploratória e qualitativa e de natureza teórico-aplicada, abordando como tema principal o mapeamento dos datasets linguísticos no Linking Open Data. Primeiramente, fez-se uma investigação teórica e prática acerca da identificação dos datasets linguísticos e as tecnologias empregadas na ligação desses dados; depois a investigação foi direcionada para a análise da categoria Linguística da iniciativa Linking Open Data. Os resultados obtidos mostram quais tipos de datasets e tecnologias da Web Semântica encontram-se em cada uma das sete categorias de dados linguísticos: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. Conclui-se que a iniciativa Linking Open Data cumpre de modo satisfatório a sua função, mostrando a viabilidade da conexão de dados abertos, por meio das tecnologias prescritas. Quanto aos dados linguísticos de tal iniciativa, nota-se que são de extrema relevância e empregam as tecnologias conforme o que se exige em cada categoria.Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)18/05101-3porUniversidade Federal de São CarlosCâmpus São CarlosBiblioteconomia e Ciência da Informação - BCIUFSCarAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessWeb SemânticaLinked Open DataDados linguísticosLinked dataCIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAODados linguísticos e iniciativa Linking Open DataLinguistic data and Linking Open Data initiativeinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bachelorThesis60060022bff02c-fafd-42b1-8acf-ec52a84bd73creponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINALTCC_Andrieli corrigido.pdfTCC_Andrieli corrigido.pdfTCCapplication/pdf1333377https://repositorio.ufscar.br/bitstream/ufscar/14181/1/TCC_Andrieli%20corrigido.pdfebeea5e850c842d7c1f43fd3f408b1feMD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufscar.br/bitstream/ufscar/14181/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52TEXTTCC_Andrieli corrigido.pdf.txtTCC_Andrieli corrigido.pdf.txtExtracted texttext/plain136836https://repositorio.ufscar.br/bitstream/ufscar/14181/3/TCC_Andrieli%20corrigido.pdf.txt78ed4bf0a02990ac5fc64d0c470e24c1MD53THUMBNAILTCC_Andrieli corrigido.pdf.jpgTCC_Andrieli corrigido.pdf.jpgIM Thumbnailimage/jpeg4673https://repositorio.ufscar.br/bitstream/ufscar/14181/4/TCC_Andrieli%20corrigido.pdf.jpg809fe5399982b6e959c7bb9b5a05a566MD54ufscar/141812023-09-18 18:32:09.775oai:repositorio.ufscar.br:ufscar/14181Repositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestopendoar:43222023-09-18T18:32:09Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false
dc.title.por.fl_str_mv	Dados linguísticos e iniciativa Linking Open Data
dc.title.alternative.por.fl_str_mv	Linguistic data and Linking Open Data initiative
title	Dados linguísticos e iniciativa Linking Open Data
spellingShingle	Dados linguísticos e iniciativa Linking Open Data Botácio, Andrieli Cristina Web Semântica Linked Open Data Dados linguísticos Linked data CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO
title_short	Dados linguísticos e iniciativa Linking Open Data
title_full	Dados linguísticos e iniciativa Linking Open Data
title_fullStr	Dados linguísticos e iniciativa Linking Open Data
title_full_unstemmed	Dados linguísticos e iniciativa Linking Open Data
title_sort	Dados linguísticos e iniciativa Linking Open Data
author	Botácio, Andrieli Cristina
author_facet	Botácio, Andrieli Cristina
author_role	author
dc.contributor.authorlattes.por.fl_str_mv	http://lattes.cnpq.br/1531142132127667
dc.contributor.author.fl_str_mv	Botácio, Andrieli Cristina
dc.contributor.advisor1.fl_str_mv	Arakaki, Ana Carolina Simionato
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/9896600626524397
dc.contributor.authorID.fl_str_mv	cc4f37ed-dacd-41fd-b3a9-37c1fdfe305d
contributor_str_mv	Arakaki, Ana Carolina Simionato
dc.subject.por.fl_str_mv	Web Semântica Linked Open Data Dados linguísticos Linked data
topic	Web Semântica Linked Open Data Dados linguísticos Linked data CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO
dc.subject.cnpq.fl_str_mv	CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO
description	Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category.
publishDate	2020
dc.date.issued.fl_str_mv	2020-12-09
dc.date.accessioned.fl_str_mv	2021-04-27T13:47:53Z
dc.date.available.fl_str_mv	2021-04-27T13:47:53Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/bachelorThesis
format	bachelorThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181.
dc.identifier.uri.fl_str_mv	https://repositorio.ufscar.br/handle/ufscar/14181
identifier_str_mv	BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181.
url	https://repositorio.ufscar.br/handle/ufscar/14181
dc.language.iso.fl_str_mv	por
language	por
dc.relation.confidence.fl_str_mv	600 600
dc.relation.authority.fl_str_mv	22bff02c-fafd-42b1-8acf-ec52a84bd73c
dc.rights.driver.fl_str_mv	Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/ info:eu-repo/semantics/openAccess
rights_invalid_str_mv	Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Universidade Federal de São Carlos Câmpus São Carlos Biblioteconomia e Ciência da Informação - BCI
dc.publisher.initials.fl_str_mv	UFSCar
publisher.none.fl_str_mv	Universidade Federal de São Carlos Câmpus São Carlos Biblioteconomia e Ciência da Informação - BCI
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFSCAR instname:Universidade Federal de São Carlos (UFSCAR) instacron:UFSCAR
instname_str	Universidade Federal de São Carlos (UFSCAR)
instacron_str	UFSCAR
institution	UFSCAR
reponame_str	Repositório Institucional da UFSCAR
collection	Repositório Institucional da UFSCAR
bitstream.url.fl_str_mv	https://repositorio.ufscar.br/bitstream/ufscar/14181/1/TCC_Andrieli%20corrigido.pdf https://repositorio.ufscar.br/bitstream/ufscar/14181/2/license_rdf https://repositorio.ufscar.br/bitstream/ufscar/14181/3/TCC_Andrieli%20corrigido.pdf.txt https://repositorio.ufscar.br/bitstream/ufscar/14181/4/TCC_Andrieli%20corrigido.pdf.jpg
bitstream.checksum.fl_str_mv	ebeea5e850c842d7c1f43fd3f408b1fe e39d27027a6cc9cb039ad269a5db8e34 78ed4bf0a02990ac5fc64d0c470e24c1 809fe5399982b6e959c7bb9b5a05a566
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5 MD5
repository.name.fl_str_mv	Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)
repository.mail.fl_str_mv
_version_	1813715629921796096

Dados linguísticos e iniciativa Linking Open Data

Registros relacionados