Dados linguísticos e iniciativa Linking Open Data

Detalhes bibliográficos
Autor(a) principal: Botácio, Andrieli Cristina
Data de Publicação: 2020
Tipo de documento: Trabalho de conclusão de curso
Idioma: por
Título da fonte: Repositório Institucional da UFSCAR
Texto Completo: https://repositorio.ufscar.br/handle/ufscar/14181
Resumo: Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category.
id SCAR_0e9c23aa5218cc01bcb86f9aa77b4e24
oai_identifier_str oai:repositorio.ufscar.br:ufscar/14181
network_acronym_str SCAR
network_name_str Repositório Institucional da UFSCAR
repository_id_str 4322
spelling Botácio, Andrieli CristinaArakaki, Ana Carolina Simionatohttp://lattes.cnpq.br/9896600626524397http://lattes.cnpq.br/1531142132127667cc4f37ed-dacd-41fd-b3a9-37c1fdfe305d2021-04-27T13:47:53Z2021-04-27T13:47:53Z2020-12-09BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181.https://repositorio.ufscar.br/handle/ufscar/14181Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category.Tim Berners-Lee propôs a Web Semântica para uma melhor recuperação da informação. Nesse contexto encontram-se os princípios Linked Data por meio dos quais se estabelece a conexão de dados na Web, fazendo com que os agentes de software e as pessoas possam trabalhar de maneira cooperativa, alcançando os objetivos de modo eficiente. Pensando nisso, foi analisada uma iniciativa que é um exemplo da aplicação do Linked Open Data, que é o Linking Open Data (LOD), que traz dados publicados em formato de dados ligados. A pesquisa se dedica a descrever e analisar os datasets linguísticos presentes nessa iniciativa. Desse ponto, incita-se como problemática de pesquisa: o que é identificado dentro das ligações e nos datasets linguísticos na iniciativa Linking Open Data? Tendo como foco tal problemática, o objetivo é mapear os datasets correspondentes à categoria ‘Linguística’ inseridos na iniciativa Linking Open Data. Trata-se de uma pesquisa exploratória e qualitativa e de natureza teórico-aplicada, abordando como tema principal o mapeamento dos datasets linguísticos no Linking Open Data. Primeiramente, fez-se uma investigação teórica e prática acerca da identificação dos datasets linguísticos e as tecnologias empregadas na ligação desses dados; depois a investigação foi direcionada para a análise da categoria Linguística da iniciativa Linking Open Data. Os resultados obtidos mostram quais tipos de datasets e tecnologias da Web Semântica encontram-se em cada uma das sete categorias de dados linguísticos: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. Conclui-se que a iniciativa Linking Open Data cumpre de modo satisfatório a sua função, mostrando a viabilidade da conexão de dados abertos, por meio das tecnologias prescritas. Quanto aos dados linguísticos de tal iniciativa, nota-se que são de extrema relevância e empregam as tecnologias conforme o que se exige em cada categoria.Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)18/05101-3porUniversidade Federal de São CarlosCâmpus São CarlosBiblioteconomia e Ciência da Informação - BCIUFSCarAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessWeb SemânticaLinked Open DataDados linguísticosLinked dataCIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAODados linguísticos e iniciativa Linking Open DataLinguistic data and Linking Open Data initiativeinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bachelorThesis60060022bff02c-fafd-42b1-8acf-ec52a84bd73creponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINALTCC_Andrieli corrigido.pdfTCC_Andrieli corrigido.pdfTCCapplication/pdf1333377https://repositorio.ufscar.br/bitstream/ufscar/14181/1/TCC_Andrieli%20corrigido.pdfebeea5e850c842d7c1f43fd3f408b1feMD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufscar.br/bitstream/ufscar/14181/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52TEXTTCC_Andrieli corrigido.pdf.txtTCC_Andrieli corrigido.pdf.txtExtracted texttext/plain136836https://repositorio.ufscar.br/bitstream/ufscar/14181/3/TCC_Andrieli%20corrigido.pdf.txt78ed4bf0a02990ac5fc64d0c470e24c1MD53THUMBNAILTCC_Andrieli corrigido.pdf.jpgTCC_Andrieli corrigido.pdf.jpgIM Thumbnailimage/jpeg4673https://repositorio.ufscar.br/bitstream/ufscar/14181/4/TCC_Andrieli%20corrigido.pdf.jpg809fe5399982b6e959c7bb9b5a05a566MD54ufscar/141812023-09-18 18:32:09.775oai:repositorio.ufscar.br:ufscar/14181Repositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestopendoar:43222023-09-18T18:32:09Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false
dc.title.por.fl_str_mv Dados linguísticos e iniciativa Linking Open Data
dc.title.alternative.por.fl_str_mv Linguistic data and Linking Open Data initiative
title Dados linguísticos e iniciativa Linking Open Data
spellingShingle Dados linguísticos e iniciativa Linking Open Data
Botácio, Andrieli Cristina
Web Semântica
Linked Open Data
Dados linguísticos
Linked data
CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO
title_short Dados linguísticos e iniciativa Linking Open Data
title_full Dados linguísticos e iniciativa Linking Open Data
title_fullStr Dados linguísticos e iniciativa Linking Open Data
title_full_unstemmed Dados linguísticos e iniciativa Linking Open Data
title_sort Dados linguísticos e iniciativa Linking Open Data
author Botácio, Andrieli Cristina
author_facet Botácio, Andrieli Cristina
author_role author
dc.contributor.authorlattes.por.fl_str_mv http://lattes.cnpq.br/1531142132127667
dc.contributor.author.fl_str_mv Botácio, Andrieli Cristina
dc.contributor.advisor1.fl_str_mv Arakaki, Ana Carolina Simionato
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/9896600626524397
dc.contributor.authorID.fl_str_mv cc4f37ed-dacd-41fd-b3a9-37c1fdfe305d
contributor_str_mv Arakaki, Ana Carolina Simionato
dc.subject.por.fl_str_mv Web Semântica
Linked Open Data
Dados linguísticos
Linked data
topic Web Semântica
Linked Open Data
Dados linguísticos
Linked data
CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO
dc.subject.cnpq.fl_str_mv CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO
description Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category.
publishDate 2020
dc.date.issued.fl_str_mv 2020-12-09
dc.date.accessioned.fl_str_mv 2021-04-27T13:47:53Z
dc.date.available.fl_str_mv 2021-04-27T13:47:53Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/bachelorThesis
format bachelorThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181.
dc.identifier.uri.fl_str_mv https://repositorio.ufscar.br/handle/ufscar/14181
identifier_str_mv BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181.
url https://repositorio.ufscar.br/handle/ufscar/14181
dc.language.iso.fl_str_mv por
language por
dc.relation.confidence.fl_str_mv 600
600
dc.relation.authority.fl_str_mv 22bff02c-fafd-42b1-8acf-ec52a84bd73c
dc.rights.driver.fl_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal de São Carlos
Câmpus São Carlos
Biblioteconomia e Ciência da Informação - BCI
dc.publisher.initials.fl_str_mv UFSCar
publisher.none.fl_str_mv Universidade Federal de São Carlos
Câmpus São Carlos
Biblioteconomia e Ciência da Informação - BCI
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFSCAR
instname:Universidade Federal de São Carlos (UFSCAR)
instacron:UFSCAR
instname_str Universidade Federal de São Carlos (UFSCAR)
instacron_str UFSCAR
institution UFSCAR
reponame_str Repositório Institucional da UFSCAR
collection Repositório Institucional da UFSCAR
bitstream.url.fl_str_mv https://repositorio.ufscar.br/bitstream/ufscar/14181/1/TCC_Andrieli%20corrigido.pdf
https://repositorio.ufscar.br/bitstream/ufscar/14181/2/license_rdf
https://repositorio.ufscar.br/bitstream/ufscar/14181/3/TCC_Andrieli%20corrigido.pdf.txt
https://repositorio.ufscar.br/bitstream/ufscar/14181/4/TCC_Andrieli%20corrigido.pdf.jpg
bitstream.checksum.fl_str_mv ebeea5e850c842d7c1f43fd3f408b1fe
e39d27027a6cc9cb039ad269a5db8e34
78ed4bf0a02990ac5fc64d0c470e24c1
809fe5399982b6e959c7bb9b5a05a566
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)
repository.mail.fl_str_mv
_version_ 1813715629921796096