Dados linguísticos e iniciativa Linking Open Data
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Tipo de documento: | Trabalho de conclusão de curso |
Idioma: | por |
Título da fonte: | Repositório Institucional da UFSCAR |
Texto Completo: | https://repositorio.ufscar.br/handle/ufscar/14181 |
Resumo: | Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category. |
id |
SCAR_0e9c23aa5218cc01bcb86f9aa77b4e24 |
---|---|
oai_identifier_str |
oai:repositorio.ufscar.br:ufscar/14181 |
network_acronym_str |
SCAR |
network_name_str |
Repositório Institucional da UFSCAR |
repository_id_str |
4322 |
spelling |
Botácio, Andrieli CristinaArakaki, Ana Carolina Simionatohttp://lattes.cnpq.br/9896600626524397http://lattes.cnpq.br/1531142132127667cc4f37ed-dacd-41fd-b3a9-37c1fdfe305d2021-04-27T13:47:53Z2021-04-27T13:47:53Z2020-12-09BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181.https://repositorio.ufscar.br/handle/ufscar/14181Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category.Tim Berners-Lee propôs a Web Semântica para uma melhor recuperação da informação. Nesse contexto encontram-se os princípios Linked Data por meio dos quais se estabelece a conexão de dados na Web, fazendo com que os agentes de software e as pessoas possam trabalhar de maneira cooperativa, alcançando os objetivos de modo eficiente. Pensando nisso, foi analisada uma iniciativa que é um exemplo da aplicação do Linked Open Data, que é o Linking Open Data (LOD), que traz dados publicados em formato de dados ligados. A pesquisa se dedica a descrever e analisar os datasets linguísticos presentes nessa iniciativa. Desse ponto, incita-se como problemática de pesquisa: o que é identificado dentro das ligações e nos datasets linguísticos na iniciativa Linking Open Data? Tendo como foco tal problemática, o objetivo é mapear os datasets correspondentes à categoria ‘Linguística’ inseridos na iniciativa Linking Open Data. Trata-se de uma pesquisa exploratória e qualitativa e de natureza teórico-aplicada, abordando como tema principal o mapeamento dos datasets linguísticos no Linking Open Data. Primeiramente, fez-se uma investigação teórica e prática acerca da identificação dos datasets linguísticos e as tecnologias empregadas na ligação desses dados; depois a investigação foi direcionada para a análise da categoria Linguística da iniciativa Linking Open Data. Os resultados obtidos mostram quais tipos de datasets e tecnologias da Web Semântica encontram-se em cada uma das sete categorias de dados linguísticos: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. Conclui-se que a iniciativa Linking Open Data cumpre de modo satisfatório a sua função, mostrando a viabilidade da conexão de dados abertos, por meio das tecnologias prescritas. Quanto aos dados linguísticos de tal iniciativa, nota-se que são de extrema relevância e empregam as tecnologias conforme o que se exige em cada categoria.Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)18/05101-3porUniversidade Federal de São CarlosCâmpus São CarlosBiblioteconomia e Ciência da Informação - BCIUFSCarAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessWeb SemânticaLinked Open DataDados linguísticosLinked dataCIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAODados linguísticos e iniciativa Linking Open DataLinguistic data and Linking Open Data initiativeinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bachelorThesis60060022bff02c-fafd-42b1-8acf-ec52a84bd73creponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINALTCC_Andrieli corrigido.pdfTCC_Andrieli corrigido.pdfTCCapplication/pdf1333377https://repositorio.ufscar.br/bitstream/ufscar/14181/1/TCC_Andrieli%20corrigido.pdfebeea5e850c842d7c1f43fd3f408b1feMD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufscar.br/bitstream/ufscar/14181/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52TEXTTCC_Andrieli corrigido.pdf.txtTCC_Andrieli corrigido.pdf.txtExtracted texttext/plain136836https://repositorio.ufscar.br/bitstream/ufscar/14181/3/TCC_Andrieli%20corrigido.pdf.txt78ed4bf0a02990ac5fc64d0c470e24c1MD53THUMBNAILTCC_Andrieli corrigido.pdf.jpgTCC_Andrieli corrigido.pdf.jpgIM Thumbnailimage/jpeg4673https://repositorio.ufscar.br/bitstream/ufscar/14181/4/TCC_Andrieli%20corrigido.pdf.jpg809fe5399982b6e959c7bb9b5a05a566MD54ufscar/141812023-09-18 18:32:09.775oai:repositorio.ufscar.br:ufscar/14181Repositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestopendoar:43222023-09-18T18:32:09Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false |
dc.title.por.fl_str_mv |
Dados linguísticos e iniciativa Linking Open Data |
dc.title.alternative.por.fl_str_mv |
Linguistic data and Linking Open Data initiative |
title |
Dados linguísticos e iniciativa Linking Open Data |
spellingShingle |
Dados linguísticos e iniciativa Linking Open Data Botácio, Andrieli Cristina Web Semântica Linked Open Data Dados linguísticos Linked data CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO |
title_short |
Dados linguísticos e iniciativa Linking Open Data |
title_full |
Dados linguísticos e iniciativa Linking Open Data |
title_fullStr |
Dados linguísticos e iniciativa Linking Open Data |
title_full_unstemmed |
Dados linguísticos e iniciativa Linking Open Data |
title_sort |
Dados linguísticos e iniciativa Linking Open Data |
author |
Botácio, Andrieli Cristina |
author_facet |
Botácio, Andrieli Cristina |
author_role |
author |
dc.contributor.authorlattes.por.fl_str_mv |
http://lattes.cnpq.br/1531142132127667 |
dc.contributor.author.fl_str_mv |
Botácio, Andrieli Cristina |
dc.contributor.advisor1.fl_str_mv |
Arakaki, Ana Carolina Simionato |
dc.contributor.advisor1Lattes.fl_str_mv |
http://lattes.cnpq.br/9896600626524397 |
dc.contributor.authorID.fl_str_mv |
cc4f37ed-dacd-41fd-b3a9-37c1fdfe305d |
contributor_str_mv |
Arakaki, Ana Carolina Simionato |
dc.subject.por.fl_str_mv |
Web Semântica Linked Open Data Dados linguísticos Linked data |
topic |
Web Semântica Linked Open Data Dados linguísticos Linked data CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO |
dc.subject.cnpq.fl_str_mv |
CIENCIAS SOCIAIS APLICADAS::CIENCIA DA INFORMACAO::BIBLIOTECONOMIA::TECNICAS DE RECUPERACAO DE INFORMACAO |
description |
Tim Berners-Lee proposed the Semantic Web for better information retrieval. In this context, there are the Linked Data principles through which the data connection on the Web is established which the main objective is to generate meaning to the Web pages. And, by utilizing, this causes with which the software agents and people could cooperate with each other to reach their goals in an efficient manner. The project consists of working with an initiative that is an example of the application of Linked Open Data, which is the Linking Open Data (LOD), which brings data published in linked data format. The research will focus on describing and analyzing the linguistic datasets present in this initiative. From this point, it is incited as a research problem: what is identified in the links and linguistic datasets in the Linking Open Data initiative? Focusing on this problem, the objective is to map the datasets corresponding to the ‘Linguistics’ category inserted in the Linking Open Data initiative. It is an exploratory and qualitative research and of a theoretical-applied nature, addressing as main theme the mapping of linguistic datasets in Linking Open Data. First, a theoretical and practical investigation was carried out on the identification of the linguistic data sets and the technologies used in the connection of these data; after the investigation was directed to the analysis of the Linguistics category of the Linking Open Data initiative. The results obtained show that types of datasets and technologies of the Semantic Web are found in each of the seven categories of linguistic data: Corpora; Lexicons and Dictionaries; Terminologies, Thesauri and Knowledge Bases; Linguistic Resource Metadata; Linguistic Data Categories; Typological Databases; Other. It is concluded that the Linking Open Data initiative satisfactorily fulfills its function, showing the viability of the open data connection, through the prescribed technologies. As for the linguistic data of such an initiative, it is noted that they are extremely relevant and employs the technologies according required in each category. |
publishDate |
2020 |
dc.date.issued.fl_str_mv |
2020-12-09 |
dc.date.accessioned.fl_str_mv |
2021-04-27T13:47:53Z |
dc.date.available.fl_str_mv |
2021-04-27T13:47:53Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/bachelorThesis |
format |
bachelorThesis |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181. |
dc.identifier.uri.fl_str_mv |
https://repositorio.ufscar.br/handle/ufscar/14181 |
identifier_str_mv |
BOTÁCIO, Andrieli Cristina. Dados linguísticos e iniciativa Linking Open Data. 2020. Trabalho de Conclusão de Curso (Graduação em Biblioteconomia e Ciência da Informação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/14181. |
url |
https://repositorio.ufscar.br/handle/ufscar/14181 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.confidence.fl_str_mv |
600 600 |
dc.relation.authority.fl_str_mv |
22bff02c-fafd-42b1-8acf-ec52a84bd73c |
dc.rights.driver.fl_str_mv |
Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/ info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/ |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
Universidade Federal de São Carlos Câmpus São Carlos Biblioteconomia e Ciência da Informação - BCI |
dc.publisher.initials.fl_str_mv |
UFSCar |
publisher.none.fl_str_mv |
Universidade Federal de São Carlos Câmpus São Carlos Biblioteconomia e Ciência da Informação - BCI |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFSCAR instname:Universidade Federal de São Carlos (UFSCAR) instacron:UFSCAR |
instname_str |
Universidade Federal de São Carlos (UFSCAR) |
instacron_str |
UFSCAR |
institution |
UFSCAR |
reponame_str |
Repositório Institucional da UFSCAR |
collection |
Repositório Institucional da UFSCAR |
bitstream.url.fl_str_mv |
https://repositorio.ufscar.br/bitstream/ufscar/14181/1/TCC_Andrieli%20corrigido.pdf https://repositorio.ufscar.br/bitstream/ufscar/14181/2/license_rdf https://repositorio.ufscar.br/bitstream/ufscar/14181/3/TCC_Andrieli%20corrigido.pdf.txt https://repositorio.ufscar.br/bitstream/ufscar/14181/4/TCC_Andrieli%20corrigido.pdf.jpg |
bitstream.checksum.fl_str_mv |
ebeea5e850c842d7c1f43fd3f408b1fe e39d27027a6cc9cb039ad269a5db8e34 78ed4bf0a02990ac5fc64d0c470e24c1 809fe5399982b6e959c7bb9b5a05a566 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR) |
repository.mail.fl_str_mv |
|
_version_ |
1813715629921796096 |