Automatic identification of academic profiles using author name disambiguation

Detalhes bibliográficos
Autor(a) principal: Digiampietri, Luciano Antonio
Data de Publicação: 2018
Outros Autores: Ferreira, João Eduardo
Tipo de documento: Artigo
Idioma: por
Título da fonte: Em Questão (Online)
Texto Completo: https://seer.ufrgs.br/index.php/EmQuestao/article/view/74064
Resumo: The author name disambiguation is a fundamental activity in bibliometric studies, in particular in those that use different sources of information. The objective of this paper is to propose and test an author name disambiguation strategy in order to allow the automatic identification of the Google Academic profile of researchers. The proposed strategy is based on the search for the profiles in Google Scholar, followed by a name matching process. Additionally, the academic publications that are registered in the researcher’s Lattes curriculum and Google Scholar profile are compared. Lastly, the name resolution is carried out by verifying among the compatible profiles the one with the highest evidence of belonging to the respective researcher. A case study involving researchers from the University of São Paulo was conducted, and the automated system was able to correctly identify 4,283 Google Scholar profiles. A coverage analysis showed that the system was able to find about 95% of the profiles of the researchers who have this information, and no false-positive was identified.
id UFRGS-8_0e90c8ef8d6260ee028ab9fa88c877a1
oai_identifier_str oai:seer.ufrgs.br:article/74064
network_acronym_str UFRGS-8
network_name_str Em Questão (Online)
repository_id_str
spelling Automatic identification of academic profiles using author name disambiguationDesambiguação de nomes de autores para a identificação automática de perfis acadêmicosDesambiguação de nomesResolução de entidadesBibliometriaAuthor name disambiguationEntity resolutionBibliometrics.The author name disambiguation is a fundamental activity in bibliometric studies, in particular in those that use different sources of information. The objective of this paper is to propose and test an author name disambiguation strategy in order to allow the automatic identification of the Google Academic profile of researchers. The proposed strategy is based on the search for the profiles in Google Scholar, followed by a name matching process. Additionally, the academic publications that are registered in the researcher’s Lattes curriculum and Google Scholar profile are compared. Lastly, the name resolution is carried out by verifying among the compatible profiles the one with the highest evidence of belonging to the respective researcher. A case study involving researchers from the University of São Paulo was conducted, and the automated system was able to correctly identify 4,283 Google Scholar profiles. A coverage analysis showed that the system was able to find about 95% of the profiles of the researchers who have this information, and no false-positive was identified.A desambiguação de nomes é uma atividade fundamental em estudos bibliométricos, em particular naqueles que utilizam diferentes fontes de informação. O objetivo deste trabalho é propor e testar uma estratégia de desambiguação de nomes de autores de forma a possibilitar a identificação automática do perfil do Google Acadêmico de docentes. A estratégia proposta é baseada na busca pelos perfis dos docentes no Google Acadêmico, seguida por um processo de casamento de nomes. Adicionalmente são comparadas as publicações acadêmicas que estão cadastradas no currículo Lattes do docente e no perfil do Google Acadêmico. Por fim, a resolução de nomes ocorre, verificando-se entre os perfis compatíveis aquele que apresenta maiores evidências de pertencer ao respectivo docente. Um estudo de caso envolvendo os docentes da Universidade de São Paulo foi realizado, e o sistema automático foi capaz de identificar, de maneira correta, 4.283 perfis do Google Acadêmico. Uma análise de cobertura mostrou que o sistema foi capaz de encontrar cerca de 95% dos perfis dos docentes que possuem essa informação, e nenhum falso-positivo foi identificado.Universidade Federal do Rio Grande do Sul, Faculdade de Biblioteconomia e Comunicação, Programa de Pós-Graduação em Ciência da Informação (Porto Alegre/RS)2018-04-19info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionAvaliado por Paresapplication/pdfhttps://seer.ufrgs.br/index.php/EmQuestao/article/view/7406410.19132/1808-5245242.37-54Em Questão; v. 24, n. 2, maio/ago. 2018; 37-54Em Questão; v. 24, n. 2, maio/ago. 2018; 37-54Em Questão; v. 24, n. 2, maio/ago. 2018; 37-541808-52451807-8893reponame:Em Questão (Online)instname:Universidade Federal do Rio Grande do Sul (UFRGS)instacron:UFRGSporhttps://seer.ufrgs.br/index.php/EmQuestao/article/view/74064/45895Copyright (c) 2017 Luciano Antonio Digiampietri, João Eduardo Ferreirahttps://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessDigiampietri, Luciano AntonioFerreira, João Eduardo2024-04-01T11:37:11Zoai:seer.ufrgs.br:article/74064Revistahttps://seer.ufrgs.br/emquestao/PUBhttps://seer.ufrgs.br/EmQuestao/oaiemquestao@ufrgs.br||emquestao@ufrgs.br1808-52451807-8893opendoar:2024-04-01T11:37:11Em Questão (Online) - Universidade Federal do Rio Grande do Sul (UFRGS)false
dc.title.none.fl_str_mv Automatic identification of academic profiles using author name disambiguation
Desambiguação de nomes de autores para a identificação automática de perfis acadêmicos
title Automatic identification of academic profiles using author name disambiguation
spellingShingle Automatic identification of academic profiles using author name disambiguation
Digiampietri, Luciano Antonio
Desambiguação de nomes
Resolução de entidades
Bibliometria
Author name disambiguation
Entity resolution
Bibliometrics.
title_short Automatic identification of academic profiles using author name disambiguation
title_full Automatic identification of academic profiles using author name disambiguation
title_fullStr Automatic identification of academic profiles using author name disambiguation
title_full_unstemmed Automatic identification of academic profiles using author name disambiguation
title_sort Automatic identification of academic profiles using author name disambiguation
author Digiampietri, Luciano Antonio
author_facet Digiampietri, Luciano Antonio
Ferreira, João Eduardo
author_role author
author2 Ferreira, João Eduardo
author2_role author
dc.contributor.author.fl_str_mv Digiampietri, Luciano Antonio
Ferreira, João Eduardo
dc.subject.por.fl_str_mv Desambiguação de nomes
Resolução de entidades
Bibliometria
Author name disambiguation
Entity resolution
Bibliometrics.
topic Desambiguação de nomes
Resolução de entidades
Bibliometria
Author name disambiguation
Entity resolution
Bibliometrics.
description The author name disambiguation is a fundamental activity in bibliometric studies, in particular in those that use different sources of information. The objective of this paper is to propose and test an author name disambiguation strategy in order to allow the automatic identification of the Google Academic profile of researchers. The proposed strategy is based on the search for the profiles in Google Scholar, followed by a name matching process. Additionally, the academic publications that are registered in the researcher’s Lattes curriculum and Google Scholar profile are compared. Lastly, the name resolution is carried out by verifying among the compatible profiles the one with the highest evidence of belonging to the respective researcher. A case study involving researchers from the University of São Paulo was conducted, and the automated system was able to correctly identify 4,283 Google Scholar profiles. A coverage analysis showed that the system was able to find about 95% of the profiles of the researchers who have this information, and no false-positive was identified.
publishDate 2018
dc.date.none.fl_str_mv 2018-04-19
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Avaliado por Pares
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://seer.ufrgs.br/index.php/EmQuestao/article/view/74064
10.19132/1808-5245242.37-54
url https://seer.ufrgs.br/index.php/EmQuestao/article/view/74064
identifier_str_mv 10.19132/1808-5245242.37-54
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://seer.ufrgs.br/index.php/EmQuestao/article/view/74064/45895
dc.rights.driver.fl_str_mv Copyright (c) 2017 Luciano Antonio Digiampietri, João Eduardo Ferreira
https://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2017 Luciano Antonio Digiampietri, João Eduardo Ferreira
https://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal do Rio Grande do Sul, Faculdade de Biblioteconomia e Comunicação, Programa de Pós-Graduação em Ciência da Informação (Porto Alegre/RS)
publisher.none.fl_str_mv Universidade Federal do Rio Grande do Sul, Faculdade de Biblioteconomia e Comunicação, Programa de Pós-Graduação em Ciência da Informação (Porto Alegre/RS)
dc.source.none.fl_str_mv Em Questão; v. 24, n. 2, maio/ago. 2018; 37-54
Em Questão; v. 24, n. 2, maio/ago. 2018; 37-54
Em Questão; v. 24, n. 2, maio/ago. 2018; 37-54
1808-5245
1807-8893
reponame:Em Questão (Online)
instname:Universidade Federal do Rio Grande do Sul (UFRGS)
instacron:UFRGS
instname_str Universidade Federal do Rio Grande do Sul (UFRGS)
instacron_str UFRGS
institution UFRGS
reponame_str Em Questão (Online)
collection Em Questão (Online)
repository.name.fl_str_mv Em Questão (Online) - Universidade Federal do Rio Grande do Sul (UFRGS)
repository.mail.fl_str_mv emquestao@ufrgs.br||emquestao@ufrgs.br
_version_ 1799766160920543232