Reducing fragmentation in incremental author name disambiguation.
Autor(a) principal: | |
---|---|
Data de Publicação: | 2014 |
Outros Autores: | , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UFOP |
Texto Completo: | http://www.repositorio.ufop.br/handle/123456789/4305 |
Resumo: | Author name ambiguity is a hard problem that occurs when several authors publish articles with the same name or when a same author publishes their articles under different names. Traditionally, automatic disambiguation methods process the author names of all citation records in a repository. Aiming efficiency, incremental methods disambiguate author names only when new citation records are inserted into the repository. As a side effect, several citation records of a same author may be associated with different authors, aka, the fragmentation problem. To diminish this problem, we propose a new merge-oriented incremental method capable of reducing such side effect, without the need to apply a traditional disambiguation method on the whole repository. Our experimental evaluation shows that our method produces significant improvements when compared to an incremental baseline and is very competitive with batch-mode methods. |
id |
UFOP_864c9e94b7cfcbea08c0e6cabf20aa2b |
---|---|
oai_identifier_str |
oai:localhost:123456789/4305 |
network_acronym_str |
UFOP |
network_name_str |
Repositório Institucional da UFOP |
repository_id_str |
3233 |
spelling |
Espiridião, Luciano Vilas BoasFerreira, Anderson AlmeidaLaender, Alberto Henrique FradeGonçalves, Marcos AndréGomes, David MenottiTavares, Andréa IabrudiAssis, Guilherme Tavares de2015-01-21T18:05:11Z2015-01-21T18:05:11Z2014ESPERIDIÃO, L. V. B. et al. Reducing fragmentation in incremental author name disambiguation. Journal of Information and Data Management - JIDM, São Paulo, v. 5, n. 3, p. 293-307, out. 2014. Disponível em: <https://periodicos.ufmg.br/index.php/jidm/article/view/286>. Acesso em: 21 jan. 2015.2178-7107http://www.repositorio.ufop.br/handle/123456789/4305Author name ambiguity is a hard problem that occurs when several authors publish articles with the same name or when a same author publishes their articles under different names. Traditionally, automatic disambiguation methods process the author names of all citation records in a repository. Aiming efficiency, incremental methods disambiguate author names only when new citation records are inserted into the repository. As a side effect, several citation records of a same author may be associated with different authors, aka, the fragmentation problem. To diminish this problem, we propose a new merge-oriented incremental method capable of reducing such side effect, without the need to apply a traditional disambiguation method on the whole repository. Our experimental evaluation shows that our method produces significant improvements when compared to an incremental baseline and is very competitive with batch-mode methods.Author name ambiguityBibliographic citationIncremental disambiguationReducing fragmentation in incremental author name disambiguation.info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlePermission to copy without fee all or part of the material printed in JIDM is granted provided that the copies are not made or distributed for commercial advantage, and that notice is given that copying is by permission of the Sociedade Brasileira de Computação. Fonte: Informação contida no artigo.info:eu-repo/semantics/openAccessengreponame:Repositório Institucional da UFOPinstname:Universidade Federal de Ouro Preto (UFOP)instacron:UFOPLICENSElicense.txtlicense.txttext/plain; charset=utf-82636http://www.repositorio.ufop.br/bitstream/123456789/4305/2/license.txtc2ffdd99e58acf69202dff00d361f23aMD52ORIGINALARTIGO_ReducingFragmentationIncremental.pdfARTIGO_ReducingFragmentationIncremental.pdfapplication/pdf509822http://www.repositorio.ufop.br/bitstream/123456789/4305/1/ARTIGO_ReducingFragmentationIncremental.pdf9b986b90a83850f6b4009d22d520e62fMD51123456789/43052019-06-11 12:42:23.408oai:localhost:123456789/4305PGh0bWw+Cjxib2R5Pgo8ZGl2IGFsaWduPSJqdXN0aWZ5Ij48c3Ryb25nPkxpY2VuJmNjZWRpbDthIGRvIFJlcG9zaXQmb2FjdXRlO3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRlIE91cm8gUHJldG88L3N0cm9uZz4KICA8YnI+CiAgPGJyPgogIEFvIGNvbmNvcmRhciBjb20gZXN0YSBsaWNlbiZjY2VkaWw7YSwgdm9jJmVjaXJjOyhzKSBhdXRvcihlcykgb3UgdGl0dWxhcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRhIG9icmEgYXF1aSBkZXNjcml0YSBjb25jZWRlKG0pICZhZ3JhdmU7CiAgPGJyPgogIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRlIE91cm8gUHJldG8gKFVGT1ApIGdlc3RvcmEgZG8gUmVwb3NpdCZvYWN1dGU7cmlvIEluc3RpdHVjaW9uYWwgZGEgVW5pdmVyc2lkYWRlIEZlZGVyYWwgZGUgT3VybyBQcmV0bwogIDxicj4KICAoUkktVUZPUCksIG8gZGlyZWl0byBuJmF0aWxkZTtvLWV4Y2x1c2l2byBkZSByZXByb2R1emlyLCBjb252ZXJ0ZXIgKGNvbW8gZGVmaW5pZG8gYWJhaXhvKSBlL291IGRpc3RyaWJ1aXIgbyBkb2N1bWVudG8gZGVwb3NpdGFkbwogIDxicj4KICBlbSBmb3JtYXRvIGltcHJlc3NvLCBlbGV0ciZvY2lyYztuaWNvIG91IGVtIHF1YWxxdWVyIG91dHJvIG1laW8uCiAgPGJyPgogIDxicj4KICBWb2MmZWNpcmM7KHMpIGNvbmNvcmRhKG0pIHF1ZSBhIFVGT1AsIGdlc3RvcmEgZG8gUkktVUZPUCwgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250ZSZ1YWN1dGU7ZG8sIGNvbnZlcnRlciBvIGFycXVpdm8gZGVwb3NpdGFkbyBhCiAgPGJyPgogIHF1YWxxdWVyIG1laW8gb3UgZm9ybWF0byBjb20gZmlucyBkZSBwcmVzZXJ2YSZjY2VkaWw7JmF0aWxkZTtvLiBWb2MmZWNpcmM7KHMpIHRhbWImZWFjdXRlO20gY29uY29yZGEobSkgcXVlIGEgVUZPUCwgZ2VzdG9yYSBkbyBSSS1VRk9QLCBwb2RlCiAgPGJyPgogIG1hbnRlciBtYWlzIGRlIHVtYSBjJm9hY3V0ZTtwaWEgZGVzdGUgZGVwJm9hY3V0ZTtzaXRvIHBhcmEgZmlucyBkZSBzZWd1cmFuJmNjZWRpbDthLCA8ZW0+YmFjay11cDwvZW0+IGUvb3UgcHJlc2VydmEmY2NlZGlsOyZhdGlsZGU7by4KICA8YnI+CiAgPGJyPgogIFZvYyZlY2lyYzsocykgZGVjbGFyYShtKSBxdWUgYSBhcHJlc2VudGEmY2NlZGlsOyZhdGlsZGU7byBkbyBzZXUgdHJhYmFsaG8gJmVhY3V0ZTsgb3JpZ2luYWwgZSBxdWUgdm9jJmVjaXJjOyhzKSBwb2RlKG0pIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zCiAgPGJyPgogIG5lc3RhIGxpY2VuJmNjZWRpbDthLiBWb2MmZWNpcmM7KHMpIHRhbWImZWFjdXRlO20gZGVjbGFyYShtKSBxdWUgbyBlbnZpbyAmZWFjdXRlOyBkZSBzZXUgY29uaGVjaW1lbnRvIGUgbiZhdGlsZGU7byBpbmZyaW5nZSBvcyBkaXJlaXRvcyBhdXRvcmFpcyBkZSBvdXRyYQogIDxicj4KICBwZXNzb2Egb3UgaW5zdGl0dWkmY2NlZGlsOyZhdGlsZGU7by4gQ2FzbyBvIGRvY3VtZW50byBhIHNlciBkZXBvc2l0YWRvIGNvbnRlbmhhIG1hdGVyaWFsIHBhcmEgbyBxdWFsIHZvYyZlY2lyYzsocykgbiZhdGlsZGU7byBkZXQmZWFjdXRlO20gYSB0aXR1bGFyaWRhZGUKICA8YnI+CiAgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2MmZWNpcmM7KHMpIGRlY2xhcmEobSkgcXVlIG9idGV2ZSBhIHBlcm1pc3MmYXRpbGRlO28gaXJyZXN0cml0YSBkbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkZSBjb25jZWRlciAmYWdyYXZlOwogIDxicj4KICBVRk9QLCBnZXN0b3JhIGRvIFJJLVVGT1Agb3MgZGlyZWl0b3MgcmVxdWVyaWRvcyBwb3IgZXN0YSBsaWNlbiZjY2VkaWw7YSBlIHF1ZSBvcyBtYXRlcmlhaXMgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zLCBlc3QmYXRpbGRlO28KICA8YnI+CiAgZGV2aWRhbWVudGUgaWRlbnRpZmljYWRvcyBlIHJlY29uaGVjaWRvcyBubyB0ZXh0byBvdSBjb250ZSZ1YWN1dGU7ZG8gZGEgYXByZXNlbnRhJmNjZWRpbDsmYXRpbGRlO28uCiAgPGJyPgogIDxicj4KICBDQVNPIE8gVFJBQkFMSE8gREVQT1NJVEFETyBURU5IQSBTSURPIEZJTkFOQ0lBRE8gT1UgQVBPSUFETyBQT1IgVU0gJk9hY3V0ZTtSRyZBdGlsZGU7TywgUVVFIE4mQXRpbGRlO08gQSBJTlNUSVRVSSZDY2VkaWw7JkF0aWxkZTtPIERFU1RFCiAgPGJyPgogIFJFU1BPU0lUJk9hY3V0ZTtSSU86IFZPQyZFY2lyYzsgREVDTEFSQSBURVIgQ1VNUFJJRE8gVE9ET1MgT1MgRElSRUlUT1MgREUgUkVWSVMmQXRpbGRlO08gRSBRVUFJU1FVRVIgT1VUUkFTIE9CUklHQSZDY2VkaWw7Jk90aWxkZTtFUwogIDxicj4KICBSRVFVRVJJREFTIFBFTE8gQ09OVFJBVE8gT1UgQUNPUkRPLiAKICA8YnI+CiAgPGJyPgogIE8gcmVwb3NpdCZvYWN1dGU7cmlvIGlkZW50aWZpY2FyJmFhY3V0ZTsgY2xhcmFtZW50ZSBvIHNldShzKSBub21lKHMpIGNvbW8gYXV0b3IoZXMpIG91IHRpdHVsYXIoZXMpIGRvIGRpcmVpdG8gZGUgYXV0b3IoZXMpIGRvIGRvY3VtZW50bwogIDxicj4KICBzdWJtZXRpZG8gZSBkZWNsYXJhIHF1ZSBuJmF0aWxkZTtvIGZhciZhYWN1dGU7IHF1YWxxdWVyIGFsdGVyYSZjY2VkaWw7JmF0aWxkZTtvIGFsJmVhY3V0ZTttIGRhcyBwZXJtaXRpZGFzIHBvciBlc3RhIGxpY2VuJmNjZWRpbDthLjwvcD4KPC9kaXY+CjwvYm9keT4KPC9odG1sPgo=Repositório InstitucionalPUBhttp://www.repositorio.ufop.br/oai/requestrepositorio@ufop.edu.bropendoar:32332019-06-11T16:42:23Repositório Institucional da UFOP - Universidade Federal de Ouro Preto (UFOP)false |
dc.title.pt_BR.fl_str_mv |
Reducing fragmentation in incremental author name disambiguation. |
title |
Reducing fragmentation in incremental author name disambiguation. |
spellingShingle |
Reducing fragmentation in incremental author name disambiguation. Espiridião, Luciano Vilas Boas Author name ambiguity Bibliographic citation Incremental disambiguation |
title_short |
Reducing fragmentation in incremental author name disambiguation. |
title_full |
Reducing fragmentation in incremental author name disambiguation. |
title_fullStr |
Reducing fragmentation in incremental author name disambiguation. |
title_full_unstemmed |
Reducing fragmentation in incremental author name disambiguation. |
title_sort |
Reducing fragmentation in incremental author name disambiguation. |
author |
Espiridião, Luciano Vilas Boas |
author_facet |
Espiridião, Luciano Vilas Boas Ferreira, Anderson Almeida Laender, Alberto Henrique Frade Gonçalves, Marcos André Gomes, David Menotti Tavares, Andréa Iabrudi Assis, Guilherme Tavares de |
author_role |
author |
author2 |
Ferreira, Anderson Almeida Laender, Alberto Henrique Frade Gonçalves, Marcos André Gomes, David Menotti Tavares, Andréa Iabrudi Assis, Guilherme Tavares de |
author2_role |
author author author author author author |
dc.contributor.author.fl_str_mv |
Espiridião, Luciano Vilas Boas Ferreira, Anderson Almeida Laender, Alberto Henrique Frade Gonçalves, Marcos André Gomes, David Menotti Tavares, Andréa Iabrudi Assis, Guilherme Tavares de |
dc.subject.por.fl_str_mv |
Author name ambiguity Bibliographic citation Incremental disambiguation |
topic |
Author name ambiguity Bibliographic citation Incremental disambiguation |
description |
Author name ambiguity is a hard problem that occurs when several authors publish articles with the same name or when a same author publishes their articles under different names. Traditionally, automatic disambiguation methods process the author names of all citation records in a repository. Aiming efficiency, incremental methods disambiguate author names only when new citation records are inserted into the repository. As a side effect, several citation records of a same author may be associated with different authors, aka, the fragmentation problem. To diminish this problem, we propose a new merge-oriented incremental method capable of reducing such side effect, without the need to apply a traditional disambiguation method on the whole repository. Our experimental evaluation shows that our method produces significant improvements when compared to an incremental baseline and is very competitive with batch-mode methods. |
publishDate |
2014 |
dc.date.issued.fl_str_mv |
2014 |
dc.date.accessioned.fl_str_mv |
2015-01-21T18:05:11Z |
dc.date.available.fl_str_mv |
2015-01-21T18:05:11Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
ESPERIDIÃO, L. V. B. et al. Reducing fragmentation in incremental author name disambiguation. Journal of Information and Data Management - JIDM, São Paulo, v. 5, n. 3, p. 293-307, out. 2014. Disponível em: <https://periodicos.ufmg.br/index.php/jidm/article/view/286>. Acesso em: 21 jan. 2015. |
dc.identifier.uri.fl_str_mv |
http://www.repositorio.ufop.br/handle/123456789/4305 |
dc.identifier.issn.none.fl_str_mv |
2178-7107 |
identifier_str_mv |
ESPERIDIÃO, L. V. B. et al. Reducing fragmentation in incremental author name disambiguation. Journal of Information and Data Management - JIDM, São Paulo, v. 5, n. 3, p. 293-307, out. 2014. Disponível em: <https://periodicos.ufmg.br/index.php/jidm/article/view/286>. Acesso em: 21 jan. 2015. 2178-7107 |
url |
http://www.repositorio.ufop.br/handle/123456789/4305 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFOP instname:Universidade Federal de Ouro Preto (UFOP) instacron:UFOP |
instname_str |
Universidade Federal de Ouro Preto (UFOP) |
instacron_str |
UFOP |
institution |
UFOP |
reponame_str |
Repositório Institucional da UFOP |
collection |
Repositório Institucional da UFOP |
bitstream.url.fl_str_mv |
http://www.repositorio.ufop.br/bitstream/123456789/4305/2/license.txt http://www.repositorio.ufop.br/bitstream/123456789/4305/1/ARTIGO_ReducingFragmentationIncremental.pdf |
bitstream.checksum.fl_str_mv |
c2ffdd99e58acf69202dff00d361f23a 9b986b90a83850f6b4009d22d520e62f |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da UFOP - Universidade Federal de Ouro Preto (UFOP) |
repository.mail.fl_str_mv |
repositorio@ufop.edu.br |
_version_ |
1801685707012440064 |