Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework

Detalhes bibliográficos
Autor(a) principal: Sadasivam, G. Sudha
Data de Publicação: 2009
Outros Autores: Kartthikeyan, V., Raja, P.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: INFOCOMP: Jornal de Ciência da Computação
Texto Completo: https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/253
Resumo: Peer-to-Peer (P2P) Information Retrieval framework consists of a peer to peer network of nodes, which voluntarily agree to share their resources by joining the network. While joining these nodes construct the active peer list. Each peer maintains a B+ tree containing IP hash values. The files are distributed over the peer to peer network based on the keywords. The files are initially uploaded into the target node based on the closest match between the hash values of the IP address of the node and the keywords used to index the file. While searching, the target node is identified by finding the closest match between the hash value of the keyword and the IP address hash from the B+ tree stored in the peers. After identifying the target node, the references to desired document is retrieved by searching a B+ tree indexed using keywords. The proposed framework uses Hadoop cluster to extract keywords from the files to be uploaded in the desired target node. Hadoop’s MapReduce programming paradigm reduces the time for keyword extraction. As the framework maintains a B+ tree in the peers, it further reduces the search time and improves network bandwidth.
id UFLA-5_ff4a3b3ecc5eeab2cc0a827772c9b633
oai_identifier_str oai:infocomp.dcc.ufla.br:article/253
network_acronym_str UFLA-5
network_name_str INFOCOMP: Jornal de Ciência da Computação
repository_id_str
spelling Design and Implementation of a Novel Peer-To-Peer Information Retrieval FrameworkP2P networksinformation retrievalHadoopB tree.B tree.Peer-to-Peer (P2P) Information Retrieval framework consists of a peer to peer network of nodes, which voluntarily agree to share their resources by joining the network. While joining these nodes construct the active peer list. Each peer maintains a B+ tree containing IP hash values. The files are distributed over the peer to peer network based on the keywords. The files are initially uploaded into the target node based on the closest match between the hash values of the IP address of the node and the keywords used to index the file. While searching, the target node is identified by finding the closest match between the hash value of the keyword and the IP address hash from the B+ tree stored in the peers. After identifying the target node, the references to desired document is retrieved by searching a B+ tree indexed using keywords. The proposed framework uses Hadoop cluster to extract keywords from the files to be uploaded in the desired target node. Hadoop’s MapReduce programming paradigm reduces the time for keyword extraction. As the framework maintains a B+ tree in the peers, it further reduces the search time and improves network bandwidth.Editora da UFLA2009-03-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://infocomp.dcc.ufla.br/index.php/infocomp/article/view/253INFOCOMP Journal of Computer Science; Vol. 8 No. 1 (2009): March, 2009; 75-831982-33631807-4545reponame:INFOCOMP: Jornal de Ciência da Computaçãoinstname:Universidade Federal de Lavras (UFLA)instacron:UFLAenghttps://infocomp.dcc.ufla.br/index.php/infocomp/article/view/253/238Copyright (c) 2016 INFOCOMP Journal of Computer Scienceinfo:eu-repo/semantics/openAccessSadasivam, G. SudhaKartthikeyan, V.Raja, P.2015-07-01T12:46:30Zoai:infocomp.dcc.ufla.br:article/253Revistahttps://infocomp.dcc.ufla.br/index.php/infocompPUBhttps://infocomp.dcc.ufla.br/index.php/infocomp/oaiinfocomp@dcc.ufla.br||apfreire@dcc.ufla.br1982-33631807-4545opendoar:2024-05-21T19:54:27.573854INFOCOMP: Jornal de Ciência da Computação - Universidade Federal de Lavras (UFLA)true
dc.title.none.fl_str_mv Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework
title Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework
spellingShingle Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework
Sadasivam, G. Sudha
P2P networks
information retrieval
Hadoop
B tree.
B tree.
title_short Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework
title_full Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework
title_fullStr Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework
title_full_unstemmed Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework
title_sort Design and Implementation of a Novel Peer-To-Peer Information Retrieval Framework
author Sadasivam, G. Sudha
author_facet Sadasivam, G. Sudha
Kartthikeyan, V.
Raja, P.
author_role author
author2 Kartthikeyan, V.
Raja, P.
author2_role author
author
dc.contributor.author.fl_str_mv Sadasivam, G. Sudha
Kartthikeyan, V.
Raja, P.
dc.subject.por.fl_str_mv P2P networks
information retrieval
Hadoop
B tree.
B tree.
topic P2P networks
information retrieval
Hadoop
B tree.
B tree.
description Peer-to-Peer (P2P) Information Retrieval framework consists of a peer to peer network of nodes, which voluntarily agree to share their resources by joining the network. While joining these nodes construct the active peer list. Each peer maintains a B+ tree containing IP hash values. The files are distributed over the peer to peer network based on the keywords. The files are initially uploaded into the target node based on the closest match between the hash values of the IP address of the node and the keywords used to index the file. While searching, the target node is identified by finding the closest match between the hash value of the keyword and the IP address hash from the B+ tree stored in the peers. After identifying the target node, the references to desired document is retrieved by searching a B+ tree indexed using keywords. The proposed framework uses Hadoop cluster to extract keywords from the files to be uploaded in the desired target node. Hadoop’s MapReduce programming paradigm reduces the time for keyword extraction. As the framework maintains a B+ tree in the peers, it further reduces the search time and improves network bandwidth.
publishDate 2009
dc.date.none.fl_str_mv 2009-03-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/253
url https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/253
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/253/238
dc.rights.driver.fl_str_mv Copyright (c) 2016 INFOCOMP Journal of Computer Science
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2016 INFOCOMP Journal of Computer Science
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Editora da UFLA
publisher.none.fl_str_mv Editora da UFLA
dc.source.none.fl_str_mv INFOCOMP Journal of Computer Science; Vol. 8 No. 1 (2009): March, 2009; 75-83
1982-3363
1807-4545
reponame:INFOCOMP: Jornal de Ciência da Computação
instname:Universidade Federal de Lavras (UFLA)
instacron:UFLA
instname_str Universidade Federal de Lavras (UFLA)
instacron_str UFLA
institution UFLA
reponame_str INFOCOMP: Jornal de Ciência da Computação
collection INFOCOMP: Jornal de Ciência da Computação
repository.name.fl_str_mv INFOCOMP: Jornal de Ciência da Computação - Universidade Federal de Lavras (UFLA)
repository.mail.fl_str_mv infocomp@dcc.ufla.br||apfreire@dcc.ufla.br
_version_ 1799874740869922816