Versus: a Web Data Repository with Time Support
Autor(a) principal: | |
---|---|
Data de Publicação: | 2003 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10451/14009 |
Resumo: | Web repositories are large scale warehouses of data downloaded from the Web, needed by applications that summarize that data to produce results that help people use information. Time is a central dimension in Web data, because the Web is continuously changing and it is impossible to get a snapshot of a large portion of the Web instantaneously. Developers of applications that manage Web data usually distribute the operations performed by their applications over several processing nodes, to scale-up to the amount of data that may be processed. Versus is a model for a repository providing time oriented distributed Web data management. Time is managed by versioning the objects saved in the repository. Distribution is managed by using a hierarchy of workspaces. Distributed threads work on data stored in the lower level workspaces, and save it by checking-in that data into the workspace in the next upper level. Versus applications can specify the granularity of the distribution and the conflict resolution policies they want to implement. This allows a great control over the repository, increasing the number and type of applications it is suitable to support. The Versus model was embodied in a prototype that is being used to build applications for managing data collected from the Web |
id |
RCAP_7c2beb6404030b055035e6282fbbd0a5 |
---|---|
oai_identifier_str |
oai:repositorio.ul.pt:10451/14009 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Versus: a Web Data Repository with Time SupportInformation repositoryVersion managementWeb warehousingDistributed computingWeb repositories are large scale warehouses of data downloaded from the Web, needed by applications that summarize that data to produce results that help people use information. Time is a central dimension in Web data, because the Web is continuously changing and it is impossible to get a snapshot of a large portion of the Web instantaneously. Developers of applications that manage Web data usually distribute the operations performed by their applications over several processing nodes, to scale-up to the amount of data that may be processed. Versus is a model for a repository providing time oriented distributed Web data management. Time is managed by versioning the objects saved in the repository. Distribution is managed by using a hierarchy of workspaces. Distributed threads work on data stored in the lower level workspaces, and save it by checking-in that data into the workspace in the next upper level. Versus applications can specify the granularity of the distribution and the conflict resolution policies they want to implement. This allows a great control over the repository, increasing the number and type of applications it is suitable to support. The Versus model was embodied in a prototype that is being used to build applications for managing data collected from the WebDepartment of Informatics, University of LisbonSilva, Mário Jorge Costa Gaspar daRepositório da Universidade de LisboaCampos, João P.2009-02-10T13:12:56Z2003-052003-05-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10451/14009porinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-08T15:59:32Zoai:repositorio.ul.pt:10451/14009Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:35:54.048037Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Versus: a Web Data Repository with Time Support |
title |
Versus: a Web Data Repository with Time Support |
spellingShingle |
Versus: a Web Data Repository with Time Support Campos, João P. Information repository Version management Web warehousing Distributed computing |
title_short |
Versus: a Web Data Repository with Time Support |
title_full |
Versus: a Web Data Repository with Time Support |
title_fullStr |
Versus: a Web Data Repository with Time Support |
title_full_unstemmed |
Versus: a Web Data Repository with Time Support |
title_sort |
Versus: a Web Data Repository with Time Support |
author |
Campos, João P. |
author_facet |
Campos, João P. |
author_role |
author |
dc.contributor.none.fl_str_mv |
Silva, Mário Jorge Costa Gaspar da Repositório da Universidade de Lisboa |
dc.contributor.author.fl_str_mv |
Campos, João P. |
dc.subject.por.fl_str_mv |
Information repository Version management Web warehousing Distributed computing |
topic |
Information repository Version management Web warehousing Distributed computing |
description |
Web repositories are large scale warehouses of data downloaded from the Web, needed by applications that summarize that data to produce results that help people use information. Time is a central dimension in Web data, because the Web is continuously changing and it is impossible to get a snapshot of a large portion of the Web instantaneously. Developers of applications that manage Web data usually distribute the operations performed by their applications over several processing nodes, to scale-up to the amount of data that may be processed. Versus is a model for a repository providing time oriented distributed Web data management. Time is managed by versioning the objects saved in the repository. Distribution is managed by using a hierarchy of workspaces. Distributed threads work on data stored in the lower level workspaces, and save it by checking-in that data into the workspace in the next upper level. Versus applications can specify the granularity of the distribution and the conflict resolution policies they want to implement. This allows a great control over the repository, increasing the number and type of applications it is suitable to support. The Versus model was embodied in a prototype that is being used to build applications for managing data collected from the Web |
publishDate |
2003 |
dc.date.none.fl_str_mv |
2003-05 2003-05-01T00:00:00Z 2009-02-10T13:12:56Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10451/14009 |
url |
http://hdl.handle.net/10451/14009 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Department of Informatics, University of Lisbon |
publisher.none.fl_str_mv |
Department of Informatics, University of Lisbon |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799134258027036672 |