Query expansion strategies for laypeople-centred health information retrieval

Bibliographic Details
Main Author: Ricardo Daniel Soares da Silva
Publication Date: 2016
Format: Master thesis
Language: eng
Source: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Download full: https://repositorio-aberto.up.pt/handle/10216/85736
Summary: One of the most common activities on the web is the research for health information. This activity has been gaining popularity among users, but the majority of them have no training in health care, which leads to difficulties in understanding the terminology and contents of documents.In the field of health information retrieval various investigations have been carried out, which resulted in methodologies that offer solutions to improve the quality of the retrieval documents. One of the most covered techniques in this area is the query expansion, that solves one of the biggest difficulties for users in the search of health information: the limited knowledge of medical terminology. This lack of knowledge influence the formulation of queries and the expectations of the retrieval documents. The query expansion complements the original query with additional terms, making it more reliable. These new terms can be obtained through thesaurus containing several terms associated with a medical concept.The amount of research conducted on the issue of readability of the documents is greatly reduced, the most developed subject is relevance, but if a document is relevant and the user does not comprehend it's contents it ceases to be useful.In this thesis it will be proposed a methodology to improve the quality of the retrieval documents, using methods to improve the users queries, such as the query expansion, and it will be used Readability formulas to determine the level of education required to understand a document. Will be conducted several tests to determine if the source to be used in the query expansion and the readability will have an effect in the retrieval process. These tests will be evaluated with precision and NDCG in the case of relevance, and in the case of readability it will be used uRBP.
id RCAP_1ad9da6a9704828eaebffee4ecbd17d4
oai_identifier_str oai:repositorio-aberto.up.pt:10216/85736
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Query expansion strategies for laypeople-centred health information retrievalEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringOne of the most common activities on the web is the research for health information. This activity has been gaining popularity among users, but the majority of them have no training in health care, which leads to difficulties in understanding the terminology and contents of documents.In the field of health information retrieval various investigations have been carried out, which resulted in methodologies that offer solutions to improve the quality of the retrieval documents. One of the most covered techniques in this area is the query expansion, that solves one of the biggest difficulties for users in the search of health information: the limited knowledge of medical terminology. This lack of knowledge influence the formulation of queries and the expectations of the retrieval documents. The query expansion complements the original query with additional terms, making it more reliable. These new terms can be obtained through thesaurus containing several terms associated with a medical concept.The amount of research conducted on the issue of readability of the documents is greatly reduced, the most developed subject is relevance, but if a document is relevant and the user does not comprehend it's contents it ceases to be useful.In this thesis it will be proposed a methodology to improve the quality of the retrieval documents, using methods to improve the users queries, such as the query expansion, and it will be used Readability formulas to determine the level of education required to understand a document. Will be conducted several tests to determine if the source to be used in the query expansion and the readability will have an effect in the retrieval process. These tests will be evaluated with precision and NDCG in the case of relevance, and in the case of readability it will be used uRBP.2016-07-202016-07-20T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://repositorio-aberto.up.pt/handle/10216/85736TID:201317974engRicardo Daniel Soares da Silvainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T16:12:25Zoai:repositorio-aberto.up.pt:10216/85736Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:38:59.558449Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Query expansion strategies for laypeople-centred health information retrieval
title Query expansion strategies for laypeople-centred health information retrieval
spellingShingle Query expansion strategies for laypeople-centred health information retrieval
Ricardo Daniel Soares da Silva
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short Query expansion strategies for laypeople-centred health information retrieval
title_full Query expansion strategies for laypeople-centred health information retrieval
title_fullStr Query expansion strategies for laypeople-centred health information retrieval
title_full_unstemmed Query expansion strategies for laypeople-centred health information retrieval
title_sort Query expansion strategies for laypeople-centred health information retrieval
author Ricardo Daniel Soares da Silva
author_facet Ricardo Daniel Soares da Silva
author_role author
dc.contributor.author.fl_str_mv Ricardo Daniel Soares da Silva
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description One of the most common activities on the web is the research for health information. This activity has been gaining popularity among users, but the majority of them have no training in health care, which leads to difficulties in understanding the terminology and contents of documents.In the field of health information retrieval various investigations have been carried out, which resulted in methodologies that offer solutions to improve the quality of the retrieval documents. One of the most covered techniques in this area is the query expansion, that solves one of the biggest difficulties for users in the search of health information: the limited knowledge of medical terminology. This lack of knowledge influence the formulation of queries and the expectations of the retrieval documents. The query expansion complements the original query with additional terms, making it more reliable. These new terms can be obtained through thesaurus containing several terms associated with a medical concept.The amount of research conducted on the issue of readability of the documents is greatly reduced, the most developed subject is relevance, but if a document is relevant and the user does not comprehend it's contents it ceases to be useful.In this thesis it will be proposed a methodology to improve the quality of the retrieval documents, using methods to improve the users queries, such as the query expansion, and it will be used Readability formulas to determine the level of education required to understand a document. Will be conducted several tests to determine if the source to be used in the query expansion and the readability will have an effect in the retrieval process. These tests will be evaluated with precision and NDCG in the case of relevance, and in the case of readability it will be used uRBP.
publishDate 2016
dc.date.none.fl_str_mv 2016-07-20
2016-07-20T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://repositorio-aberto.up.pt/handle/10216/85736
TID:201317974
url https://repositorio-aberto.up.pt/handle/10216/85736
identifier_str_mv TID:201317974
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136296279474176