Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/135954 |
Resumo: | The Wikipedia is an online, free, multi language and collaborative encyclopaedia, currently one of the biggest data sources in the web. There, it is possible to find information from different areas, from technology to philosophy, including health. As an health related data source, it is not only used by general public, but also by professionals as well. One of the reasons for such, is that, apart from the content of the articles, it includes external links for additional data sources as well. The open nature of Wikipedia contributions, specifically in health context, raises safety concerns, as such data is being used to make decisions. Thus, and considering possible consequences, it is very relevant to evaluate the quality of the information herein. This subject has been previously addressed by several studies, considering different metrics. In this work, a set of predefined metrics will be used to evaluate que quality of the information, such as autorithy, completeness, complexity, informativeness, consistency, currency and volatility. An additional set of first level measurements, and metrics based on them will be used, focusing in the health area. The definition of these metrics resulted from the analysis of other previously defined ones, which have already been applied to Wikipedia. This set of measures and metrics was then evaluated based on a proposed dataset, consisting of health and medical, english, articles, previously evaluated by the WikiProject Medicine. In the last stage, with the objective of evaluating differences in the quality of the information, the proposed methodology was applied to other languages. Specifically, it was applied, when possible, to articles available in languages with over than one hundred million native speakers, and also in Greek, Italian, Korean, Turkish, Perse and Hebrew, for its historical tradition. As a result, this work contributes to the clarification of the role of Wikipedia in the access to health, specifically the access to health information in different languages. The proposal consists of a set of measurements and metrics to infer the quality of Wikipedia health related articles, plus an analysis regarding the differences in the quality between different languages available in Wikipedia. |
id |
RCAP_3180fc52d5c0d0fbea507ab84d2bb692 |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/135954 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúdeEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringThe Wikipedia is an online, free, multi language and collaborative encyclopaedia, currently one of the biggest data sources in the web. There, it is possible to find information from different areas, from technology to philosophy, including health. As an health related data source, it is not only used by general public, but also by professionals as well. One of the reasons for such, is that, apart from the content of the articles, it includes external links for additional data sources as well. The open nature of Wikipedia contributions, specifically in health context, raises safety concerns, as such data is being used to make decisions. Thus, and considering possible consequences, it is very relevant to evaluate the quality of the information herein. This subject has been previously addressed by several studies, considering different metrics. In this work, a set of predefined metrics will be used to evaluate que quality of the information, such as autorithy, completeness, complexity, informativeness, consistency, currency and volatility. An additional set of first level measurements, and metrics based on them will be used, focusing in the health area. The definition of these metrics resulted from the analysis of other previously defined ones, which have already been applied to Wikipedia. This set of measures and metrics was then evaluated based on a proposed dataset, consisting of health and medical, english, articles, previously evaluated by the WikiProject Medicine. In the last stage, with the objective of evaluating differences in the quality of the information, the proposed methodology was applied to other languages. Specifically, it was applied, when possible, to articles available in languages with over than one hundred million native speakers, and also in Greek, Italian, Korean, Turkish, Perse and Hebrew, for its historical tradition. As a result, this work contributes to the clarification of the role of Wikipedia in the access to health, specifically the access to health information in different languages. The proposal consists of a set of measurements and metrics to infer the quality of Wikipedia health related articles, plus an analysis regarding the differences in the quality between different languages available in Wikipedia.2021-07-202021-07-20T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/135954TID:202900908porLuís Pedro da Silva Coutoinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T13:15:12Zoai:repositorio-aberto.up.pt:10216/135954Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:36:41.972508Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde |
title |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde |
spellingShingle |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde Luís Pedro da Silva Couto Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde |
title_full |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde |
title_fullStr |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde |
title_full_unstemmed |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde |
title_sort |
Avaliação da qualidade da Wikipédia enquanto fonte de informação em saúde |
author |
Luís Pedro da Silva Couto |
author_facet |
Luís Pedro da Silva Couto |
author_role |
author |
dc.contributor.author.fl_str_mv |
Luís Pedro da Silva Couto |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
The Wikipedia is an online, free, multi language and collaborative encyclopaedia, currently one of the biggest data sources in the web. There, it is possible to find information from different areas, from technology to philosophy, including health. As an health related data source, it is not only used by general public, but also by professionals as well. One of the reasons for such, is that, apart from the content of the articles, it includes external links for additional data sources as well. The open nature of Wikipedia contributions, specifically in health context, raises safety concerns, as such data is being used to make decisions. Thus, and considering possible consequences, it is very relevant to evaluate the quality of the information herein. This subject has been previously addressed by several studies, considering different metrics. In this work, a set of predefined metrics will be used to evaluate que quality of the information, such as autorithy, completeness, complexity, informativeness, consistency, currency and volatility. An additional set of first level measurements, and metrics based on them will be used, focusing in the health area. The definition of these metrics resulted from the analysis of other previously defined ones, which have already been applied to Wikipedia. This set of measures and metrics was then evaluated based on a proposed dataset, consisting of health and medical, english, articles, previously evaluated by the WikiProject Medicine. In the last stage, with the objective of evaluating differences in the quality of the information, the proposed methodology was applied to other languages. Specifically, it was applied, when possible, to articles available in languages with over than one hundred million native speakers, and also in Greek, Italian, Korean, Turkish, Perse and Hebrew, for its historical tradition. As a result, this work contributes to the clarification of the role of Wikipedia in the access to health, specifically the access to health information in different languages. The proposal consists of a set of measurements and metrics to infer the quality of Wikipedia health related articles, plus an analysis regarding the differences in the quality between different languages available in Wikipedia. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-07-20 2021-07-20T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/135954 TID:202900908 |
url |
https://hdl.handle.net/10216/135954 |
identifier_str_mv |
TID:202900908 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135679486099457 |