Data Mining para análise dos resultados de Gene Expression

Detalhes bibliográficos
Autor(a) principal: Luís Miguel Barroso Natividade
Data de Publicação: 2017
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://repositorio-aberto.up.pt/handle/10216/106389
Resumo: We currently live in an age when technology is involved in all areas and technological evolution has direct consequences in the study of different scientific areas.In the area of biology, genome sequencing has undergone tremendous advances in recent years. It has become more accurate, faster and less costly financially.These developments lead to increased use of this technology in carrying out deeper and more complex studies in genomics, in particular in research studies on the genomic origin of different types of cancer.One of the characteristics of these new sequencing technology is that it requires considerable computational resources and generates an enormous amount of data, which makes it impossible to manually analyze these data to obtain conclusions from the experts.Derived from the enormous amount of data generated and the amount of information available on the Internet these days, there are already several databases accessible on the WEB with this type of information. Although it is quite positive that there is a lot of information on different websites, it is arduous and complex to find all the necessary information about a gene. In addition, it gets more difficult because often each database has its own identifier for each gene.The final objective of this dissertation is the elaboration of a platform for the use of biological research specialists, which will facilitate their work, thus allowing the development of progress in the investigation of various diseases of genomic origin, such as cancers or tumors.In order to acomplish this we have developed a WEB Platform that allows the use of different data mining techniques, classification and clustering techniques in order to allow the experts to draw conclusions in the analysis of results of the genetic expression. In addition, and in order to simplify the work of the specialists, the platform also allows the collection of gene information from different databases, being possible to extract this information for several file formats, for later use. Targeting a wide range of users the platform has a simple and intuitive interface, allowing it to be usable by users without great experience in computing.The evaluation of the platform was done through an objective evaluation, own of the tools of data mining, and subjective, resorting to specialists of I3S.
id RCAP_ecb3836f1ebc96b63712068c2c03cf41
oai_identifier_str oai:repositorio-aberto.up.pt:10216/106389
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Data Mining para análise dos resultados de Gene ExpressionEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringWe currently live in an age when technology is involved in all areas and technological evolution has direct consequences in the study of different scientific areas.In the area of biology, genome sequencing has undergone tremendous advances in recent years. It has become more accurate, faster and less costly financially.These developments lead to increased use of this technology in carrying out deeper and more complex studies in genomics, in particular in research studies on the genomic origin of different types of cancer.One of the characteristics of these new sequencing technology is that it requires considerable computational resources and generates an enormous amount of data, which makes it impossible to manually analyze these data to obtain conclusions from the experts.Derived from the enormous amount of data generated and the amount of information available on the Internet these days, there are already several databases accessible on the WEB with this type of information. Although it is quite positive that there is a lot of information on different websites, it is arduous and complex to find all the necessary information about a gene. In addition, it gets more difficult because often each database has its own identifier for each gene.The final objective of this dissertation is the elaboration of a platform for the use of biological research specialists, which will facilitate their work, thus allowing the development of progress in the investigation of various diseases of genomic origin, such as cancers or tumors.In order to acomplish this we have developed a WEB Platform that allows the use of different data mining techniques, classification and clustering techniques in order to allow the experts to draw conclusions in the analysis of results of the genetic expression. In addition, and in order to simplify the work of the specialists, the platform also allows the collection of gene information from different databases, being possible to extract this information for several file formats, for later use. Targeting a wide range of users the platform has a simple and intuitive interface, allowing it to be usable by users without great experience in computing.The evaluation of the platform was done through an objective evaluation, own of the tools of data mining, and subjective, resorting to specialists of I3S.2017-07-142017-07-14T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://repositorio-aberto.up.pt/handle/10216/106389TID:201804727porLuís Miguel Barroso Natividadeinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T12:28:34Zoai:repositorio-aberto.up.pt:10216/106389Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:20:59.796721Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Data Mining para análise dos resultados de Gene Expression
title Data Mining para análise dos resultados de Gene Expression
spellingShingle Data Mining para análise dos resultados de Gene Expression
Luís Miguel Barroso Natividade
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short Data Mining para análise dos resultados de Gene Expression
title_full Data Mining para análise dos resultados de Gene Expression
title_fullStr Data Mining para análise dos resultados de Gene Expression
title_full_unstemmed Data Mining para análise dos resultados de Gene Expression
title_sort Data Mining para análise dos resultados de Gene Expression
author Luís Miguel Barroso Natividade
author_facet Luís Miguel Barroso Natividade
author_role author
dc.contributor.author.fl_str_mv Luís Miguel Barroso Natividade
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description We currently live in an age when technology is involved in all areas and technological evolution has direct consequences in the study of different scientific areas.In the area of biology, genome sequencing has undergone tremendous advances in recent years. It has become more accurate, faster and less costly financially.These developments lead to increased use of this technology in carrying out deeper and more complex studies in genomics, in particular in research studies on the genomic origin of different types of cancer.One of the characteristics of these new sequencing technology is that it requires considerable computational resources and generates an enormous amount of data, which makes it impossible to manually analyze these data to obtain conclusions from the experts.Derived from the enormous amount of data generated and the amount of information available on the Internet these days, there are already several databases accessible on the WEB with this type of information. Although it is quite positive that there is a lot of information on different websites, it is arduous and complex to find all the necessary information about a gene. In addition, it gets more difficult because often each database has its own identifier for each gene.The final objective of this dissertation is the elaboration of a platform for the use of biological research specialists, which will facilitate their work, thus allowing the development of progress in the investigation of various diseases of genomic origin, such as cancers or tumors.In order to acomplish this we have developed a WEB Platform that allows the use of different data mining techniques, classification and clustering techniques in order to allow the experts to draw conclusions in the analysis of results of the genetic expression. In addition, and in order to simplify the work of the specialists, the platform also allows the collection of gene information from different databases, being possible to extract this information for several file formats, for later use. Targeting a wide range of users the platform has a simple and intuitive interface, allowing it to be usable by users without great experience in computing.The evaluation of the platform was done through an objective evaluation, own of the tools of data mining, and subjective, resorting to specialists of I3S.
publishDate 2017
dc.date.none.fl_str_mv 2017-07-14
2017-07-14T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://repositorio-aberto.up.pt/handle/10216/106389
TID:201804727
url https://repositorio-aberto.up.pt/handle/10216/106389
identifier_str_mv TID:201804727
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799135509255028737