Data Mining para análise dos resultados de Gene Expression
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://repositorio-aberto.up.pt/handle/10216/106389 |
Resumo: | We currently live in an age when technology is involved in all areas and technological evolution has direct consequences in the study of different scientific areas.In the area of biology, genome sequencing has undergone tremendous advances in recent years. It has become more accurate, faster and less costly financially.These developments lead to increased use of this technology in carrying out deeper and more complex studies in genomics, in particular in research studies on the genomic origin of different types of cancer.One of the characteristics of these new sequencing technology is that it requires considerable computational resources and generates an enormous amount of data, which makes it impossible to manually analyze these data to obtain conclusions from the experts.Derived from the enormous amount of data generated and the amount of information available on the Internet these days, there are already several databases accessible on the WEB with this type of information. Although it is quite positive that there is a lot of information on different websites, it is arduous and complex to find all the necessary information about a gene. In addition, it gets more difficult because often each database has its own identifier for each gene.The final objective of this dissertation is the elaboration of a platform for the use of biological research specialists, which will facilitate their work, thus allowing the development of progress in the investigation of various diseases of genomic origin, such as cancers or tumors.In order to acomplish this we have developed a WEB Platform that allows the use of different data mining techniques, classification and clustering techniques in order to allow the experts to draw conclusions in the analysis of results of the genetic expression. In addition, and in order to simplify the work of the specialists, the platform also allows the collection of gene information from different databases, being possible to extract this information for several file formats, for later use. Targeting a wide range of users the platform has a simple and intuitive interface, allowing it to be usable by users without great experience in computing.The evaluation of the platform was done through an objective evaluation, own of the tools of data mining, and subjective, resorting to specialists of I3S. |
id |
RCAP_ecb3836f1ebc96b63712068c2c03cf41 |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/106389 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Data Mining para análise dos resultados de Gene ExpressionEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringWe currently live in an age when technology is involved in all areas and technological evolution has direct consequences in the study of different scientific areas.In the area of biology, genome sequencing has undergone tremendous advances in recent years. It has become more accurate, faster and less costly financially.These developments lead to increased use of this technology in carrying out deeper and more complex studies in genomics, in particular in research studies on the genomic origin of different types of cancer.One of the characteristics of these new sequencing technology is that it requires considerable computational resources and generates an enormous amount of data, which makes it impossible to manually analyze these data to obtain conclusions from the experts.Derived from the enormous amount of data generated and the amount of information available on the Internet these days, there are already several databases accessible on the WEB with this type of information. Although it is quite positive that there is a lot of information on different websites, it is arduous and complex to find all the necessary information about a gene. In addition, it gets more difficult because often each database has its own identifier for each gene.The final objective of this dissertation is the elaboration of a platform for the use of biological research specialists, which will facilitate their work, thus allowing the development of progress in the investigation of various diseases of genomic origin, such as cancers or tumors.In order to acomplish this we have developed a WEB Platform that allows the use of different data mining techniques, classification and clustering techniques in order to allow the experts to draw conclusions in the analysis of results of the genetic expression. In addition, and in order to simplify the work of the specialists, the platform also allows the collection of gene information from different databases, being possible to extract this information for several file formats, for later use. Targeting a wide range of users the platform has a simple and intuitive interface, allowing it to be usable by users without great experience in computing.The evaluation of the platform was done through an objective evaluation, own of the tools of data mining, and subjective, resorting to specialists of I3S.2017-07-142017-07-14T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://repositorio-aberto.up.pt/handle/10216/106389TID:201804727porLuís Miguel Barroso Natividadeinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T12:28:34Zoai:repositorio-aberto.up.pt:10216/106389Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:20:59.796721Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Data Mining para análise dos resultados de Gene Expression |
title |
Data Mining para análise dos resultados de Gene Expression |
spellingShingle |
Data Mining para análise dos resultados de Gene Expression Luís Miguel Barroso Natividade Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Data Mining para análise dos resultados de Gene Expression |
title_full |
Data Mining para análise dos resultados de Gene Expression |
title_fullStr |
Data Mining para análise dos resultados de Gene Expression |
title_full_unstemmed |
Data Mining para análise dos resultados de Gene Expression |
title_sort |
Data Mining para análise dos resultados de Gene Expression |
author |
Luís Miguel Barroso Natividade |
author_facet |
Luís Miguel Barroso Natividade |
author_role |
author |
dc.contributor.author.fl_str_mv |
Luís Miguel Barroso Natividade |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
We currently live in an age when technology is involved in all areas and technological evolution has direct consequences in the study of different scientific areas.In the area of biology, genome sequencing has undergone tremendous advances in recent years. It has become more accurate, faster and less costly financially.These developments lead to increased use of this technology in carrying out deeper and more complex studies in genomics, in particular in research studies on the genomic origin of different types of cancer.One of the characteristics of these new sequencing technology is that it requires considerable computational resources and generates an enormous amount of data, which makes it impossible to manually analyze these data to obtain conclusions from the experts.Derived from the enormous amount of data generated and the amount of information available on the Internet these days, there are already several databases accessible on the WEB with this type of information. Although it is quite positive that there is a lot of information on different websites, it is arduous and complex to find all the necessary information about a gene. In addition, it gets more difficult because often each database has its own identifier for each gene.The final objective of this dissertation is the elaboration of a platform for the use of biological research specialists, which will facilitate their work, thus allowing the development of progress in the investigation of various diseases of genomic origin, such as cancers or tumors.In order to acomplish this we have developed a WEB Platform that allows the use of different data mining techniques, classification and clustering techniques in order to allow the experts to draw conclusions in the analysis of results of the genetic expression. In addition, and in order to simplify the work of the specialists, the platform also allows the collection of gene information from different databases, being possible to extract this information for several file formats, for later use. Targeting a wide range of users the platform has a simple and intuitive interface, allowing it to be usable by users without great experience in computing.The evaluation of the platform was done through an objective evaluation, own of the tools of data mining, and subjective, resorting to specialists of I3S. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017-07-14 2017-07-14T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://repositorio-aberto.up.pt/handle/10216/106389 TID:201804727 |
url |
https://repositorio-aberto.up.pt/handle/10216/106389 |
identifier_str_mv |
TID:201804727 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135509255028737 |