Looking for exceptions on knowledge rules induced from HIV cleavage data set

Detalhes bibliográficos
Autor(a) principal: Prati,Ronaldo Cristiano
Data de Publicação: 2004
Outros Autores: Monard,Maria Carolina, Carvalho,André C. P. L. F. de
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Genetics and Molecular Biology
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400027
Resumo: The aim of data mining is to find useful knowledge inout of databases. In order to extract such knowledge, several methods can be used, among them machine learning (ML) algorithms. In this work we focus on ML algorithms that express the extracted knowledge in a symbolic form, such as rules. This representation may allow us to ''explain'' the data. Rule learning algorithms are mainly designed to induce classification rules that can predict new cases with high accuracy. However, these sorts of rules generally express common sense knowledge, resulting in many interesting and useful rules not being discovered. Furthermore, the domain independent biases, especially those related to the language used to express the induced knowledge, could induce rules that are difficult to understand. Exceptions might be used in order to overcome these drawbacks. Exceptions are defined as rules that contradict common believebeliefs. This kind of rules can play an important role in the process of understanding the underlying data as well as in making critical decisions. By contradicting the user's common beliefves, exceptions are bound to be interesting. This work proposes a method to find exceptions. In order to illustrate the potential of our approach, we apply the method in a real world data set to discover rules and exceptions in the HIV virus protein cleavage process. A good understanding of the process that generates this data plays an important role oin the research of cleavage inhibitors. We consider believe that the proposed approach may help the domain expert to further understand this process.
id SBG-1_1a6a815bbb8d146360d653e2fae500d1
oai_identifier_str oai:scielo:S1415-47572004000400027
network_acronym_str SBG-1
network_name_str Genetics and Molecular Biology
repository_id_str
spelling Looking for exceptions on knowledge rules induced from HIV cleavage data setdata miningexceptionsHIVproteaseThe aim of data mining is to find useful knowledge inout of databases. In order to extract such knowledge, several methods can be used, among them machine learning (ML) algorithms. In this work we focus on ML algorithms that express the extracted knowledge in a symbolic form, such as rules. This representation may allow us to ''explain'' the data. Rule learning algorithms are mainly designed to induce classification rules that can predict new cases with high accuracy. However, these sorts of rules generally express common sense knowledge, resulting in many interesting and useful rules not being discovered. Furthermore, the domain independent biases, especially those related to the language used to express the induced knowledge, could induce rules that are difficult to understand. Exceptions might be used in order to overcome these drawbacks. Exceptions are defined as rules that contradict common believebeliefs. This kind of rules can play an important role in the process of understanding the underlying data as well as in making critical decisions. By contradicting the user's common beliefves, exceptions are bound to be interesting. This work proposes a method to find exceptions. In order to illustrate the potential of our approach, we apply the method in a real world data set to discover rules and exceptions in the HIV virus protein cleavage process. A good understanding of the process that generates this data plays an important role oin the research of cleavage inhibitors. We consider believe that the proposed approach may help the domain expert to further understand this process.Sociedade Brasileira de Genética2004-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400027Genetics and Molecular Biology v.27 n.4 2004reponame:Genetics and Molecular Biologyinstname:Sociedade Brasileira de Genética (SBG)instacron:SBG10.1590/S1415-47572004000400027info:eu-repo/semantics/openAccessPrati,Ronaldo CristianoMonard,Maria CarolinaCarvalho,André C. P. L. F. deeng2005-01-14T00:00:00Zoai:scielo:S1415-47572004000400027Revistahttp://www.gmb.org.br/ONGhttps://old.scielo.br/oai/scielo-oai.php||editor@gmb.org.br1678-46851415-4757opendoar:2005-01-14T00:00Genetics and Molecular Biology - Sociedade Brasileira de Genética (SBG)false
dc.title.none.fl_str_mv Looking for exceptions on knowledge rules induced from HIV cleavage data set
title Looking for exceptions on knowledge rules induced from HIV cleavage data set
spellingShingle Looking for exceptions on knowledge rules induced from HIV cleavage data set
Prati,Ronaldo Cristiano
data mining
exceptions
HIV
protease
title_short Looking for exceptions on knowledge rules induced from HIV cleavage data set
title_full Looking for exceptions on knowledge rules induced from HIV cleavage data set
title_fullStr Looking for exceptions on knowledge rules induced from HIV cleavage data set
title_full_unstemmed Looking for exceptions on knowledge rules induced from HIV cleavage data set
title_sort Looking for exceptions on knowledge rules induced from HIV cleavage data set
author Prati,Ronaldo Cristiano
author_facet Prati,Ronaldo Cristiano
Monard,Maria Carolina
Carvalho,André C. P. L. F. de
author_role author
author2 Monard,Maria Carolina
Carvalho,André C. P. L. F. de
author2_role author
author
dc.contributor.author.fl_str_mv Prati,Ronaldo Cristiano
Monard,Maria Carolina
Carvalho,André C. P. L. F. de
dc.subject.por.fl_str_mv data mining
exceptions
HIV
protease
topic data mining
exceptions
HIV
protease
description The aim of data mining is to find useful knowledge inout of databases. In order to extract such knowledge, several methods can be used, among them machine learning (ML) algorithms. In this work we focus on ML algorithms that express the extracted knowledge in a symbolic form, such as rules. This representation may allow us to ''explain'' the data. Rule learning algorithms are mainly designed to induce classification rules that can predict new cases with high accuracy. However, these sorts of rules generally express common sense knowledge, resulting in many interesting and useful rules not being discovered. Furthermore, the domain independent biases, especially those related to the language used to express the induced knowledge, could induce rules that are difficult to understand. Exceptions might be used in order to overcome these drawbacks. Exceptions are defined as rules that contradict common believebeliefs. This kind of rules can play an important role in the process of understanding the underlying data as well as in making critical decisions. By contradicting the user's common beliefves, exceptions are bound to be interesting. This work proposes a method to find exceptions. In order to illustrate the potential of our approach, we apply the method in a real world data set to discover rules and exceptions in the HIV virus protein cleavage process. A good understanding of the process that generates this data plays an important role oin the research of cleavage inhibitors. We consider believe that the proposed approach may help the domain expert to further understand this process.
publishDate 2004
dc.date.none.fl_str_mv 2004-01-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400027
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400027
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/S1415-47572004000400027
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Sociedade Brasileira de Genética
publisher.none.fl_str_mv Sociedade Brasileira de Genética
dc.source.none.fl_str_mv Genetics and Molecular Biology v.27 n.4 2004
reponame:Genetics and Molecular Biology
instname:Sociedade Brasileira de Genética (SBG)
instacron:SBG
instname_str Sociedade Brasileira de Genética (SBG)
instacron_str SBG
institution SBG
reponame_str Genetics and Molecular Biology
collection Genetics and Molecular Biology
repository.name.fl_str_mv Genetics and Molecular Biology - Sociedade Brasileira de Genética (SBG)
repository.mail.fl_str_mv ||editor@gmb.org.br
_version_ 1752122379407982592