Looking for exceptions on knowledge rules induced from HIV cleavage data set
Autor(a) principal: | |
---|---|
Data de Publicação: | 2004 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Genetics and Molecular Biology |
Texto Completo: | http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400027 |
Resumo: | The aim of data mining is to find useful knowledge inout of databases. In order to extract such knowledge, several methods can be used, among them machine learning (ML) algorithms. In this work we focus on ML algorithms that express the extracted knowledge in a symbolic form, such as rules. This representation may allow us to ''explain'' the data. Rule learning algorithms are mainly designed to induce classification rules that can predict new cases with high accuracy. However, these sorts of rules generally express common sense knowledge, resulting in many interesting and useful rules not being discovered. Furthermore, the domain independent biases, especially those related to the language used to express the induced knowledge, could induce rules that are difficult to understand. Exceptions might be used in order to overcome these drawbacks. Exceptions are defined as rules that contradict common believebeliefs. This kind of rules can play an important role in the process of understanding the underlying data as well as in making critical decisions. By contradicting the user's common beliefves, exceptions are bound to be interesting. This work proposes a method to find exceptions. In order to illustrate the potential of our approach, we apply the method in a real world data set to discover rules and exceptions in the HIV virus protein cleavage process. A good understanding of the process that generates this data plays an important role oin the research of cleavage inhibitors. We consider believe that the proposed approach may help the domain expert to further understand this process. |
id |
SBG-1_1a6a815bbb8d146360d653e2fae500d1 |
---|---|
oai_identifier_str |
oai:scielo:S1415-47572004000400027 |
network_acronym_str |
SBG-1 |
network_name_str |
Genetics and Molecular Biology |
repository_id_str |
|
spelling |
Looking for exceptions on knowledge rules induced from HIV cleavage data setdata miningexceptionsHIVproteaseThe aim of data mining is to find useful knowledge inout of databases. In order to extract such knowledge, several methods can be used, among them machine learning (ML) algorithms. In this work we focus on ML algorithms that express the extracted knowledge in a symbolic form, such as rules. This representation may allow us to ''explain'' the data. Rule learning algorithms are mainly designed to induce classification rules that can predict new cases with high accuracy. However, these sorts of rules generally express common sense knowledge, resulting in many interesting and useful rules not being discovered. Furthermore, the domain independent biases, especially those related to the language used to express the induced knowledge, could induce rules that are difficult to understand. Exceptions might be used in order to overcome these drawbacks. Exceptions are defined as rules that contradict common believebeliefs. This kind of rules can play an important role in the process of understanding the underlying data as well as in making critical decisions. By contradicting the user's common beliefves, exceptions are bound to be interesting. This work proposes a method to find exceptions. In order to illustrate the potential of our approach, we apply the method in a real world data set to discover rules and exceptions in the HIV virus protein cleavage process. A good understanding of the process that generates this data plays an important role oin the research of cleavage inhibitors. We consider believe that the proposed approach may help the domain expert to further understand this process.Sociedade Brasileira de Genética2004-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400027Genetics and Molecular Biology v.27 n.4 2004reponame:Genetics and Molecular Biologyinstname:Sociedade Brasileira de Genética (SBG)instacron:SBG10.1590/S1415-47572004000400027info:eu-repo/semantics/openAccessPrati,Ronaldo CristianoMonard,Maria CarolinaCarvalho,André C. P. L. F. deeng2005-01-14T00:00:00Zoai:scielo:S1415-47572004000400027Revistahttp://www.gmb.org.br/ONGhttps://old.scielo.br/oai/scielo-oai.php||editor@gmb.org.br1678-46851415-4757opendoar:2005-01-14T00:00Genetics and Molecular Biology - Sociedade Brasileira de Genética (SBG)false |
dc.title.none.fl_str_mv |
Looking for exceptions on knowledge rules induced from HIV cleavage data set |
title |
Looking for exceptions on knowledge rules induced from HIV cleavage data set |
spellingShingle |
Looking for exceptions on knowledge rules induced from HIV cleavage data set Prati,Ronaldo Cristiano data mining exceptions HIV protease |
title_short |
Looking for exceptions on knowledge rules induced from HIV cleavage data set |
title_full |
Looking for exceptions on knowledge rules induced from HIV cleavage data set |
title_fullStr |
Looking for exceptions on knowledge rules induced from HIV cleavage data set |
title_full_unstemmed |
Looking for exceptions on knowledge rules induced from HIV cleavage data set |
title_sort |
Looking for exceptions on knowledge rules induced from HIV cleavage data set |
author |
Prati,Ronaldo Cristiano |
author_facet |
Prati,Ronaldo Cristiano Monard,Maria Carolina Carvalho,André C. P. L. F. de |
author_role |
author |
author2 |
Monard,Maria Carolina Carvalho,André C. P. L. F. de |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Prati,Ronaldo Cristiano Monard,Maria Carolina Carvalho,André C. P. L. F. de |
dc.subject.por.fl_str_mv |
data mining exceptions HIV protease |
topic |
data mining exceptions HIV protease |
description |
The aim of data mining is to find useful knowledge inout of databases. In order to extract such knowledge, several methods can be used, among them machine learning (ML) algorithms. In this work we focus on ML algorithms that express the extracted knowledge in a symbolic form, such as rules. This representation may allow us to ''explain'' the data. Rule learning algorithms are mainly designed to induce classification rules that can predict new cases with high accuracy. However, these sorts of rules generally express common sense knowledge, resulting in many interesting and useful rules not being discovered. Furthermore, the domain independent biases, especially those related to the language used to express the induced knowledge, could induce rules that are difficult to understand. Exceptions might be used in order to overcome these drawbacks. Exceptions are defined as rules that contradict common believebeliefs. This kind of rules can play an important role in the process of understanding the underlying data as well as in making critical decisions. By contradicting the user's common beliefves, exceptions are bound to be interesting. This work proposes a method to find exceptions. In order to illustrate the potential of our approach, we apply the method in a real world data set to discover rules and exceptions in the HIV virus protein cleavage process. A good understanding of the process that generates this data plays an important role oin the research of cleavage inhibitors. We consider believe that the proposed approach may help the domain expert to further understand this process. |
publishDate |
2004 |
dc.date.none.fl_str_mv |
2004-01-01 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400027 |
url |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1415-47572004000400027 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1590/S1415-47572004000400027 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
text/html |
dc.publisher.none.fl_str_mv |
Sociedade Brasileira de Genética |
publisher.none.fl_str_mv |
Sociedade Brasileira de Genética |
dc.source.none.fl_str_mv |
Genetics and Molecular Biology v.27 n.4 2004 reponame:Genetics and Molecular Biology instname:Sociedade Brasileira de Genética (SBG) instacron:SBG |
instname_str |
Sociedade Brasileira de Genética (SBG) |
instacron_str |
SBG |
institution |
SBG |
reponame_str |
Genetics and Molecular Biology |
collection |
Genetics and Molecular Biology |
repository.name.fl_str_mv |
Genetics and Molecular Biology - Sociedade Brasileira de Genética (SBG) |
repository.mail.fl_str_mv |
||editor@gmb.org.br |
_version_ |
1752122379407982592 |