Diversity oriented Deep Reinforcement Learning for targeted molecule generation

Detalhes bibliográficos
Autor(a) principal: Pereira, Tiago
Data de Publicação: 2021
Outros Autores: Abbasi, Maryam, Ribeiro, Bernardete, Arrais, Joel P.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10316/104852
https://doi.org/10.1186/s13321-021-00498-z
Resumo: In this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine [Formula: see text] and [Formula: see text] opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.
id RCAP_02d36ecb3b23a6dccb90b91f025ad8cd
oai_identifier_str oai:estudogeral.uc.pt:10316/104852
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Diversity oriented Deep Reinforcement Learning for targeted molecule generationDrug DesignSMILESReinforcement LearninRNNIn this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine [Formula: see text] and [Formula: see text] opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.Chemistry Central2021-03-09info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10316/104852http://hdl.handle.net/10316/104852https://doi.org/10.1186/s13321-021-00498-zeng1758-2946Pereira, TiagoAbbasi, MaryamRibeiro, BernardeteArrais, Joel P.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-01-26T21:55:11Zoai:estudogeral.uc.pt:10316/104852Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:21:29.396636Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Diversity oriented Deep Reinforcement Learning for targeted molecule generation
title Diversity oriented Deep Reinforcement Learning for targeted molecule generation
spellingShingle Diversity oriented Deep Reinforcement Learning for targeted molecule generation
Pereira, Tiago
Drug Design
SMILES
Reinforcement Learnin
RNN
title_short Diversity oriented Deep Reinforcement Learning for targeted molecule generation
title_full Diversity oriented Deep Reinforcement Learning for targeted molecule generation
title_fullStr Diversity oriented Deep Reinforcement Learning for targeted molecule generation
title_full_unstemmed Diversity oriented Deep Reinforcement Learning for targeted molecule generation
title_sort Diversity oriented Deep Reinforcement Learning for targeted molecule generation
author Pereira, Tiago
author_facet Pereira, Tiago
Abbasi, Maryam
Ribeiro, Bernardete
Arrais, Joel P.
author_role author
author2 Abbasi, Maryam
Ribeiro, Bernardete
Arrais, Joel P.
author2_role author
author
author
dc.contributor.author.fl_str_mv Pereira, Tiago
Abbasi, Maryam
Ribeiro, Bernardete
Arrais, Joel P.
dc.subject.por.fl_str_mv Drug Design
SMILES
Reinforcement Learnin
RNN
topic Drug Design
SMILES
Reinforcement Learnin
RNN
description In this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine [Formula: see text] and [Formula: see text] opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.
publishDate 2021
dc.date.none.fl_str_mv 2021-03-09
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10316/104852
http://hdl.handle.net/10316/104852
https://doi.org/10.1186/s13321-021-00498-z
url http://hdl.handle.net/10316/104852
https://doi.org/10.1186/s13321-021-00498-z
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1758-2946
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Chemistry Central
publisher.none.fl_str_mv Chemistry Central
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134105187647488