Deepmol: an automated machine and deep learning framework for computational chemistry

Detalhes bibliográficos
Autor(a) principal: Correia, João
Data de Publicação: 2024
Outros Autores: Capela, João, Rocha, Miguel
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo: https://hdl.handle.net/1822/94167
Resumo: The domain of computational chemistry has experienced a significant evolution due to the introduction of Machine Learning (ML) technologies. Despite its potential to revolutionize the field, researchers are often encumbered by obstacles, such as the complexity of selecting optimal algorithms, the automation of data pre-processing steps, the necessity for adaptive feature engineering, and the assurance of model performance consistency across different datasets. Addressing these issues head-on, DeepMol stands out as an Automated ML (AutoML) tool by automating critical steps of the ML pipeline. DeepMol rapidly and automatically identifies the most effective data representation, pre-processing methods and model configurations for a specific molecular property/activity prediction problem. On 22 benchmark datasets, DeepMol obtained competitive pipelines compared with those requiring time-consuming feature engineering, model design and selection processes. As one of the first AutoML tools specifically developed for the computational chemistry domain, DeepMol stands out with its open-source code, in-depth tutorials, detailed documentation, and examples of real-world applications, all available at https://github.com/BioSystemsUM/DeepMoland https://deepmol.readthedocs.io/en/latest/. By introducing AutoML as a groundbreaking feature in computational chemistry, DeepMol establishes itself as the pioneering state-of-the-art tool in the field.
id RCAP_ef7278e7ef9d54d2c39c519455fe003c
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/94167
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Deepmol: an automated machine and deep learning framework for computational chemistryAutoMLCheminformaticsQSARDeep learningThe domain of computational chemistry has experienced a significant evolution due to the introduction of Machine Learning (ML) technologies. Despite its potential to revolutionize the field, researchers are often encumbered by obstacles, such as the complexity of selecting optimal algorithms, the automation of data pre-processing steps, the necessity for adaptive feature engineering, and the assurance of model performance consistency across different datasets. Addressing these issues head-on, DeepMol stands out as an Automated ML (AutoML) tool by automating critical steps of the ML pipeline. DeepMol rapidly and automatically identifies the most effective data representation, pre-processing methods and model configurations for a specific molecular property/activity prediction problem. On 22 benchmark datasets, DeepMol obtained competitive pipelines compared with those requiring time-consuming feature engineering, model design and selection processes. As one of the first AutoML tools specifically developed for the computational chemistry domain, DeepMol stands out with its open-source code, in-depth tutorials, detailed documentation, and examples of real-world applications, all available at https://github.com/BioSystemsUM/DeepMoland https://deepmol.readthedocs.io/en/latest/. By introducing AutoML as a groundbreaking feature in computational chemistry, DeepMol establishes itself as the pioneering state-of-the-art tool in the field.FCT -Fundação para a Ciência e a Tecnologia(LA/P/0029/2020)info:eu-repo/semantics/publishedVersionBioMed Central (BMC)Universidade do MinhoCorreia, JoãoCapela, JoãoRocha, Miguel2024-12-052024-12-05T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://hdl.handle.net/1822/94167engCorreia, João; Capela, João; Rocha, Miguel, Deepmol: an automated machine and deep learning framework for computational chemistry. Journal of Cheminformatics, 16(136), 20241758-294610.1186/s13321-024-00937-7https://jcheminf.biomedcentral.com/info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-29T01:49:48Zoai:repositorium.sdum.uminho.pt:1822/94167Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T19:38:20.303833Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Deepmol: an automated machine and deep learning framework for computational chemistry
title Deepmol: an automated machine and deep learning framework for computational chemistry
spellingShingle Deepmol: an automated machine and deep learning framework for computational chemistry
Correia, João
AutoML
Cheminformatics
QSAR
Deep learning
title_short Deepmol: an automated machine and deep learning framework for computational chemistry
title_full Deepmol: an automated machine and deep learning framework for computational chemistry
title_fullStr Deepmol: an automated machine and deep learning framework for computational chemistry
title_full_unstemmed Deepmol: an automated machine and deep learning framework for computational chemistry
title_sort Deepmol: an automated machine and deep learning framework for computational chemistry
author Correia, João
author_facet Correia, João
Capela, João
Rocha, Miguel
author_role author
author2 Capela, João
Rocha, Miguel
author2_role author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Correia, João
Capela, João
Rocha, Miguel
dc.subject.por.fl_str_mv AutoML
Cheminformatics
QSAR
Deep learning
topic AutoML
Cheminformatics
QSAR
Deep learning
description The domain of computational chemistry has experienced a significant evolution due to the introduction of Machine Learning (ML) technologies. Despite its potential to revolutionize the field, researchers are often encumbered by obstacles, such as the complexity of selecting optimal algorithms, the automation of data pre-processing steps, the necessity for adaptive feature engineering, and the assurance of model performance consistency across different datasets. Addressing these issues head-on, DeepMol stands out as an Automated ML (AutoML) tool by automating critical steps of the ML pipeline. DeepMol rapidly and automatically identifies the most effective data representation, pre-processing methods and model configurations for a specific molecular property/activity prediction problem. On 22 benchmark datasets, DeepMol obtained competitive pipelines compared with those requiring time-consuming feature engineering, model design and selection processes. As one of the first AutoML tools specifically developed for the computational chemistry domain, DeepMol stands out with its open-source code, in-depth tutorials, detailed documentation, and examples of real-world applications, all available at https://github.com/BioSystemsUM/DeepMoland https://deepmol.readthedocs.io/en/latest/. By introducing AutoML as a groundbreaking feature in computational chemistry, DeepMol establishes itself as the pioneering state-of-the-art tool in the field.
publishDate 2024
dc.date.none.fl_str_mv 2024-12-05
2024-12-05T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/1822/94167
url https://hdl.handle.net/1822/94167
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Correia, João; Capela, João; Rocha, Miguel, Deepmol: an automated machine and deep learning framework for computational chemistry. Journal of Cheminformatics, 16(136), 2024
1758-2946
10.1186/s13321-024-00937-7
https://jcheminf.biomedcentral.com/
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv BioMed Central (BMC)
publisher.none.fl_str_mv BioMed Central (BMC)
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833598226820759552