An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets

Detalhes bibliográficos
Autor(a) principal: Rocha, I.
Data de Publicação: 2006
Outros Autores: Rocha, Miguel, Ferreira, Eugénio C., Patil, Kiran Raosaheb, Nielsen, Jens
Idioma: eng
Título da fonte: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo: https://hdl.handle.net/1822/5916
Resumo: In metabolic engineering problems, due to the complexity of metabolic networks, it is often difficult to identify a priori which genetic manipulations will originate a given desired phenotype. Genome-scale metabolic models, available for several microorganisms, can be used to simulate the metabolic phenotype and therefore help the tasks of metabolic engineering. This simulation can be performed by calculating the fluxes through all metabolic reactions using techniques like the Flux Balance Analysis (FBA) or the MOMA approaches, among others. Several algorithms have been developed that use genome-scale metabolic models to enable the identification of gene knockout strategies for obtaining improved phenotypes. However, the problem of finding optimal gene deletion strategy is combinatorial and consequently the computational time increases exponentially with the size of the problem. In a previous study we reported that evolutionary algorithms (EAs) enable solving large gene knockout problems in relatively short computational time. The proposed algorithm – OptGene - also allows the optimization of non-linear objective functions and additionally provides a family of close to optimal solutions. Given the promising results obtained, this algorithm was modified with two main objectives: improve the predictions obtained and increase the flexibility. For these purposes, a new program was built by the authors using the Java programming language. Regarding the optimization algorithms, in OptGene two distinct encoding schemes had been taken into account, binary and integer representations. The latter is more compact but potentially reduces the search space to a limited number of knockouts. In order to overcome this limitation, in this work a new feature was implemented by allowing the evolution of solutions with variable size. This allows maintaining the potential solutions with a relatively small number of genes while not defining a priori the exact number of knockouts. Furthermore, the EA’s performance was boosted by the introduction of local search operators that look for improved solutions in the neighbourhood of the individual under consideration. The quality of the solutions obtained by the EAs was compared to the ones obtained using a simpler algorithm, the well-known hill-climbing algorithm adapted for the present situation. The local search operator, in this case, considers all neighbours that imply the addition of a single knockout to the present solution and selects the best. The wild-type is considered as the starting solution and the local search operator is applied, until no improvement is possible. Finally, a graphical user interface was developed that allows an easy utilization of any genome-scale metabolic model in SBML or other format, the manual modification of flux bound values, the selection of the appropriate simulation technique (FBA or MOMA) and the corresponding flux to be optimized for FBA. Additionally, the program allows the utilization of any of the optimization algorithms described above and the selection of a suitable (linear or non-linear) objective function, like yield or biomass coupled yield. A tool for the visualization of the flux distribution in the metabolic network is being developed. This modified algorithm was validated using succinate production in both Escherichia coli and Saccharomyces cerevisiae as case studies. Potential metabolic engineering targets were identified and the results suggest that non-intuitive genetic modifications spanning several different pathways may be necessary for solving challenging metabolic engineering problems.
id RCAP_63c31adb0b8927bf1aa15687e5f67bce
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/5916
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targetsIn metabolic engineering problems, due to the complexity of metabolic networks, it is often difficult to identify a priori which genetic manipulations will originate a given desired phenotype. Genome-scale metabolic models, available for several microorganisms, can be used to simulate the metabolic phenotype and therefore help the tasks of metabolic engineering. This simulation can be performed by calculating the fluxes through all metabolic reactions using techniques like the Flux Balance Analysis (FBA) or the MOMA approaches, among others. Several algorithms have been developed that use genome-scale metabolic models to enable the identification of gene knockout strategies for obtaining improved phenotypes. However, the problem of finding optimal gene deletion strategy is combinatorial and consequently the computational time increases exponentially with the size of the problem. In a previous study we reported that evolutionary algorithms (EAs) enable solving large gene knockout problems in relatively short computational time. The proposed algorithm – OptGene - also allows the optimization of non-linear objective functions and additionally provides a family of close to optimal solutions. Given the promising results obtained, this algorithm was modified with two main objectives: improve the predictions obtained and increase the flexibility. For these purposes, a new program was built by the authors using the Java programming language. Regarding the optimization algorithms, in OptGene two distinct encoding schemes had been taken into account, binary and integer representations. The latter is more compact but potentially reduces the search space to a limited number of knockouts. In order to overcome this limitation, in this work a new feature was implemented by allowing the evolution of solutions with variable size. This allows maintaining the potential solutions with a relatively small number of genes while not defining a priori the exact number of knockouts. Furthermore, the EA’s performance was boosted by the introduction of local search operators that look for improved solutions in the neighbourhood of the individual under consideration. The quality of the solutions obtained by the EAs was compared to the ones obtained using a simpler algorithm, the well-known hill-climbing algorithm adapted for the present situation. The local search operator, in this case, considers all neighbours that imply the addition of a single knockout to the present solution and selects the best. The wild-type is considered as the starting solution and the local search operator is applied, until no improvement is possible. Finally, a graphical user interface was developed that allows an easy utilization of any genome-scale metabolic model in SBML or other format, the manual modification of flux bound values, the selection of the appropriate simulation technique (FBA or MOMA) and the corresponding flux to be optimized for FBA. Additionally, the program allows the utilization of any of the optimization algorithms described above and the selection of a suitable (linear or non-linear) objective function, like yield or biomass coupled yield. A tool for the visualization of the flux distribution in the metabolic network is being developed. This modified algorithm was validated using succinate production in both Escherichia coli and Saccharomyces cerevisiae as case studies. Potential metabolic engineering targets were identified and the results suggest that non-intuitive genetic modifications spanning several different pathways may be necessary for solving challenging metabolic engineering problems.Universidade do MinhoRocha, I.Rocha, MiguelFerreira, Eugénio C.Patil, Kiran RaosahebNielsen, Jens2006-10-012006-10-01T00:00:00Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://hdl.handle.net/1822/5916engMETABOLIC ENGINEERING, 6, Leeuwenhorst, Netherlands, 2006 - " Metabolic Engineering VI : from recDNA towards Engineering Biological Systems. [S.l. : s.n., 2006]. p.185.info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T07:26:10Zoai:repositorium.sdum.uminho.pt:1822/5916Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T16:27:09.872279Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets
title An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets
spellingShingle An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets
Rocha, I.
title_short An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets
title_full An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets
title_fullStr An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets
title_full_unstemmed An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets
title_sort An improved evolutionary algorithm-based framework for identifying in silico metabolic engineering targets
author Rocha, I.
author_facet Rocha, I.
Rocha, Miguel
Ferreira, Eugénio C.
Patil, Kiran Raosaheb
Nielsen, Jens
author_role author
author2 Rocha, Miguel
Ferreira, Eugénio C.
Patil, Kiran Raosaheb
Nielsen, Jens
author2_role author
author
author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Rocha, I.
Rocha, Miguel
Ferreira, Eugénio C.
Patil, Kiran Raosaheb
Nielsen, Jens
description In metabolic engineering problems, due to the complexity of metabolic networks, it is often difficult to identify a priori which genetic manipulations will originate a given desired phenotype. Genome-scale metabolic models, available for several microorganisms, can be used to simulate the metabolic phenotype and therefore help the tasks of metabolic engineering. This simulation can be performed by calculating the fluxes through all metabolic reactions using techniques like the Flux Balance Analysis (FBA) or the MOMA approaches, among others. Several algorithms have been developed that use genome-scale metabolic models to enable the identification of gene knockout strategies for obtaining improved phenotypes. However, the problem of finding optimal gene deletion strategy is combinatorial and consequently the computational time increases exponentially with the size of the problem. In a previous study we reported that evolutionary algorithms (EAs) enable solving large gene knockout problems in relatively short computational time. The proposed algorithm – OptGene - also allows the optimization of non-linear objective functions and additionally provides a family of close to optimal solutions. Given the promising results obtained, this algorithm was modified with two main objectives: improve the predictions obtained and increase the flexibility. For these purposes, a new program was built by the authors using the Java programming language. Regarding the optimization algorithms, in OptGene two distinct encoding schemes had been taken into account, binary and integer representations. The latter is more compact but potentially reduces the search space to a limited number of knockouts. In order to overcome this limitation, in this work a new feature was implemented by allowing the evolution of solutions with variable size. This allows maintaining the potential solutions with a relatively small number of genes while not defining a priori the exact number of knockouts. Furthermore, the EA’s performance was boosted by the introduction of local search operators that look for improved solutions in the neighbourhood of the individual under consideration. The quality of the solutions obtained by the EAs was compared to the ones obtained using a simpler algorithm, the well-known hill-climbing algorithm adapted for the present situation. The local search operator, in this case, considers all neighbours that imply the addition of a single knockout to the present solution and selects the best. The wild-type is considered as the starting solution and the local search operator is applied, until no improvement is possible. Finally, a graphical user interface was developed that allows an easy utilization of any genome-scale metabolic model in SBML or other format, the manual modification of flux bound values, the selection of the appropriate simulation technique (FBA or MOMA) and the corresponding flux to be optimized for FBA. Additionally, the program allows the utilization of any of the optimization algorithms described above and the selection of a suitable (linear or non-linear) objective function, like yield or biomass coupled yield. A tool for the visualization of the flux distribution in the metabolic network is being developed. This modified algorithm was validated using succinate production in both Escherichia coli and Saccharomyces cerevisiae as case studies. Potential metabolic engineering targets were identified and the results suggest that non-intuitive genetic modifications spanning several different pathways may be necessary for solving challenging metabolic engineering problems.
publishDate 2006
dc.date.none.fl_str_mv 2006-10-01
2006-10-01T00:00:00Z
dc.type.driver.fl_str_mv conference object
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/1822/5916
url https://hdl.handle.net/1822/5916
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv METABOLIC ENGINEERING, 6, Leeuwenhorst, Netherlands, 2006 - " Metabolic Engineering VI : from recDNA towards Engineering Biological Systems. [S.l. : s.n., 2006]. p.185.
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833595950226997248