Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base

Bibliographic Details
Main Author: Correia, Marta
Publication Date: 2022
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/10451/54984
Summary: FH, the most common monogenic dyslipidaemia, is characterised by increased circulating LDL-C levels leading to premature cardiovascular disease when undiagnosed or untreated. Current guidelines support genetic testing in patients fulfilling clinical diagnostic criteria and cascade screening of their family members. However, about half of clinical FH patients do not present pathogenic variants in the known disease genes (LDLR, APOB, PCSK9), and these most likely suffer from polygenic hypercholesterolaemia, which translates into a relatively low yield of genetic screening programs. This project aimed to identify new biomarkers able to improve the distinction between monogenic and polygenic profiles. Using a machine-learning approach in a paediatric dataset, tested for disease causative genes and investigated with an extended lipid profile, we developed new models that classify FH patients with higher specificity than currently used methods. The best performing models incorporated parameters absent from the common FH clinical criteria, which rely only on TC and LDL-C. A hierarchical clustering analysis of the same dataset showed that the study population can be clearly divided in three groups of dyslipidaemic individuals, showing the complexity of the dyslipidaemic biological context and the need of an integrative and multidisciplinary approach for biomarker selection. Both clustering and modelling analysis have revealed that the extended lipid profile contains important biomarkers. The exploration of lipid metabolic pathways associated with the identified biomarkers allowed us to identify a set of related genes. Using additional information from public databases, including gene expression data, associated GWAS and GO terms, we defined a universe of lipid-related genes and molecular interactions relevant for the dyslipidaemic context and future genetic studies. All this information was used to establish a new lipid knowledge base available online. The obtained results can be applied to improve the yield of genetic screening programs and decrease the associated costs, and also provide novel contributions to our understanding of dyslipidaemias.
id RCAP_cde10c026824cecb8551a3d94d2cdb89
oai_identifier_str oai:repositorio.ulisboa.pt:10451/54984
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge basehipercolesterolemia familiarbiomarcadoresperfil lipídico estendidométodos baseados em machine learningbase de dados de conhecimento lipídicofamilial hypercholesterolaemiabiomarkersextended lipid profilemachine-learning based methodslipid knowledge baseDomínio/Área Científica::Ciências Naturais::Ciências BiológicasFH, the most common monogenic dyslipidaemia, is characterised by increased circulating LDL-C levels leading to premature cardiovascular disease when undiagnosed or untreated. Current guidelines support genetic testing in patients fulfilling clinical diagnostic criteria and cascade screening of their family members. However, about half of clinical FH patients do not present pathogenic variants in the known disease genes (LDLR, APOB, PCSK9), and these most likely suffer from polygenic hypercholesterolaemia, which translates into a relatively low yield of genetic screening programs. This project aimed to identify new biomarkers able to improve the distinction between monogenic and polygenic profiles. Using a machine-learning approach in a paediatric dataset, tested for disease causative genes and investigated with an extended lipid profile, we developed new models that classify FH patients with higher specificity than currently used methods. The best performing models incorporated parameters absent from the common FH clinical criteria, which rely only on TC and LDL-C. A hierarchical clustering analysis of the same dataset showed that the study population can be clearly divided in three groups of dyslipidaemic individuals, showing the complexity of the dyslipidaemic biological context and the need of an integrative and multidisciplinary approach for biomarker selection. Both clustering and modelling analysis have revealed that the extended lipid profile contains important biomarkers. The exploration of lipid metabolic pathways associated with the identified biomarkers allowed us to identify a set of related genes. Using additional information from public databases, including gene expression data, associated GWAS and GO terms, we defined a universe of lipid-related genes and molecular interactions relevant for the dyslipidaemic context and future genetic studies. All this information was used to establish a new lipid knowledge base available online. The obtained results can be applied to improve the yield of genetic screening programs and decrease the associated costs, and also provide novel contributions to our understanding of dyslipidaemias.Carvalho, Margarida GamaBourbon, MafaldaRepositório da Universidade de LisboaCorreia, Marta2022-11-07T12:28:58Z2022-072022-012022-07-01T00:00:00Zdoctoral thesisinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10451/54984TID:101579420enginfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-17T14:50:43Zoai:repositorio.ulisboa.pt:10451/54984Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T03:26:34.923302Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base
title Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base
spellingShingle Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base
Correia, Marta
hipercolesterolemia familiar
biomarcadores
perfil lipídico estendido
métodos baseados em machine learning
base de dados de conhecimento lipídico
familial hypercholesterolaemia
biomarkers
extended lipid profile
machine-learning based methods
lipid knowledge base
Domínio/Área Científica::Ciências Naturais::Ciências Biológicas
title_short Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base
title_full Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base
title_fullStr Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base
title_full_unstemmed Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base
title_sort Identification of novel biomarkers and candidate genes associated to lipid traits : improving the lipid metabolism knowledge base
author Correia, Marta
author_facet Correia, Marta
author_role author
dc.contributor.none.fl_str_mv Carvalho, Margarida Gama
Bourbon, Mafalda
Repositório da Universidade de Lisboa
dc.contributor.author.fl_str_mv Correia, Marta
dc.subject.por.fl_str_mv hipercolesterolemia familiar
biomarcadores
perfil lipídico estendido
métodos baseados em machine learning
base de dados de conhecimento lipídico
familial hypercholesterolaemia
biomarkers
extended lipid profile
machine-learning based methods
lipid knowledge base
Domínio/Área Científica::Ciências Naturais::Ciências Biológicas
topic hipercolesterolemia familiar
biomarcadores
perfil lipídico estendido
métodos baseados em machine learning
base de dados de conhecimento lipídico
familial hypercholesterolaemia
biomarkers
extended lipid profile
machine-learning based methods
lipid knowledge base
Domínio/Área Científica::Ciências Naturais::Ciências Biológicas
description FH, the most common monogenic dyslipidaemia, is characterised by increased circulating LDL-C levels leading to premature cardiovascular disease when undiagnosed or untreated. Current guidelines support genetic testing in patients fulfilling clinical diagnostic criteria and cascade screening of their family members. However, about half of clinical FH patients do not present pathogenic variants in the known disease genes (LDLR, APOB, PCSK9), and these most likely suffer from polygenic hypercholesterolaemia, which translates into a relatively low yield of genetic screening programs. This project aimed to identify new biomarkers able to improve the distinction between monogenic and polygenic profiles. Using a machine-learning approach in a paediatric dataset, tested for disease causative genes and investigated with an extended lipid profile, we developed new models that classify FH patients with higher specificity than currently used methods. The best performing models incorporated parameters absent from the common FH clinical criteria, which rely only on TC and LDL-C. A hierarchical clustering analysis of the same dataset showed that the study population can be clearly divided in three groups of dyslipidaemic individuals, showing the complexity of the dyslipidaemic biological context and the need of an integrative and multidisciplinary approach for biomarker selection. Both clustering and modelling analysis have revealed that the extended lipid profile contains important biomarkers. The exploration of lipid metabolic pathways associated with the identified biomarkers allowed us to identify a set of related genes. Using additional information from public databases, including gene expression data, associated GWAS and GO terms, we defined a universe of lipid-related genes and molecular interactions relevant for the dyslipidaemic context and future genetic studies. All this information was used to establish a new lipid knowledge base available online. The obtained results can be applied to improve the yield of genetic screening programs and decrease the associated costs, and also provide novel contributions to our understanding of dyslipidaemias.
publishDate 2022
dc.date.none.fl_str_mv 2022-11-07T12:28:58Z
2022-07
2022-01
2022-07-01T00:00:00Z
dc.type.driver.fl_str_mv doctoral thesis
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10451/54984
TID:101579420
url http://hdl.handle.net/10451/54984
identifier_str_mv TID:101579420
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833601699897409536