Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy

Detalhes bibliográficos
Autor(a) principal: Deusdado, Sérgio
Data de Publicação: 2010
Outros Autores: Carvalho, Paulo
Idioma: eng
Título da fonte: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo: http://hdl.handle.net/1822/17403
Resumo: http://iwpacbb2010.di.uminho.pt/
id RCAP_925992b1fe4cc1e82f3a52a89c3e4ca4
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/17403
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Employing compact intra-genomic language models to predict genomic sequences and characterize their entropyDNA entropy estimationGenomic sequence modelingLanguage modelsgenomic sequences modelingScience & Technologyhttp://iwpacbb2010.di.uminho.pt/Probabilistic models of languages are fundamental to understand and learn the profile of the subjacent code in order to estimate its entropy, enabling the verification and prediction of “natural” emanations of the language. Language models are devoted to capture salient statistical characteristics of the distribution of sequences of words, which transposed to the genomic language, allow modeling a predictive system of the peculiarities and regularities of genomic code in different inter and intra-genomic conditions. In this paper, we propose the application of compact intra-genomic language models to predict the composition of genomic sequences, aiming to achieve valuable resources for data compression and to contribute to enlarge the similarity analysis perspectives in genomic sequences. The obtained results encourage further investigation and validate the use of language models in biological sequence analysis.SpringerUniversidade do MinhoDeusdado, SérgioCarvalho, Paulo2010-062010-06-01T00:00:00Zconference paperinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/1822/17403eng97836421321311867-5662info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T05:59:40Zoai:repositorium.sdum.uminho.pt:1822/17403Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T15:37:10.315656Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
title Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
spellingShingle Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
Deusdado, Sérgio
DNA entropy estimation
Genomic sequence modeling
Language models
genomic sequences modeling
Science & Technology
title_short Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
title_full Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
title_fullStr Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
title_full_unstemmed Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
title_sort Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
author Deusdado, Sérgio
author_facet Deusdado, Sérgio
Carvalho, Paulo
author_role author
author2 Carvalho, Paulo
author2_role author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Deusdado, Sérgio
Carvalho, Paulo
dc.subject.por.fl_str_mv DNA entropy estimation
Genomic sequence modeling
Language models
genomic sequences modeling
Science & Technology
topic DNA entropy estimation
Genomic sequence modeling
Language models
genomic sequences modeling
Science & Technology
description http://iwpacbb2010.di.uminho.pt/
publishDate 2010
dc.date.none.fl_str_mv 2010-06
2010-06-01T00:00:00Z
dc.type.driver.fl_str_mv conference paper
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/1822/17403
url http://hdl.handle.net/1822/17403
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 9783642132131
1867-5662
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Springer
publisher.none.fl_str_mv Springer
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833595430973210624