Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes

Bibliographic Details
Main Author: Abrahim, Mayla
Publication Date: 2022
Other Authors: Machado, Edson, Alvarez-Valín, Fernando, Miranda, Antonio Basílio de, Catanho, Marcos
Format: Article
Language: eng
Source: Repositório Institucional da FIOCRUZ (ARCA)
DOI: 10.1093/gbe/evac142
Download full: https://arca.fiocruz.br/handle/icict/56027
Summary: Fundação Oswaldo Cruz. Vice-Diretoria de Desenvolvimento Tecnológico, Bio-Manguinhos. Instituto de Tecnologia em Imunobiológicos. Laboratório de Tecnologia Imunológica. Rio de Janeiro, RJ, Brasil.
id CRUZ_0e37b601e9e79e0097eb67bd2f64c8c9
oai_identifier_str oai:arca.fiocruz.br:icict/56027
network_acronym_str CRUZ
network_name_str Repositório Institucional da FIOCRUZ (ARCA)
repository_id_str 2135
spelling Abrahim, MaylaMachado, EdsonAlvarez-Valín, FernandoMiranda, Antonio Basílio deCatanho, Marcos2022-12-17T19:33:28Z2022-12-17T19:33:28Z2022ABRAHIM, Mayla et al. Uncovering Pseudogenes and Intergenic Protein-coding Sequences in TriTryps’ Genomes. Genome Biol. Evol., v. 14, n. 10. p. 1 - 14, Oct. 2022.2482-500Xhttps://arca.fiocruz.br/handle/icict/5602710.1093/gbe/evac142engOxford University PressGenômicaTripanosomatídeosPesquisa de homologiaAlinhamento de sequênciaGenomicsTrypanosomatidsHomology searchSequence alignmentUncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomesinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleFundação Oswaldo Cruz. Vice-Diretoria de Desenvolvimento Tecnológico, Bio-Manguinhos. Instituto de Tecnologia em Imunobiológicos. Laboratório de Tecnologia Imunológica. Rio de Janeiro, RJ, Brasil.Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Laboratório de Biologia Molecular Aplicada a Micobactérias. Rio de Janeiro, RJ, Brasil.Universidad de la República del Uruguay. Unidad de Genómica Evolutiva, Sección Biomatemática. Montevideo, Uruguay.Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Laboratório de Genética Molecular de Microrganismos. Rio de Janeiro, RJ, Brasil.Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Laboratório de Genética Molecular de Microrganismos. Rio de Janeiro, RJ, Brasil.Trypanosomatids belong to a remarkable group of unicellular, parasitic organisms of the order Kinetoplastida, an early diverging branch of the phylogenetic tree of eukaryotes, exhibiting intriguing biological characteristics affecting gene expression (intronless polycistronic transcription, trans-splicing, and RNA editing), metabolism, surface molecules, and organelles (compartmentalization of glycolysis, variation of the surface molecules, and unique mitochondrial DNA), cell biology and life cycle (phagocytic vacuoles evasion and intricate patterns of cell morphogenesis). With numerous genomic-scale data of several trypanosomatids becoming available since 2005 (genomes, transcriptomes, and proteomes), the scientific community can further investigate the mechanisms underlying these unusual features and address other unexplored phenomena possibly revealing biological aspects of the early evolution of eukaryotes. One fundamental aspect comprises the processes and mechanisms involved in the acquisition and loss of genes throughout the evolutionary history of these primitive microorganisms. Here, we present a comprehensive in silico analysis of pseudogenes in three major representatives of this group: Leishmania major, Trypanosoma brucei, and Trypanosoma cruzi. Pseudogenes, DNA segments originating from altered genes that lost their original function, are genomic relics that can offer an essential record of the evolutionary history of functional genes, as well as clues about the dynamics and evolution of hosting genomes. Scanning these genomes with functional proteins as proxies to reveal intergenic regions with protein-coding features, relying on a customized threshold to distinguish statistically and biologically significant sequence similarities, and reassembling remnant sequences from their debris, we found thousands of pseudogenes and hundreds of open reading frames, with particular characteristics in each trypanosomatid: mutation profile, number, content, density, codon bias, average size, single- or multi-copy gene origin, number and type of mutations, putative primitive function, and transcriptional activity. These features suggest a common process of pseudogene formation, different patterns of pseudogene evolution and extant biological functions, and/or distinct genome organization undertaken by those parasites during evolution, as well as different evolutionary and/or selective pressures acting on distinct lineages.info:eu-repo/semantics/openAccessreponame:Repositório Institucional da FIOCRUZ (ARCA)instname:Fundação Oswaldo Cruz (FIOCRUZ)instacron:FIOCRUZLICENSElicense.txtlicense.txttext/plain; charset=utf-82991https://arca.fiocruz.br/bitstreams/1cda0da9-b777-44e2-b9b8-7f3132d43add/download5a560609d32a3863062d77ff32785d58MD51falseAnonymousREADORIGINALMaylaAbrahim_MarcosCatanho_etal_IOC_2022.pdfMaylaAbrahim_MarcosCatanho_etal_IOC_2022.pdfapplication/pdf642176https://arca.fiocruz.br/bitstreams/bc7e92e5-8726-4d03-8486-861d32806cad/downloadda4ec2f605ff2b7931d0b1980a90f130MD52trueAnonymousREADTEXTMaylaAbrahim_MarcosCatanho_etal_IOC_2022.pdf.txtMaylaAbrahim_MarcosCatanho_etal_IOC_2022.pdf.txtExtracted texttext/plain74196https://arca.fiocruz.br/bitstreams/371606bb-0e5d-4c18-aeab-cc1f657f7d24/download47ef8af7350a7183a194fb19ee2c8602MD57falseAnonymousREADTHUMBNAILMaylaAbrahim_MarcosCatanho_etal_IOC_2022.pdf.jpgMaylaAbrahim_MarcosCatanho_etal_IOC_2022.pdf.jpgGenerated Thumbnailimage/jpeg5035https://arca.fiocruz.br/bitstreams/528f8f83-1049-4702-8268-b7c6325cce33/download9f27dc43ac8ce09b6ff370bb2c689079MD58falseAnonymousREADicict/560272025-07-29 18:53:01.806open.accessoai:arca.fiocruz.br:icict/56027https://arca.fiocruz.brRepositório InstitucionalPUBhttps://www.arca.fiocruz.br/oai/requestrepositorio.arca@fiocruz.bropendoar:21352025-07-29T21:53:01Repositório Institucional da FIOCRUZ (ARCA) - Fundação Oswaldo Cruz (FIOCRUZ)falseQ0VTU8ODTyBOw4NPIEVYQ0xVU0lWQSBERSBESVJFSVRPUyBBVVRPUkFJUwoKQW8gYWNlaXRhciBvcyBURVJNT1MgZSBDT05EScOHw5VFUyBkZXN0YSBDRVNTw4NPLCBvIEFVVE9SIGUvb3UgVElUVUxBUiBkZSBkaXJlaXRvcwphdXRvcmFpcyBzb2JyZSBhIE9CUkEgZGUgcXVlIHRyYXRhIGVzdGUgZG9jdW1lbnRvOgoKKDEpIENFREUgZSBUUkFOU0ZFUkUsIHRvdGFsIGUgZ3JhdHVpdGFtZW50ZSwgw6AgRklPQ1JVWiAtIEZVTkRBw4fDg08gT1NXQUxETyBDUlVaLCBlbQpjYXLDoXRlciBwZXJtYW5lbnRlLCBpcnJldm9nw6F2ZWwgZSBOw4NPIEVYQ0xVU0lWTywgdG9kb3Mgb3MgZGlyZWl0b3MgcGF0cmltb25pYWlzIE7Dg08KQ09NRVJDSUFJUyBkZSB1dGlsaXphw6fDo28gZGEgT0JSQSBhcnTDrXN0aWNhIGUvb3UgY2llbnTDrWZpY2EgaW5kaWNhZGEgYWNpbWEsIGluY2x1c2l2ZSBvcyBkaXJlaXRvcwpkZSB2b3ogZSBpbWFnZW0gdmluY3VsYWRvcyDDoCBPQlJBLCBkdXJhbnRlIHRvZG8gbyBwcmF6byBkZSBkdXJhw6fDo28gZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCBlbQpxdWFscXVlciBpZGlvbWEgZSBlbSB0b2RvcyBvcyBwYcOtc2VzOwoKKDIpIEFDRUlUQSBxdWUgYSBjZXNzw6NvIHRvdGFsIG7Do28gZXhjbHVzaXZhLCBwZXJtYW5lbnRlIGUgaXJyZXZvZ8OhdmVsIGRvcyBkaXJlaXRvcyBhdXRvcmFpcwpwYXRyaW1vbmlhaXMgbsOjbyBjb21lcmNpYWlzIGRlIHV0aWxpemHDp8OjbyBkZSBxdWUgdHJhdGEgZXN0ZSBkb2N1bWVudG8gaW5jbHVpLCBleGVtcGxpZmljYXRpdmFtZW50ZSwKb3MgZGlyZWl0b3MgZGUgZGlzcG9uaWJpbGl6YcOnw6NvIGUgY29tdW5pY2HDp8OjbyBww7pibGljYSBkYSBPQlJBLCBlbSBxdWFscXVlciBtZWlvIG91IHZlw61jdWxvLAppbmNsdXNpdmUgZW0gUmVwb3NpdMOzcmlvcyBEaWdpdGFpcywgYmVtIGNvbW8gb3MgZGlyZWl0b3MgZGUgcmVwcm9kdcOnw6NvLCBleGliacOnw6NvLCBleGVjdcOnw6NvLApkZWNsYW1hw6fDo28sIHJlY2l0YcOnw6NvLCBleHBvc2nDp8OjbywgYXJxdWl2YW1lbnRvLCBpbmNsdXPDo28gZW0gYmFuY28gZGUgZGFkb3MsIHByZXNlcnZhw6fDo28sIGRpZnVzw6NvLApkaXN0cmlidWnDp8OjbywgZGl2dWxnYcOnw6NvLCBlbXByw6lzdGltbywgdHJhZHXDp8OjbywgZHVibGFnZW0sIGxlZ2VuZGFnZW0sIGluY2x1c8OjbyBlbSBub3ZhcyBvYnJhcyBvdQpjb2xldMOibmVhcywgcmV1dGlsaXphw6fDo28sIGVkacOnw6NvLCBwcm9kdcOnw6NvIGRlIG1hdGVyaWFsIGRpZMOhdGljbyBlIGN1cnNvcyBvdSBxdWFscXVlciBmb3JtYSBkZQp1dGlsaXphw6fDo28gbsOjbyBjb21lcmNpYWw7CgooMykgUkVDT05IRUNFIHF1ZSBhIGNlc3PDo28gYXF1aSBlc3BlY2lmaWNhZGEgY29uY2VkZSDDoCBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPCkNSVVogbyBkaXJlaXRvIGRlIGF1dG9yaXphciBxdWFscXVlciBwZXNzb2Eg4oCTIGbDrXNpY2Egb3UganVyw61kaWNhLCBww7pibGljYSBvdSBwcml2YWRhLCBuYWNpb25hbCBvdQplc3RyYW5nZWlyYSDigJMgYSBhY2Vzc2FyIGUgdXRpbGl6YXIgYW1wbGFtZW50ZSBhIE9CUkEsIHNlbSBleGNsdXNpdmlkYWRlLCBwYXJhIHF1YWlzcXVlcgpmaW5hbGlkYWRlcyBuw6NvIGNvbWVyY2lhaXM7CgooNCkgREVDTEFSQSBxdWUgYSBvYnJhIMOpIGNyaWHDp8OjbyBvcmlnaW5hbCBlIHF1ZSDDqSBvIHRpdHVsYXIgZG9zIGRpcmVpdG9zIGFxdWkgY2VkaWRvcyBlIGF1dG9yaXphZG9zLApyZXNwb25zYWJpbGl6YW5kby1zZSBpbnRlZ3JhbG1lbnRlIHBlbG8gY29udGXDumRvIGUgb3V0cm9zIGVsZW1lbnRvcyBxdWUgZmF6ZW0gcGFydGUgZGEgT0JSQSwKaW5jbHVzaXZlIG9zIGRpcmVpdG9zIGRlIHZveiBlIGltYWdlbSB2aW5jdWxhZG9zIMOgIE9CUkEsIG9icmlnYW5kby1zZSBhIGluZGVuaXphciB0ZXJjZWlyb3MgcG9yCmRhbm9zLCBiZW0gY29tbyBpbmRlbml6YXIgZSByZXNzYXJjaXIgYSBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPIENSVVogZGUKZXZlbnR1YWlzIGRlc3Blc2FzIHF1ZSB2aWVyZW0gYSBzdXBvcnRhciwgZW0gcmF6w6NvIGRlIHF1YWxxdWVyIG9mZW5zYSBhIGRpcmVpdG9zIGF1dG9yYWlzIG91CmRpcmVpdG9zIGRlIHZveiBvdSBpbWFnZW0sIHByaW5jaXBhbG1lbnRlIG5vIHF1ZSBkaXogcmVzcGVpdG8gYSBwbMOhZ2lvIGUgdmlvbGHDp8O1ZXMgZGUgZGlyZWl0b3M7CgooNSkgQUZJUk1BIHF1ZSBjb25oZWNlIGEgUG9sw610aWNhIEluc3RpdHVjaW9uYWwgZGUgQWNlc3NvIEFiZXJ0byBkYSBGSU9DUlVaIC0gRlVOREHDh8ODTwpPU1dBTERPIENSVVogZSBhcyBkaXJldHJpemVzIHBhcmEgbyBmdW5jaW9uYW1lbnRvIGRvIHJlcG9zaXTDs3JpbyBpbnN0aXR1Y2lvbmFsIEFSQ0EuCgpBIFBvbMOtdGljYSBJbnN0aXR1Y2lvbmFsIGRlIEFjZXNzbyBBYmVydG8gZGEgRklPQ1JVWiAtIEZVTkRBw4fDg08gT1NXQUxETyBDUlVaIHJlc2VydmEKZXhjbHVzaXZhbWVudGUgYW8gQVVUT1Igb3MgZGlyZWl0b3MgbW9yYWlzIGUgb3MgdXNvcyBjb21lcmNpYWlzIHNvYnJlIGFzIG9icmFzIGRlIHN1YSBhdXRvcmlhCmUvb3UgdGl0dWxhcmlkYWRlLCBzZW5kbyBvcyB0ZXJjZWlyb3MgdXN1w6FyaW9zIHJlc3BvbnPDoXZlaXMgcGVsYSBhdHJpYnVpw6fDo28gZGUgYXV0b3JpYSBlIG1hbnV0ZW7Dp8OjbwpkYSBpbnRlZ3JpZGFkZSBkYSBPQlJBIGVtIHF1YWxxdWVyIHV0aWxpemHDp8Ojby4KCkEgUG9sw610aWNhIEluc3RpdHVjaW9uYWwgZGUgQWNlc3NvIEFiZXJ0byBkYSBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPIENSVVoKcmVzcGVpdGEgb3MgY29udHJhdG9zIGUgYWNvcmRvcyBwcmVleGlzdGVudGVzIGRvcyBBdXRvcmVzIGNvbSB0ZXJjZWlyb3MsIGNhYmVuZG8gYW9zIEF1dG9yZXMKaW5mb3JtYXIgw6AgSW5zdGl0dWnDp8OjbyBhcyBjb25kacOnw7VlcyBlIG91dHJhcyByZXN0cmnDp8O1ZXMgaW1wb3N0YXMgcG9yIGVzdGVzIGluc3RydW1lbnRvcy4K
dc.title.none.fl_str_mv Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes
title Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes
spellingShingle Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes
Abrahim, Mayla
Genômica
Tripanosomatídeos
Pesquisa de homologia
Alinhamento de sequência
Genomics
Trypanosomatids
Homology search
Sequence alignment
title_short Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes
title_full Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes
title_fullStr Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes
title_full_unstemmed Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes
title_sort Uncovering pseudogenes and intergenic protein-coding sequences in TriTryps’ genomes
author Abrahim, Mayla
author_facet Abrahim, Mayla
Machado, Edson
Alvarez-Valín, Fernando
Miranda, Antonio Basílio de
Catanho, Marcos
author_role author
author2 Machado, Edson
Alvarez-Valín, Fernando
Miranda, Antonio Basílio de
Catanho, Marcos
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Abrahim, Mayla
Machado, Edson
Alvarez-Valín, Fernando
Miranda, Antonio Basílio de
Catanho, Marcos
dc.subject.other.none.fl_str_mv Genômica
Tripanosomatídeos
Pesquisa de homologia
Alinhamento de sequência
topic Genômica
Tripanosomatídeos
Pesquisa de homologia
Alinhamento de sequência
Genomics
Trypanosomatids
Homology search
Sequence alignment
dc.subject.en.none.fl_str_mv Genomics
Trypanosomatids
Homology search
Sequence alignment
description Fundação Oswaldo Cruz. Vice-Diretoria de Desenvolvimento Tecnológico, Bio-Manguinhos. Instituto de Tecnologia em Imunobiológicos. Laboratório de Tecnologia Imunológica. Rio de Janeiro, RJ, Brasil.
publishDate 2022
dc.date.accessioned.fl_str_mv 2022-12-17T19:33:28Z
dc.date.available.fl_str_mv 2022-12-17T19:33:28Z
dc.date.issued.fl_str_mv 2022
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.citation.fl_str_mv ABRAHIM, Mayla et al. Uncovering Pseudogenes and Intergenic Protein-coding Sequences in TriTryps’ Genomes. Genome Biol. Evol., v. 14, n. 10. p. 1 - 14, Oct. 2022.
dc.identifier.uri.fl_str_mv https://arca.fiocruz.br/handle/icict/56027
dc.identifier.issn.none.fl_str_mv 2482-500X
dc.identifier.doi.none.fl_str_mv 10.1093/gbe/evac142
identifier_str_mv ABRAHIM, Mayla et al. Uncovering Pseudogenes and Intergenic Protein-coding Sequences in TriTryps’ Genomes. Genome Biol. Evol., v. 14, n. 10. p. 1 - 14, Oct. 2022.
2482-500X
10.1093/gbe/evac142
url https://arca.fiocruz.br/handle/icict/56027
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Oxford University Press
publisher.none.fl_str_mv Oxford University Press
dc.source.none.fl_str_mv reponame:Repositório Institucional da FIOCRUZ (ARCA)
instname:Fundação Oswaldo Cruz (FIOCRUZ)
instacron:FIOCRUZ
instname_str Fundação Oswaldo Cruz (FIOCRUZ)
instacron_str FIOCRUZ
institution FIOCRUZ
reponame_str Repositório Institucional da FIOCRUZ (ARCA)
collection Repositório Institucional da FIOCRUZ (ARCA)
bitstream.url.fl_str_mv https://arca.fiocruz.br/bitstreams/1cda0da9-b777-44e2-b9b8-7f3132d43add/download
https://arca.fiocruz.br/bitstreams/bc7e92e5-8726-4d03-8486-861d32806cad/download
https://arca.fiocruz.br/bitstreams/371606bb-0e5d-4c18-aeab-cc1f657f7d24/download
https://arca.fiocruz.br/bitstreams/528f8f83-1049-4702-8268-b7c6325cce33/download
bitstream.checksum.fl_str_mv 5a560609d32a3863062d77ff32785d58
da4ec2f605ff2b7931d0b1980a90f130
47ef8af7350a7183a194fb19ee2c8602
9f27dc43ac8ce09b6ff370bb2c689079
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da FIOCRUZ (ARCA) - Fundação Oswaldo Cruz (FIOCRUZ)
repository.mail.fl_str_mv repositorio.arca@fiocruz.br
_version_ 1839715868723904512