Montagem e anotação funcional de sequências Gênicas de handroanthus impetiginosus (mart. ex DC) Mattos

Detalhes bibliográficos
Ano de defesa: 2015
Autor(a) principal: Fernandes, Lanusse Andrade lattes
Orientador(a): Novaes, Evandro lattes
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Goiás
Programa de Pós-Graduação: Programa de Pós-graduação em Genetica e Biologia Molecular
Departamento: Instituto de Ciências Biológicas - ICB (RG)
País: Brasil
Palavras-chave em Português:
Palavras-chave em Inglês:
Área do conhecimento CNPq:
Link de acesso: http://repositorio.bc.ufg.br/tede/handle/tede/5519
Resumo: Handroanthus impetiginosus, known as ipê roxo, is a species present in several Brazilian biomes, with ecological and economic importance. The species has suffered strong exploration thanks to the quality of its wood. Despite of its ecological importance, economic value and medicinal potential, there is virtually no genomic study published with this species. This work aimed to take advantage of the large sequencing capacity of next generation platforms to generate, assemble and functional annotate the sequences of the entire transcriptome of H. impetiginosus. Seeds were collected from two trees on the Campus II of the Universidade Federal de Goiás. The seeds were germinated, their aerial tissues collected and the RNA extracted and sent to the University of Illinois to be sequenced on the HiSeq 2500 platform (Illumina). A total of 148,359,530 sequencing reads were generated. The adapters and low-quality sequences were removed. Contaminating sequences were detected and eliminated, and the remaining sequences were normalized. Sequences were then assembled, annotated and polymorphisms were detected. A total of 190,885 scaffolds were assembled from 23,002 genes. Mimulus guttatus was the species with the most Blast best hit against the sequences of H. impetiginosus. Of all the Gene Ontology (GO) annotated sequences, the majority had categories for Biological Process (50.94%), followed by Cellular Component (28.34%) and Molecular Function (20.72%). The most frequent GO categories related to Biological Process were metabolic and cellular processes, while in Cell Components the most frequent were cell and organelles. Catalytic activity and binding of ions and molecules were the most frequent annotated categories of Molecular Function. As the sequences were generated from a pool of RNA from four plants, 25,169 InDels and 278,121 SNPs were identified. As expected, InDels are depleted in coding regions (CDS) when compared to SNPs. These results will be important for future studies with an interest in gene expression of the species. Furthermore, the possible SNPs identified can facilitate genetic studies and genomic H. impetiginosus populations.