An R script for quality control in genome-wide association studies.

Bibliographic Details
Main Author: HIGA, R.
Publication Date: 2011
Other Authors: BARRICHELLO, F., NICIURA, S., MEIRELLES, S., REGITANO, L.
Language: eng
Source: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Download full: http://www.alice.cnptia.embrapa.br/alice/handle/doc/908682
Summary: Recent advances in massive genotyping technology based on SNP (Single Nucleotide Polymorphisms) markers are pushing animal breeding research into a new era where the entire genome is screened in the search for genes which affect traits of economic interest, called genome wide association studies (GWAS). The first step in setting up a GWAS is to perform a quality control analysis (QC) on the genotyped data in order to filter out samples and SNPs not satisfying a previously defined set of criteria [1]. The most common criteria include sample and SNP call rate, minor allele frequency (MAF) and Hardy-Weinberg equilibrium (HWE). It is also recommended to analyze the heterozygosity and the presence of population structure and outlier samples. However, a different set of criteria and thresholds may be more appropriate for different datasets. We wrote an R script [2] which implements a number of QC criteria, making them available as functions and allowing users to use those QC criteria which are more appropriate to their dataset. In this work, we describe a set of functions implemented in our R script and illustrate its use in a GWAS of cattle meat quality in a Canchim cattle breed population.
id EMBR_2ac03f8445d9fe820b53e2323fbb0783
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/908682
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling An R script for quality control in genome-wide association studies.Marcadores SNPPolimorfismo de nucleotídeo únicoGenomaGado de CorteGenomeSingle nucleotide polymorphismCattleRecent advances in massive genotyping technology based on SNP (Single Nucleotide Polymorphisms) markers are pushing animal breeding research into a new era where the entire genome is screened in the search for genes which affect traits of economic interest, called genome wide association studies (GWAS). The first step in setting up a GWAS is to perform a quality control analysis (QC) on the genotyped data in order to filter out samples and SNPs not satisfying a previously defined set of criteria [1]. The most common criteria include sample and SNP call rate, minor allele frequency (MAF) and Hardy-Weinberg equilibrium (HWE). It is also recommended to analyze the heterozygosity and the presence of population structure and outlier samples. However, a different set of criteria and thresholds may be more appropriate for different datasets. We wrote an R script [2] which implements a number of QC criteria, making them available as functions and allowing users to use those QC criteria which are more appropriate to their dataset. In this work, we describe a set of functions implemented in our R script and illustrate its use in a GWAS of cattle meat quality in a Canchim cattle breed population.X-MEETING 2011.ROBERTO HIROSHI HIGA, CNPTIA; FABIANA BARRICHELLO, UFSCar; SIMONE CRISTINA MEO NICIURA, CPPSE; SARAH MEIRELLES, UFLA; LUCIANA CORREIA DE ALMEIDA REGITANO, CPPSE.HIGA, R.BARRICHELLO, F.NICIURA, S.MEIRELLES, S.REGITANO, L.2011-12-06T11:11:11Z2011-12-06T11:11:11Z2011-12-06T11:11:11Z2011-12-06T11:11:11Z2011-12-0620112020-01-24T11:11:11ZResumo em anais e proceedingsinfo:eu-repo/semantics/publishedVersionNão paginado.In: INTERNATIONAL CONFERENCE OF THE BRAZILIAN ASSOCIATION FOR BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 7.; INTERNATIONAL CONFERENCE OF THE IBEROAMERICAN SOCIETY FOR BIOINFORMATICS, 3., 2011, Florianópolis. Proceedings... Florianópolis: Associação Brasileira de Bioinformática e Biologia Computacional, 2011.http://www.alice.cnptia.embrapa.br/alice/handle/doc/908682enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2017-08-15T23:10:13Zoai:www.alice.cnptia.embrapa.br:doc/908682Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542017-08-15T23:10:13Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv An R script for quality control in genome-wide association studies.
title An R script for quality control in genome-wide association studies.
spellingShingle An R script for quality control in genome-wide association studies.
HIGA, R.
Marcadores SNP
Polimorfismo de nucleotídeo único
Genoma
Gado de Corte
Genome
Single nucleotide polymorphism
Cattle
title_short An R script for quality control in genome-wide association studies.
title_full An R script for quality control in genome-wide association studies.
title_fullStr An R script for quality control in genome-wide association studies.
title_full_unstemmed An R script for quality control in genome-wide association studies.
title_sort An R script for quality control in genome-wide association studies.
author HIGA, R.
author_facet HIGA, R.
BARRICHELLO, F.
NICIURA, S.
MEIRELLES, S.
REGITANO, L.
author_role author
author2 BARRICHELLO, F.
NICIURA, S.
MEIRELLES, S.
REGITANO, L.
author2_role author
author
author
author
dc.contributor.none.fl_str_mv ROBERTO HIROSHI HIGA, CNPTIA; FABIANA BARRICHELLO, UFSCar; SIMONE CRISTINA MEO NICIURA, CPPSE; SARAH MEIRELLES, UFLA; LUCIANA CORREIA DE ALMEIDA REGITANO, CPPSE.
dc.contributor.author.fl_str_mv HIGA, R.
BARRICHELLO, F.
NICIURA, S.
MEIRELLES, S.
REGITANO, L.
dc.subject.por.fl_str_mv Marcadores SNP
Polimorfismo de nucleotídeo único
Genoma
Gado de Corte
Genome
Single nucleotide polymorphism
Cattle
topic Marcadores SNP
Polimorfismo de nucleotídeo único
Genoma
Gado de Corte
Genome
Single nucleotide polymorphism
Cattle
description Recent advances in massive genotyping technology based on SNP (Single Nucleotide Polymorphisms) markers are pushing animal breeding research into a new era where the entire genome is screened in the search for genes which affect traits of economic interest, called genome wide association studies (GWAS). The first step in setting up a GWAS is to perform a quality control analysis (QC) on the genotyped data in order to filter out samples and SNPs not satisfying a previously defined set of criteria [1]. The most common criteria include sample and SNP call rate, minor allele frequency (MAF) and Hardy-Weinberg equilibrium (HWE). It is also recommended to analyze the heterozygosity and the presence of population structure and outlier samples. However, a different set of criteria and thresholds may be more appropriate for different datasets. We wrote an R script [2] which implements a number of QC criteria, making them available as functions and allowing users to use those QC criteria which are more appropriate to their dataset. In this work, we describe a set of functions implemented in our R script and illustrate its use in a GWAS of cattle meat quality in a Canchim cattle breed population.
publishDate 2011
dc.date.none.fl_str_mv 2011-12-06T11:11:11Z
2011-12-06T11:11:11Z
2011-12-06T11:11:11Z
2011-12-06T11:11:11Z
2011-12-06
2011
2020-01-24T11:11:11Z
dc.type.driver.fl_str_mv Resumo em anais e proceedings
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv In: INTERNATIONAL CONFERENCE OF THE BRAZILIAN ASSOCIATION FOR BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 7.; INTERNATIONAL CONFERENCE OF THE IBEROAMERICAN SOCIETY FOR BIOINFORMATICS, 3., 2011, Florianópolis. Proceedings... Florianópolis: Associação Brasileira de Bioinformática e Biologia Computacional, 2011.
http://www.alice.cnptia.embrapa.br/alice/handle/doc/908682
identifier_str_mv In: INTERNATIONAL CONFERENCE OF THE BRAZILIAN ASSOCIATION FOR BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 7.; INTERNATIONAL CONFERENCE OF THE IBEROAMERICAN SOCIETY FOR BIOINFORMATICS, 3., 2011, Florianópolis. Proceedings... Florianópolis: Associação Brasileira de Bioinformática e Biologia Computacional, 2011.
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/908682
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv Não paginado.
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1822720843018403840