Cargando…

PopAlu: population-scale detection of Alu polymorphisms

Alu elements are sequences of approximately 300 basepairs that together comprise more than 10% of the human genome. Due to their recent origin in primate evolution some Alu elements are polymorphic in humans, present in some individuals while absent in others. We present PopAlu, a tool to detect pol...

Descripción completa

Detalles Bibliográficos
Autores principales: Qian, Yu, Kehr, Birte, Halldórsson, Bjarni V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4582951/
https://www.ncbi.nlm.nih.gov/pubmed/26417547
http://dx.doi.org/10.7717/peerj.1269
_version_ 1782391779981524992
author Qian, Yu
Kehr, Birte
Halldórsson, Bjarni V.
author_facet Qian, Yu
Kehr, Birte
Halldórsson, Bjarni V.
author_sort Qian, Yu
collection PubMed
description Alu elements are sequences of approximately 300 basepairs that together comprise more than 10% of the human genome. Due to their recent origin in primate evolution some Alu elements are polymorphic in humans, present in some individuals while absent in others. We present PopAlu, a tool to detect polymorphic Alu elements on a population scale from paired-end sequencing data. PopAlu uses read pair distance and orientation as well as split reads to identify the location and precise breakpoints of polymorphic Alus. Genotype calling enables us to differentiate between homozygous and heterozygous carriers, making the output of PopAlu suitable for use in downstream analyses such as genome-wide association studies (GWAS). We show on a simulated dataset that PopAlu calls Alu elements inserted and deleted with respect to a reference genome with high accuracy and high precision. Our analysis of real data of a human trio from the 1000 Genomes Project confirms that PopAlu is able to produce highly accurate genotype calls. To our knowledge, PopAlu is the first tool that identifies polymorphic Alu elements from multiple individuals simultaneously, pinpoints the precise breakpoints and calls genotypes with high accuracy.
format Online
Article
Text
id pubmed-4582951
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-45829512015-09-28 PopAlu: population-scale detection of Alu polymorphisms Qian, Yu Kehr, Birte Halldórsson, Bjarni V. PeerJ Bioinformatics Alu elements are sequences of approximately 300 basepairs that together comprise more than 10% of the human genome. Due to their recent origin in primate evolution some Alu elements are polymorphic in humans, present in some individuals while absent in others. We present PopAlu, a tool to detect polymorphic Alu elements on a population scale from paired-end sequencing data. PopAlu uses read pair distance and orientation as well as split reads to identify the location and precise breakpoints of polymorphic Alus. Genotype calling enables us to differentiate between homozygous and heterozygous carriers, making the output of PopAlu suitable for use in downstream analyses such as genome-wide association studies (GWAS). We show on a simulated dataset that PopAlu calls Alu elements inserted and deleted with respect to a reference genome with high accuracy and high precision. Our analysis of real data of a human trio from the 1000 Genomes Project confirms that PopAlu is able to produce highly accurate genotype calls. To our knowledge, PopAlu is the first tool that identifies polymorphic Alu elements from multiple individuals simultaneously, pinpoints the precise breakpoints and calls genotypes with high accuracy. PeerJ Inc. 2015-09-22 /pmc/articles/PMC4582951/ /pubmed/26417547 http://dx.doi.org/10.7717/peerj.1269 Text en © 2015 Qian et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Qian, Yu
Kehr, Birte
Halldórsson, Bjarni V.
PopAlu: population-scale detection of Alu polymorphisms
title PopAlu: population-scale detection of Alu polymorphisms
title_full PopAlu: population-scale detection of Alu polymorphisms
title_fullStr PopAlu: population-scale detection of Alu polymorphisms
title_full_unstemmed PopAlu: population-scale detection of Alu polymorphisms
title_short PopAlu: population-scale detection of Alu polymorphisms
title_sort popalu: population-scale detection of alu polymorphisms
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4582951/
https://www.ncbi.nlm.nih.gov/pubmed/26417547
http://dx.doi.org/10.7717/peerj.1269
work_keys_str_mv AT qianyu popalupopulationscaledetectionofalupolymorphisms
AT kehrbirte popalupopulationscaledetectionofalupolymorphisms
AT halldorssonbjarniv popalupopulationscaledetectionofalupolymorphisms