Cargando…

Identifying and Classifying Trait Linked Polymorphisms in Non-Reference Species by Walking Coloured de Bruijn Graphs

Single Nucleotide Polymorphisms are invaluable markers for tracing the genetic basis of inheritable traits and the ability to create marker libraries quickly is vital for timely identification of target genes. Next-generation sequencing makes it possible to sample a genome rapidly, but polymorphism...

Descripción completa

Detalles Bibliográficos
Autores principales: Leggett, Richard M., Ramirez-Gonzalez, Ricardo H., Verweij, Walter, Kawashima, Cintia G., Iqbal, Zamin, Jones, Jonathan D. G., Caccamo, Mario, MacLean, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3607606/
https://www.ncbi.nlm.nih.gov/pubmed/23536903
http://dx.doi.org/10.1371/journal.pone.0060058
_version_ 1782264121665781760
author Leggett, Richard M.
Ramirez-Gonzalez, Ricardo H.
Verweij, Walter
Kawashima, Cintia G.
Iqbal, Zamin
Jones, Jonathan D. G.
Caccamo, Mario
MacLean, Daniel
author_facet Leggett, Richard M.
Ramirez-Gonzalez, Ricardo H.
Verweij, Walter
Kawashima, Cintia G.
Iqbal, Zamin
Jones, Jonathan D. G.
Caccamo, Mario
MacLean, Daniel
author_sort Leggett, Richard M.
collection PubMed
description Single Nucleotide Polymorphisms are invaluable markers for tracing the genetic basis of inheritable traits and the ability to create marker libraries quickly is vital for timely identification of target genes. Next-generation sequencing makes it possible to sample a genome rapidly, but polymorphism detection relies on having a reference genome to which reads can be aligned and variants detected. We present Bubbleparse, a method for detecting variants directly from next-generation reads without a reference sequence. Bubbleparse uses the de Bruijn graph implementation in the Cortex framework as a basis and allows the user to identify bubbles in these graphs that represent polymorphisms, quickly, easily and sensitively. We show that the Bubbleparse algorithm is sensitive and can detect many polymorphisms quickly and that it performs well when compared with polymorphism detection methods based on alignment to a reference in Arabidopsis thaliana. We show that the heuristic can be used to maximise the number of true polymorphisms returned, and with a proof-of-principle experiment show that Bubbleparse is very effective on data from unsequenced wild relatives of potato and enabled us to identify disease resistance linked genes quickly and easily.
format Online
Article
Text
id pubmed-3607606
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36076062013-03-27 Identifying and Classifying Trait Linked Polymorphisms in Non-Reference Species by Walking Coloured de Bruijn Graphs Leggett, Richard M. Ramirez-Gonzalez, Ricardo H. Verweij, Walter Kawashima, Cintia G. Iqbal, Zamin Jones, Jonathan D. G. Caccamo, Mario MacLean, Daniel PLoS One Research Article Single Nucleotide Polymorphisms are invaluable markers for tracing the genetic basis of inheritable traits and the ability to create marker libraries quickly is vital for timely identification of target genes. Next-generation sequencing makes it possible to sample a genome rapidly, but polymorphism detection relies on having a reference genome to which reads can be aligned and variants detected. We present Bubbleparse, a method for detecting variants directly from next-generation reads without a reference sequence. Bubbleparse uses the de Bruijn graph implementation in the Cortex framework as a basis and allows the user to identify bubbles in these graphs that represent polymorphisms, quickly, easily and sensitively. We show that the Bubbleparse algorithm is sensitive and can detect many polymorphisms quickly and that it performs well when compared with polymorphism detection methods based on alignment to a reference in Arabidopsis thaliana. We show that the heuristic can be used to maximise the number of true polymorphisms returned, and with a proof-of-principle experiment show that Bubbleparse is very effective on data from unsequenced wild relatives of potato and enabled us to identify disease resistance linked genes quickly and easily. Public Library of Science 2013-03-25 /pmc/articles/PMC3607606/ /pubmed/23536903 http://dx.doi.org/10.1371/journal.pone.0060058 Text en © 2013 Leggett et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Leggett, Richard M.
Ramirez-Gonzalez, Ricardo H.
Verweij, Walter
Kawashima, Cintia G.
Iqbal, Zamin
Jones, Jonathan D. G.
Caccamo, Mario
MacLean, Daniel
Identifying and Classifying Trait Linked Polymorphisms in Non-Reference Species by Walking Coloured de Bruijn Graphs
title Identifying and Classifying Trait Linked Polymorphisms in Non-Reference Species by Walking Coloured de Bruijn Graphs
title_full Identifying and Classifying Trait Linked Polymorphisms in Non-Reference Species by Walking Coloured de Bruijn Graphs
title_fullStr Identifying and Classifying Trait Linked Polymorphisms in Non-Reference Species by Walking Coloured de Bruijn Graphs
title_full_unstemmed Identifying and Classifying Trait Linked Polymorphisms in Non-Reference Species by Walking Coloured de Bruijn Graphs
title_short Identifying and Classifying Trait Linked Polymorphisms in Non-Reference Species by Walking Coloured de Bruijn Graphs
title_sort identifying and classifying trait linked polymorphisms in non-reference species by walking coloured de bruijn graphs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3607606/
https://www.ncbi.nlm.nih.gov/pubmed/23536903
http://dx.doi.org/10.1371/journal.pone.0060058
work_keys_str_mv AT leggettrichardm identifyingandclassifyingtraitlinkedpolymorphismsinnonreferencespeciesbywalkingcoloureddebruijngraphs
AT ramirezgonzalezricardoh identifyingandclassifyingtraitlinkedpolymorphismsinnonreferencespeciesbywalkingcoloureddebruijngraphs
AT verweijwalter identifyingandclassifyingtraitlinkedpolymorphismsinnonreferencespeciesbywalkingcoloureddebruijngraphs
AT kawashimacintiag identifyingandclassifyingtraitlinkedpolymorphismsinnonreferencespeciesbywalkingcoloureddebruijngraphs
AT iqbalzamin identifyingandclassifyingtraitlinkedpolymorphismsinnonreferencespeciesbywalkingcoloureddebruijngraphs
AT jonesjonathandg identifyingandclassifyingtraitlinkedpolymorphismsinnonreferencespeciesbywalkingcoloureddebruijngraphs
AT caccamomario identifyingandclassifyingtraitlinkedpolymorphismsinnonreferencespeciesbywalkingcoloureddebruijngraphs
AT macleandaniel identifyingandclassifyingtraitlinkedpolymorphismsinnonreferencespeciesbywalkingcoloureddebruijngraphs