Cargando…

Nanopore sequencing of PCR products enables multicopy gene family reconstruction

The importance of gene amplifications in evolution is more and more recognized. Yet, tools to study multi-copy gene families are still scarce, and many such families are overlooked using common sequencing methods. Haplotype reconstruction is even harder for polymorphic multi-copy gene families. Here...

Descripción completa

Detalles Bibliográficos
Autores principales: Namias, Alice, Sahlin, Kristoffer, Makoundou, Patrick, Bonnici, Iago, Sicard, Mathieu, Belkhir, Khalid, Weill, Mylène
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10393513/
https://www.ncbi.nlm.nih.gov/pubmed/37533804
http://dx.doi.org/10.1016/j.csbj.2023.07.012
_version_ 1785083174683934720
author Namias, Alice
Sahlin, Kristoffer
Makoundou, Patrick
Bonnici, Iago
Sicard, Mathieu
Belkhir, Khalid
Weill, Mylène
author_facet Namias, Alice
Sahlin, Kristoffer
Makoundou, Patrick
Bonnici, Iago
Sicard, Mathieu
Belkhir, Khalid
Weill, Mylène
author_sort Namias, Alice
collection PubMed
description The importance of gene amplifications in evolution is more and more recognized. Yet, tools to study multi-copy gene families are still scarce, and many such families are overlooked using common sequencing methods. Haplotype reconstruction is even harder for polymorphic multi-copy gene families. Here, we show that all variants (or haplotypes) of a multi-copy gene family present in a single genome, can be obtained using Oxford Nanopore Technologies sequencing of PCR products, followed by steps of mapping, SNP calling and haplotyping. As a proof of concept, we acquired the sequences of highly similar variants of the cidA and cidB genes present in the genome of the Wolbachia wPip, a bacterium infecting Culex pipiens mosquitoes. Our method relies on a wide database of cid genes, previously acquired by cloning and Sanger sequencing. We addressed problems commonly faced when using mapping approaches for multi-copy gene families with highly similar variants. In addition, we confirmed that PCR amplification causes frequent chimeras which have to be carefully considered when working on families of recombinant genes. We tested the robustness of the method using a combination of bioinformatics (read simulations) and molecular biology approaches (sequence acquisitions through cloning and Sanger sequencing, specific PCRs and digital droplet PCR). When different haplotypes present within a single genome cannot be reconstructed from short reads sequencing, this pipeline confers a high throughput acquisition, gives reliable results as well as insights of the relative copy numbers of the different variants.
format Online
Article
Text
id pubmed-10393513
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-103935132023-08-02 Nanopore sequencing of PCR products enables multicopy gene family reconstruction Namias, Alice Sahlin, Kristoffer Makoundou, Patrick Bonnici, Iago Sicard, Mathieu Belkhir, Khalid Weill, Mylène Comput Struct Biotechnol J Research Article The importance of gene amplifications in evolution is more and more recognized. Yet, tools to study multi-copy gene families are still scarce, and many such families are overlooked using common sequencing methods. Haplotype reconstruction is even harder for polymorphic multi-copy gene families. Here, we show that all variants (or haplotypes) of a multi-copy gene family present in a single genome, can be obtained using Oxford Nanopore Technologies sequencing of PCR products, followed by steps of mapping, SNP calling and haplotyping. As a proof of concept, we acquired the sequences of highly similar variants of the cidA and cidB genes present in the genome of the Wolbachia wPip, a bacterium infecting Culex pipiens mosquitoes. Our method relies on a wide database of cid genes, previously acquired by cloning and Sanger sequencing. We addressed problems commonly faced when using mapping approaches for multi-copy gene families with highly similar variants. In addition, we confirmed that PCR amplification causes frequent chimeras which have to be carefully considered when working on families of recombinant genes. We tested the robustness of the method using a combination of bioinformatics (read simulations) and molecular biology approaches (sequence acquisitions through cloning and Sanger sequencing, specific PCRs and digital droplet PCR). When different haplotypes present within a single genome cannot be reconstructed from short reads sequencing, this pipeline confers a high throughput acquisition, gives reliable results as well as insights of the relative copy numbers of the different variants. Research Network of Computational and Structural Biotechnology 2023-07-16 /pmc/articles/PMC10393513/ /pubmed/37533804 http://dx.doi.org/10.1016/j.csbj.2023.07.012 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Namias, Alice
Sahlin, Kristoffer
Makoundou, Patrick
Bonnici, Iago
Sicard, Mathieu
Belkhir, Khalid
Weill, Mylène
Nanopore sequencing of PCR products enables multicopy gene family reconstruction
title Nanopore sequencing of PCR products enables multicopy gene family reconstruction
title_full Nanopore sequencing of PCR products enables multicopy gene family reconstruction
title_fullStr Nanopore sequencing of PCR products enables multicopy gene family reconstruction
title_full_unstemmed Nanopore sequencing of PCR products enables multicopy gene family reconstruction
title_short Nanopore sequencing of PCR products enables multicopy gene family reconstruction
title_sort nanopore sequencing of pcr products enables multicopy gene family reconstruction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10393513/
https://www.ncbi.nlm.nih.gov/pubmed/37533804
http://dx.doi.org/10.1016/j.csbj.2023.07.012
work_keys_str_mv AT namiasalice nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction
AT sahlinkristoffer nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction
AT makoundoupatrick nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction
AT bonniciiago nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction
AT sicardmathieu nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction
AT belkhirkhalid nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction
AT weillmylene nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction