Cargando…
Nanopore sequencing of PCR products enables multicopy gene family reconstruction
The importance of gene amplifications in evolution is more and more recognized. Yet, tools to study multi-copy gene families are still scarce, and many such families are overlooked using common sequencing methods. Haplotype reconstruction is even harder for polymorphic multi-copy gene families. Here...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10393513/ https://www.ncbi.nlm.nih.gov/pubmed/37533804 http://dx.doi.org/10.1016/j.csbj.2023.07.012 |
_version_ | 1785083174683934720 |
---|---|
author | Namias, Alice Sahlin, Kristoffer Makoundou, Patrick Bonnici, Iago Sicard, Mathieu Belkhir, Khalid Weill, Mylène |
author_facet | Namias, Alice Sahlin, Kristoffer Makoundou, Patrick Bonnici, Iago Sicard, Mathieu Belkhir, Khalid Weill, Mylène |
author_sort | Namias, Alice |
collection | PubMed |
description | The importance of gene amplifications in evolution is more and more recognized. Yet, tools to study multi-copy gene families are still scarce, and many such families are overlooked using common sequencing methods. Haplotype reconstruction is even harder for polymorphic multi-copy gene families. Here, we show that all variants (or haplotypes) of a multi-copy gene family present in a single genome, can be obtained using Oxford Nanopore Technologies sequencing of PCR products, followed by steps of mapping, SNP calling and haplotyping. As a proof of concept, we acquired the sequences of highly similar variants of the cidA and cidB genes present in the genome of the Wolbachia wPip, a bacterium infecting Culex pipiens mosquitoes. Our method relies on a wide database of cid genes, previously acquired by cloning and Sanger sequencing. We addressed problems commonly faced when using mapping approaches for multi-copy gene families with highly similar variants. In addition, we confirmed that PCR amplification causes frequent chimeras which have to be carefully considered when working on families of recombinant genes. We tested the robustness of the method using a combination of bioinformatics (read simulations) and molecular biology approaches (sequence acquisitions through cloning and Sanger sequencing, specific PCRs and digital droplet PCR). When different haplotypes present within a single genome cannot be reconstructed from short reads sequencing, this pipeline confers a high throughput acquisition, gives reliable results as well as insights of the relative copy numbers of the different variants. |
format | Online Article Text |
id | pubmed-10393513 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-103935132023-08-02 Nanopore sequencing of PCR products enables multicopy gene family reconstruction Namias, Alice Sahlin, Kristoffer Makoundou, Patrick Bonnici, Iago Sicard, Mathieu Belkhir, Khalid Weill, Mylène Comput Struct Biotechnol J Research Article The importance of gene amplifications in evolution is more and more recognized. Yet, tools to study multi-copy gene families are still scarce, and many such families are overlooked using common sequencing methods. Haplotype reconstruction is even harder for polymorphic multi-copy gene families. Here, we show that all variants (or haplotypes) of a multi-copy gene family present in a single genome, can be obtained using Oxford Nanopore Technologies sequencing of PCR products, followed by steps of mapping, SNP calling and haplotyping. As a proof of concept, we acquired the sequences of highly similar variants of the cidA and cidB genes present in the genome of the Wolbachia wPip, a bacterium infecting Culex pipiens mosquitoes. Our method relies on a wide database of cid genes, previously acquired by cloning and Sanger sequencing. We addressed problems commonly faced when using mapping approaches for multi-copy gene families with highly similar variants. In addition, we confirmed that PCR amplification causes frequent chimeras which have to be carefully considered when working on families of recombinant genes. We tested the robustness of the method using a combination of bioinformatics (read simulations) and molecular biology approaches (sequence acquisitions through cloning and Sanger sequencing, specific PCRs and digital droplet PCR). When different haplotypes present within a single genome cannot be reconstructed from short reads sequencing, this pipeline confers a high throughput acquisition, gives reliable results as well as insights of the relative copy numbers of the different variants. Research Network of Computational and Structural Biotechnology 2023-07-16 /pmc/articles/PMC10393513/ /pubmed/37533804 http://dx.doi.org/10.1016/j.csbj.2023.07.012 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Namias, Alice Sahlin, Kristoffer Makoundou, Patrick Bonnici, Iago Sicard, Mathieu Belkhir, Khalid Weill, Mylène Nanopore sequencing of PCR products enables multicopy gene family reconstruction |
title | Nanopore sequencing of PCR products enables multicopy gene family reconstruction |
title_full | Nanopore sequencing of PCR products enables multicopy gene family reconstruction |
title_fullStr | Nanopore sequencing of PCR products enables multicopy gene family reconstruction |
title_full_unstemmed | Nanopore sequencing of PCR products enables multicopy gene family reconstruction |
title_short | Nanopore sequencing of PCR products enables multicopy gene family reconstruction |
title_sort | nanopore sequencing of pcr products enables multicopy gene family reconstruction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10393513/ https://www.ncbi.nlm.nih.gov/pubmed/37533804 http://dx.doi.org/10.1016/j.csbj.2023.07.012 |
work_keys_str_mv | AT namiasalice nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction AT sahlinkristoffer nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction AT makoundoupatrick nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction AT bonniciiago nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction AT sicardmathieu nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction AT belkhirkhalid nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction AT weillmylene nanoporesequencingofpcrproductsenablesmulticopygenefamilyreconstruction |