Cargando…

Comprehensive evaluation of SNP identification with the Restriction Enzyme-based Reduced Representation Library (RRL) method

BACKGROUND: Restriction Enzyme-based Reduced Representation Library (RRL) method represents a relatively feasible and flexible strategy used for Single Nucleotide Polymorphism (SNP) identification in different species. It has remarkable advantage of reducing the complexity of the genome by orders of...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Ye, Jiang, Hui, Chen, Ying, Li, Cong, Zhao, Meiru, Wu, Jinghua, Qiu, Yong, Li, Qibin, Zhang, Xiuqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3305556/
https://www.ncbi.nlm.nih.gov/pubmed/22340203
http://dx.doi.org/10.1186/1471-2164-13-77
_version_ 1782227095805493248
author Du, Ye
Jiang, Hui
Chen, Ying
Li, Cong
Zhao, Meiru
Wu, Jinghua
Qiu, Yong
Li, Qibin
Zhang, Xiuqing
author_facet Du, Ye
Jiang, Hui
Chen, Ying
Li, Cong
Zhao, Meiru
Wu, Jinghua
Qiu, Yong
Li, Qibin
Zhang, Xiuqing
author_sort Du, Ye
collection PubMed
description BACKGROUND: Restriction Enzyme-based Reduced Representation Library (RRL) method represents a relatively feasible and flexible strategy used for Single Nucleotide Polymorphism (SNP) identification in different species. It has remarkable advantage of reducing the complexity of the genome by orders of magnitude. However, comprehensive evaluation for actual efficacy of SNP identification by this method is still unavailable. RESULTS: In order to evaluate the efficacy of Restriction Enzyme-based RRL method, we selected Tsp 45I enzyme which covers 266 Mb flanking region of the enzyme recognition site according to in silico simulation on human reference genome, then we sequenced YH RRL after Tsp 45I treatment and obtained reads of which 80.8% were mapped to target region with an 20-fold average coverage, about 96.8% of target region was covered by at least one read and 257 K SNPs were identified in the region using SOAPsnp software. Compared with whole genome resequencing data, we observed false discovery rate (FDR) of 13.95% and false negative rate (FNR) of 25.90%. The concordance rate of homozygote loci was over 99.8%, but that of heterozygote were only 92.56%. Repeat sequences and bases quality were proved to have a great effect on the accuracy of SNP calling, SNPs in recognition sites contributed evidently to the high FNR and the low concordance rate of heterozygote. Our results indicated that repeat masking and high stringent filter criteria could significantly decrease both FDR and FNR. CONCLUSIONS: This study demonstrates that Restriction Enzyme-based RRL method was effective for SNP identification. The results highlight the important role of bias and the method-derived defects represented in this method and emphasize the special attentions noteworthy.
format Online
Article
Text
id pubmed-3305556
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33055562012-03-16 Comprehensive evaluation of SNP identification with the Restriction Enzyme-based Reduced Representation Library (RRL) method Du, Ye Jiang, Hui Chen, Ying Li, Cong Zhao, Meiru Wu, Jinghua Qiu, Yong Li, Qibin Zhang, Xiuqing BMC Genomics Research Article BACKGROUND: Restriction Enzyme-based Reduced Representation Library (RRL) method represents a relatively feasible and flexible strategy used for Single Nucleotide Polymorphism (SNP) identification in different species. It has remarkable advantage of reducing the complexity of the genome by orders of magnitude. However, comprehensive evaluation for actual efficacy of SNP identification by this method is still unavailable. RESULTS: In order to evaluate the efficacy of Restriction Enzyme-based RRL method, we selected Tsp 45I enzyme which covers 266 Mb flanking region of the enzyme recognition site according to in silico simulation on human reference genome, then we sequenced YH RRL after Tsp 45I treatment and obtained reads of which 80.8% were mapped to target region with an 20-fold average coverage, about 96.8% of target region was covered by at least one read and 257 K SNPs were identified in the region using SOAPsnp software. Compared with whole genome resequencing data, we observed false discovery rate (FDR) of 13.95% and false negative rate (FNR) of 25.90%. The concordance rate of homozygote loci was over 99.8%, but that of heterozygote were only 92.56%. Repeat sequences and bases quality were proved to have a great effect on the accuracy of SNP calling, SNPs in recognition sites contributed evidently to the high FNR and the low concordance rate of heterozygote. Our results indicated that repeat masking and high stringent filter criteria could significantly decrease both FDR and FNR. CONCLUSIONS: This study demonstrates that Restriction Enzyme-based RRL method was effective for SNP identification. The results highlight the important role of bias and the method-derived defects represented in this method and emphasize the special attentions noteworthy. BioMed Central 2012-02-16 /pmc/articles/PMC3305556/ /pubmed/22340203 http://dx.doi.org/10.1186/1471-2164-13-77 Text en Copyright ©2012 Du et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Du, Ye
Jiang, Hui
Chen, Ying
Li, Cong
Zhao, Meiru
Wu, Jinghua
Qiu, Yong
Li, Qibin
Zhang, Xiuqing
Comprehensive evaluation of SNP identification with the Restriction Enzyme-based Reduced Representation Library (RRL) method
title Comprehensive evaluation of SNP identification with the Restriction Enzyme-based Reduced Representation Library (RRL) method
title_full Comprehensive evaluation of SNP identification with the Restriction Enzyme-based Reduced Representation Library (RRL) method
title_fullStr Comprehensive evaluation of SNP identification with the Restriction Enzyme-based Reduced Representation Library (RRL) method
title_full_unstemmed Comprehensive evaluation of SNP identification with the Restriction Enzyme-based Reduced Representation Library (RRL) method
title_short Comprehensive evaluation of SNP identification with the Restriction Enzyme-based Reduced Representation Library (RRL) method
title_sort comprehensive evaluation of snp identification with the restriction enzyme-based reduced representation library (rrl) method
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3305556/
https://www.ncbi.nlm.nih.gov/pubmed/22340203
http://dx.doi.org/10.1186/1471-2164-13-77
work_keys_str_mv AT duye comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod
AT jianghui comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod
AT chenying comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod
AT licong comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod
AT zhaomeiru comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod
AT wujinghua comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod
AT qiuyong comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod
AT liqibin comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod
AT zhangxiuqing comprehensiveevaluationofsnpidentificationwiththerestrictionenzymebasedreducedrepresentationlibraryrrlmethod