Cargando…
Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle
BACKGROUND: Optimization of an RNA-Sequencing (RNA-Seq) pipeline is critical to maximize power and accuracy to identify genetic variants, including SNPs, which may serve as genetic markers to select for feed efficiency, leading to economic benefits for beef production. This study used RNA-Seq data (...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7545862/ https://www.ncbi.nlm.nih.gov/pubmed/33032519 http://dx.doi.org/10.1186/s12864-020-07107-7 |
_version_ | 1783592118326919168 |
---|---|
author | Lam, S. Zeidan, J. Miglior, F. Suárez-Vega, A. Gómez-Redondo, I. Fonseca, P. A. S. Guan, L. L. Waters, S. Cánovas, A. |
author_facet | Lam, S. Zeidan, J. Miglior, F. Suárez-Vega, A. Gómez-Redondo, I. Fonseca, P. A. S. Guan, L. L. Waters, S. Cánovas, A. |
author_sort | Lam, S. |
collection | PubMed |
description | BACKGROUND: Optimization of an RNA-Sequencing (RNA-Seq) pipeline is critical to maximize power and accuracy to identify genetic variants, including SNPs, which may serve as genetic markers to select for feed efficiency, leading to economic benefits for beef production. This study used RNA-Seq data (GEO Accession ID: PRJEB7696 and PRJEB15314) from muscle and liver tissue, respectively, from 12 Nellore beef steers selected from 585 steers with residual feed intake measures (RFI; n = 6 low-RFI, n = 6 high-RFI). Three RNA-Seq pipelines were compared including multi-sample calling from i) non-merged samples; ii) merged samples by RFI group, iii) merged samples by RFI and tissue group. The RNA-Seq reads were aligned against the UMD3.1 bovine reference genome (release 94) assembly using STAR aligner. Variants were called using BCFtools and variant effect prediction (VeP) and functional annotation (ToppGene) analyses were performed. RESULTS: On average, total reads detected for Approach i) non-merged samples for liver and muscle, were 18,362,086.3 and 35,645,898.7, respectively. For Approach ii), merging samples by RFI group, total reads detected for each merged group was 162,030,705, and for Approach iii), merging samples by RFI group and tissues, was 324,061,410, revealing the highest read depth for Approach iii). Additionally, Approach iii) merging samples by RFI group and tissues, revealed the highest read depth per variant coverage (572.59 ± 3993.11) and encompassed the majority of localized positional genes detected by each approach. This suggests Approach iii) had optimized detection power, read depth, and accuracy of SNP calling, therefore increasing confidence of variant detection and reducing false positive detection. Approach iii) was then used to detect unique SNPs fixed within low- (12,145) and high-RFI (14,663) groups. Functional annotation of SNPs revealed positional candidate genes, for each RFI group (2886 for low-RFI, 3075 for high-RFI), which were significantly (P < 0.05) associated with immune and metabolic pathways. CONCLUSION: The most optimized RNA-Seq pipeline allowed for more accurate identification of SNPs, associated positional candidate genes, and significantly associated metabolic pathways in muscle and liver tissues, providing insight on the underlying genetic architecture of feed efficiency in beef cattle. |
format | Online Article Text |
id | pubmed-7545862 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-75458622020-10-13 Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle Lam, S. Zeidan, J. Miglior, F. Suárez-Vega, A. Gómez-Redondo, I. Fonseca, P. A. S. Guan, L. L. Waters, S. Cánovas, A. BMC Genomics Research Article BACKGROUND: Optimization of an RNA-Sequencing (RNA-Seq) pipeline is critical to maximize power and accuracy to identify genetic variants, including SNPs, which may serve as genetic markers to select for feed efficiency, leading to economic benefits for beef production. This study used RNA-Seq data (GEO Accession ID: PRJEB7696 and PRJEB15314) from muscle and liver tissue, respectively, from 12 Nellore beef steers selected from 585 steers with residual feed intake measures (RFI; n = 6 low-RFI, n = 6 high-RFI). Three RNA-Seq pipelines were compared including multi-sample calling from i) non-merged samples; ii) merged samples by RFI group, iii) merged samples by RFI and tissue group. The RNA-Seq reads were aligned against the UMD3.1 bovine reference genome (release 94) assembly using STAR aligner. Variants were called using BCFtools and variant effect prediction (VeP) and functional annotation (ToppGene) analyses were performed. RESULTS: On average, total reads detected for Approach i) non-merged samples for liver and muscle, were 18,362,086.3 and 35,645,898.7, respectively. For Approach ii), merging samples by RFI group, total reads detected for each merged group was 162,030,705, and for Approach iii), merging samples by RFI group and tissues, was 324,061,410, revealing the highest read depth for Approach iii). Additionally, Approach iii) merging samples by RFI group and tissues, revealed the highest read depth per variant coverage (572.59 ± 3993.11) and encompassed the majority of localized positional genes detected by each approach. This suggests Approach iii) had optimized detection power, read depth, and accuracy of SNP calling, therefore increasing confidence of variant detection and reducing false positive detection. Approach iii) was then used to detect unique SNPs fixed within low- (12,145) and high-RFI (14,663) groups. Functional annotation of SNPs revealed positional candidate genes, for each RFI group (2886 for low-RFI, 3075 for high-RFI), which were significantly (P < 0.05) associated with immune and metabolic pathways. CONCLUSION: The most optimized RNA-Seq pipeline allowed for more accurate identification of SNPs, associated positional candidate genes, and significantly associated metabolic pathways in muscle and liver tissues, providing insight on the underlying genetic architecture of feed efficiency in beef cattle. BioMed Central 2020-10-08 /pmc/articles/PMC7545862/ /pubmed/33032519 http://dx.doi.org/10.1186/s12864-020-07107-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Lam, S. Zeidan, J. Miglior, F. Suárez-Vega, A. Gómez-Redondo, I. Fonseca, P. A. S. Guan, L. L. Waters, S. Cánovas, A. Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle |
title | Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle |
title_full | Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle |
title_fullStr | Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle |
title_full_unstemmed | Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle |
title_short | Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle |
title_sort | development and comparison of rna-sequencing pipelines for more accurate snp identification: practical example of functional snp detection associated with feed efficiency in nellore beef cattle |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7545862/ https://www.ncbi.nlm.nih.gov/pubmed/33032519 http://dx.doi.org/10.1186/s12864-020-07107-7 |
work_keys_str_mv | AT lams developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle AT zeidanj developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle AT migliorf developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle AT suarezvegaa developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle AT gomezredondoi developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle AT fonsecapas developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle AT guanll developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle AT waterss developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle AT canovasa developmentandcomparisonofrnasequencingpipelinesformoreaccuratesnpidentificationpracticalexampleoffunctionalsnpdetectionassociatedwithfeedefficiencyinnellorebeefcattle |