Cargando…
A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle
This study aimed to perform a genome-wide association analysis (GWAS) using the Random Forest (RF) approach for scanning candidate genes for age at first calving (AFC) in Nellore cattle. Additionally, potential epistatic effects were investigated using linear mixed models with pairwise interactions...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9178659/ https://www.ncbi.nlm.nih.gov/pubmed/35692843 http://dx.doi.org/10.3389/fgene.2022.834724 |
_version_ | 1784723103714115584 |
---|---|
author | Alves, Anderson Antonio Carvalho da Costa, Rebeka Magalhães Fonseca, Larissa Fernanda Simielli Carvalheiro, Roberto Ventura, Ricardo Vieira Rosa, Guilherme Jordão de Magalhães Albuquerque, Lucia Galvão |
author_facet | Alves, Anderson Antonio Carvalho da Costa, Rebeka Magalhães Fonseca, Larissa Fernanda Simielli Carvalheiro, Roberto Ventura, Ricardo Vieira Rosa, Guilherme Jordão de Magalhães Albuquerque, Lucia Galvão |
author_sort | Alves, Anderson Antonio Carvalho |
collection | PubMed |
description | This study aimed to perform a genome-wide association analysis (GWAS) using the Random Forest (RF) approach for scanning candidate genes for age at first calving (AFC) in Nellore cattle. Additionally, potential epistatic effects were investigated using linear mixed models with pairwise interactions between all markers with high importance scores within the tree ensemble non-linear structure. Data from Nellore cattle were used, including records of animals born between 1984 and 2015 and raised in commercial herds located in different regions of Brazil. The estimated breeding values (EBV) were computed and used as the response variable in the genomic analyses. After quality control, the remaining number of animals and SNPs considered were 3,174 and 360,130, respectively. Five independent RF analyses were carried out, considering different initialization seeds. The importance score of each SNP was averaged across the independent RF analyses to rank the markers according to their predictive relevance. A total of 117 SNPs associated with AFC were identified, which spanned 10 autosomes (2, 3, 5, 10, 11, 17, 18, 21, 24, and 25). In total, 23 non-overlapping genomic regions embedded 262 candidate genes for AFC. Enrichment analysis and previous evidence in the literature revealed that many candidate genes annotated close to the lead SNPs have key roles in fertility, including embryo pre-implantation and development, embryonic viability, male germinal cell maturation, and pheromone recognition. Furthermore, some genomic regions previously associated with fertility and growth traits in Nellore cattle were also detected in the present study, reinforcing the effectiveness of RF for pre-screening candidate regions associated with complex traits. Complementary analyses revealed that many SNPs top-ranked in the RF-based GWAS did not present a strong marginal linear effect but are potentially involved in epistatic hotspots between genomic regions in different autosomes, remarkably in the BTAs 3, 5, 11, and 21. The reported results are expected to enhance the understanding of genetic mechanisms involved in the biological regulation of AFC in this cattle breed. |
format | Online Article Text |
id | pubmed-9178659 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-91786592022-06-10 A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle Alves, Anderson Antonio Carvalho da Costa, Rebeka Magalhães Fonseca, Larissa Fernanda Simielli Carvalheiro, Roberto Ventura, Ricardo Vieira Rosa, Guilherme Jordão de Magalhães Albuquerque, Lucia Galvão Front Genet Genetics This study aimed to perform a genome-wide association analysis (GWAS) using the Random Forest (RF) approach for scanning candidate genes for age at first calving (AFC) in Nellore cattle. Additionally, potential epistatic effects were investigated using linear mixed models with pairwise interactions between all markers with high importance scores within the tree ensemble non-linear structure. Data from Nellore cattle were used, including records of animals born between 1984 and 2015 and raised in commercial herds located in different regions of Brazil. The estimated breeding values (EBV) were computed and used as the response variable in the genomic analyses. After quality control, the remaining number of animals and SNPs considered were 3,174 and 360,130, respectively. Five independent RF analyses were carried out, considering different initialization seeds. The importance score of each SNP was averaged across the independent RF analyses to rank the markers according to their predictive relevance. A total of 117 SNPs associated with AFC were identified, which spanned 10 autosomes (2, 3, 5, 10, 11, 17, 18, 21, 24, and 25). In total, 23 non-overlapping genomic regions embedded 262 candidate genes for AFC. Enrichment analysis and previous evidence in the literature revealed that many candidate genes annotated close to the lead SNPs have key roles in fertility, including embryo pre-implantation and development, embryonic viability, male germinal cell maturation, and pheromone recognition. Furthermore, some genomic regions previously associated with fertility and growth traits in Nellore cattle were also detected in the present study, reinforcing the effectiveness of RF for pre-screening candidate regions associated with complex traits. Complementary analyses revealed that many SNPs top-ranked in the RF-based GWAS did not present a strong marginal linear effect but are potentially involved in epistatic hotspots between genomic regions in different autosomes, remarkably in the BTAs 3, 5, 11, and 21. The reported results are expected to enhance the understanding of genetic mechanisms involved in the biological regulation of AFC in this cattle breed. Frontiers Media S.A. 2022-05-18 /pmc/articles/PMC9178659/ /pubmed/35692843 http://dx.doi.org/10.3389/fgene.2022.834724 Text en Copyright © 2022 Alves, da Costa, Fonseca, Carvalheiro, Ventura, Rosa and Albuquerque. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Alves, Anderson Antonio Carvalho da Costa, Rebeka Magalhães Fonseca, Larissa Fernanda Simielli Carvalheiro, Roberto Ventura, Ricardo Vieira Rosa, Guilherme Jordão de Magalhães Albuquerque, Lucia Galvão A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle |
title | A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle |
title_full | A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle |
title_fullStr | A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle |
title_full_unstemmed | A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle |
title_short | A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle |
title_sort | random forest-based genome-wide scan reveals fertility-related candidate genes and potential inter-chromosomal epistatic regions associated with age at first calving in nellore cattle |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9178659/ https://www.ncbi.nlm.nih.gov/pubmed/35692843 http://dx.doi.org/10.3389/fgene.2022.834724 |
work_keys_str_mv | AT alvesandersonantoniocarvalho arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT dacostarebekamagalhaes arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT fonsecalarissafernandasimielli arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT carvalheiroroberto arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT venturaricardovieira arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT rosaguilhermejordaodemagalhaes arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT albuquerqueluciagalvao arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT alvesandersonantoniocarvalho randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT dacostarebekamagalhaes randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT fonsecalarissafernandasimielli randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT carvalheiroroberto randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT venturaricardovieira randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT rosaguilhermejordaodemagalhaes randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle AT albuquerqueluciagalvao randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle |