Cargando…

A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle

This study aimed to perform a genome-wide association analysis (GWAS) using the Random Forest (RF) approach for scanning candidate genes for age at first calving (AFC) in Nellore cattle. Additionally, potential epistatic effects were investigated using linear mixed models with pairwise interactions...

Descripción completa

Detalles Bibliográficos
Autores principales: Alves, Anderson Antonio Carvalho, da Costa, Rebeka Magalhães, Fonseca, Larissa Fernanda Simielli, Carvalheiro, Roberto, Ventura, Ricardo Vieira, Rosa, Guilherme Jordão de Magalhães, Albuquerque, Lucia Galvão
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9178659/
https://www.ncbi.nlm.nih.gov/pubmed/35692843
http://dx.doi.org/10.3389/fgene.2022.834724
_version_ 1784723103714115584
author Alves, Anderson Antonio Carvalho
da Costa, Rebeka Magalhães
Fonseca, Larissa Fernanda Simielli
Carvalheiro, Roberto
Ventura, Ricardo Vieira
Rosa, Guilherme Jordão de Magalhães
Albuquerque, Lucia Galvão
author_facet Alves, Anderson Antonio Carvalho
da Costa, Rebeka Magalhães
Fonseca, Larissa Fernanda Simielli
Carvalheiro, Roberto
Ventura, Ricardo Vieira
Rosa, Guilherme Jordão de Magalhães
Albuquerque, Lucia Galvão
author_sort Alves, Anderson Antonio Carvalho
collection PubMed
description This study aimed to perform a genome-wide association analysis (GWAS) using the Random Forest (RF) approach for scanning candidate genes for age at first calving (AFC) in Nellore cattle. Additionally, potential epistatic effects were investigated using linear mixed models with pairwise interactions between all markers with high importance scores within the tree ensemble non-linear structure. Data from Nellore cattle were used, including records of animals born between 1984 and 2015 and raised in commercial herds located in different regions of Brazil. The estimated breeding values (EBV) were computed and used as the response variable in the genomic analyses. After quality control, the remaining number of animals and SNPs considered were 3,174 and 360,130, respectively. Five independent RF analyses were carried out, considering different initialization seeds. The importance score of each SNP was averaged across the independent RF analyses to rank the markers according to their predictive relevance. A total of 117 SNPs associated with AFC were identified, which spanned 10 autosomes (2, 3, 5, 10, 11, 17, 18, 21, 24, and 25). In total, 23 non-overlapping genomic regions embedded 262 candidate genes for AFC. Enrichment analysis and previous evidence in the literature revealed that many candidate genes annotated close to the lead SNPs have key roles in fertility, including embryo pre-implantation and development, embryonic viability, male germinal cell maturation, and pheromone recognition. Furthermore, some genomic regions previously associated with fertility and growth traits in Nellore cattle were also detected in the present study, reinforcing the effectiveness of RF for pre-screening candidate regions associated with complex traits. Complementary analyses revealed that many SNPs top-ranked in the RF-based GWAS did not present a strong marginal linear effect but are potentially involved in epistatic hotspots between genomic regions in different autosomes, remarkably in the BTAs 3, 5, 11, and 21. The reported results are expected to enhance the understanding of genetic mechanisms involved in the biological regulation of AFC in this cattle breed.
format Online
Article
Text
id pubmed-9178659
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-91786592022-06-10 A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle Alves, Anderson Antonio Carvalho da Costa, Rebeka Magalhães Fonseca, Larissa Fernanda Simielli Carvalheiro, Roberto Ventura, Ricardo Vieira Rosa, Guilherme Jordão de Magalhães Albuquerque, Lucia Galvão Front Genet Genetics This study aimed to perform a genome-wide association analysis (GWAS) using the Random Forest (RF) approach for scanning candidate genes for age at first calving (AFC) in Nellore cattle. Additionally, potential epistatic effects were investigated using linear mixed models with pairwise interactions between all markers with high importance scores within the tree ensemble non-linear structure. Data from Nellore cattle were used, including records of animals born between 1984 and 2015 and raised in commercial herds located in different regions of Brazil. The estimated breeding values (EBV) were computed and used as the response variable in the genomic analyses. After quality control, the remaining number of animals and SNPs considered were 3,174 and 360,130, respectively. Five independent RF analyses were carried out, considering different initialization seeds. The importance score of each SNP was averaged across the independent RF analyses to rank the markers according to their predictive relevance. A total of 117 SNPs associated with AFC were identified, which spanned 10 autosomes (2, 3, 5, 10, 11, 17, 18, 21, 24, and 25). In total, 23 non-overlapping genomic regions embedded 262 candidate genes for AFC. Enrichment analysis and previous evidence in the literature revealed that many candidate genes annotated close to the lead SNPs have key roles in fertility, including embryo pre-implantation and development, embryonic viability, male germinal cell maturation, and pheromone recognition. Furthermore, some genomic regions previously associated with fertility and growth traits in Nellore cattle were also detected in the present study, reinforcing the effectiveness of RF for pre-screening candidate regions associated with complex traits. Complementary analyses revealed that many SNPs top-ranked in the RF-based GWAS did not present a strong marginal linear effect but are potentially involved in epistatic hotspots between genomic regions in different autosomes, remarkably in the BTAs 3, 5, 11, and 21. The reported results are expected to enhance the understanding of genetic mechanisms involved in the biological regulation of AFC in this cattle breed. Frontiers Media S.A. 2022-05-18 /pmc/articles/PMC9178659/ /pubmed/35692843 http://dx.doi.org/10.3389/fgene.2022.834724 Text en Copyright © 2022 Alves, da Costa, Fonseca, Carvalheiro, Ventura, Rosa and Albuquerque. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Alves, Anderson Antonio Carvalho
da Costa, Rebeka Magalhães
Fonseca, Larissa Fernanda Simielli
Carvalheiro, Roberto
Ventura, Ricardo Vieira
Rosa, Guilherme Jordão de Magalhães
Albuquerque, Lucia Galvão
A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle
title A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle
title_full A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle
title_fullStr A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle
title_full_unstemmed A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle
title_short A Random Forest-Based Genome-Wide Scan Reveals Fertility-Related Candidate Genes and Potential Inter-Chromosomal Epistatic Regions Associated With Age at First Calving in Nellore Cattle
title_sort random forest-based genome-wide scan reveals fertility-related candidate genes and potential inter-chromosomal epistatic regions associated with age at first calving in nellore cattle
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9178659/
https://www.ncbi.nlm.nih.gov/pubmed/35692843
http://dx.doi.org/10.3389/fgene.2022.834724
work_keys_str_mv AT alvesandersonantoniocarvalho arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT dacostarebekamagalhaes arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT fonsecalarissafernandasimielli arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT carvalheiroroberto arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT venturaricardovieira arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT rosaguilhermejordaodemagalhaes arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT albuquerqueluciagalvao arandomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT alvesandersonantoniocarvalho randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT dacostarebekamagalhaes randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT fonsecalarissafernandasimielli randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT carvalheiroroberto randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT venturaricardovieira randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT rosaguilhermejordaodemagalhaes randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle
AT albuquerqueluciagalvao randomforestbasedgenomewidescanrevealsfertilityrelatedcandidategenesandpotentialinterchromosomalepistaticregionsassociatedwithageatfirstcalvinginnellorecattle