Cargando…

Structural variants in 3000 rice genomes

Investigation of large structural variants (SVs) is a challenging yet important task in understanding trait differences in highly repetitive genomes. Combining different bioinformatic approaches for SV detection, we analyzed whole-genome sequencing data from 3000 rice genomes and identified 63 milli...

Descripción completa

Detalles Bibliográficos
Autores principales: Fuentes, Roven Rommel, Chebotarov, Dmytro, Duitama, Jorge, Smith, Sean, De la Hoz, Juan Fernando, Mohiyuddin, Marghoob, Wing, Rod A., McNally, Kenneth L., Tatarinova, Tatiana, Grigoriev, Andrey, Mauleon, Ramil, Alexandrov, Nickolai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499320/
https://www.ncbi.nlm.nih.gov/pubmed/30992303
http://dx.doi.org/10.1101/gr.241240.118
_version_ 1783415773250715648
author Fuentes, Roven Rommel
Chebotarov, Dmytro
Duitama, Jorge
Smith, Sean
De la Hoz, Juan Fernando
Mohiyuddin, Marghoob
Wing, Rod A.
McNally, Kenneth L.
Tatarinova, Tatiana
Grigoriev, Andrey
Mauleon, Ramil
Alexandrov, Nickolai
author_facet Fuentes, Roven Rommel
Chebotarov, Dmytro
Duitama, Jorge
Smith, Sean
De la Hoz, Juan Fernando
Mohiyuddin, Marghoob
Wing, Rod A.
McNally, Kenneth L.
Tatarinova, Tatiana
Grigoriev, Andrey
Mauleon, Ramil
Alexandrov, Nickolai
author_sort Fuentes, Roven Rommel
collection PubMed
description Investigation of large structural variants (SVs) is a challenging yet important task in understanding trait differences in highly repetitive genomes. Combining different bioinformatic approaches for SV detection, we analyzed whole-genome sequencing data from 3000 rice genomes and identified 63 million individual SV calls that grouped into 1.5 million allelic variants. We found enrichment of long SVs in promoters and an excess of shorter variants in 5′ UTRs. Across the rice genomes, we identified regions of high SV frequency enriched in stress response genes. We demonstrated how SVs may help in finding causative variants in genome-wide association analysis. These new insights into rice genome biology are valuable for understanding the effects SVs have on gene function, with the prospect of identifying novel agronomically important alleles that can be utilized to improve cultivated rice.
format Online
Article
Text
id pubmed-6499320
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-64993202019-05-17 Structural variants in 3000 rice genomes Fuentes, Roven Rommel Chebotarov, Dmytro Duitama, Jorge Smith, Sean De la Hoz, Juan Fernando Mohiyuddin, Marghoob Wing, Rod A. McNally, Kenneth L. Tatarinova, Tatiana Grigoriev, Andrey Mauleon, Ramil Alexandrov, Nickolai Genome Res Resource Investigation of large structural variants (SVs) is a challenging yet important task in understanding trait differences in highly repetitive genomes. Combining different bioinformatic approaches for SV detection, we analyzed whole-genome sequencing data from 3000 rice genomes and identified 63 million individual SV calls that grouped into 1.5 million allelic variants. We found enrichment of long SVs in promoters and an excess of shorter variants in 5′ UTRs. Across the rice genomes, we identified regions of high SV frequency enriched in stress response genes. We demonstrated how SVs may help in finding causative variants in genome-wide association analysis. These new insights into rice genome biology are valuable for understanding the effects SVs have on gene function, with the prospect of identifying novel agronomically important alleles that can be utilized to improve cultivated rice. Cold Spring Harbor Laboratory Press 2019-05 /pmc/articles/PMC6499320/ /pubmed/30992303 http://dx.doi.org/10.1101/gr.241240.118 Text en © 2019 Fuentes et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.
spellingShingle Resource
Fuentes, Roven Rommel
Chebotarov, Dmytro
Duitama, Jorge
Smith, Sean
De la Hoz, Juan Fernando
Mohiyuddin, Marghoob
Wing, Rod A.
McNally, Kenneth L.
Tatarinova, Tatiana
Grigoriev, Andrey
Mauleon, Ramil
Alexandrov, Nickolai
Structural variants in 3000 rice genomes
title Structural variants in 3000 rice genomes
title_full Structural variants in 3000 rice genomes
title_fullStr Structural variants in 3000 rice genomes
title_full_unstemmed Structural variants in 3000 rice genomes
title_short Structural variants in 3000 rice genomes
title_sort structural variants in 3000 rice genomes
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499320/
https://www.ncbi.nlm.nih.gov/pubmed/30992303
http://dx.doi.org/10.1101/gr.241240.118
work_keys_str_mv AT fuentesrovenrommel structuralvariantsin3000ricegenomes
AT chebotarovdmytro structuralvariantsin3000ricegenomes
AT duitamajorge structuralvariantsin3000ricegenomes
AT smithsean structuralvariantsin3000ricegenomes
AT delahozjuanfernando structuralvariantsin3000ricegenomes
AT mohiyuddinmarghoob structuralvariantsin3000ricegenomes
AT wingroda structuralvariantsin3000ricegenomes
AT mcnallykennethl structuralvariantsin3000ricegenomes
AT tatarinovatatiana structuralvariantsin3000ricegenomes
AT grigorievandrey structuralvariantsin3000ricegenomes
AT mauleonramil structuralvariantsin3000ricegenomes
AT alexandrovnickolai structuralvariantsin3000ricegenomes