Cargando…
Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order
This data article includes SNP scoring across lines of the Brassica napus TNDH population based on Illumina sequencing of mRNA, expanded to 75 lines. The 21, 323 mapped markers defined 887 recombination bins, representing an updated genetic linkage map for the species. Based on this new map, 5 genom...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4510050/ https://www.ncbi.nlm.nih.gov/pubmed/26217717 http://dx.doi.org/10.1016/j.dib.2015.01.004 |
_version_ | 1782382110075518976 |
---|---|
author | Bancroft, Ian Fraser, Fiona Morgan, Colin Trick, Martin |
author_facet | Bancroft, Ian Fraser, Fiona Morgan, Colin Trick, Martin |
author_sort | Bancroft, Ian |
collection | PubMed |
description | This data article includes SNP scoring across lines of the Brassica napus TNDH population based on Illumina sequencing of mRNA, expanded to 75 lines. The 21, 323 mapped markers defined 887 recombination bins, representing an updated genetic linkage map for the species. Based on this new map, 5 genome sequence scaffolds were split and the order and orientation of scaffolds updated to establish a new pseudomolecule specification. The order of unigenes and SNP array probes within these pseudomolecules was determined. Unigenes were assessed for sequence similarity to the A and C genomes. The 57, 246 that mapped to both enabled the collinearity of the A and C genomes to be illustrated graphically. Although the great majority was in collinear positions, some were not. Analyses of 60 such instances are presented, suggesting that the breakdown in collinearity was largely due to either the absence of the homoeologue on one genome (resulting in sequence match to a paralogue) or multiple similar sequences being present. The mRNAseq datasets for the TNDH lines are available from the SRA repository (ERA283648); the remaining datasets are supplied with this article. |
format | Online Article Text |
id | pubmed-4510050 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-45100502015-07-27 Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order Bancroft, Ian Fraser, Fiona Morgan, Colin Trick, Martin Data Brief Data Article This data article includes SNP scoring across lines of the Brassica napus TNDH population based on Illumina sequencing of mRNA, expanded to 75 lines. The 21, 323 mapped markers defined 887 recombination bins, representing an updated genetic linkage map for the species. Based on this new map, 5 genome sequence scaffolds were split and the order and orientation of scaffolds updated to establish a new pseudomolecule specification. The order of unigenes and SNP array probes within these pseudomolecules was determined. Unigenes were assessed for sequence similarity to the A and C genomes. The 57, 246 that mapped to both enabled the collinearity of the A and C genomes to be illustrated graphically. Although the great majority was in collinear positions, some were not. Analyses of 60 such instances are presented, suggesting that the breakdown in collinearity was largely due to either the absence of the homoeologue on one genome (resulting in sequence match to a paralogue) or multiple similar sequences being present. The mRNAseq datasets for the TNDH lines are available from the SRA repository (ERA283648); the remaining datasets are supplied with this article. Elsevier 2015-02-10 /pmc/articles/PMC4510050/ /pubmed/26217717 http://dx.doi.org/10.1016/j.dib.2015.01.004 Text en © 2015 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Article Bancroft, Ian Fraser, Fiona Morgan, Colin Trick, Martin Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order |
title | Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order |
title_full | Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order |
title_fullStr | Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order |
title_full_unstemmed | Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order |
title_short | Collinearity analysis of Brassica A and C genomes based on an updated inferred unigene order |
title_sort | collinearity analysis of brassica a and c genomes based on an updated inferred unigene order |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4510050/ https://www.ncbi.nlm.nih.gov/pubmed/26217717 http://dx.doi.org/10.1016/j.dib.2015.01.004 |
work_keys_str_mv | AT bancroftian collinearityanalysisofbrassicaaandcgenomesbasedonanupdatedinferredunigeneorder AT fraserfiona collinearityanalysisofbrassicaaandcgenomesbasedonanupdatedinferredunigeneorder AT morgancolin collinearityanalysisofbrassicaaandcgenomesbasedonanupdatedinferredunigeneorder AT trickmartin collinearityanalysisofbrassicaaandcgenomesbasedonanupdatedinferredunigeneorder |