Cargando…

The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser

BACKGROUND: Eukaryotic nuclear genomes contain fragments of mitochondrial DNA called NumtS (Nuclear mitochondrial Sequences), whose mode and time of insertion, as well as their functional/structural role within the genome are debated issues. Insertion sites match with chromosomal breaks, revealing t...

Descripción completa

Detalles Bibliográficos
Autores principales: Simone, Domenico, Calabrese, Francesco Maria, Lang, Martin, Gasparre, Giuseppe, Attimonelli, Marcella
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228558/
https://www.ncbi.nlm.nih.gov/pubmed/22013967
http://dx.doi.org/10.1186/1471-2164-12-517
_version_ 1782217833482027008
author Simone, Domenico
Calabrese, Francesco Maria
Lang, Martin
Gasparre, Giuseppe
Attimonelli, Marcella
author_facet Simone, Domenico
Calabrese, Francesco Maria
Lang, Martin
Gasparre, Giuseppe
Attimonelli, Marcella
author_sort Simone, Domenico
collection PubMed
description BACKGROUND: Eukaryotic nuclear genomes contain fragments of mitochondrial DNA called NumtS (Nuclear mitochondrial Sequences), whose mode and time of insertion, as well as their functional/structural role within the genome are debated issues. Insertion sites match with chromosomal breaks, revealing that micro-deletions usually occurring at non-homologous end joining loci become reduced in presence of NumtS. Some NumtS are involved in recombination events leading to fragment duplication. Moreover, NumtS are polymorphic, a feature that renders them candidates as population markers. Finally, they are a cause of contamination during human mtDNA sequencing, leading to the generation of false heteroplasmies. RESULTS: Here we present RHNumtS.2, the most exhaustive human NumtSome catalogue annotating 585 NumtS, 97% of which were here validated in a European individual and in HapMap samples. The NumtS complete dataset and related features have been made available at the UCSC Genome Browser. The produced sequences have been submitted to INSDC databases. The implementation of the RHNumtS.2 tracks within the UCSC Genome Browser has been carried out with the aim to facilitate browsing of the NumtS tracks to be exploited in a wide range of research applications. CONCLUSIONS: We aimed at providing the scientific community with the most exhaustive overview on the human NumtSome, a resource whose aim is to support several research applications, such as studies concerning human structural variation, diversity, and disease, as well as the detection of false heteroplasmic mtDNA variants. Upon implementation of the NumtS tracks, the application of the BLAT program on the UCSC Genome Browser has now become an additional tool to check for heteroplasmic artefacts, supported by data available through the NumtS tracks.
format Online
Article
Text
id pubmed-3228558
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32285582011-12-02 The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser Simone, Domenico Calabrese, Francesco Maria Lang, Martin Gasparre, Giuseppe Attimonelli, Marcella BMC Genomics Research Article BACKGROUND: Eukaryotic nuclear genomes contain fragments of mitochondrial DNA called NumtS (Nuclear mitochondrial Sequences), whose mode and time of insertion, as well as their functional/structural role within the genome are debated issues. Insertion sites match with chromosomal breaks, revealing that micro-deletions usually occurring at non-homologous end joining loci become reduced in presence of NumtS. Some NumtS are involved in recombination events leading to fragment duplication. Moreover, NumtS are polymorphic, a feature that renders them candidates as population markers. Finally, they are a cause of contamination during human mtDNA sequencing, leading to the generation of false heteroplasmies. RESULTS: Here we present RHNumtS.2, the most exhaustive human NumtSome catalogue annotating 585 NumtS, 97% of which were here validated in a European individual and in HapMap samples. The NumtS complete dataset and related features have been made available at the UCSC Genome Browser. The produced sequences have been submitted to INSDC databases. The implementation of the RHNumtS.2 tracks within the UCSC Genome Browser has been carried out with the aim to facilitate browsing of the NumtS tracks to be exploited in a wide range of research applications. CONCLUSIONS: We aimed at providing the scientific community with the most exhaustive overview on the human NumtSome, a resource whose aim is to support several research applications, such as studies concerning human structural variation, diversity, and disease, as well as the detection of false heteroplasmic mtDNA variants. Upon implementation of the NumtS tracks, the application of the BLAT program on the UCSC Genome Browser has now become an additional tool to check for heteroplasmic artefacts, supported by data available through the NumtS tracks. BioMed Central 2011-10-20 /pmc/articles/PMC3228558/ /pubmed/22013967 http://dx.doi.org/10.1186/1471-2164-12-517 Text en Copyright ©2011 Simone et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Simone, Domenico
Calabrese, Francesco Maria
Lang, Martin
Gasparre, Giuseppe
Attimonelli, Marcella
The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser
title The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser
title_full The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser
title_fullStr The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser
title_full_unstemmed The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser
title_short The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser
title_sort reference human nuclear mitochondrial sequences compilation validated and implemented on the ucsc genome browser
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228558/
https://www.ncbi.nlm.nih.gov/pubmed/22013967
http://dx.doi.org/10.1186/1471-2164-12-517
work_keys_str_mv AT simonedomenico thereferencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT calabresefrancescomaria thereferencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT langmartin thereferencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT gasparregiuseppe thereferencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT attimonellimarcella thereferencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT simonedomenico referencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT calabresefrancescomaria referencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT langmartin referencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT gasparregiuseppe referencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser
AT attimonellimarcella referencehumannuclearmitochondrialsequencescompilationvalidatedandimplementedontheucscgenomebrowser