Cargando…

On the length, weight and GC content of the human genome

OBJECTIVE: Basic parameters commonly used to describe genomes including length, weight and relative guanine-cytosine (GC) content are widely cited in absence of a primary source. By using updated data and original software we determined these values to the best of our knowledge as standard reference...

Descripción completa

Detalles Bibliográficos
Autores principales: Piovesan, Allison, Pelleri, Maria Chiara, Antonaros, Francesca, Strippoli, Pierluigi, Caracausi, Maria, Vitale, Lorenza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6391780/
https://www.ncbi.nlm.nih.gov/pubmed/30813969
http://dx.doi.org/10.1186/s13104-019-4137-z
_version_ 1783398363136262144
author Piovesan, Allison
Pelleri, Maria Chiara
Antonaros, Francesca
Strippoli, Pierluigi
Caracausi, Maria
Vitale, Lorenza
author_facet Piovesan, Allison
Pelleri, Maria Chiara
Antonaros, Francesca
Strippoli, Pierluigi
Caracausi, Maria
Vitale, Lorenza
author_sort Piovesan, Allison
collection PubMed
description OBJECTIVE: Basic parameters commonly used to describe genomes including length, weight and relative guanine-cytosine (GC) content are widely cited in absence of a primary source. By using updated data and original software we determined these values to the best of our knowledge as standard reference for the whole human nuclear genome, for each chromosome and for mitochondrial DNA. We also devised a method to calculate the relative GC content in the whole messenger RNA sequence set and in transcriptomes by multiplying the GC content of each gene by its mean expression level. RESULTS: The male nuclear diploid genome extends for 6.27 Gigabase pairs (Gbp), is 205.00 cm (cm) long and weighs 6.41 picograms (pg). Female values are 6.37 Gbp, 208.23 cm, 6.51 pg. The individual variability and the implication for the DNA informational density in terms of bits/volume were discussed. The genomic GC content is 40.9%. Following analysis in different transcriptomes and species, we showed that the greatest deviation was observed in the pathological condition analysed (trisomy 21 leukaemic cells) and in Caenorhabditis elegans. Our results may represent a solid basis for further investigation on human structural and functional genomics while also providing a framework for other genome comparative analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13104-019-4137-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6391780
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63917802019-03-11 On the length, weight and GC content of the human genome Piovesan, Allison Pelleri, Maria Chiara Antonaros, Francesca Strippoli, Pierluigi Caracausi, Maria Vitale, Lorenza BMC Res Notes Research Note OBJECTIVE: Basic parameters commonly used to describe genomes including length, weight and relative guanine-cytosine (GC) content are widely cited in absence of a primary source. By using updated data and original software we determined these values to the best of our knowledge as standard reference for the whole human nuclear genome, for each chromosome and for mitochondrial DNA. We also devised a method to calculate the relative GC content in the whole messenger RNA sequence set and in transcriptomes by multiplying the GC content of each gene by its mean expression level. RESULTS: The male nuclear diploid genome extends for 6.27 Gigabase pairs (Gbp), is 205.00 cm (cm) long and weighs 6.41 picograms (pg). Female values are 6.37 Gbp, 208.23 cm, 6.51 pg. The individual variability and the implication for the DNA informational density in terms of bits/volume were discussed. The genomic GC content is 40.9%. Following analysis in different transcriptomes and species, we showed that the greatest deviation was observed in the pathological condition analysed (trisomy 21 leukaemic cells) and in Caenorhabditis elegans. Our results may represent a solid basis for further investigation on human structural and functional genomics while also providing a framework for other genome comparative analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13104-019-4137-z) contains supplementary material, which is available to authorized users. BioMed Central 2019-02-27 /pmc/articles/PMC6391780/ /pubmed/30813969 http://dx.doi.org/10.1186/s13104-019-4137-z Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Note
Piovesan, Allison
Pelleri, Maria Chiara
Antonaros, Francesca
Strippoli, Pierluigi
Caracausi, Maria
Vitale, Lorenza
On the length, weight and GC content of the human genome
title On the length, weight and GC content of the human genome
title_full On the length, weight and GC content of the human genome
title_fullStr On the length, weight and GC content of the human genome
title_full_unstemmed On the length, weight and GC content of the human genome
title_short On the length, weight and GC content of the human genome
title_sort on the length, weight and gc content of the human genome
topic Research Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6391780/
https://www.ncbi.nlm.nih.gov/pubmed/30813969
http://dx.doi.org/10.1186/s13104-019-4137-z
work_keys_str_mv AT piovesanallison onthelengthweightandgccontentofthehumangenome
AT pellerimariachiara onthelengthweightandgccontentofthehumangenome
AT antonarosfrancesca onthelengthweightandgccontentofthehumangenome
AT strippolipierluigi onthelengthweightandgccontentofthehumangenome
AT caracausimaria onthelengthweightandgccontentofthehumangenome
AT vitalelorenza onthelengthweightandgccontentofthehumangenome