Cargando…

Modeling one thousand intron length distributions with fitild

MOTIVATION: Intron length distribution (ILD) is a specific feature of a genome that exhibits extensive species-specific variation. Whereas ILD contributes to up to 30% of the total information content for intron recognition in some species, rendering it an important component of computational gene p...

Descripción completa

Detalles Bibliográficos
Autor principal: Gotoh, Osamu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157073/
https://www.ncbi.nlm.nih.gov/pubmed/29722882
http://dx.doi.org/10.1093/bioinformatics/bty353
_version_ 1783358204434972672
author Gotoh, Osamu
author_facet Gotoh, Osamu
author_sort Gotoh, Osamu
collection PubMed
description MOTIVATION: Intron length distribution (ILD) is a specific feature of a genome that exhibits extensive species-specific variation. Whereas ILD contributes to up to 30% of the total information content for intron recognition in some species, rendering it an important component of computational gene prediction, very few studies have been conducted to quantitatively characterize ILDs of various species. RESULTS: We developed a set of computer programs (fitild, compild, etc.) to build statistical models of ILDs and compare them with one another. Each ILD of more than 1000 genomes was fitted with fitild to a statistical model consisting of one, two, or three components of Frechet distributions. Several measures of distances between ILDs were calculated by compild. A theoretical model was presented to better understand the origin of the observed shape of an ILD. AVAILABILITY AND IMPLEMENTATION: The C++ source codes are available at https://github.com/ogotoh/fitild.git/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6157073
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61570732018-10-01 Modeling one thousand intron length distributions with fitild Gotoh, Osamu Bioinformatics Original Papers MOTIVATION: Intron length distribution (ILD) is a specific feature of a genome that exhibits extensive species-specific variation. Whereas ILD contributes to up to 30% of the total information content for intron recognition in some species, rendering it an important component of computational gene prediction, very few studies have been conducted to quantitatively characterize ILDs of various species. RESULTS: We developed a set of computer programs (fitild, compild, etc.) to build statistical models of ILDs and compare them with one another. Each ILD of more than 1000 genomes was fitted with fitild to a statistical model consisting of one, two, or three components of Frechet distributions. Several measures of distances between ILDs were calculated by compild. A theoretical model was presented to better understand the origin of the observed shape of an ILD. AVAILABILITY AND IMPLEMENTATION: The C++ source codes are available at https://github.com/ogotoh/fitild.git/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-10-01 2018-05-02 /pmc/articles/PMC6157073/ /pubmed/29722882 http://dx.doi.org/10.1093/bioinformatics/bty353 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Gotoh, Osamu
Modeling one thousand intron length distributions with fitild
title Modeling one thousand intron length distributions with fitild
title_full Modeling one thousand intron length distributions with fitild
title_fullStr Modeling one thousand intron length distributions with fitild
title_full_unstemmed Modeling one thousand intron length distributions with fitild
title_short Modeling one thousand intron length distributions with fitild
title_sort modeling one thousand intron length distributions with fitild
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157073/
https://www.ncbi.nlm.nih.gov/pubmed/29722882
http://dx.doi.org/10.1093/bioinformatics/bty353
work_keys_str_mv AT gotohosamu modelingonethousandintronlengthdistributionswithfitild