Cargando…
Modeling one thousand intron length distributions with fitild
MOTIVATION: Intron length distribution (ILD) is a specific feature of a genome that exhibits extensive species-specific variation. Whereas ILD contributes to up to 30% of the total information content for intron recognition in some species, rendering it an important component of computational gene p...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157073/ https://www.ncbi.nlm.nih.gov/pubmed/29722882 http://dx.doi.org/10.1093/bioinformatics/bty353 |
_version_ | 1783358204434972672 |
---|---|
author | Gotoh, Osamu |
author_facet | Gotoh, Osamu |
author_sort | Gotoh, Osamu |
collection | PubMed |
description | MOTIVATION: Intron length distribution (ILD) is a specific feature of a genome that exhibits extensive species-specific variation. Whereas ILD contributes to up to 30% of the total information content for intron recognition in some species, rendering it an important component of computational gene prediction, very few studies have been conducted to quantitatively characterize ILDs of various species. RESULTS: We developed a set of computer programs (fitild, compild, etc.) to build statistical models of ILDs and compare them with one another. Each ILD of more than 1000 genomes was fitted with fitild to a statistical model consisting of one, two, or three components of Frechet distributions. Several measures of distances between ILDs were calculated by compild. A theoretical model was presented to better understand the origin of the observed shape of an ILD. AVAILABILITY AND IMPLEMENTATION: The C++ source codes are available at https://github.com/ogotoh/fitild.git/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6157073 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-61570732018-10-01 Modeling one thousand intron length distributions with fitild Gotoh, Osamu Bioinformatics Original Papers MOTIVATION: Intron length distribution (ILD) is a specific feature of a genome that exhibits extensive species-specific variation. Whereas ILD contributes to up to 30% of the total information content for intron recognition in some species, rendering it an important component of computational gene prediction, very few studies have been conducted to quantitatively characterize ILDs of various species. RESULTS: We developed a set of computer programs (fitild, compild, etc.) to build statistical models of ILDs and compare them with one another. Each ILD of more than 1000 genomes was fitted with fitild to a statistical model consisting of one, two, or three components of Frechet distributions. Several measures of distances between ILDs were calculated by compild. A theoretical model was presented to better understand the origin of the observed shape of an ILD. AVAILABILITY AND IMPLEMENTATION: The C++ source codes are available at https://github.com/ogotoh/fitild.git/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-10-01 2018-05-02 /pmc/articles/PMC6157073/ /pubmed/29722882 http://dx.doi.org/10.1093/bioinformatics/bty353 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Gotoh, Osamu Modeling one thousand intron length distributions with fitild |
title | Modeling one thousand intron length distributions with fitild |
title_full | Modeling one thousand intron length distributions with fitild |
title_fullStr | Modeling one thousand intron length distributions with fitild |
title_full_unstemmed | Modeling one thousand intron length distributions with fitild |
title_short | Modeling one thousand intron length distributions with fitild |
title_sort | modeling one thousand intron length distributions with fitild |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157073/ https://www.ncbi.nlm.nih.gov/pubmed/29722882 http://dx.doi.org/10.1093/bioinformatics/bty353 |
work_keys_str_mv | AT gotohosamu modelingonethousandintronlengthdistributionswithfitild |