Cargando…

IPRStats: visualization of the functional potential of an InterProScan run

BACKGROUND: InterPro is a collection of protein signatures for the classification and automated annotation of proteins. Interproscan is a software tool that scans protein sequences against Interpro member databases using a variety of profile-based, hidden markov model and positional specific score m...

Descripción completa

Detalles Bibliográficos
Autores principales: Kelly, Ryan J, Vincent, David E, Friedberg, Iddo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3040527/
https://www.ncbi.nlm.nih.gov/pubmed/21210980
http://dx.doi.org/10.1186/1471-2105-11-S12-S13
_version_ 1782198329977864192
author Kelly, Ryan J
Vincent, David E
Friedberg, Iddo
author_facet Kelly, Ryan J
Vincent, David E
Friedberg, Iddo
author_sort Kelly, Ryan J
collection PubMed
description BACKGROUND: InterPro is a collection of protein signatures for the classification and automated annotation of proteins. Interproscan is a software tool that scans protein sequences against Interpro member databases using a variety of profile-based, hidden markov model and positional specific score matrix methods. It not only combines a set of analysis tools, but also performs data look-up from various sources, as well as some redundancy removal. Interproscan is robust and scalable, able to perform on any machine from a netbook to a large cluster. However, when performing whole-genome or metagenome analysis, there is a need for a fast statistical visualization of the results to have good initial grasp on the functional potential of the sequences in the analyzed data set. This is especially important when analyzing and comparing metagenomic or metaproteomic data-sets. RESULTS: IPRStats is a tool for the visualization of Interproscan results. Interproscan results are parsed from the Interproscan XML or EBIXML file into an SQLite or MySQL database. The results for each signature database scan are read and displayed as pie-charts or bar charts as summary statistics. A table is also provided, where each entry is a signature (e.g. a Pfam entry) accompanied by one or more Gene Ontology terms, if Interproscan was run using the Gene Ontology option. CONCLUSIONS: We present an platform-independent, open source licensed tool that is useful for Interproscan users who wish to view the summary of their results in a rapid and concise fashion.
format Text
id pubmed-3040527
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30405272011-02-18 IPRStats: visualization of the functional potential of an InterProScan run Kelly, Ryan J Vincent, David E Friedberg, Iddo BMC Bioinformatics Proceedings BACKGROUND: InterPro is a collection of protein signatures for the classification and automated annotation of proteins. Interproscan is a software tool that scans protein sequences against Interpro member databases using a variety of profile-based, hidden markov model and positional specific score matrix methods. It not only combines a set of analysis tools, but also performs data look-up from various sources, as well as some redundancy removal. Interproscan is robust and scalable, able to perform on any machine from a netbook to a large cluster. However, when performing whole-genome or metagenome analysis, there is a need for a fast statistical visualization of the results to have good initial grasp on the functional potential of the sequences in the analyzed data set. This is especially important when analyzing and comparing metagenomic or metaproteomic data-sets. RESULTS: IPRStats is a tool for the visualization of Interproscan results. Interproscan results are parsed from the Interproscan XML or EBIXML file into an SQLite or MySQL database. The results for each signature database scan are read and displayed as pie-charts or bar charts as summary statistics. A table is also provided, where each entry is a signature (e.g. a Pfam entry) accompanied by one or more Gene Ontology terms, if Interproscan was run using the Gene Ontology option. CONCLUSIONS: We present an platform-independent, open source licensed tool that is useful for Interproscan users who wish to view the summary of their results in a rapid and concise fashion. BioMed Central 2010-12-21 /pmc/articles/PMC3040527/ /pubmed/21210980 http://dx.doi.org/10.1186/1471-2105-11-S12-S13 Text en Copyright ©2010 Kelly et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Kelly, Ryan J
Vincent, David E
Friedberg, Iddo
IPRStats: visualization of the functional potential of an InterProScan run
title IPRStats: visualization of the functional potential of an InterProScan run
title_full IPRStats: visualization of the functional potential of an InterProScan run
title_fullStr IPRStats: visualization of the functional potential of an InterProScan run
title_full_unstemmed IPRStats: visualization of the functional potential of an InterProScan run
title_short IPRStats: visualization of the functional potential of an InterProScan run
title_sort iprstats: visualization of the functional potential of an interproscan run
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3040527/
https://www.ncbi.nlm.nih.gov/pubmed/21210980
http://dx.doi.org/10.1186/1471-2105-11-S12-S13
work_keys_str_mv AT kellyryanj iprstatsvisualizationofthefunctionalpotentialofaninterproscanrun
AT vincentdavide iprstatsvisualizationofthefunctionalpotentialofaninterproscanrun
AT friedbergiddo iprstatsvisualizationofthefunctionalpotentialofaninterproscanrun