Cargando…
IPRStats: visualization of the functional potential of an InterProScan run
BACKGROUND: InterPro is a collection of protein signatures for the classification and automated annotation of proteins. Interproscan is a software tool that scans protein sequences against Interpro member databases using a variety of profile-based, hidden markov model and positional specific score m...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3040527/ https://www.ncbi.nlm.nih.gov/pubmed/21210980 http://dx.doi.org/10.1186/1471-2105-11-S12-S13 |
_version_ | 1782198329977864192 |
---|---|
author | Kelly, Ryan J Vincent, David E Friedberg, Iddo |
author_facet | Kelly, Ryan J Vincent, David E Friedberg, Iddo |
author_sort | Kelly, Ryan J |
collection | PubMed |
description | BACKGROUND: InterPro is a collection of protein signatures for the classification and automated annotation of proteins. Interproscan is a software tool that scans protein sequences against Interpro member databases using a variety of profile-based, hidden markov model and positional specific score matrix methods. It not only combines a set of analysis tools, but also performs data look-up from various sources, as well as some redundancy removal. Interproscan is robust and scalable, able to perform on any machine from a netbook to a large cluster. However, when performing whole-genome or metagenome analysis, there is a need for a fast statistical visualization of the results to have good initial grasp on the functional potential of the sequences in the analyzed data set. This is especially important when analyzing and comparing metagenomic or metaproteomic data-sets. RESULTS: IPRStats is a tool for the visualization of Interproscan results. Interproscan results are parsed from the Interproscan XML or EBIXML file into an SQLite or MySQL database. The results for each signature database scan are read and displayed as pie-charts or bar charts as summary statistics. A table is also provided, where each entry is a signature (e.g. a Pfam entry) accompanied by one or more Gene Ontology terms, if Interproscan was run using the Gene Ontology option. CONCLUSIONS: We present an platform-independent, open source licensed tool that is useful for Interproscan users who wish to view the summary of their results in a rapid and concise fashion. |
format | Text |
id | pubmed-3040527 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30405272011-02-18 IPRStats: visualization of the functional potential of an InterProScan run Kelly, Ryan J Vincent, David E Friedberg, Iddo BMC Bioinformatics Proceedings BACKGROUND: InterPro is a collection of protein signatures for the classification and automated annotation of proteins. Interproscan is a software tool that scans protein sequences against Interpro member databases using a variety of profile-based, hidden markov model and positional specific score matrix methods. It not only combines a set of analysis tools, but also performs data look-up from various sources, as well as some redundancy removal. Interproscan is robust and scalable, able to perform on any machine from a netbook to a large cluster. However, when performing whole-genome or metagenome analysis, there is a need for a fast statistical visualization of the results to have good initial grasp on the functional potential of the sequences in the analyzed data set. This is especially important when analyzing and comparing metagenomic or metaproteomic data-sets. RESULTS: IPRStats is a tool for the visualization of Interproscan results. Interproscan results are parsed from the Interproscan XML or EBIXML file into an SQLite or MySQL database. The results for each signature database scan are read and displayed as pie-charts or bar charts as summary statistics. A table is also provided, where each entry is a signature (e.g. a Pfam entry) accompanied by one or more Gene Ontology terms, if Interproscan was run using the Gene Ontology option. CONCLUSIONS: We present an platform-independent, open source licensed tool that is useful for Interproscan users who wish to view the summary of their results in a rapid and concise fashion. BioMed Central 2010-12-21 /pmc/articles/PMC3040527/ /pubmed/21210980 http://dx.doi.org/10.1186/1471-2105-11-S12-S13 Text en Copyright ©2010 Kelly et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Kelly, Ryan J Vincent, David E Friedberg, Iddo IPRStats: visualization of the functional potential of an InterProScan run |
title | IPRStats: visualization of the functional potential of an InterProScan run |
title_full | IPRStats: visualization of the functional potential of an InterProScan run |
title_fullStr | IPRStats: visualization of the functional potential of an InterProScan run |
title_full_unstemmed | IPRStats: visualization of the functional potential of an InterProScan run |
title_short | IPRStats: visualization of the functional potential of an InterProScan run |
title_sort | iprstats: visualization of the functional potential of an interproscan run |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3040527/ https://www.ncbi.nlm.nih.gov/pubmed/21210980 http://dx.doi.org/10.1186/1471-2105-11-S12-S13 |
work_keys_str_mv | AT kellyryanj iprstatsvisualizationofthefunctionalpotentialofaninterproscanrun AT vincentdavide iprstatsvisualizationofthefunctionalpotentialofaninterproscanrun AT friedbergiddo iprstatsvisualizationofthefunctionalpotentialofaninterproscanrun |