Cargando…

InterProScan 5: genome-scale protein function classification

Motivation: Robust large-scale sequence analysis is a major challenge in modern genomic science, where biologists are frequently trying to characterize many millions of sequences. Here, we describe a new Java-based architecture for the widely used protein function prediction software package InterPr...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, Philip, Binns, David, Chang, Hsin-Yu, Fraser, Matthew, Li, Weizhong, McAnulla, Craig, McWilliam, Hamish, Maslen, John, Mitchell, Alex, Nuka, Gift, Pesseat, Sebastien, Quinn, Antony F., Sangrador-Vegas, Amaia, Scheremetjew, Maxim, Yong, Siew-Yit, Lopez, Rodrigo, Hunter, Sarah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3998142/
https://www.ncbi.nlm.nih.gov/pubmed/24451626
http://dx.doi.org/10.1093/bioinformatics/btu031
Descripción
Sumario:Motivation: Robust large-scale sequence analysis is a major challenge in modern genomic science, where biologists are frequently trying to characterize many millions of sequences. Here, we describe a new Java-based architecture for the widely used protein function prediction software package InterProScan. Developments include improvements and additions to the outputs of the software and the complete reimplementation of the software framework, resulting in a flexible and stable system that is able to use both multiprocessor machines and/or conventional clusters to achieve scalable distributed data analysis. InterProScan is freely available for download from the EMBl-EBI FTP site and the open source code is hosted at Google Code. Availability and implementation: InterProScan is distributed via FTP at ftp://ftp.ebi.ac.uk/pub/software/unix/iprscan/5/ and the source code is available from http://code.google.com/p/interproscan/. Contact: http://www.ebi.ac.uk/support or interhelp@ebi.ac.uk or mitchell@ebi.ac.uk