Cargando…

An optimistic protein assembly from sequence reads salvaged an uncharacterized segment of mouse picobirnavirus

Advances in Next Generation Sequencing technologies have enabled the generation of millions of sequences from microorganisms. However, distinguishing the sequence of a novel species from sequencing errors remains a technical challenge when the novel species is highly divergent from the closest known...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonzalez, Gabriel, Sasaki, Michihito, Burkitt-Gray, Lucy, Kamiya, Tomonori, Tsuji, Noriko M., Sawa, Hirofumi, Ito, Kimihito
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5223137/
https://www.ncbi.nlm.nih.gov/pubmed/28071766
http://dx.doi.org/10.1038/srep40447
Descripción
Sumario:Advances in Next Generation Sequencing technologies have enabled the generation of millions of sequences from microorganisms. However, distinguishing the sequence of a novel species from sequencing errors remains a technical challenge when the novel species is highly divergent from the closest known species. To solve such a problem, we developed a new method called Optimistic Protein Assembly from Reads (OPAR). This method is based on the assumption that protein sequences could be more conserved than the nucleotide sequences encoding them. By taking advantage of metagenomics, bioinformatics and conventional Sanger sequencing, our method successfully identified all coding regions of the mouse picobirnavirus for the first time. The salvaged sequences indicated that segment 1 of this virus was more divergent from its homologues in other Picobirnaviridae species than segment 2. For this reason, only segment 2 of mouse picobirnavirus has been detected in previous studies. OPAR web tool is available at http://bioinformatics.czc.hokudai.ac.jp/opar/.