Cargando…

A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling

BACKGROUND: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cel...

Descripción completa

Detalles Bibliográficos
Autores principales: Pinheiro, Daniel G, Galante, Pedro AF, de Souza, Sandro J, Zago, Marco A, Silva, Wilson A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2701951/
https://www.ncbi.nlm.nih.gov/pubmed/19500384
http://dx.doi.org/10.1186/1471-2105-10-170
_version_ 1782168724641415168
author Pinheiro, Daniel G
Galante, Pedro AF
de Souza, Sandro J
Zago, Marco A
Silva, Wilson A
author_facet Pinheiro, Daniel G
Galante, Pedro AF
de Souza, Sandro J
Zago, Marco A
Silva, Wilson A
author_sort Pinheiro, Daniel G
collection PubMed
description BACKGROUND: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. RESULTS: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. CONCLUSION: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at . S3T source code and datasets can also be downloaded from the aforementioned website.
format Text
id pubmed-2701951
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27019512009-06-26 A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling Pinheiro, Daniel G Galante, Pedro AF de Souza, Sandro J Zago, Marco A Silva, Wilson A BMC Bioinformatics Software BACKGROUND: High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. RESULTS: This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. CONCLUSION: These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at . S3T source code and datasets can also be downloaded from the aforementioned website. BioMed Central 2009-06-06 /pmc/articles/PMC2701951/ /pubmed/19500384 http://dx.doi.org/10.1186/1471-2105-10-170 Text en Copyright © 2009 Pinheiro et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Pinheiro, Daniel G
Galante, Pedro AF
de Souza, Sandro J
Zago, Marco A
Silva, Wilson A
A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling
title A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling
title_full A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling
title_fullStr A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling
title_full_unstemmed A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling
title_short A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling
title_sort score system for quality evaluation of rna sequence tags: an improvement for gene expression profiling
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2701951/
https://www.ncbi.nlm.nih.gov/pubmed/19500384
http://dx.doi.org/10.1186/1471-2105-10-170
work_keys_str_mv AT pinheirodanielg ascoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT galantepedroaf ascoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT desouzasandroj ascoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT zagomarcoa ascoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT silvawilsona ascoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT pinheirodanielg scoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT galantepedroaf scoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT desouzasandroj scoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT zagomarcoa scoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling
AT silvawilsona scoresystemforqualityevaluationofrnasequencetagsanimprovementforgeneexpressionprofiling