Cargando…

Masking repeats while clustering ESTs

A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the proc...

Descripción completa

Detalles Bibliográficos
Autores principales:	Schneeberger, Korbinian, Malde, Ketil, Coward, Eivind, Jonassen, Inge
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2005
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1079970/ https://www.ncbi.nlm.nih.gov/pubmed/15831790 http://dx.doi.org/10.1093/nar/gki511

_version_	1782123469614350336
author	Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge
author_facet	Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge
author_sort	Schneeberger, Korbinian
collection	PubMed
description	A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the process of clustering. Unlike traditional methods, repeats are inferred directly from the EST data, we do not rely on any external library of known repeats. This makes the method especially suitable for analysing the ESTs from organisms without good repeat libraries. We demonstrate that the result is very similar to performing standard repeat masking before clustering.
format	Text
id	pubmed-1079970
institution	National Center for Biotechnology Information
language	English
publishDate	2005
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-10799702005-04-14 Masking repeats while clustering ESTs Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge Nucleic Acids Res Article A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the process of clustering. Unlike traditional methods, repeats are inferred directly from the EST data, we do not rely on any external library of known repeats. This makes the method especially suitable for analysing the ESTs from organisms without good repeat libraries. We demonstrate that the result is very similar to performing standard repeat masking before clustering. Oxford University Press 2005 2005-04-14 /pmc/articles/PMC1079970/ /pubmed/15831790 http://dx.doi.org/10.1093/nar/gki511 Text en © The Author 2005. Published by Oxford University Press. All rights reserved
spellingShingle	Article Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge Masking repeats while clustering ESTs
title	Masking repeats while clustering ESTs
title_full	Masking repeats while clustering ESTs
title_fullStr	Masking repeats while clustering ESTs
title_full_unstemmed	Masking repeats while clustering ESTs
title_short	Masking repeats while clustering ESTs
title_sort	masking repeats while clustering ests
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1079970/ https://www.ncbi.nlm.nih.gov/pubmed/15831790 http://dx.doi.org/10.1093/nar/gki511
work_keys_str_mv	AT schneebergerkorbinian maskingrepeatswhileclusteringests AT maldeketil maskingrepeatswhileclusteringests AT cowardeivind maskingrepeatswhileclusteringests AT jonasseninge maskingrepeatswhileclusteringests

Masking repeats while clustering ESTs

Ejemplares similares