Cargando…

Masking repeats while clustering ESTs

A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the proc...

Descripción completa

Detalles Bibliográficos
Autores principales: Schneeberger, Korbinian, Malde, Ketil, Coward, Eivind, Jonassen, Inge
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1079970/
https://www.ncbi.nlm.nih.gov/pubmed/15831790
http://dx.doi.org/10.1093/nar/gki511
_version_ 1782123469614350336
author Schneeberger, Korbinian
Malde, Ketil
Coward, Eivind
Jonassen, Inge
author_facet Schneeberger, Korbinian
Malde, Ketil
Coward, Eivind
Jonassen, Inge
author_sort Schneeberger, Korbinian
collection PubMed
description A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the process of clustering. Unlike traditional methods, repeats are inferred directly from the EST data, we do not rely on any external library of known repeats. This makes the method especially suitable for analysing the ESTs from organisms without good repeat libraries. We demonstrate that the result is very similar to performing standard repeat masking before clustering.
format Text
id pubmed-1079970
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-10799702005-04-14 Masking repeats while clustering ESTs Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge Nucleic Acids Res Article A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the process of clustering. Unlike traditional methods, repeats are inferred directly from the EST data, we do not rely on any external library of known repeats. This makes the method especially suitable for analysing the ESTs from organisms without good repeat libraries. We demonstrate that the result is very similar to performing standard repeat masking before clustering. Oxford University Press 2005 2005-04-14 /pmc/articles/PMC1079970/ /pubmed/15831790 http://dx.doi.org/10.1093/nar/gki511 Text en © The Author 2005. Published by Oxford University Press. All rights reserved
spellingShingle Article
Schneeberger, Korbinian
Malde, Ketil
Coward, Eivind
Jonassen, Inge
Masking repeats while clustering ESTs
title Masking repeats while clustering ESTs
title_full Masking repeats while clustering ESTs
title_fullStr Masking repeats while clustering ESTs
title_full_unstemmed Masking repeats while clustering ESTs
title_short Masking repeats while clustering ESTs
title_sort masking repeats while clustering ests
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1079970/
https://www.ncbi.nlm.nih.gov/pubmed/15831790
http://dx.doi.org/10.1093/nar/gki511
work_keys_str_mv AT schneebergerkorbinian maskingrepeatswhileclusteringests
AT maldeketil maskingrepeatswhileclusteringests
AT cowardeivind maskingrepeatswhileclusteringests
AT jonasseninge maskingrepeatswhileclusteringests