Cargando…
Masking repeats while clustering ESTs
A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the proc...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1079970/ https://www.ncbi.nlm.nih.gov/pubmed/15831790 http://dx.doi.org/10.1093/nar/gki511 |
_version_ | 1782123469614350336 |
---|---|
author | Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge |
author_facet | Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge |
author_sort | Schneeberger, Korbinian |
collection | PubMed |
description | A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the process of clustering. Unlike traditional methods, repeats are inferred directly from the EST data, we do not rely on any external library of known repeats. This makes the method especially suitable for analysing the ESTs from organisms without good repeat libraries. We demonstrate that the result is very similar to performing standard repeat masking before clustering. |
format | Text |
id | pubmed-1079970 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-10799702005-04-14 Masking repeats while clustering ESTs Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge Nucleic Acids Res Article A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the process of clustering. Unlike traditional methods, repeats are inferred directly from the EST data, we do not rely on any external library of known repeats. This makes the method especially suitable for analysing the ESTs from organisms without good repeat libraries. We demonstrate that the result is very similar to performing standard repeat masking before clustering. Oxford University Press 2005 2005-04-14 /pmc/articles/PMC1079970/ /pubmed/15831790 http://dx.doi.org/10.1093/nar/gki511 Text en © The Author 2005. Published by Oxford University Press. All rights reserved |
spellingShingle | Article Schneeberger, Korbinian Malde, Ketil Coward, Eivind Jonassen, Inge Masking repeats while clustering ESTs |
title | Masking repeats while clustering ESTs |
title_full | Masking repeats while clustering ESTs |
title_fullStr | Masking repeats while clustering ESTs |
title_full_unstemmed | Masking repeats while clustering ESTs |
title_short | Masking repeats while clustering ESTs |
title_sort | masking repeats while clustering ests |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1079970/ https://www.ncbi.nlm.nih.gov/pubmed/15831790 http://dx.doi.org/10.1093/nar/gki511 |
work_keys_str_mv | AT schneebergerkorbinian maskingrepeatswhileclusteringests AT maldeketil maskingrepeatswhileclusteringests AT cowardeivind maskingrepeatswhileclusteringests AT jonasseninge maskingrepeatswhileclusteringests |