Cargando…

Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)

BACKGROUND: Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar d...

Descripción completa

Detalles Bibliográficos
Autores principales: Andreassen, Rune, Lunner, Sigbjørn, Høyheim, Bjørn
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2774873/
https://www.ncbi.nlm.nih.gov/pubmed/19878547
http://dx.doi.org/10.1186/1471-2164-10-502
_version_ 1782173976991105024
author Andreassen, Rune
Lunner, Sigbjørn
Høyheim, Bjørn
author_facet Andreassen, Rune
Lunner, Sigbjørn
Høyheim, Bjørn
author_sort Andreassen, Rune
collection PubMed
description BACKGROUND: Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP), the number of sequences where the full-length of the cDNA insert has been determined has been small. RESULTS: High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91%) of the transcripts were annotated using Gene Ontology (GO) terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS). The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. CONCLUSION: This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS). This suggests that the remaining cDNA libraries generated by SGP represent a valuable cCDS FLIc source. The conservation of 7-mers in 3'UTRs indicates that these motifs are functionally important. Identity between some of these 7-mers and miRNA target sequences suggests that they are miRNA targets in Salmo salar transcripts as well.
format Text
id pubmed-2774873
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27748732009-11-10 Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar) Andreassen, Rune Lunner, Sigbjørn Høyheim, Bjørn BMC Genomics Research Article BACKGROUND: Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP), the number of sequences where the full-length of the cDNA insert has been determined has been small. RESULTS: High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91%) of the transcripts were annotated using Gene Ontology (GO) terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS). The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. CONCLUSION: This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS). This suggests that the remaining cDNA libraries generated by SGP represent a valuable cCDS FLIc source. The conservation of 7-mers in 3'UTRs indicates that these motifs are functionally important. Identity between some of these 7-mers and miRNA target sequences suggests that they are miRNA targets in Salmo salar transcripts as well. BioMed Central 2009-10-30 /pmc/articles/PMC2774873/ /pubmed/19878547 http://dx.doi.org/10.1186/1471-2164-10-502 Text en Copyright © 2009 Andreassen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Andreassen, Rune
Lunner, Sigbjørn
Høyheim, Bjørn
Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)
title Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)
title_full Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)
title_fullStr Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)
title_full_unstemmed Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)
title_short Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)
title_sort characterization of full-length sequenced cdna inserts (flics) from atlantic salmon (salmo salar)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2774873/
https://www.ncbi.nlm.nih.gov/pubmed/19878547
http://dx.doi.org/10.1186/1471-2164-10-502
work_keys_str_mv AT andreassenrune characterizationoffulllengthsequencedcdnainsertsflicsfromatlanticsalmonsalmosalar
AT lunnersigbjørn characterizationoffulllengthsequencedcdnainsertsflicsfromatlanticsalmonsalmosalar
AT høyheimbjørn characterizationoffulllengthsequencedcdnainsertsflicsfromatlanticsalmonsalmosalar