Cargando…

A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions

In the present study, a positive training set of 30 known human imprinted gene coding regions are compared with a set of 72 randomly sampled human nonimprinted gene coding regions (negative training set) to identify genomic features common to human imprinted genes. The most important feature of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Daura-Oller, Elias, Cabré, Maria, Montero, Miguel A., Paternáin, José L., Romeu, Antoni
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2666875/
https://www.ncbi.nlm.nih.gov/pubmed/19360135
http://dx.doi.org/10.1155/2009/549387
_version_ 1782166087193853952
author Daura-Oller, Elias
Cabré, Maria
Montero, Miguel A.
Paternáin, José L.
Romeu, Antoni
author_facet Daura-Oller, Elias
Cabré, Maria
Montero, Miguel A.
Paternáin, José L.
Romeu, Antoni
author_sort Daura-Oller, Elias
collection PubMed
description In the present study, a positive training set of 30 known human imprinted gene coding regions are compared with a set of 72 randomly sampled human nonimprinted gene coding regions (negative training set) to identify genomic features common to human imprinted genes. The most important feature of the present work is its ability to use multivariate analysis to look at variation, at coding region DNA level, among imprinted and non-imprinted genes. There is a force affecting genomic parameters that appears through the use of the appropriate multivariate methods (principle components analysis (PCA) and quadratic discriminant analysis (QDA)) to analyse quantitative genomic data. We show that variables, such as CG content, [bp]% CpG islands, [bp]% Large Tandem Repeats, and [bp]% Simple Repeats, are able to distinguish coding regions of human imprinted genes.
format Text
id pubmed-2666875
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-26668752009-04-09 A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions Daura-Oller, Elias Cabré, Maria Montero, Miguel A. Paternáin, José L. Romeu, Antoni Comp Funct Genomics Research Article In the present study, a positive training set of 30 known human imprinted gene coding regions are compared with a set of 72 randomly sampled human nonimprinted gene coding regions (negative training set) to identify genomic features common to human imprinted genes. The most important feature of the present work is its ability to use multivariate analysis to look at variation, at coding region DNA level, among imprinted and non-imprinted genes. There is a force affecting genomic parameters that appears through the use of the appropriate multivariate methods (principle components analysis (PCA) and quadratic discriminant analysis (QDA)) to analyse quantitative genomic data. We show that variables, such as CG content, [bp]% CpG islands, [bp]% Large Tandem Repeats, and [bp]% Simple Repeats, are able to distinguish coding regions of human imprinted genes. Hindawi Publishing Corporation 2009 2009-04-08 /pmc/articles/PMC2666875/ /pubmed/19360135 http://dx.doi.org/10.1155/2009/549387 Text en Copyright © 2009 Elias Daura-Oller et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Daura-Oller, Elias
Cabré, Maria
Montero, Miguel A.
Paternáin, José L.
Romeu, Antoni
A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions
title A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions
title_full A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions
title_fullStr A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions
title_full_unstemmed A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions
title_short A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions
title_sort first-stage approximation to identify new imprinted genes through sequence analysis of its coding regions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2666875/
https://www.ncbi.nlm.nih.gov/pubmed/19360135
http://dx.doi.org/10.1155/2009/549387
work_keys_str_mv AT dauraollerelias afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT cabremaria afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT monteromiguela afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT paternainjosel afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT romeuantoni afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT dauraollerelias firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT cabremaria firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT monteromiguela firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT paternainjosel firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions
AT romeuantoni firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions