Cargando…

A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions

In the present study, a positive training set of 30 known human imprinted gene coding regions are compared with a set of 72 randomly sampled human nonimprinted gene coding regions (negative training set) to identify genomic features common to human imprinted genes. The most important feature of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Daura-Oller, Elias, Cabré, Maria, Montero, Miguel A., Paternáin, José L., Romeu, Antoni
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2666875/
https://www.ncbi.nlm.nih.gov/pubmed/19360135
http://dx.doi.org/10.1155/2009/549387
Descripción
Sumario:In the present study, a positive training set of 30 known human imprinted gene coding regions are compared with a set of 72 randomly sampled human nonimprinted gene coding regions (negative training set) to identify genomic features common to human imprinted genes. The most important feature of the present work is its ability to use multivariate analysis to look at variation, at coding region DNA level, among imprinted and non-imprinted genes. There is a force affecting genomic parameters that appears through the use of the appropriate multivariate methods (principle components analysis (PCA) and quadratic discriminant analysis (QDA)) to analyse quantitative genomic data. We show that variables, such as CG content, [bp]% CpG islands, [bp]% Large Tandem Repeats, and [bp]% Simple Repeats, are able to distinguish coding regions of human imprinted genes.