Cargando…
A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions
In the present study, a positive training set of 30 known human imprinted gene coding regions are compared with a set of 72 randomly sampled human nonimprinted gene coding regions (negative training set) to identify genomic features common to human imprinted genes. The most important feature of the...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2666875/ https://www.ncbi.nlm.nih.gov/pubmed/19360135 http://dx.doi.org/10.1155/2009/549387 |
_version_ | 1782166087193853952 |
---|---|
author | Daura-Oller, Elias Cabré, Maria Montero, Miguel A. Paternáin, José L. Romeu, Antoni |
author_facet | Daura-Oller, Elias Cabré, Maria Montero, Miguel A. Paternáin, José L. Romeu, Antoni |
author_sort | Daura-Oller, Elias |
collection | PubMed |
description | In the present study, a positive training set of 30 known human imprinted gene coding regions are compared with a set of 72 randomly sampled human nonimprinted gene coding regions (negative training set) to identify genomic features common to human imprinted genes. The most important feature of the present work is its ability to use multivariate analysis to look at variation, at coding region DNA level, among imprinted and non-imprinted genes. There is a force affecting genomic parameters that appears through the use of the appropriate multivariate methods (principle components analysis (PCA) and quadratic discriminant analysis (QDA)) to analyse quantitative genomic data. We show that variables, such as CG content, [bp]% CpG islands, [bp]% Large Tandem Repeats, and [bp]% Simple Repeats, are able to distinguish coding regions of human imprinted genes. |
format | Text |
id | pubmed-2666875 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-26668752009-04-09 A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions Daura-Oller, Elias Cabré, Maria Montero, Miguel A. Paternáin, José L. Romeu, Antoni Comp Funct Genomics Research Article In the present study, a positive training set of 30 known human imprinted gene coding regions are compared with a set of 72 randomly sampled human nonimprinted gene coding regions (negative training set) to identify genomic features common to human imprinted genes. The most important feature of the present work is its ability to use multivariate analysis to look at variation, at coding region DNA level, among imprinted and non-imprinted genes. There is a force affecting genomic parameters that appears through the use of the appropriate multivariate methods (principle components analysis (PCA) and quadratic discriminant analysis (QDA)) to analyse quantitative genomic data. We show that variables, such as CG content, [bp]% CpG islands, [bp]% Large Tandem Repeats, and [bp]% Simple Repeats, are able to distinguish coding regions of human imprinted genes. Hindawi Publishing Corporation 2009 2009-04-08 /pmc/articles/PMC2666875/ /pubmed/19360135 http://dx.doi.org/10.1155/2009/549387 Text en Copyright © 2009 Elias Daura-Oller et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Daura-Oller, Elias Cabré, Maria Montero, Miguel A. Paternáin, José L. Romeu, Antoni A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions |
title | A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions |
title_full | A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions |
title_fullStr | A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions |
title_full_unstemmed | A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions |
title_short | A First-Stage Approximation to Identify New Imprinted Genes through Sequence Analysis of Its Coding Regions |
title_sort | first-stage approximation to identify new imprinted genes through sequence analysis of its coding regions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2666875/ https://www.ncbi.nlm.nih.gov/pubmed/19360135 http://dx.doi.org/10.1155/2009/549387 |
work_keys_str_mv | AT dauraollerelias afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT cabremaria afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT monteromiguela afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT paternainjosel afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT romeuantoni afirststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT dauraollerelias firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT cabremaria firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT monteromiguela firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT paternainjosel firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions AT romeuantoni firststageapproximationtoidentifynewimprintedgenesthroughsequenceanalysisofitscodingregions |