Cargando…

Two-Phase Analysis in Consensus Genetic Mapping

Numerous mapping projects conducted on different species have generated an abundance of mapping data. Consequently, many multilocus maps have been constructed using diverse mapping populations and marker sets for the same organism. The quality of maps varies broadly among populations, marker sets, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Ronin, Y., Mester, D., Minkov, D., Belotserkovski, R., Jackson, B. N., Schnable, P. S., Aluru, S., Korol, A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3362937/
https://www.ncbi.nlm.nih.gov/pubmed/22670224
http://dx.doi.org/10.1534/g3.112.002428
_version_ 1782234280929263616
author Ronin, Y.
Mester, D.
Minkov, D.
Belotserkovski, R.
Jackson, B. N.
Schnable, P. S.
Aluru, S.
Korol, A.
author_facet Ronin, Y.
Mester, D.
Minkov, D.
Belotserkovski, R.
Jackson, B. N.
Schnable, P. S.
Aluru, S.
Korol, A.
author_sort Ronin, Y.
collection PubMed
description Numerous mapping projects conducted on different species have generated an abundance of mapping data. Consequently, many multilocus maps have been constructed using diverse mapping populations and marker sets for the same organism. The quality of maps varies broadly among populations, marker sets, and software used, necessitating efforts to integrate the mapping information and generate consensus maps. The problem of consensus genetic mapping (MCGM) is by far more challenging compared with genetic mapping based on a single dataset, which by itself is also cumbersome. The additional complications introduced by consensus analysis include inter-population differences in recombination rate and exchange distribution along chromosomes; variations in dominance of the employed markers; and use of different subsets of markers in different labs. Hence, it is necessary to handle arbitrary patterns of shared sets of markers and different level of mapping data quality. In this article, we introduce a two-phase approach for solving MCGM. In phase 1, for each dataset, multilocus ordering is performed combined with iterative jackknife resampling to evaluate the stability of marker orders. In this phase, the ordering problem is reduced to the well-known traveling salesperson problem (TSP). Namely, for each dataset, we look for order that gives minimum sum of recombination distances between adjacent markers. In phase 2, the optimal consensus order of shared markers is selected from the set of allowed orders and gives the minimal sum of total lengths of nonconflicting maps of the chromosome. This criterion may be used in different modifications to take into account the variation in quality of the original data (population size, marker quality, etc.). In the foregoing formulation, consensus mapping is considered as a specific version of TSP that can be referred to as “synchronized TSP.” The conflicts detected after phase 1 are resolved using either a heuristic algorithm over the entire chromosome or an exact/heuristic algorithm applied subsequently to the revealed small non-overlapping regions with conflicts separated by non-conflicting regions. The proposed approach was tested on a wide range of simulated data and real datasets from maize.
format Online
Article
Text
id pubmed-3362937
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-33629372012-06-05 Two-Phase Analysis in Consensus Genetic Mapping Ronin, Y. Mester, D. Minkov, D. Belotserkovski, R. Jackson, B. N. Schnable, P. S. Aluru, S. Korol, A. G3 (Bethesda) Investigations Numerous mapping projects conducted on different species have generated an abundance of mapping data. Consequently, many multilocus maps have been constructed using diverse mapping populations and marker sets for the same organism. The quality of maps varies broadly among populations, marker sets, and software used, necessitating efforts to integrate the mapping information and generate consensus maps. The problem of consensus genetic mapping (MCGM) is by far more challenging compared with genetic mapping based on a single dataset, which by itself is also cumbersome. The additional complications introduced by consensus analysis include inter-population differences in recombination rate and exchange distribution along chromosomes; variations in dominance of the employed markers; and use of different subsets of markers in different labs. Hence, it is necessary to handle arbitrary patterns of shared sets of markers and different level of mapping data quality. In this article, we introduce a two-phase approach for solving MCGM. In phase 1, for each dataset, multilocus ordering is performed combined with iterative jackknife resampling to evaluate the stability of marker orders. In this phase, the ordering problem is reduced to the well-known traveling salesperson problem (TSP). Namely, for each dataset, we look for order that gives minimum sum of recombination distances between adjacent markers. In phase 2, the optimal consensus order of shared markers is selected from the set of allowed orders and gives the minimal sum of total lengths of nonconflicting maps of the chromosome. This criterion may be used in different modifications to take into account the variation in quality of the original data (population size, marker quality, etc.). In the foregoing formulation, consensus mapping is considered as a specific version of TSP that can be referred to as “synchronized TSP.” The conflicts detected after phase 1 are resolved using either a heuristic algorithm over the entire chromosome or an exact/heuristic algorithm applied subsequently to the revealed small non-overlapping regions with conflicts separated by non-conflicting regions. The proposed approach was tested on a wide range of simulated data and real datasets from maize. Genetics Society of America 2012-05-01 /pmc/articles/PMC3362937/ /pubmed/22670224 http://dx.doi.org/10.1534/g3.112.002428 Text en Copyright © 2012 Ronin et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution Unported License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Ronin, Y.
Mester, D.
Minkov, D.
Belotserkovski, R.
Jackson, B. N.
Schnable, P. S.
Aluru, S.
Korol, A.
Two-Phase Analysis in Consensus Genetic Mapping
title Two-Phase Analysis in Consensus Genetic Mapping
title_full Two-Phase Analysis in Consensus Genetic Mapping
title_fullStr Two-Phase Analysis in Consensus Genetic Mapping
title_full_unstemmed Two-Phase Analysis in Consensus Genetic Mapping
title_short Two-Phase Analysis in Consensus Genetic Mapping
title_sort two-phase analysis in consensus genetic mapping
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3362937/
https://www.ncbi.nlm.nih.gov/pubmed/22670224
http://dx.doi.org/10.1534/g3.112.002428
work_keys_str_mv AT roniny twophaseanalysisinconsensusgeneticmapping
AT mesterd twophaseanalysisinconsensusgeneticmapping
AT minkovd twophaseanalysisinconsensusgeneticmapping
AT belotserkovskir twophaseanalysisinconsensusgeneticmapping
AT jacksonbn twophaseanalysisinconsensusgeneticmapping
AT schnableps twophaseanalysisinconsensusgeneticmapping
AT alurus twophaseanalysisinconsensusgeneticmapping
AT korola twophaseanalysisinconsensusgeneticmapping