Cargando…
Two-Phase Analysis in Consensus Genetic Mapping
Numerous mapping projects conducted on different species have generated an abundance of mapping data. Consequently, many multilocus maps have been constructed using diverse mapping populations and marker sets for the same organism. The quality of maps varies broadly among populations, marker sets, a...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3362937/ https://www.ncbi.nlm.nih.gov/pubmed/22670224 http://dx.doi.org/10.1534/g3.112.002428 |
_version_ | 1782234280929263616 |
---|---|
author | Ronin, Y. Mester, D. Minkov, D. Belotserkovski, R. Jackson, B. N. Schnable, P. S. Aluru, S. Korol, A. |
author_facet | Ronin, Y. Mester, D. Minkov, D. Belotserkovski, R. Jackson, B. N. Schnable, P. S. Aluru, S. Korol, A. |
author_sort | Ronin, Y. |
collection | PubMed |
description | Numerous mapping projects conducted on different species have generated an abundance of mapping data. Consequently, many multilocus maps have been constructed using diverse mapping populations and marker sets for the same organism. The quality of maps varies broadly among populations, marker sets, and software used, necessitating efforts to integrate the mapping information and generate consensus maps. The problem of consensus genetic mapping (MCGM) is by far more challenging compared with genetic mapping based on a single dataset, which by itself is also cumbersome. The additional complications introduced by consensus analysis include inter-population differences in recombination rate and exchange distribution along chromosomes; variations in dominance of the employed markers; and use of different subsets of markers in different labs. Hence, it is necessary to handle arbitrary patterns of shared sets of markers and different level of mapping data quality. In this article, we introduce a two-phase approach for solving MCGM. In phase 1, for each dataset, multilocus ordering is performed combined with iterative jackknife resampling to evaluate the stability of marker orders. In this phase, the ordering problem is reduced to the well-known traveling salesperson problem (TSP). Namely, for each dataset, we look for order that gives minimum sum of recombination distances between adjacent markers. In phase 2, the optimal consensus order of shared markers is selected from the set of allowed orders and gives the minimal sum of total lengths of nonconflicting maps of the chromosome. This criterion may be used in different modifications to take into account the variation in quality of the original data (population size, marker quality, etc.). In the foregoing formulation, consensus mapping is considered as a specific version of TSP that can be referred to as “synchronized TSP.” The conflicts detected after phase 1 are resolved using either a heuristic algorithm over the entire chromosome or an exact/heuristic algorithm applied subsequently to the revealed small non-overlapping regions with conflicts separated by non-conflicting regions. The proposed approach was tested on a wide range of simulated data and real datasets from maize. |
format | Online Article Text |
id | pubmed-3362937 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-33629372012-06-05 Two-Phase Analysis in Consensus Genetic Mapping Ronin, Y. Mester, D. Minkov, D. Belotserkovski, R. Jackson, B. N. Schnable, P. S. Aluru, S. Korol, A. G3 (Bethesda) Investigations Numerous mapping projects conducted on different species have generated an abundance of mapping data. Consequently, many multilocus maps have been constructed using diverse mapping populations and marker sets for the same organism. The quality of maps varies broadly among populations, marker sets, and software used, necessitating efforts to integrate the mapping information and generate consensus maps. The problem of consensus genetic mapping (MCGM) is by far more challenging compared with genetic mapping based on a single dataset, which by itself is also cumbersome. The additional complications introduced by consensus analysis include inter-population differences in recombination rate and exchange distribution along chromosomes; variations in dominance of the employed markers; and use of different subsets of markers in different labs. Hence, it is necessary to handle arbitrary patterns of shared sets of markers and different level of mapping data quality. In this article, we introduce a two-phase approach for solving MCGM. In phase 1, for each dataset, multilocus ordering is performed combined with iterative jackknife resampling to evaluate the stability of marker orders. In this phase, the ordering problem is reduced to the well-known traveling salesperson problem (TSP). Namely, for each dataset, we look for order that gives minimum sum of recombination distances between adjacent markers. In phase 2, the optimal consensus order of shared markers is selected from the set of allowed orders and gives the minimal sum of total lengths of nonconflicting maps of the chromosome. This criterion may be used in different modifications to take into account the variation in quality of the original data (population size, marker quality, etc.). In the foregoing formulation, consensus mapping is considered as a specific version of TSP that can be referred to as “synchronized TSP.” The conflicts detected after phase 1 are resolved using either a heuristic algorithm over the entire chromosome or an exact/heuristic algorithm applied subsequently to the revealed small non-overlapping regions with conflicts separated by non-conflicting regions. The proposed approach was tested on a wide range of simulated data and real datasets from maize. Genetics Society of America 2012-05-01 /pmc/articles/PMC3362937/ /pubmed/22670224 http://dx.doi.org/10.1534/g3.112.002428 Text en Copyright © 2012 Ronin et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution Unported License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations Ronin, Y. Mester, D. Minkov, D. Belotserkovski, R. Jackson, B. N. Schnable, P. S. Aluru, S. Korol, A. Two-Phase Analysis in Consensus Genetic Mapping |
title | Two-Phase Analysis in Consensus Genetic Mapping |
title_full | Two-Phase Analysis in Consensus Genetic Mapping |
title_fullStr | Two-Phase Analysis in Consensus Genetic Mapping |
title_full_unstemmed | Two-Phase Analysis in Consensus Genetic Mapping |
title_short | Two-Phase Analysis in Consensus Genetic Mapping |
title_sort | two-phase analysis in consensus genetic mapping |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3362937/ https://www.ncbi.nlm.nih.gov/pubmed/22670224 http://dx.doi.org/10.1534/g3.112.002428 |
work_keys_str_mv | AT roniny twophaseanalysisinconsensusgeneticmapping AT mesterd twophaseanalysisinconsensusgeneticmapping AT minkovd twophaseanalysisinconsensusgeneticmapping AT belotserkovskir twophaseanalysisinconsensusgeneticmapping AT jacksonbn twophaseanalysisinconsensusgeneticmapping AT schnableps twophaseanalysisinconsensusgeneticmapping AT alurus twophaseanalysisinconsensusgeneticmapping AT korola twophaseanalysisinconsensusgeneticmapping |