Cargando…

AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets

Analysis of an individual’s immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to b...

Descripción completa

Detalles Bibliográficos
Autores principales: Lees, William D., Christley, Scott, Peres, Ayelet, Kos, Justin T., Corrie, Brian, Ralph, Duncan, Breden, Felix, Cowell, Lindsay G., Yaari, Gur, Corcoran, Martin, Karlsson Hedestam, Gunilla B., Ohlin, Mats, Collins, Andrew M., Watson, Corey T., Busse, Christian E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310305/
https://www.ncbi.nlm.nih.gov/pubmed/37388275
http://dx.doi.org/10.1016/j.immuno.2023.100025
Descripción
Sumario:Analysis of an individual’s immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.