BioC2024

Unraveling Immunogenomic Diversity in Single-Cell Data
07-25, 16:00–16:45 (US/Eastern), Room 3104-5

The human immune system is governed by a complex interplay of molecules encoded by highly diverse genetic loci. Immune genes such as B and T cell receptors, human leukocyte antigens (HLAs), and killer Ig-like receptors (KIRs) exhibit remarkable allelic diversity across populations. However, conventional single-cell analysis methods often overlook this diversity, leading to erroneous quantification of immune mediators and compromised inter-donor comparability.

To address these challenges and unlock deeper insights from single-cell studies, we present a comprehensive workflow comprising two software and one data packages:

  1. scIGD (single-cell ImmunoGenomic Diversity): A Snakemake workflow designed to automate allele-typing processes for immune genes, with a focus on key targets like HLAs. In addition, it facilitates allele-specific quantification from single-cell RNA-sequencing (scRNA-seq) data using donor-specific references.

  2. SingleCellAlleleExperiment: This (soon-to-be-submitted) R/Bioconductor package maximizes the analytical potential of results obtained from scIGD. It offers a versatile multi-layer data structure, allowing representation of immune genes at various levels, from alleles to genes to functionally similar gene groups. This enables comprehensive analysis across different layers of immunologically-relevant annotation.

  3. scaeData: An R/ExperimentHub data package housing datasets processed by scIGD. These datasets can be utilized to perform exploratory and downstream analysis using the novel SingleCellAlleleExperiment data structure.

Preliminary findings demonstrate accurate quantification of different HLA allele groups in (amplicon-based and whole-transcriptome-based) scRNA-seq datasets from diverse sources, including cancer patients and human atlas samples. This not only enhances the comparability of immune profiles across donors but also sheds light on population-specific susceptibilities to infections. Our work lays the groundwork for precise immunological analysis of multi-omics data, particularly in elucidating allele-specific interactions.

If accepted as a package demo, I intend to showcase all three tools, emphasizing the utilization of SingleCellAlleleExperiment and its functionalities on one of the example datasets available in scaeData, for exploratory and downstream analysis across the three layers offered by the data structure.