skip to main content

A Bayesian multivariate mixture model for high throughput spatial transcriptomics

Allen, Carter ; Chang, Yuzhou ; Neelon, Brian ; Chang, Won ; Kim, Hang J ; Li, Zihai ; Ma, Qin ; Chung, Dongjun

Biometrics, 2023-09, Vol.79 (3), p.1775-1787 [Periódico revisado por pares]

United States: Blackwell Publishing Ltd

Texto completo disponível

Citações Citado por
  • Título:
    A Bayesian multivariate mixture model for high throughput spatial transcriptomics
  • Autor: Allen, Carter ; Chang, Yuzhou ; Neelon, Brian ; Chang, Won ; Kim, Hang J ; Li, Zihai ; Ma, Qin ; Chung, Dongjun
  • Assuntos: Bayesian analysis ; Biological computing ; Brain ; Breast cancer ; Clustering ; Computational neuroscience ; Data augmentation ; Gene expression ; Heterogeneity ; Mathematical models ; Mixtures ; Multivariate analysis ; Populations ; Probabilistic models ; Skewed distributions ; Skewness ; Spatial data ; Spatial heterogeneity ; Statistical models ; Transcriptomics ; Tumor microenvironment
  • É parte de: Biometrics, 2023-09, Vol.79 (3), p.1775-1787
  • Notas: ObjectType-Article-1
    SourceType-Scholarly Journals-1
    ObjectType-Feature-2
    content type line 23
  • Descrição: High throughput spatial transcriptomics (HST) is a rapidly emerging class of experimental technologies that allow for profiling gene expression in tissue samples at or near single-cell resolution while retaining the spatial location of each sequencing unit within the tissue sample. Through analyzing HST data, we seek to identify sub-populations of cells within a tissue sample that may inform biological phenomena. Existing computational methods either ignore the spatial heterogeneity in gene expression profiles, fail to account for important statistical features such as skewness, or are heuristic-based network clustering methods that lack the inferential benefits of statistical modeling. To address this gap, we develop SPRUCE: a Bayesian spatial multivariate finite mixture model based on multivariate skew-normal distributions, which is capable of identifying distinct cellular sub-populations in HST data. We further implement a novel combination of Pólya-Gamma data augmentation and spatial random effects to infer spatially correlated mixture component membership probabilities without relying on approximate inference techniques. Via a simulation study, we demonstrate the detrimental inferential effects of ignoring skewness or spatial correlation in HST data. Using publicly available human brain HST data, SPRUCE outperforms existing methods in recovering expertly annotated brain layers. Finally, our application of SPRUCE to human breast cancer HST data indicates that SPRUCE can distinguish distinct cell populations within the tumor microenvironment. An R package spruce for fitting the proposed models is available through The Comprehensive R Archive Network.
  • Editor: United States: Blackwell Publishing Ltd
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.