spotBinning 在空间数据中的应用 / 开普饭

男，

一个长大了才会遇到的帅哥，

稳健，潇洒，大方，靠谱。

一段生信缘，一棵技能树。

生信技能树核心成员，单细胞天地特约撰稿人，简书创作者，单细胞数据科学家。

我们知道，空间分析中常见的是解析每个spot中的细胞数，这是往细了做。而临近的细胞放到一个bin中获得概览，不仅是在可视化方面，在数据集大了之后，这种分箱的操作可以减少维度。这种分析方法或可叫做spotbinning抑或是pseudospot。之所以产生这个联想是因为之前做宏基因组的时候做过Contigbinning。今天我们就来看看spotBinning 在空间数据中的应用，主角是：同属于Seurat生态的schex ，起初，schex 拟解决单细胞转录组图谱（tsne/umap）中细胞重叠的问题。如：

library(schex) library(Seurat) library(SeuratData) data("pbmc_small") pbmc3k.final <- make_hexbin(pbmc3k.final, 10, dimension_reduction = "PCA") p1 <- DimPlot(pbmc3k.final,reduction = 'pca') p2 <- plot_hexbin_density(pbmc3k.final)


p1+p2

可以看到schex在pca空间中将细胞点划分为不同的区域，并计算了该区域的细胞数。当然，如果我们把pca空间换成空间位置信息，自然也是可以做类似的操作的。而空间位置的点，往往是均匀分布的，所以每个bin（箱子）可以看作是重采样的过程。下面我们就来做一下这个演示：

载入空间数据并作标准计算：

SeuratData::AvailableData()


stxBrain.SeuratData::anterior1  -> Brain

Brain <- FindVariableFeatures(Brain)

Brain %>%  NormalizeData() %>% 

  ScaleData() %>%

  RunPCA() ->Brain
Brain %>% FindNeighbors(dims=1:20)  %>%

  FindClusters()  ->Brain

坐标替换：

Brain@reductions$sptial <- Brain@reductions$pca # head(Brain@reductions$sptial@cell.embeddings) Brain@reductions$sptial@cell.embeddings <- as.matrix(Brain@images$anterior1@coordinates[2:3]) Brain <- make_hexbin(Brain, 20, dimension_reduction = "sptial") Brain


An object of class Seurat 

31053 features across 2696 samples within 1 assay 

Active assay: Spatial (31053 features, 2000 variable features)

 2 dimensional reductions calculated: pca, sptial

下面就可以调用schex了

plot_hexbin_density(Brain) + theme_bw()

可以看到明显的边际差异：处在边界上的bin中的spot更少。

绘制分群信息：

plot_hexbin_meta(Brain, col="Spatial_snn_res.0.8", action="majority") + theme_bw()

当然不止于此，我们可以在上面画基因啊。如，bin中的基因平均表达量：

VariableFeatures(Brain) plot_hexbin_feature(Brain, mod="Spatial", type="scale.data", feature="Ttr", action="mean", xlab="row", ylab="col", title=paste0("Mean of expression of Ttr")) + theme_bw()

还记得Seurat空间变异基因如何计算的吗？

Brain <- FindSpatiallyVariableFeatures(Brain, assay = "Spatial", features = VariableFeatures(Brain)[1:100], selection.method = "markvariogram") grep("Cc",SpatiallyVariableFeatures(Brain, selection.method = "markvariogram"),value = T)


[1] "Cck"  "Ccl5"

用Seurat画两个基因：

SpatialFeaturePlot(Brain,features = c("Cck", "Ttr"))

第二种方式：

FeaturePlot(Brain, features = c("Cck", "Ttr"), blend = TRUE)

schex 提供三种方式，这里看一下在空间中绘制差异基因：

plot_hexbin_interact(Brain, type=c("scale.data", "scale.data"), mod=c("Spatial", "Spatial"), feature=c("Cck","Ttr"), interact="fc", ylab="row", xlab="col", title="Interaction ") + scale_fill_gradient2(midpoint=0, low="blue", mid="white", high="red", space ="Lab") + theme_bw()

看到指定差异的interact="fc"了吗？

两基因在bin中的相关性：

plot_hexbin_interact(Brain, type=c("scale.data", "scale.data"), mod=c("Spatial", "Spatial"), feature=c("Cck","Ttr"), interact="corr_spearman", ylab="row", xlab="col", title="Interaction ") + scale_fill_gradient2(midpoint=0, low="blue", mid="white", high="red", space ="Lab") + theme_bw()

除了简单的绘图 schex还可以调用shiny更灵活地调节

plot_hexbin_feature_shiny(Brain, type="counts", feature="Cck", action="prop_0", min_nbins=10, max_nbins=50, dimension_reduction="sptial", mod="Spatial")