Skip to content

Principles of SCimilarity's integration function (or say batch effect removal) #34

@LiuCanidk

Description

@LiuCanidk

Thanks for developing this nice tool!

I have tried SCimilarity on my own dataset, which consists of 6 time series samples from a cell line cultured under nutrient deprivation. And SCimilarity's result seems confused, and when I checked the query coherence and the embedding computed by SCimilarity, I found it strange. (figure below).

Image

Embedding was blurring. And my most similar cells was come from xenograft studies, and most importantly, many cells were overlapped, as shown by Jaccard Index between different cluster groups, calculated using cell index provided in result metadata.

LUAD xenograft studies, Jaccard Index of similar cells' index:

Image

And when I score the cell with signature derived from previous seurat's clustering marker, I found the SCimilarity embedding seperated the Cluster3 into two subclusters, which made me even more confused because my original UMAP using seurat did not show any clues about Cluster3's subclusters.

Image

my original UMAP plot computed using Harmony-corrected embeddings:

Image

So I started to think about the integration principle, or batch effect removal within SCimilarity. In the tutorial, it seems that we only need input the expression matrix into the SCimilarity's function. However, other batch effect removal methods, such as Harmony, which I used with my seurat object, need the sample metadata or group.by metadata, i.e., different batch. So I am curious about how SCimilarity can accomplish the batch effect removal without such information provided. In addition, I notice that in the original paper Fig.2b, the batch effect removal performance of SCimilarity only win in the cell type cluster coherence but fail in removing study batchiness when compared to other methods:

Image

Therefore, I wonder how SCimilarity perform integration without group or sample information provided. Hopefully, any suggestion of interpreting my result from SCimilarity would be greatly appreciated!

Thanks in advance.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions