pySCENIC cannot reproduce AUCell + tSNE projection from a tutorial #386
Unanswered
ninhleba
asked this question in
* General SCENIC questions
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I've been testing pySCENIC by following this tutorial: http://htmlpreview.github.io/?https://github.com/aertslab/SCENICprotocol/blob/master/notebooks/SCENIC%20Protocol%20-%20Case%20study%20-%20Cancer%20data%20sets.html. The dataset I tried was the first one (Accession ID: GSE115978, Cancer type: SKCM). Everything looked pretty compatible until the AUCell + tSNE projection, which didn't seem to be able to cluster cells by cell type at all, in contrast with the corresponding plot in the tutorial.
![tsne - GSE115978 - AUCell+tSNE](https://user-images.githubusercontent.com/86318293/226442886-ba26333e-542a-4d4f-a50a-ecc171422125.svg)
Previous steps such as grn, ctx and aucell ran smoothly, and the clustermap based on binarized AUCell matrix I obtained also suggests tSNE computed from AUCell scores should be able to cluster out at least half the number of cell types.
![clustermap - GSE115978](https://user-images.githubusercontent.com/86318293/226441981-c97565e8-884c-4872-8f3d-2bad19218937.png)
![legend - GSE115978 - cell_type_colors](https://user-images.githubusercontent.com/86318293/226442106-ede41316-eed1-4b9d-820f-fe374a6c74f4.svg)
![legend - GSE115978 - on_off](https://user-images.githubusercontent.com/86318293/226442111-4ac96225-81a3-492c-9b91-1902dc657a17.svg)
I understand the nature of the algorithm makes the outputs of different runs vary slightly from one another but I don't think they should by this much. I don't think the error stems from tSNE because the PCA + tSNE projection looks like it's supposed to.
![tsne - GSE115978 - PCA+tSNE](https://user-images.githubusercontent.com/86318293/226442677-9ae58a58-8ade-44e2-9ef2-4d2b866478a9.svg)
I would really appreciate it if someone could share their thoughts on this.
Here are the databases that I used:
Human TFs: https://github.com/aertslab/pySCENIC/blob/master/resources/lambert2018.txt
Ranking databases: https://resources.aertslab.org/cistarget/databases/homo_sapiens/hg19/refseq_r45/mc9nr/gene_based/
Motifs annotation: https://resources.aertslab.org/cistarget/motif2tf/motifs-v9-nr.hgnc-m0.001-o0.0.tbl
Here is my session info:
Beta Was this translation helpful? Give feedback.
All reactions