Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The simulated training dataset for the Human Brain. #104

Closed
hathawayxxh opened this issue Jul 19, 2021 · 7 comments
Closed

The simulated training dataset for the Human Brain. #104

hathawayxxh opened this issue Jul 19, 2021 · 7 comments

Comments

@hathawayxxh
Copy link

Hi Kevin,

I am trying to reimplement the experiments in your paper. However, I didn't find the simulated training data for the human brain in the following url:
https://scaden.readthedocs.io/en/latest/datasets.html
Could you release the simulated dataset so I can conduct the experiments?

Thank your so much.

Best regards,
Xiaohan

@Tushar-87
Copy link

I am also interested in human brain single cell data...

@KevinMenden
Copy link
Owner

Hi,

yes unfortunately we didn't publish them on Figshare. I will see if can make a new dataset and publish it there or, alternatively, publish the scripts for creating it from scratch. I currently don't have much time to maintain Scaden but hopefully I can do this rather soon.

Best,
Kevin

@hathawayxxh
Copy link
Author

Hi Kevin,

It would be great if you could publish the scripts for creating the dataset.
BTW, I am also curious about the single cell annotation part. Since the raw data you downloaded is not annotated, so you first clustered the cells and then annotated the cells by gene markers. Could you also publish the scripts for this cell annotation part?

Best,
Xiaohan

@KevinMenden
Copy link
Owner

Hi Both,

have a look here: https://figshare.com/projects/Scaden/62834

This contains the scripts used to generate some of the datasets. Look in the preprocessing folder. There should be a script for every dataset used (you just need to download them). Those scripts contain the cell type annotation and generation of processed datasets, which can then be used with scaden simulate to make your own datasets.

Hope that helps!

Cheers,
Kevin

@hathawayxxh
Copy link
Author

Hi Kevin,

Thanks but I cannot find the "preprocessing" folder in the scripts.

image

@hathawayxxh
Copy link
Author

Do you mean this folder?
image

@KevinMenden
Copy link
Owner

Sorry, yes that's the correct folde! 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants