Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issues with Regtools Statistical Analysis Workflow #190

Open
mguaita opened this issue Sep 16, 2024 · 0 comments
Open

Issues with Regtools Statistical Analysis Workflow #190

mguaita opened this issue Sep 16, 2024 · 0 comments

Comments

@mguaita
Copy link

mguaita commented Sep 16, 2024

Dear Griffith Lab,

Thank you very much for this tool.

Following the Example workflow (https://regtools.readthedocs.io/en/latest/workflow/), I have encountered some issues during the statistical analysis of the Regtools comparison outputs.

  1. Does the compare_junctions_hist.py script work properly with the recommended option variant_grouping_mode == 'exclude'? I get all zeros for the non-mutated scores.
    In lines 235-236, all the rows with common junctions between the samples_w_variant_df are removed from the samples_wout_variant_df, thus removing the score values of the non-mutated samples for the junctions of interest. This makes sense for variants associated with the same junction, but later the script seems to not restore the non-mutated scores well, generating a list of all zeros for the non-mutated scores. I do get p-values with variant_grouping_mode == "strict".

  2. Is the R script compare_junctions_hist_v2.R a good alternative for the statistical analysis?

I would really appreciate some assistance with this final step, as the rest of the tool works well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant