Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question regarding input vcf #70

Open
lyj95618 opened this issue Sep 22, 2021 · 4 comments
Open

question regarding input vcf #70

lyj95618 opened this issue Sep 22, 2021 · 4 comments

Comments

@lyj95618
Copy link

Hello,

I have a question regarding the input vcf. When the input vcf is unphased, can I just use phaser to perform the read-backed phasing and then take the haplotypic_count.txt for ASE analysis using phaser gene AE script?

From the 2016 paper, if I didn't misunderstand, it seems like I can use phaser for variant phasing. But in the tutorial "Generate ASE Data with phASER", the input vcf is a phased vcf. I am just wondering whether a phased vcf is a must.

Any help would be much appreciated!
Laur

@secastel
Copy link
Owner

Hi Laur,

You can absolutely use an unphased VCF as input. It will just reduce the maximum length (and number of variants) that can be summed across when generating the counts. The phaser gene AE script will automatically select the phased haplotype with the greatest read depth and use that to generate the counts.

  • Stephane

@lyj95618
Copy link
Author

lyj95618 commented Sep 23, 2021

Thanks for answering my question!

Just two more things:

  1. when WGS/WES is unavailable, is it possible to use panel sequence VCF file as the input VCF? will it work if I am only interested in a particular gene?

  2. Would you recommend to perform "remove duplicated read (e.g. Picard)" before inputting the bam to phaser?

Thanks,
Laur

@secastel
Copy link
Owner

If by panel you mean genotyping array, then yes it should work. Theoretically it will work with any genotyping platform, provided of course that the genotype calls are accurate.

As for duplicate reads, I'd suggest reading our 2015 paper, where we discussing duplicate reads (https://link.springer.com/article/10.1186/s13059-015-0762-6)

@lyj95618
Copy link
Author

Thank you very much for the reply!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants