$LIFTOVERSV/bin/liftoverSV -I $INPUT_FILE -C $CHAIN_FILE -O $OUTPUT_FILE
--BCFTOOLS,-F <File> The bcftools path
See https://samtools.github.io/bcftools/howtos/install.html
Default: "bcftools"
--BEDTOOLS,-B <File> The bedtools path
See https://bedtools.readthedocs.io/en/latest/content/installation.html
Default: "bedtools"
--CHAIN,-C <File> The liftover chain file
See https://genome.ucsc.edu/goldenPath/help/chain.html for a description of chain files
See http://hgdownload.soe.ucsc.edu/downloads.html#terms for where to download chain files.
Required
--help,-h <Boolean> Display the help message
Default value: false. Possible values: {true, false}
--INPUTFILE,-I <File> The SV VCF input file.
Gzipped VCF file is supported.
Multi-allelic lines are not allowed
Required
--LIFTOVER,-L <File> The UCSC Liftover tool path
Default: "liftOver"
--OUTPUTFILE,-O <File> The liftover SV VCF output file
Required
--PERCENT,-P <float> Variation in length authorized for a lifted SV (e.g. difference max between SVLEN < 5%)
Default value: 0.05
--REFFASTASEQ,-R <File> The reference sequence (fasta) for the TARGET genome build (i.e., the new one after the liftover)
Please cite the following doi if you are using this tool in your research:
Lifts over a SV VCF file from one reference build to a target build:
-
Lifts over #CHROM, POS, REF, ALT, INFO/END and INFO/SVEND
(chromosomes, coordinates and sequences are lifted) -
Lifts over INFO/SVLEN, INFO/SVSIZE:
- Lifts over for deletion, duplication and inversion (SVLEN_lifted = End_lifted - Start_Lifted)
- Keep the same SVLEN/SVSIZE for insertion (the number of the inserted bases remains the same)
- Set SVLEN/SVSIZE to "." for SV type not equal to DEL, DUP, INV or INS (TRA, CPX...)
-
Drop the SV if:
- Case1: One position (start or end) is lifted while the other doesn't
- Case2: One position (start or end) goes to a different chrom from the other (except for translocation)
- Case3: "lifted start" > "lifted end"
- Case4: The distance between the two lifted positions changes significantly (Default: difference between both SVLENs > 5%)
- Case5: The ALT feature is not a square or an angle bracketed notation.
Square-bracketed notation examples:A]chr2:32156]
orACCCCC[chr2:32156[
Angle-bracketed notation examples:<INS>
or<CN0>
Non authorized format example:REF="A" and ALT="ACGGTAG"
=> See "OUTPUTFILE.unmapped" file for details
-
Check/Update INFO/CIPOS and INFO/CIEND, so that in the target build:
- POS-CIPOS >= 1 (VCF coordinates are 1-based)
- END+CIEND <= chromosome length
- POS-CIPOS >= 1 (VCF coordinates are 1-based)
-
Update/create and sort some VCF header lines:
- Checks that the "contig" field includes all the ID attributes (do not include additional optional attributes)
e.g.##contig=<ID=chr22>
added after a lift from chr1 to chr22 - Create/update the "assembly" field
e.g.##assembly=liftoverSV used with hg19ToHg38.over.chain
- Create the "liftoverSV_version" field
e.g.##liftoverSV_version=0.1.2_beta; hg19ToHg38.over.chain; August 30 2024 12:30
- Update the "INFO" and "FORMAT field if one value is missing.
As the format (Number, String) is not known, "Number=." and "Type=String" values are used by default:
e.g.##FORMAT=<ID=XXX,Number=.,Type=String,Description="XXX">
e.g.##INFO=<ID=YYY,Number=.,Type=String,Description="YYY">
- Checks that the "contig" field includes all the ID attributes (do not include additional optional attributes)
The UCSC Liftover tool needs to be locally installed to lift over chrom/positions
https://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/liftOver
The “bedtools” toolset needs to be locally installed to lift over sequences (e.g. ACGGTTG]chr1:12569863])
The “bcftools” toolset needs to be locally installed to sort the VCF output file
cf https://samtools.github.io/hts-specs/VCFv4.4.pdf
See section 3: "INFO keys used for structural variants"
-
Lifts over INFO/MEINFO and INFO/METRANS
-
Lifts over INFO/HOMLEN, INFO/HOMSEQ
-
Lifts over ALT when described with square bracket notation. For example, G]17:198982] or ]chr1:3000]A
cf https://github.com/EUCANCan/variant-extractor for notation rules