-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #5 from PedroBarbosa/dev_expl
Version 0.1.0
- Loading branch information
Showing
946 changed files
with
194,932 additions
and
1,534 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,48 @@ | ||
name: pytest | ||
|
||
on: | ||
push: | ||
branches: | ||
- main | ||
pull_request: | ||
branches: | ||
- main | ||
|
||
jobs: | ||
build: | ||
runs-on: ubuntu-latest | ||
strategy: | ||
max-parallel: 5 | ||
fail-fast: false | ||
matrix: | ||
python-version: ["3.10", "3.11"] | ||
|
||
steps: | ||
- uses: actions/checkout@v4 | ||
- name: Set up Python | ||
uses: actions/setup-python@v5 | ||
with: | ||
python-version: ${{ matrix.python-version }} | ||
cache: 'pip' | ||
- name: Add conda to system path | ||
run: | | ||
# $CONDA is an environment variable pointing to the root of the miniconda directory | ||
echo $CONDA/bin >> $GITHUB_PATH | ||
- name: Install dependencies | ||
run: | | ||
conda config --add channels conda-forge | ||
conda config --add channels bioconda | ||
conda install -y python=${{ matrix.python-version }} meme=5.5.5 | ||
if [ -f requirements.txt ]; then pip install -r requirements.txt; fi | ||
- name: Lint with flake8 | ||
run: | | ||
pip install flake8 | ||
# stop the build if there are Python syntax errors or undefined names | ||
flake8 . --count --select=E9,F63,F7,F82 --show-source --statistics | ||
# exit-zero treats all errors as warnings. The GitHub editor is 127 chars wide | ||
flake8 . --count --exit-zero --max-complexity=10 --max-line-length=127 --statistics | ||
- name: Test with pytest | ||
run: | | ||
pip install pytest | ||
# Ignore tests that should use GPU | ||
pytest --ignore=tests/black_box_test.py --ignore=tests/prune_archive_test.py |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,19 +1,33 @@ | ||
[![pytest](https://github.com/PedroBarbosa/dress/actions/workflows/run_tests.yml/badge.svg)](https://github.com/PedroBarbosa/dress/actions/workflows/run_tests.yml) | ||
|
||
# DRESS - Deep learning-based Resource for Exploring Splicing Signatures | ||
|
||
A toolkit for generating synthetic datasets related to RNA splicing. | ||
|
||
## Running example | ||
|
||
As for now, the package contains two commands: | ||
- `generate` to generate synthetic datasets from a start sequence. | ||
- `filter` to filter datasets by desired levels of PSI or dPSI. | ||
|
||
- `filter` to filter datasets by desired levels of splice site probability, PSI or dPSI. | ||
|
||
## Installation | ||
|
||
Clone the repo, take care of dependencies with `conda` or `mamba` and install the package with `pip`: | ||
|
||
``` | ||
git clone https://github.com/PedroBarbosa/dress.git | ||
cd dress | ||
conda env create -f conda_env.yml | ||
conda activate dress | ||
pip install . | ||
``` | ||
|
||
## Running example | ||
|
||
To run an evolutionary search with exon 6 of FAS gene: | ||
|
||
`dress generate data/examples/generate/raw_input/FAS_exon6/data.tsv` | ||
|
||
To skip running the black box model (e.g, SpliceAI), run with `--dry_run`, which will return as fitness the proportional index (between 0 and 1) of the individual in the population. | ||
The required transcript structure cache (from GENCODE v44) can be downloaded from [here](https://app.box.com/s/tbh293kqh1s9nbi624esl0c18maxuhss). Then, download the human genome hg38 (for example from [here](https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_45/GRCh38.primary_assembly.genome.fa.gz), uncompress it and optionally simplify chromosome headers with `sed '/^>/s/ .*//'`. Then, put both files in a single directory, which is given in `--cache_dir`. By default, it expects this data to be in `data/cache`. | ||
|
||
The full list of argument options can be inspected with `dress generate --help` or by looking at the yaml configuration file at `dress/configs/generate.yaml`. | ||
The full list of argument options can be inspected with `dress generate --help` or by looking at one of the pre-configured yaml files at `dress/configs/generate*`. | ||
|
||
The required transcript structure cache (from GENCODE v44) can be downloaded from [here](https://app.box.com/s/tbh293kqh1s9nbi624esl0c18maxuhss). Then, download the human genome hg38 (for example from [here](https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_45/GRCh38.primary_assembly.genome.fa.gz), uncompress it and optionally simplify chromosome headers with `sed '/^>/s/ .*//'`. Then, put both files in a single directory, which is given in `--cache_dir`. By default, it expects this data to be in `data/cache`. | ||
Full documentation and tutorials will be available soon. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file added
BIN
+6.57 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_30_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+173 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_30_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+4.55 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_31_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+191 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_31_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.74 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_32_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+178 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_32_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.45 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_33_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+184 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_33_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.73 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_34_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+180 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_34_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.01 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_35_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+185 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_35_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.92 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_36_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+180 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_36_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.54 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_37_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+186 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_37_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.61 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_38_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+169 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_38_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.05 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_39_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+177 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_39_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.72 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_40_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+180 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_40_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.26 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_41_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+178 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_41_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.87 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_42_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+178 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_42_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.71 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_43_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+180 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_43_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.66 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_44_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+182 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_44_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.63 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_45_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+185 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_45_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.69 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_46_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+180 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_46_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.2 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_47_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+178 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_47_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.89 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_48_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+183 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_48_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.92 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_49_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+187 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_49_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+8.44 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_50_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+174 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_50_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.71 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_51_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+177 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_51_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.25 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_52_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+173 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_52_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.19 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_53_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+179 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_53_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.72 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_54_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+183 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_54_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.25 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_55_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+183 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_55_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+8.11 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_56_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+180 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_56_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+6.63 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_57_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+178 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_57_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+7.53 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_58_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+182 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_58_dataset.csv.gz
Binary file not shown.
Binary file added
BIN
+5.95 KB
...as_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_59_archive_logger.csv.gz
Binary file not shown.
Binary file added
BIN
+186 KB
...asets_fas_exon6/bin_filler/chr10_89010439_89012181_ENST00000652046_seed_59_dataset.csv.gz
Binary file not shown.
1 change: 1 addition & 0 deletions
1
data/examples/explain/datasets_fas_exon6/bin_filler/preprocessing_extracted_from_cache.bed
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
chr10 89010751 89010815 chr10:89010753-89010815 . + FAS ENST00000652046 ENSG00000026103 protein_coding |
2 changes: 2 additions & 0 deletions
2
...amples/explain/datasets_fas_exon6/bin_filler/preprocessing_sequences_trimmed_at_5000bp.fa
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
>chr10:89010439-89012181(+)_ENST00000652046 | ||
AAGATGAATAAAATGGCCCCTAATTTACAAAGTGCCATTGAAAATTATAAAGGAATTATTCTGCCAGGCTTTTGAATTTCTCCTGTATTTTTTTTTCTAGATGTGAACATGGAATCATCAAGGAATGCACACTCACCAGCAACACCAAGTGCAAAGAGGAAGGTAATTATTTTTTTACGGTTATATTCTCCTTTCCCCCAACCCCATGGAAAGATGTGAAGAAAAACCAATCACTCTTGATTACTAGAAAGTCCTTTATTTAATCTTAAAGATTGCTTATTTTCATATAAAATGTCCAATGTTCCAACCTACAGGATCCAGATCTAACTTGGGGTGGCTTTGTCTTCTTCTTTTGCCAATTCCACTAATTGTTTGGGGTAAGTTCTTGCTTTGTTCAAACTGCAGATTGAAATAACTTGGGAAGTAGTTCACAAAGATTTGCCTCATTCTTACCTATAAAAAGCTACCACTTTGGTAGATTTATGTATTGTTAATTTCTTGCCCCTGAATGCAGCCTTGAGAGCTGACTGATAAGAACAAATGAAATTATTCCTCAGCTAGTTTCTGAGCAACAGTTTTGGGGCATTGAGTGGTATTCTCATCCTTCCTATGAACAGGTGTTCTCTGCAGCAGCAGAATTGGCCAAAAATCAGAAGCAATTCTTCACTATTCATTGAGATCTCCCTATGCAAAAAGAGAACACAAGAAGCAAAGGCATTCCCAGGAAACACATTGCAGGGAACACTTTAAAAACTTGTACTTCACTGCCTCCTCTTCCTCGGCCTAATTGCTTGTTTTTAATTATTTCTCCTTCTTAACTTAAAATACTATGGGGACACATGTTATACAAAGGTGACTTAGTAGAGTCAGTAGAAAAGCCAAAATTAGATATTATCATAATTAGTCTAGAAAAATCCCTTTAAGTCATTCATCAACTACAGGGTCACACCAACTTTCAGTAACTTAGAAGTATTCAATTTTCCCTTCTCAGAACAATTATCTGTTTCTTCAGTTCAGTTGAAGAAGAAAGTTTGCCTTGCCTTTAGCGGTTGTTTAGCTGAAAATACATTTGGGATATTTAAGCACTGTAATTGTGCTCAGAGACATACAGATTCTTCTATCTCACATTGACTTTAATGCATACACCTATTGAGTATGTATGCTTGAGTTATTTGTGTGTGTATTTCATTTCTGGGCATCCATAGCAAGTTGATGTTGACTTGCTTGTCCTACGGCTTCTGCATCCTGCCATAGTCTTGCCGTCCACATCTTTGCTGGACAGAGAGTGGTGCTTGCCATATGGTAAGTCAAAAGCCATCTCCTTGCTAGGCCAGCCTGTGGTAATTAGATGACTAATTAAGATATGTCCTTTCACTAGAACACTTGACTTAGTAGTACGAAAGTTCCAAAATCAGCGGTCTCCTGCGATGTTTGGCCACTTTTAAGTTTCACTGAATTTCTCCTTTTTCCTTCTTATATTTCTCTTAGTGTGAAAGTATGTTCTCACATGCATTCTACAAGGCTGAGACCTGAGTTGATAAAATTTCTTTGTTCTTTCAGTGAAGAGAAAGGAAGTACAGAAAACATGCAGAAAGCACAGAAAGGAAAACCAAGGTTCTCATGAATCTCCAACTTTAAATCCTGTAGGTATTGAAATAGGTATCAGCTTTCCTTGAAAAGAAAAATAGAGAAATTAGTGATTTGGCTTTTTGTTACTTCCTTTTACTTTTTTGTTTCTTGTTT |
2 changes: 2 additions & 0 deletions
2
...xplain/datasets_fas_exon6/bin_filler/preprocessing_sequences_trimmed_at_5000bp_ss_idx.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
header acceptor_idx donor_idx tx_id exon | ||
chr10:89010439-89012181(+) 100;314;1560 161;376;1642 ENST00000652046 chr10:89010753-89010815 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
exon Strand gene_name transcript_id transcript_type dPSI | ||
chr5:82058495-82058602 + ATG10 ENST00000282185 protein_coding -0.24 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,77 @@ | ||
# Mandatory arguments (input) should be provided as command line arguments to the script. | ||
generate: | ||
dry_run: false | ||
verbosity: 0 | ||
shuffle_input: | ||
seed: 0 | ||
model: spliceai | ||
model_scoring_metric: mean | ||
pangolin_mode: ss_usage | ||
pangolin_tissue: null | ||
disable_gpu: false | ||
outdir: outdir | ||
outbasename: | ||
preprocessing: | ||
cache_dir: data/cache/ | ||
genome: data/cache/Homo_sapiens.GRCh38.dna.primary_assembly.fa | ||
use_full_sequence: false | ||
fitness: | ||
minimize_fitness: false | ||
fitness_function: bin_filler | ||
fitness_threshold: 0.0 | ||
archive: | ||
archive_size: 5000 | ||
archive_diversity_metric: normalized_shannon | ||
prune_archive_individuals: true | ||
prune_at_generations: | ||
population: | ||
population_size: 500 | ||
selection: | ||
selection_method: tournament | ||
tournament_size: 5 | ||
custom_mutation_operator: false | ||
custom_mutation_operator_weight: 0.8 | ||
mutation_probability: 0.7 | ||
crossover_probability: 0.25 | ||
operators_weight: | ||
- 0.8 | ||
elitism_weight: | ||
- 0.0 | ||
novelty_weight: | ||
- 0.1 | ||
update_weights_at_generation: | ||
stopping: | ||
stopping_criterium: | ||
- archive_size | ||
- time | ||
stop_at_value: | ||
- 5000 | ||
- 5 | ||
stop_when_all: false | ||
tracking_evolution: | ||
disable_tracking: false | ||
track_full_population: false | ||
track_full_archive: false | ||
grammar: | ||
which_grammar: motif_based | ||
max_diff_units: 6 | ||
snv_weight: 0.1 | ||
insertion_weight: 0.25 | ||
deletion_weight: 0.25 | ||
motif_ablation_weight: 0.2 | ||
motif_substitution_weight: 0.2 | ||
max_insertion_size: 5 | ||
max_deletion_size: 5 | ||
acceptor_untouched_range: | ||
- -10 | ||
- 2 | ||
donor_untouched_range: | ||
- -3 | ||
- 6 | ||
untouched_regions: | ||
motif_db: ATtRACT | ||
motif_search: fimo | ||
subset_rbps: encode | ||
min_nucleotide_probability: 0.15 | ||
min_motif_length: 5 | ||
pvalue_threshold: 0.001 |
Oops, something went wrong.