- Mar 06, 2020
-
-
-
BIOPZ-Gypas Foivos authored
-
-
- Feb 24, 2020
-
-
Alex Kanitz authored
Generate gene and transcript count tables from salmon Closes #51 See merge request zavolan_group/pipelines/rnaseqpipeline!33
-
- compile Salmon gene and transcript count summary tables across all samples in workflow run - add `pandas` to `install/environment.dev.yml` - update rule graph and DAG images
-
- Feb 21, 2020
-
-
Alex Kanitz authored
Use minified docker images Closes #67 See merge request zavolan_group/pipelines/rnaseqpipeline!32
-
Alex Kanitz authored
-
Alex Kanitz authored
Rule to collect all TPM values from Salmon added See merge request zavolan_group/pipelines/rnaseqpipeline!30
-
- Remove files with non-deterministic output from `tests/test_integration_workflow/expected_output.files` - Update MD5 sums in `tests/test_integration_workflow/expected_output.md5` - Update new workflow DAG and rule graph images
-
Alex Kanitz authored
Handle polyA processing in input preparation script Closes #37 See merge request zavolan_group/pipelines/rnaseqpipeline!31
-
- fixes some functions in `labkey_to_snakemake.py` - add optional argument for trimming polyA tails; they are trimmed as follows: - if mate is sense, oligo-A is added to sample table for `cutadapt` rule to trim - if mate is antisense, oligo-T is added to sample table for `cutadapt` rule to trim - if option is set to `--trim_polya`, oligo-X stretch is added to sample table and `cutadapt` will not trim
-
- Feb 20, 2020
-
-
Alex Kanitz authored
create log directories in Snakefile Closes #70 See merge request zavolan_group/pipelines/rnaseqpipeline!28
-
Alex Kanitz authored
- log and, if workflow is executed on cluster, cluster log directories are explicitly created in `Snakefile` - location of main log directory can be configured in `config.yaml` (field `log_dir`, previously: `local_log`; requires change in script `labkey_to_snakemake.py` as well as subworkflows as field name is hard-coded there) - location of cluster log directory can be configured in `cluster.json` (in field `__default__` -> `out`) - `config.yaml` and `cluster.json` in `tests/input_files` are set such that a directory `logs/` is created in the directory where Snakemake is run (i.e., the directory of each test); cluster logs are stored in a subdirectory `logs/cluster` - removes instructions to explicitly create log directories from docs and all test scripts - cleans up main `Snakefile` (apart from Snakemake-specific syntax, now passes `flake8` linter test)
-
- Feb 18, 2020
-
-
Alex Kanitz authored
fix broken links in docs See merge request zavolan_group/pipelines/rnaseqpipeline!27
-
Alex Kanitz authored
-
Alex Kanitz authored
install dependencies via Conda Closes #56 See merge request zavolan_group/pipelines/rnaseqpipeline!24
-
Alex Kanitz authored
- add 3 separate Conda environment files for installing - essential dependencies without Singularity (most users) - essential dependencies including Singularity (root users on Linux only) - non-essential dependencies (required for tests etc.) - install Python dependencies via `pip` _within_ Conda environment to avoid #56 - update Conda-based installation instructions - describe usage of different environment files - add supported OS versions (macOS not yet added, because only installation works) - update CI config to use same setup (based on Miniconda Docker image)
-
Alex Kanitz authored
run tests in verbose mode Closes #68 See merge request zavolan_group/pipelines/rnaseqpipeline!26
-
Alex Kanitz authored
- trap call functionalized through cleanup() function - function added to all test scripts - function prints out exit status of last command before trap - flag `--verbose` added to Snakemake calls in all test scripts - script tests rename to follow naming convention 'test_script_<script_name>_<script_run_mode>
-
- Feb 17, 2020
-
-
Alex Kanitz authored
TIN score calculation added See merge request zavolan_group/pipelines/rnaseqpipeline!25
-
- add rule for input preparation (GTF to BED12) - add rule for TIN score calculation - update rule graph and DAG image - update Slurm cluster config
-
- Feb 15, 2020
-
-
Alex Kanitz authored
get Snakemake input from LabKey API Closes #47 See merge request zavolan_group/pipelines/rnaseqpipeline!23
-
- add script that prepares Snakemake input files 'samples.tsv' and 'config.yaml' from LabKey table - script either connects to API directly (with '--remote' and related options) or processes a tab-separated LabKey dump file - add tests for both use cases - common input files for tests now in 'tests/input_files' - update all other tests to account for new file locations - update documentation
-
- Feb 14, 2020
-
-
Alex Kanitz authored
repo follows recommended structure Closes #31 See merge request zavolan_group/pipelines/rnaseqpipeline!22
-
Alex Kanitz authored
-
Alex Kanitz authored
hot fix integration test script See merge request zavolan_group/pipelines/rnaseqpipeline!21
-
Alex Kanitz authored
-
Alex Kanitz authored
LabKey-like input to Snakmake input See merge request zavolan_group/pipelines/rnaseqpipeline!20
-
- separate organism genome architecture (different input folder) - change MD5 checksums to match the new output
-
Alex Kanitz authored
update install instructions See merge request zavolan_group/pipelines/rnaseqpipeline!17
-
- describe installation of Singularity inside `conda` environment when user has root access - put clone repo section before install section
-
Alex Kanitz authored
display rule graph instead of DAG Closes #52 See merge request zavolan_group/pipelines/rnaseqpipeline!18
-
- add script `tests/test_rule_graph/test.sh` to generate a rule graph in `images/rule_graph.svg` - display rule graph created in `README.md` instead of specific workflow DAG - add test script to GitLab CI config - renamed test to create workflow DAG from `test_create_dag_chart` to `test_create_dag_image` (also output file is renamed from `images/workflow_dag.svg` to `images/dag_test_workflow.svg`
-
Alex Kanitz authored
Ground truth checks Closes #49 See merge request zavolan_group/pipelines/rnaseqpipeline!19
-
Alex Kanitz authored
add tests to workflow integration test `tests/test_workflow_integration` that - verify that STAR alignments match expected alignments (based on "ground truth" files) - verify that Salmon gene quantification assign the correct number of reads to each gene (based on "ground truth files) resolves #49
-
- Feb 09, 2020
-
-
Alex Kanitz authored
Replace test files with small synthetic ones See merge request zavolan_group/pipelines/rnaseqpipeline!16
-
Alex Kanitz authored
- replaces existing larger libraries and annotations in test cases `test_create_dag_chart` and `test_integration_workflow` - adds the following new test files: - `chr1-10000-20000.fa`: artificial chromosome of length 10'000 (based on human chromosome 1) - `chr1-10000-20000.gtf`: matching gene annotation file with two gene and three multi-exon transcripts entries - `chr1-10000-20000.transcripts.fa`: sequences of the transcripts listed in the gene annotation file - `synthetic.mate_?.fastq.gz`: 10 read pairs randomly sampled from the genic regions of the artificial chromosome - `synthetic.*.bed`: BED files with expected alignments for each read; names of overlapping genes are specified in a 7th column - updates file paths in the relevant sample tables - extends and updates checksum checking of result files in CI/CD pipeline
-
- Feb 08, 2020
-
-
Alex Kanitz authored
Wire all available rules See merge request zavolan_group/pipelines/rnaseqpipeline!15
-
- Feb 07, 2020
-
-
Alex Kanitz authored
Labkey input merge See merge request zavolan_group/pipelines/rnaseqpipeline!14
-