- Jun 15, 2020
-
-
Alex Kanitz authored
refactor: rename LabKey/input table column See merge request !70
-
-
- Jun 12, 2020
-
-
Alex Kanitz authored
Add Snakemake report See merge request !64
-
-
Alex Kanitz authored
docs: rename project/workflow See merge request !69
-
Alex Kanitz authored
-
Alex Kanitz authored
fix(prepare_inputs): support relative paths See merge request !68
-
Alex Kanitz authored
-
BIOPZ-Gypas Foivos authored
Salmon argument fix See merge request !67
-
fix: added arguments for fragment size mean and sd required for the salmon quantification approach in single-end mode
-
BIOPZ-Gypas Foivos authored
-
- Jun 11, 2020
-
-
Dominik Burri authored
-
- Jun 10, 2020
-
-
Alex Kanitz authored
Gtf2bed update See merge request !65
-
-
- Apr 28, 2020
-
-
Alex Kanitz authored
Extend workflow description See merge request zavolan_group/pipelines/rhea!63
-
Alex Kanitz authored
- add linked ToC - add table of third party software - add rule graph - clean up sample table description - formalize structure for each rule description - cross-reference rules via inputs/outputs - clean list of references
-
- Apr 27, 2020
-
-
Alex Kanitz authored
- clean up command line interface - improve descriptions - add consistent structure - remove or merge superfluous CLI arguments - set defaults - update test calls - update docs - when importing data from LabKey, table is saved to 'samples.tsv.labkey' in same directory as Snakemake sample table - allow user to specify environment variables and relative paths in input table and on CLI - relative paths in the input table are interpreted with respect to the directory containing the input table - relative paths will are interpreted with respect to the current working directory; this is to achieve portability with respect to tests but is discouraged in production because its behavior is not very predictable from the user's perspective; consequently a warning is thrown - set STAR index size to read length - 1 - remove `gtf_filtered` and `tr_fasta_filtered` and update Snakefiles and test sample tables accordingly - rename some MultiQC report-related parameters and update Snakefiles and test config files accordingly - add logging - add docstrings to module and all functions - add typing definitions to all functions - restructure and comment code to improve readability - linters `flake8` and `mypy` pass
-
* Sequencing mode-related changes: * allowed sequencing modes in Snakemake input table changed from `paired_end` and `single_end` to `pe` and `se`, respectively * remove sequencing mode from output paths for each rule * corresponding wild cards removed entirely from all rules that do not depend on sequencing mode (currently all rules that are defined in the main `Snakefile` in the project root directory) * where absolutely necessary, sequencing mode is added as part of output file or directory instead * remove dependency of sequencing mode for rule for `FastQC`; now runs separately for each strand * Changes related to MultiQC and output file/directory structure * moving and renaming outputs for MultiQC is no longer required * code to create MultiQC custom config externalized into script `scripts/rhea_multiqc_config.py` * add MultiQC output files with deterministic output to md5 sum checks performed during execution of `tests/test_integration_workflow/test.{local,slurm}.sh` * output filenames for each rule now follow this general structure: `samples/{sample_name}/{rule}/{output_file}` * change log directory structure matches results directory structure * Miscellaneous changes * consistent, PEP8-compliant formatting in most parts, including Snakemake files, where allowed * remove rule `extract_decoys_salmon`; equivalent file `chrName.txt` produced by `star_index` is used instead * add rule `start` which copies sample data to the results directory and enforces uniform naming * refactoring of ALFA rules and modification of the CI/CD test to ensure compatibility
-
- Mar 25, 2020
-
-
Alex Kanitz authored
Update salmon transcriptome index generation See merge request zavolan_group/pipelines/rhea!59
-
-
- Mar 24, 2020
-
-
Alex Kanitz authored
Second fix in results restructuring See merge request zavolan_group/pipelines/rhea!57
-
-
- Mar 22, 2020
-
-
Alex Kanitz authored
fix absolute / relative path issue in fastqc results parsing See merge request zavolan_group/pipelines/rhea!56
-
-
- Mar 21, 2020
-
-
Alex Kanitz authored
update documentation See merge request zavolan_group/pipelines/rhea!54
-
Alex Kanitz authored
Use minified container images Closes #105 See merge request zavolan_group/pipelines/rhea!55
-
Alex Kanitz authored
-
- Mar 20, 2020
-
-
BIOPZ-Katsantoni Maria authored
-
Alex Kanitz authored
ALFA merged plots included in the MultiQC report See merge request zavolan_group/pipelines/rhea!53
-
BIOPZ-Bak Maciej authored
-
Alex Kanitz authored
ALFA extensions Closes #100 and #93 See merge request zavolan_group/pipelines/rhea!52
-
- generate nucleotide distribution for unique reads only - new rule to generate PNG image for MultiQC
-
Alex Kanitz authored
Polya trimming correction Closes #98 See merge request zavolan_group/pipelines/rhea!50
-
In labkey_to_snakemake.py fixed the parameters so that there is 3p as well 5p polya feature for every mate, which can be matched to the -a -g -A and -G options of cutadapt depending on which is the sense or antisense mate the appropriate variable is populated and the rest of variables are filled with 'XXXXXXXXXXXX' which leads to no trimming by cutadapt. The poly-A trimming rules are fixed to contain all -a -g -A -G options.
-
- Mar 19, 2020
-
-
Alex Kanitz authored
MultiQC See merge request zavolan_group/pipelines/rhea!49
-
-
Alex Kanitz authored
Rhea naming Closes #94 See merge request zavolan_group/pipelines/rhea!51
-
-