Commits · 34622aa1ad5500dec43e71d5daa71ae201d5af64 · zavolan_group / pipelines / ZARP

Jul 14, 2021
- properly raise TypeError and rephrase error message. · 34622aa1
  Dominik Burri authored 3 years ago
  
  34622aa1
- separate empty and erroneous optional field. · b7dc184a
  Dominik Burri authored 3 years ago
  
  b7dc184a
- correct error to create empty optional dict. · 3baa76e2
  Dominik Burri authored 3 years ago
  
  3baa76e2
- Report missing required keys, restructure rule config handling. · efa86659
  Dominik Burri authored 3 years ago
  
  efa86659
Jul 13, 2021
- adjust all config files and test cases to new optional field. · 1e5e058d
  Dominik Burri authored 3 years ago
  
  1e5e058d
- validate config on required and optional fields #174 · ef68b765
  Dominik Burri authored 3 years ago
  
  ef68b765
Jul 12, 2021
- add author information (#78 ) · b5db2001
  Dominik Burri authored 3 years ago
  
  b5db2001
Jun 11, 2021
- Add conda support in ZARP · 10885dca
  BIOPZ-Gypas Foivos authored 3 years ago
  
  10885dca
May 28, 2021
- Add biocontainers image for multiqc-plugins · 8ea4e8ff
  BIOPZ-Gypas Foivos authored 3 years ago
  
  8ea4e8ff
- Add biocontainers image for tin-score-calculation · d4a37587
  BIOPZ-Gypas Foivos authored 3 years ago
  
  d4a37587
- Add biocontainers image for merge_kallisto.R · 6ded6ada
  BIOPZ-Gypas Foivos authored 3 years ago
  
  6ded6ada
- Replace zpca container from zavolab to biocontainers. · a81803b9
  BIOPZ-Gypas Foivos authored 3 years ago
  
  a81803b9
May 12, 2021
- Add zgtf biocontainers image · f8028a93
  BIOPZ-Gypas Foivos authored 3 years ago
  
  f8028a93
May 11, 2021
- MultiQC report: custom plugin for zpca. Fixes #146 · 36c02328
  BIOPZ-Bak Maciej authored 3 years ago and BIOPZ-Gypas Foivos committed 3 years ago
  
  36c02328
May 07, 2021

Use biocontainers images for star, gffread, salmon, kallisto, cutadapt,... · be8eb02d

Use biocontainers images for star, gffread, salmon, kallisto, cutadapt, samtools, fastqc, alfa, bedtools, bedgraphtobigwig. Change container from bash to ubuntu. Fixes #149

be8eb02d

Apr 15, 2021
- feat: enable user to configure CLI params per rule · d35d5500
  CJHerrmann authored 3 years ago and Alex Kanitz committed 3 years ago
  
  d35d5500
Mar 25, 2021

Remove extra parameters, that had or should have the default values · 5396adf5

BIOPZ-Katsantoni Maria authored 4 years ago and

BIOPZ-Gypas Foivos committed 4 years ago

and are therefore not required in the rules.
- Snakefile
 star_rpm: --outWigNorm (default RPM was used)
 star_rpm: --outWigStrand (default Stranded was used)
 rename_star_rpm_for_alfa: orientation in params is redundant (Fixes #152)
- single_end.snakefile.smk
 map_genome_star: outFilterMismatchNoverLmax
 map_genome_star: outFilterScoreMinOverLread
 map_genome_star: outFilterMatchNminOverLread
 quantification_salmon: --writeUnmappedNames
- paired_end.snakefile.smk
  pe_map_genome_star: outFilterMismatchNoverLmax
  pe_map_genome_star: outFilterScoreMinOverLread
  pe_map_genome_star: outFilterMatchNminOverLread
  quantification_salmon: --writeUnmappedNames

5396adf5

Feb 26, 2021
- Remove unnecessary files in the results directory · 6804ea67
  Dominik Burri authored 4 years ago and BIOPZ-Gypas Foivos committed 4 years ago
  
  6804ea67
Feb 11, 2021
- MultiQC plugins for TIN scores and ALFA Fixes #138 · fab75506
  BIOPZ-Bak Maciej authored 4 years ago and BIOPZ-Gypas Foivos committed 4 years ago
  
  fab75506
Oct 16, 2020

- Delete unused files scripts/fg_extract_transcripts.py,... · c535a892

BIOPZ-Gypas Foivos authored 4 years ago

- Delete unused files scripts/fg_extract_transcripts.py, scripts/heatmap_and_clustermap.py, scripts/perform_PCA.py
- Add rules (pca_kallisto, pca_salmon) that run zpca (https://github.com/zavolanlab/zpca) on genes and transcripts TPM tables from kallisto and salmon.
- The output is wired to multiqc_report but the plots are not visualized to multiqc. Update documentation.
- Update dag and rulegraph.
Fixes #140 #142

c535a892

Jun 23, 2020
- Merge kallisto rules: kallisto_merge_genes and kallisto_merge_transcript · 1f007e19
  BIOPZ-Iborra de Toledo Paula authored 4 years ago and BIOPZ-Gypas Foivos committed 4 years ago
  
  The rules rely on https://github.com/zavolanlab/merge_kallisto Update info in pipeline_documentation.md
  1f007e19
Jun 15, 2020

fix: Renamed samples_concat.tsv to samples.multiple_lanes.tsv. Renamed rows... · 05c167fd

BIOPZ-Katsantoni Maria authored 4 years ago and

Alex Kanitz committed 4 years ago

fix: Renamed samples_concat.tsv to samples.multiple_lanes.tsv. Renamed rows with split with the same name as the other test samples, so that I do not change the tests (md5 and sunch). Removed the one lane samples. Created config that uses this tsv file

05c167fd

Jun 12, 2020
- docs: rename project/workflow · a07e1173
  Alex Kanitz authored 4 years ago
  
  a07e1173
Jun 10, 2020
- refactor: use zgtf for GTF to BED12 conversion · b62f3bf1
  BIOPZ-Gypas Foivos authored 4 years ago and Alex Kanitz committed 4 years ago
  
  b62f3bf1
Apr 27, 2020

Refactor LabKey to Snakemake script · 556f1e12

Alex Kanitz authored 4 years ago

- clean up command line interface
  - improve descriptions
  - add consistent structure
  - remove or merge superfluous CLI arguments
  - set defaults
  - update test calls
  - update docs
  - when importing data from LabKey, table is saved to 'samples.tsv.labkey' in same directory as Snakemake sample table
- allow user to specify environment variables and relative paths in input table and on CLI
  - relative paths in the input table are interpreted with respect to the directory containing the input table
  - relative paths will are interpreted with respect to the current working directory; this is to achieve portability with respect to tests but is discouraged in production because its behavior is not very predictable from the user's perspective; consequently a warning is thrown
- set STAR index size to read length - 1
- remove `gtf_filtered` and `tr_fasta_filtered` and update Snakefiles and test sample tables accordingly
- rename some MultiQC report-related parameters and update Snakefiles and test config files accordingly
- add logging
- add docstrings to module and all functions
- add typing definitions to all functions
- restructure and comment code to improve readability
- linters `flake8` and `mypy` pass

556f1e12

Major refactoring · 6cf28511

BIOPZ-Katsantoni Maria authored 5 years ago and

Alex Kanitz committed 4 years ago

* Sequencing mode-related changes:
  * allowed sequencing modes in Snakemake input table changed from `paired_end` and `single_end` to `pe` and `se`, respectively
  * remove sequencing mode from output paths for each rule
  * corresponding wild cards removed entirely from all rules that do not depend on sequencing mode (currently all rules that are defined in the main `Snakefile` in the project root directory)
  * where absolutely necessary, sequencing mode is added as part of output file or directory instead
  * remove dependency of sequencing mode for rule for `FastQC`; now runs separately for each strand
* Changes related to MultiQC and output file/directory structure
  * moving and renaming outputs for MultiQC is no longer required
  * code to create MultiQC custom config externalized into script `scripts/rhea_multiqc_config.py`
  * add MultiQC output files with deterministic output to md5 sum checks performed during execution of `tests/test_integration_workflow/test.{local,slurm}.sh`
  * output filenames for each rule now follow this general structure: `samples/{sample_name}/{rule}/{output_file}`
  * change log directory structure matches results directory structure
* Miscellaneous changes
  * consistent, PEP8-compliant formatting in most parts, including Snakemake files, where allowed
  * remove rule `extract_decoys_salmon`; equivalent file `chrName.txt` produced by `star_index` is used instead
  * add rule `start` which copies sample data to the results directory and enforces uniform naming
  * refactoring of ALFA rules and modification of the CI/CD test to ensure compatibility

6cf28511

Add rules for bigWig creation · 907082c3
CJHerrmann authored 4 years ago and Alex Kanitz committed 4 years ago

907082c3

Mar 25, 2020
- Update salmon transcriptome index generation · 4e3cac05
  BIOPZ-Bak Maciej authored 5 years ago and Alex Kanitz committed 5 years ago
  
  4e3cac05
Mar 24, 2020
- Second fix in results restructuring · 27b38068
  BIOPZ-Bak Maciej authored 5 years ago and Alex Kanitz committed 5 years ago
  
  27b38068
Mar 22, 2020
- fix absolute / relative path issue in fastqc results parsing · 2792d1e6
  BIOPZ-Bak Maciej authored 5 years ago and Alex Kanitz committed 5 years ago
  
  2792d1e6
Mar 21, 2020
- use minified container images · e9d50454
  Alex Kanitz authored 5 years ago
  
  e9d50454
Mar 20, 2020
- renamed plot file 4 proper parsing · b0a1a53c
  BIOPZ-Bak Maciej authored 5 years ago
  
  b0a1a53c
- extend ALFA functionality · f5e2f6ac
  Dominik Burri authored 5 years ago and Alex Kanitz committed 5 years ago
  
  - generate nucleotide distribution for unique reads only - new rule to generate PNG image for MultiQC
  f5e2f6ac
Mar 19, 2020
- MultiQC · fd1e3123
  BIOPZ-Bak Maciej authored 5 years ago and Alex Kanitz committed 5 years ago
  
  fd1e3123
Mar 12, 2020

add TIN score merge and plot steps · a0babc83
BIOPZ-Bak Maciej authored 5 years ago

a0babc83
included tests for ALFA qc · ad3a8e52
Dominik Burri authored 5 years ago
```
corrected md5sum for config.yaml

remove unnecessary file
```
ad3a8e52

added rule for · 37fb0fd0

Dominik Burri authored 5 years ago

- renaming bedgraph
- creating ALFA qc plots

removed conda dependence, moved import statement.

included ALFA in finish rule, corrected annotation.gtf and config.yaml, created new .svg

37fb0fd0

Mar 06, 2020
- Merged paired end and single end rules for star_rpm and... · fb784999
  BIOPZ-Gypas Foivos authored 5 years ago
  
  Merged paired end and single end rules for star_rpm and index_genomic_alignment_samtools. Fixed wiring of calculate tin score: bam should be input and not params.
  fb784999
- Replace cufflinks image with gffread image · c3d15275
  BIOPZ-Gypas Foivos authored 5 years ago
  
  c3d15275
- Generate bedgraph file of normalised coverage. Fixes #45 · a54ff3e8
  Dominik Burri authored 5 years ago and BIOPZ-Gypas Foivos committed 5 years ago
  
  a54ff3e8