Add final rule for multiqc
Add final rule for multiqc
- Show closed items
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- Author Maintainer
################################################################################ ### Multiqc ################################################################################ rule multiqc: input: fastqc_1 = expand(os.path.join(config["output_dir"], "{sample}", "fastqc_1"), sample=get_samples()), fastqc_2 = expand(os.path.join(config["output_dir"], "{sample}", "fastqc_2"), sample=get_samples()), STAR = expand(STAR) output: multiqc_dir = directory(os.path.join(config["output_dir"], "summary", "qc")) log: os.path.join(config["local_log"], "multiqc_fastqc.log") singularity: "docker://zavolab/multiqc:1.8" shell: "(multiqc --outdir {output.multiqc_dir} results logs) &> {log}"
Edited by BIOPZ-Gypas Foivos - MihaelaZavolan assigned to @gypas
assigned to @gypas
- Owner
Is multiqc an alternative to fastp? Probably not because fastp also subsumes cutadapt.
- Author Maintainer
No multiqc collects logs files from other tools and generates general reports at the end (e.g. parses the logs from STAR, fastqc and creates an html file). fastp generates quality statistics plots (similar to fastqc.
- Maintainer
OK, so as I just asked Mihaela - I will work on integrating results from distinct steps into MultiQC report at the end of the workflow.
- BIOPZ-Bak Maciej assigned to @bakma and unassigned @gypas
- BIOPZ-Bak Maciej assigned to @gypas and unassigned @bakma
- BIOPZ-Bak Maciej assigned to @bakma and unassigned @gypas
- BIOPZ-Bak Maciej added Doing label
added Doing label
- Maintainer
- Prepare dummy 1-rule MultiQC workflow
- Branch from master and add dummy MultiQC rule to the workflow
- Test execution: local/slurm with singularity
- Construct the initial version of the report: adjust the YAML config
Edited by BIOPZ-Bak Maciej - Alex Kanitz added 1 deleted label
added 1 deleted label
- Alex Kanitz mentioned in issue #6 (closed)
mentioned in issue #6 (closed)
- Alex Kanitz mentioned in issue #51 (closed)
mentioned in issue #51 (closed)
- Alex Kanitz changed milestone to %v0.1.0 release
changed milestone to %v0.1.0 release
- BIOPZ-Bak Maciej added To Do label and removed Doing label
- BIOPZ-Bak Maciej added Doing label and removed To Do label
- Maintainer
I have branched the master and added two rules at the end of the workflow:
- gathers logs, re-formats (if necessary) and creates config for MultiQC
- MultiQC execution (runs in the author's docker container)
Dummy runs finish successfully.
It seems that this aggregator is highly customizable:
https://multiqc.info/docs/#configuring-multiqcI believe that if we ensure a proper output/log directory structure and prepare a proper YAML config for the tool we can easily have very appealing reports. I will have to really dive into how to create this configuration file for our pipeline now.
1 - Author Maintainer
@bakma Does this mean that we can add reports for custom tools and scripts? (e.g. output of https://git.scicore.unibas.ch/zavolan_group/pipelines/rnaseqpipeline/blob/master/scripts/perform_PCA.py)
- Maintainer
Yes, but it is not advised and significantly limited:
https://multiqc.info/docs/#custom-content - Owner
Well, guess we should write modules for our own tools then. Well, first we should package our tools properly, in their own repos :)