Skip to content

PCA analysis on genes and transcripts TPM for kallisto and salmon

BIOPZ-Gypas Foivos requested to merge pca into dev
  • Delete unused files scripts/fg_extract_transcripts.py, scripts/heatmap_and_clustermap.py, scripts/perform_PCA.py.
  • Add rules (pca_kallisto, pca_salmon) that use zpca on genes and transcripts TPM tables from kallisto and salmon.
  • The output is wired to multiqc_report but the plots are not visualized to multiqc.
  • Update documentation.
  • Update dag and rulegraph.
  • Note: No tests are added, because of random numbers used. No fixed md5sum.

Fixes #140 (closed) #142 (closed)

Before merging I would ask:

  • @boersch check if the current PCA analysis gives similar results to the standard PCA analysis she is performing on a real dataset.
  • @katsanto to review the new snakemake rules to make sure that they are according to the snakemake standards.
  • @bakma Check is there are further changes required, in order to integrate the output of the PCA analysis in multiqc.
  • @kanitz have a look at zpca and work on the merge request.

Thank you in advance.

Edited by BIOPZ-Gypas Foivos

Merge request reports