Workflow for the analysis of the synthetic data
The aim of the workflow is to automate the analysis of synthetic data. Inputs and intermediate outputs will be:
Inputs (I#)
- I1. Path to genome sequence file
- I3. Path to gene expression table (csv)
- I14. Output directory
- I16. Pattern specifying the reads file name for an individual cell
Intermediate outputs
- O1. Sampled transcript structures (gtf)
- O11. Files with read-to-genome alignments (bam)
- O12. Path to estimates of gene expression
- O13. Plot with mean vs. standard deviation for individual genes
- O14. Path to table with mean vs. standard deviation for individual genes (csv)
- O15. Plot with initial vs. inferred expression levels for all genes
This will be done as follows: