Entry Points Sub-workflow¶
- Workflow File:
Config Options: see The config.yml for more details
user_files.gtc_pattern
user_files.idat_pattern
user_files.ped
user_files.map
user_files.bed
user_files.bim
user_files.fam
user_files.bcf
Major Outputs:
sample_level/samples.bed
sample_level/samples.bim
sample_level/samples.fam
The pipeline accepts either per-sample GTC files or an aggregated dataset file:
per-sample GTC files:
The pipeline supports two different methods for converting per-sample GTCs to aggregated BED/BIM/FAM:
If GTC files are provided using
user_files.gtc_pattern
andworkflow_params.convert_gtc2bcf=false
(default) then following rulegraph will be followed:
If GTC files are provided using
user_files.gtc_pattern
andworkflow_params.convert_gtc2bcf=true
then the following rulegraph will be followed:
Aggregated dataset:
If a reanalysis for previous dataset is desired or GTC files are unavailable, an aggregated file encoding genotypes for all samples can be provided. The pipeline currently supports following three aggregated file formats:
If an aggregated PED/MAP is provided using
user_files.ped
anduser_files.map
then we will convert the PED/MAP to BED/BIM/FAM.If an aggregated BED/BIM/FAM is provided using
user_files.bed
,user_files.bim
,user_files.fam
then we will create a symbolic link.If an aggregated BCF file is provided using
user_files.bcf
then we will convert the BCF to BED/BIM/FAM.