Create your config.yaml file for ProHap

General parameters

Ensembl release

Included chromosomes

Select transcripts

Use the default set of transcripts

Use only MANE Select transcripts
Only available for Ensembl v.108 and above. For genes that do not include any MANE Select transcript in Ensembl, "Ensembl Canonical" transcripts will be selected.

Select transcripts by biotype (provide below)

User-defined list of transcripts (provide below)

Transcript biotypes to be included (Comma-separated list, use biotypes from Gencode)

Path to the custom transcript list (CSV file, see example data/transcripts_reference_108.csv)

Path to the contaminant FASTA file The default contaminant database is provided in the crap.fasta file in this repository.

Path to the final FASTA file

Simplify FASTA headers (extract all information from the FASTA protein headers in to a tab-separated file)

Check that no fields below are labbeled as MISSING and or copy the content below to your config.yaml file:

Create your config.yaml file for ProHap

General parameters

ProHap

ProVar

Add your VCF files: