A guide to running SMaSH. If you have a question that isn't answered here, please visit our support forum.
How to run the SMaSH evaluation scripts.
Tips on setting up EC2 instances and gathering metrics.
Documentation for our rescue and normalization algorithms.
The evaluation scripts can be run on consumer hardware; allow about 30 minutes and about 5 GB of RAM
perl smash/scripts/sortByRef.pl /PATH/TO/YOUR/VCF /PATH/TO/YOUR/REFERENCE.FA(STA).FAI
python smash/smashbenchmarking/normalize_vcf.py /PATH/TO/YOUR/VCF /PATH/TO/REFERENCE.FA(STA) myvcf 50or you can normalize as part of the evaluation script by adding the option
--normalize
smash/smashbenchmarking/bench.py
on the normalized VCFs, or add the option --normalize
if your VCF file are not normalized:
python smash/smashbenchmarking/bench.py /PATH/TO/TRUE_VCF /PATH/TO/PREDICTED_VCF /PATH/TO/REFERENCE /PATH/TO/REFERENCE/INDEX --snp_err 0.0 --indel_err 0.0 --sv_err 0.0 --sv_bp 100 -w 50
python smash/smashbenchmarking/bench.py /PATH/TO/TRUE_VCF /PATH/TO/PREDICTED_VCF /PATH/TO/REFERENCE /PATH/TO/REFERENCE/INDEX --snp_err 0.002 --indel_err 0.002 --sv_err 0.003 --sv_bp 100 -w 50
python smash/smashbenchmarking/bench.py /PATH/TO/TRUE_VCF /PATH/TO/PREDICTED_VCF /PATH/TO/REFERENCE /PATH/TO/REFERENCE/INDEX --snp_err 0.0004 --indel_err 0 --sv_err 0.01 --sv_bp 100 -w 50
Here are some recommendations for benchmarking performance on EC2 for aligners or variant callers.