
Generating consensus sequences for each sample each chromosme
1 |
DATAPATH="/users/lbuggiotti/WGresRussia/umd_3_1" |
Combining samples for each chromosme(high coverage)
1 |
for i in Chr{1..29} ChrX; do /users/hzhou/miniconda3/bin/python3.6 /users/hzhou/tools/msmc-tools/generate_multihetsep.py --chr ChrX --mask=./A7.${i}_mask.bed.gz --mask=./A25.${i}_mask.bed.gz --mask=./A1.${i}_mask.bed.gz --mask=./A5.${i}_mask.bed.gz --mask=./A17.${i}_mask.bed.gz --mask=./A13.${i}_mask.bed.gz --mask=./A3.${i}_mask.bed.gz --mask=./A15.${i}_mask.bed.gz --mask=./A8.${i}_mask.bed.gz --mask=./A16.${i}_mask.bed.gz --mask=./A41.${i}_mask.bed.gz --mask=./A42.${i}_mask.bed.gz --mask=./A38.${i}_mask.bed.gz --mask=./A26.${i}_mask.bed.gz --mask=./A43.${i}_mask.bed.gz --mask=./A27.${i}_mask.bed.gz --mask=./A37.${i}_mask.bed.gz --mask=./A40.${i}_mask.bed.gz --mask=./A33.${i}_mask.bed.gz --mask=./A39.${i}_mask.bed.gz --mask=/users/hzhou/tools/seqbility-20091110/umd_3_1_35_chr${i}.mask.bed.gz A7.${i}.vcf.gz A25.${i}.vcf.gz A1.${i}.vcf.gz A5.${i}.vcf.gz A17.${i}.vcf.gz A13.${i}.vcf.gz A3.${i}.vcf.gz A15.${i}.vcf.gz A8.${i}.vcf.gz A16.${i}.vcf.gz A41.${i}.vcf.gz A42.${i}.vcf.gz A38.${i}.vcf.gz A26.${i}.vcf.gz A43.${i}.vcf.gz A27.${i}.vcf.gz A37.${i}.vcf.gz A40.${i}.vcf.gz A33.${i}.vcf.gz A39.${i}.vcf.gz > ./Y10K10.high.msmcinput.txt/${i}.YandK.msmcinput.txt ;done |
- mask files are in /users/hzhou/tools/seqbility-20091110/
the following script to build bos taurus mask files
1 |
module load apps/bwa/0.7.10/gcc-4.4.7 |
Estimating the effective population size
module load apps/msmc
for K
1 |
msmc2 -t 5 -o k.h.10h_msmc_output -I 0,1,2,3,4,5,6,7,8,9 Chr1.YandK.msmcinput.txt Chr2.YandK.msmcinput.txt Chr3.YandK.msmcinput.txt Chr4.YandK.msmcinput.txt Chr5.YandK.msmcinput.txt Chr6.YandK.msmcinput.txt Chr7.YandK.msmcinput.txt Chr8.YandK.msmcinput.txt Chr9.YandK.msmcinput.txt Chr10.YandK.msmcinput.txt Chr11.YandK.msmcinput.txt Chr12.YandK.msmcinput.txt Chr13.YandK.msmcinput.txt Chr14.YandK.msmcinput.txt Chr15.YandK.msmcinput.txt Chr16.YandK.msmcinput.txt Chr17.YandK.msmcinput.txt Chr18.YandK.msmcinput.txt Chr19.YandK.msmcinput.txt Chr20.YandK.msmcinput.txt Chr21.YandK.msmcinput.txt Chr22.YandK.msmcinput.txt Chr23.YandK.msmcinput.txt Chr24.YandK.msmcinput.txt Chr25.YandK.msmcinput.txt Chr26.YandK.msmcinput.txt Chr27.YandK.msmcinput.txt Chr28.YandK.msmcinput.txt Chr29.YandK.msmcinput.txt ChrX.YandK.msmcinput.txt |
for Y
1 |
msmc2 -t 5 -o y.h.10h_msmc_output -I 20-20,20-21,20-22,20-23,20-24,20-25,20-26,20-27,20-28,20-29,21-21,21-22,21-23,21-24,21-25,21-26,21-27,21-28,21-29,22-22,22-23,22-24,22-25,22-26,22-27,22-28,22-29,23-23,23-24,23-25,23-26,23-27,23-28,23-29,24-24,24-25,24-26,24-27,24-28,24-29,25-25,25-26,25-27,25-28,25-29,26-26,26-27,26-28,26-29,27-27,27-28,27-29,28-28,28-29 Chr1.YandK.msmcinput.txt Chr2.YandK.msmcinput.txt Chr3.YandK.msmcinput.txt Chr4.YandK.msmcinput.txt Chr5.YandK.msmcinput.txt Chr6.YandK.msmcinput.txt Chr7.YandK.msmcinput.txt Chr8.YandK.msmcinput.txt Chr9.YandK.msmcinput.txt Chr10.YandK.msmcinput.txt Chr11.YandK.msmcinput.txt Chr12.YandK.msmcinput.txt Chr13.YandK.msmcinput.txt Chr14.YandK.msmcinput.txt Chr15.YandK.msmcinput.txt Chr16.YandK.msmcinput.txt Chr17.YandK.msmcinput.txt Chr18.YandK.msmcinput.txt Chr19.YandK.msmcinput.txt Chr20.YandK.msmcinput.txt Chr21.YandK.msmcinput.txt Chr22.YandK.msmcinput.txt Chr23.YandK.msmcinput.txt Chr24.YandK.msmcinput.txt Chr25.YandK.msmcinput.txt Chr26.YandK.msmcinput.txt Chr27.YandK.msmcinput.txt Chr28.YandK.msmcinput.txt Chr29.YandK.msmcinput.txt ChrX.YandK.msmcinput.txt |
- There is a problem with MSMC -I parameter, when using 10,11,12,…19 will produce the error: [email protected](139): Range violation—————-
??:? _d_arrayboundsp [0x5b2da6], so I used the alternative approach.
plot using R script
1 |
setwd("D:/Data_analysis/evolution/MSMC/yh10.29chr/") |
- set the u and g here
reference:
https://github.com/stschiff/msmc-tools/blob/master/msmc-tutorial/guide.md




近期评论