• plink 软件中 recode 01、recode 12、outputmissinggenotype的作用


    1、准备测试数据,8个样本,8个位点

    [root@linuxprobe test]# cat outcome.ped
    DOR     1       0       0       0       -9      A G     G G     G G     G C     G G     C C     C C     0 0
    DOR     2       0       0       0       -9      G G     G G     A G     C C     G G     G C     C C     0 0
    DOR     3       0       0       0       -9      G G     G G     A G     G C     C G     C C     G G     0 0
    DOR     4       0       0       0       -9      G G     G G     G G     G G     C G     G G     G G     G G
    DOR     5       0       0       0       -9      G G     G G     A G     G C     G G     C C     G G     G G
    DOR     6       0       0       0       -9      G G     G G     A A     C C     G G     C C     G G     G G
    DOR     7       0       0       0       -9      A A     G G     G G     C C     G G     C C     G G     A G
    DOR     9       0       0       0       -9      A A     G G     G G     C C     G G     C C     G G     A G
    [root@linuxprobe test]# cat outcome.map
    1       snp1    0       55910
    1       snp2    0       85204
    1       snp3    0       122948
    1       snp4    0       203750
    1       snp5    0       312707
    1       snp6    0       356863
    1       snp7    0       400518
    1       snp8    0       487423

    2、 --recode 01

    plink --file outcome --recode 12 --out test;rm *.log *.nosex  ## --recode 12 的作用是将次等位基因转换为1,主等位基因转换为2,缺失基因型扔为0
    [root@linuxprobe test]# cat test.map
    1       snp1    0       55910
    1       snp2    0       85204
    1       snp3    0       122948
    1       snp4    0       203750
    1       snp5    0       312707
    1       snp6    0       356863
    1       snp7    0       400518
    1       snp8    0       487423
    [root@linuxprobe test]# cat test.ped
    DOR 1 0 0 0 -9 1 2 2 2 2 2 1 2 2 2 2 2 1 1 0 0
    DOR 2 0 0 0 -9 2 2 2 2 1 2 2 2 2 2 1 2 1 1 0 0
    DOR 3 0 0 0 -9 2 2 2 2 1 2 1 2 1 2 2 2 2 2 0 0
    DOR 4 0 0 0 -9 2 2 2 2 2 2 1 1 1 2 1 1 2 2 2 2
    DOR 5 0 0 0 -9 2 2 2 2 1 2 1 2 2 2 2 2 2 2 2 2
    DOR 6 0 0 0 -9 2 2 2 2 1 1 2 2 2 2 2 2 2 2 2 2
    DOR 7 0 0 0 -9 1 1 2 2 2 2 2 2 2 2 2 2 2 2 1 2
    DOR 9 0 0 0 -9 1 1 2 2 2 2 2 2 2 2 2 2 2 2 1 2

    3、--recode 01

    [root@linuxprobe test]# plink --file outcome --recode 01 --out test;rm *.log *.nosex  ## 直接使用,报错了
    PLINK v1.90b6.19 64-bit (16 Sep 2020)          www.cog-genomics.org/plink/1.9/
    (C) 2005-2020 Shaun Purcell, Christopher Chang   GNU General Public License v3
    Logging to test.log.
    Options in effect:
      --file outcome
      --out test
      --recode 01
    
    23700 MB RAM detected; reserving 11850 MB for main workspace.
    .ped scan complete (for binary autoconversion).
    Performing single-pass .bed write (8 variants, 8 people).
    --file: test-temporary.bed + test-temporary.bim + test-temporary.fam written.
    8 variants loaded from .bim file.
    8 people (0 males, 0 females, 8 ambiguous) loaded from .fam.
    Ambiguous sex IDs written to test.nosex .
    Using 1 thread (no multithreaded calculations invoked).
    Before main variant filters, 8 founders and 0 nonfounders present.
    Calculating allele frequencies... done.
    Total genotyping rate is 0.953125.
    8 variants and 8 people pass filters and QC.
    Note: No phenotypes present.
    Error: The --recode '01' modifier normally has to be used with a nonzero
    --output-missing-genotype setting.

    4、--recode 01  +  --output-missing-genotype 

    plink --file outcome --recode 01 --output-missing-genotype 9 --out test;rm *.log *.nosex  ## 加参数 --output-missing-genotype
    [root@linuxprobe test]# cat test.ped  ## 此等位基因变为0,主等位基因变为1,缺失基因型变为9
    DOR 1 0 0 0 -9 0 1 1 1 1 1 0 1 1 1 1 1 0 0 9 9
    DOR 2 0 0 0 -9 1 1 1 1 0 1 1 1 1 1 0 1 0 0 9 9
    DOR 3 0 0 0 -9 1 1 1 1 0 1 0 1 0 1 1 1 1 1 9 9
    DOR 4 0 0 0 -9 1 1 1 1 1 1 0 0 0 1 0 0 1 1 1 1
    DOR 5 0 0 0 -9 1 1 1 1 0 1 0 1 1 1 1 1 1 1 1 1
    DOR 6 0 0 0 -9 1 1 1 1 0 0 1 1 1 1 1 1 1 1 1 1
    DOR 7 0 0 0 -9 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 1
    DOR 9 0 0 0 -9 0 0 1 1 1 1 1 1 1 1 1 1 1 1 0 1
    [root@linuxprobe test]# cat test.map
    1       snp1    0       55910
    1       snp2    0       85204
    1       snp3    0       122948
    1       snp4    0       203750
    1       snp5    0       312707
    1       snp6    0       356863
    1       snp7    0       400518
    1       snp8    0       487423

     

    结论:--recode 12 :将次等位基因变为1,主等位基因变为2

             --recode 01 :需结合--output-missing-genotype使用,将次等位基因变为0,主等位基因变为1,--output-missing-genotype作用是设定缺失基因型的代表字符。

  • 相关阅读:
    4.终端
    **如何让CI框架支持service层
    *CI框架装载器Loader.php源码分析
    如何在CI中写工具类,在哪个目录写
    ***在PHP语言中使用JSON和将json还原成数组(json_decode()的常见错误)
    DedeCMS常见问题和技巧
    ***Linux系统下如何查看及修改文件读写权限
    ***linux下安装xampp,XAMPP目录结构(阿里云安装xampp)
    PHP5.2至5.6的新增功能详解
    PHP 5.2、5.3、5.4、5.5、5.6 对比以及功能详解
  • 原文地址:https://www.cnblogs.com/liujiaxin2018/p/13795230.html
Copyright © 2020-2023  润新知