• 7、RNAseq Downstream Analysis


    Created by Dennis C Wylie, last modified on Jun 29, 2015

    Machine learning methods (including clustering, dimensionality reduction, classification and regression modeling, resampling techniques, etc.), ANOVA modeling, and empirical Bayes analysis. 

    Unsupervised Analysis

    Unsupervised methods provide exploratory data analysis useful for getting a big picture view: can provide valuable QC information and can help to both assess expected trends and identify unexpected patterns in your data.

    • Deliverables: 
      • Plots in png and pdf format
      • Results from any additional algorithms applied may be provided in tab-delimited or excel formatted tables as appropriate
    • Tools Used:
      • Hierarchical Clustering: both of genes and and samples.
      • Principal Components Analysis: PCA biplot of data after centering both on the gene and sample axes (and optionally scaling of gene axis if desired).
      • Other methods: (e.g., k-means clustering, self-organized maps, multidimensional scaling, etc.) available if desired

    Empirical Bayes Differential Expression Analysis

    RNAseq experiments yield simultaneous measurements of many intrinsically similar variables (gene expression levels) but with often limited sample sizes. Empirical Bayes methods provide a statistical approach designed just for such situations which "borrow strength" across genes to increase statistical power and decrease false discovery.

    • Deliverables:

      • Tables of model parameters, p-values, and FDR q-values (in tab-delimited and excel format)

      • Boxplots (stratified by sample group) and pairs plots of top genes provided in png and pdf format

    • Tools Used:
      • Limma: applies empirical Bayes methods in the construction of linear models (e.g, t-tests, ANOVA) for a large variety of experimental designs. Originally designed for microarray data analysis, Limma's developers have substantially extended its functionality into the realm of RNAseq as well.

    Supervised Analysis

    Many methods available for classification and regression as appropriate to your analysis. Model performance may be assessed using standard metrics evaluated under cross-validation or using independent test sets if available. Analysis will be conducted using R and/or Python scripts.

    • Deliverables:
      • Tables of results (in tab-delimited and excel formats)
      • plots in png and pdf format
      • R and/or Python source files
      • binary, JSON, or XML representations of R or Python objects can be made available if desired
      • further reports in the form of slides or text documents may be provided in standard formats (pdf, doc, ppt) if desired
    • Methods Available:
      • Diagonal linear discriminant analysis (DLDA, a form of linear naive Bayes classification)
      • Linear and quadratic discriminant analysis
      • Logistic regression including L1/lasso and/or L2/ridge regularization if desired
      • Partial least squares (PLS) discriminant analysis and regression
      • k-nearest neighbors (KNN)
      • Support vector machines (SVM)
      • Decision tree ensembles (Random Forests or AdaBoost).
      • Other methods are available on request.
     
  • 相关阅读:
    [linux]在使用rsync时需要注意的小细节
    [日期工具分享][Shell]为特定命令依次传入顺序日期执行
    【API】反转输入字符(Java)
    【笔记】对自定义异常的理解(Java)
    【技巧】解决win10的1803版本下,无法收到1809推送、从而无法更新到1903版本的问题。
    【杂谈】5G有啥用?跟咱有关系么?关注那玩意儿干啥?
    【练习总结】题目:筛法遍历素数(Java)
    【练习】Java实现的杨辉三角形控制台输出
    windows系列的(xp/win7/server2003/2008/2012...)完美移植到centos7下面的虚拟机(KVM)
    模块之-os模块
  • 原文地址:https://www.cnblogs.com/renping/p/7045344.html
Copyright © 2020-2023  润新知