Difference between revisions of "Linear models for Microarray analysis"

From Organic Design wiki
m (Object orintated programming environment)
m (Object orintated programming environment)
Line 29: Line 29:
 
**<font color="Red">R (''Red foreground'')</font>
 
**<font color="Red">R (''Red foreground'')</font>
 
**<font color="Green">G (''Green foreground'')</font>
 
**<font color="Green">G (''Green foreground'')</font>
:: ''Foreground intensities range ~ 1 &rarr; 65535''
+
*''Foreground intensities range ~ 1 &rarr; 65535''
 
 
 
**<font color="Red">Rb (''Red background'')</font>
 
**<font color="Red">Rb (''Red background'')</font>
 
**<font color="Green">Gb (''Green background'')</font>
 
**<font color="Green">Gb (''Green background'')</font>
:: ''Background intensities range ~ 1 &rarr; 1000''
+
* ''Background intensities range ~ 1 &rarr; 1000''
 
 
 
**genes (''Spot annotation list'')
 
**genes (''Spot annotation list'')
 
**weights (''prior weights weights given to each spot'')  
 
**weights (''prior weights weights given to each spot'')  
  
*MAList data transformation:
+
*MAList data transformation
 
**M = log2(R) - log2(G) (''minus'')
 
**M = log2(R) - log2(G) (''minus'')
 
**A = (log2(R) + log2(G))/2) (''add - abundance'')
 
**A = (log2(R) + log2(G))/2) (''add - abundance'')
  
*''Backtransforming to Normalized R', G' values''
+
*Backtransforming to Normalized R', G' values
 
**log2(R') = A + M/2
 
**log2(R') = A + M/2
 
**log2(G') = A - M/2
 
**log2(G') = A - M/2

Revision as of 20:57, 14 March 2006

Overview of Limma package for R
  • Fits a linear model for each spot (gene)
  • An open source software package for the R programming environment
  • Focus on normalization and statistical analysis of cDNA microarray gene expression data
  • OOP environment for handling information in a microarray experiment
  • Statistical analysis approach can be used for Affymetrix microarray experiments

Origin
  • Written and maintained by Gordon Smyth with contributions From WEHI, Melbourne, Australia
  • Software made public at the Australian Genstat Conference, Perth, in Dec 2002
  • Became available in the Bioconductor open source bioinformatics project April 2003
  • Limma integrates with other Bioconductor software packages, affy, marray, using convert package
  • Active development cycle

File:Limma versions.tiff


Statistical approach

  • Parallel inference for each gene
    • univariate analysis approach
  • Computationally fast/robust
  • Handles missing information/use defined flag information
  • Linear models are essentially t-statistics for each spot/gene (signal/noise)
  • Also makes use of between gene information (moderated t-statistics)

Object orintated programming environment

File:OOP.tiff

  • Uploading data into the R programming language automatically populates elements of RGList
    • R (Red foreground)
    • G (Green foreground)
  • Foreground intensities range ~ 1 → 65535
    • Rb (Red background)
    • Gb (Green background)
  • Background intensities range ~ 1 → 1000
    • genes (Spot annotation list)
    • weights (prior weights weights given to each spot)
  • MAList data transformation
    • M = log2(R) - log2(G) (minus)
    • A = (log2(R) + log2(G))/2) (add - abundance)
  • Backtransforming to Normalized R', G' values
    • log2(R') = A + M/2
    • log2(G') = A - M/2

Scratchpad

  • benefits/limitations?
  • FDR control? → ranking better than selecting cutoff