Difference between revisions of "Linear models for Microarray analysis"
From Organic Design wiki
m (→Object orintated programming environment) |
m (→Object orintated programming environment) |
||
Line 29: | Line 29: | ||
**<font color="Red">R (''Red foreground'')</font> | **<font color="Red">R (''Red foreground'')</font> | ||
**<font color="Green">G (''Green foreground'')</font> | **<font color="Green">G (''Green foreground'')</font> | ||
− | + | *''Foreground intensities range ~ 1 → 65535'' | |
− | |||
**<font color="Red">Rb (''Red background'')</font> | **<font color="Red">Rb (''Red background'')</font> | ||
**<font color="Green">Gb (''Green background'')</font> | **<font color="Green">Gb (''Green background'')</font> | ||
− | + | * ''Background intensities range ~ 1 → 1000'' | |
− | |||
**genes (''Spot annotation list'') | **genes (''Spot annotation list'') | ||
**weights (''prior weights weights given to each spot'') | **weights (''prior weights weights given to each spot'') | ||
− | *MAList data transformation | + | *MAList data transformation |
**M = log2(R) - log2(G) (''minus'') | **M = log2(R) - log2(G) (''minus'') | ||
**A = (log2(R) + log2(G))/2) (''add - abundance'') | **A = (log2(R) + log2(G))/2) (''add - abundance'') | ||
− | * | + | *Backtransforming to Normalized R', G' values |
**log2(R') = A + M/2 | **log2(R') = A + M/2 | ||
**log2(G') = A - M/2 | **log2(G') = A - M/2 |
Revision as of 20:57, 14 March 2006
Overview of Limma package for R
- Fits a linear model for each spot (gene)
- An open source software package for the R programming environment
- Focus on normalization and statistical analysis of cDNA microarray gene expression data
- OOP environment for handling information in a microarray experiment
- Statistical analysis approach can be used for Affymetrix microarray experiments
Origin
- Written and maintained by Gordon Smyth with contributions From WEHI, Melbourne, Australia
- Software made public at the Australian Genstat Conference, Perth, in Dec 2002
- Became available in the Bioconductor open source bioinformatics project April 2003
- Limma integrates with other Bioconductor software packages, affy, marray, using convert package
- Active development cycle
Statistical approach
- Parallel inference for each gene
- univariate analysis approach
- Computationally fast/robust
- Handles missing information/use defined flag information
- Linear models are essentially t-statistics for each spot/gene (signal/noise)
- Also makes use of between gene information (moderated t-statistics)
Object orintated programming environment
- Uploading data into the R programming language automatically populates elements of RGList
- R (Red foreground)
- G (Green foreground)
- Foreground intensities range ~ 1 → 65535
- Rb (Red background)
- Gb (Green background)
- Background intensities range ~ 1 → 1000
- genes (Spot annotation list)
- weights (prior weights weights given to each spot)
- MAList data transformation
- M = log2(R) - log2(G) (minus)
- A = (log2(R) + log2(G))/2) (add - abundance)
- Backtransforming to Normalized R', G' values
- log2(R') = A + M/2
- log2(G') = A - M/2
Scratchpad
- benefits/limitations?
- FDR control? → ranking better than selecting cutoff