Difference between revisions of "Limma analysis"
m |
(→Algorithm details: List elements) |
||
Line 9: | Line 9: | ||
The linear model reduces to effectively estimating average M values using a categorical design matrix. If correlation between rows (spots) is estimated, then the function ''duplicateCorrelation'' is called. This fits a reml model on all genes to estimate a ''rho'' correlation matrix. A ''[[Wikipedia:Fisher transformation|fisher transformation]]'' (identical to ''atanh(x)'') is then applied to the ''rho'' matrix and an average ''rho'' calculating an mean correlation with trim=0.15 by default, which is then backtransformed to give a ''consensus correlation''. This correlation can be utilised in lmFit by calling ''gls.series'' which fits a generalized least squares model. | The linear model reduces to effectively estimating average M values using a categorical design matrix. If correlation between rows (spots) is estimated, then the function ''duplicateCorrelation'' is called. This fits a reml model on all genes to estimate a ''rho'' correlation matrix. A ''[[Wikipedia:Fisher transformation|fisher transformation]]'' (identical to ''atanh(x)'') is then applied to the ''rho'' matrix and an average ''rho'' calculating an mean correlation with trim=0.15 by default, which is then backtransformed to give a ''consensus correlation''. This correlation can be utilised in lmFit by calling ''gls.series'' which fits a generalized least squares model. | ||
+ | |||
+ | ===List elements from lmFit=== | ||
+ | <table class=document-code><tr><td> | ||
+ | > names(fit) | ||
+ | [1] "coefficients" "rank" "assign" "qr" | ||
+ | [5] "df.residual" "sigma" "cov.coefficients" "stdev.unscaled" | ||
+ | [9] "pivot" "method" "design" "Amean" | ||
+ | [13] "genes" | ||
+ | |||
+ | *coefficients= ''estimated M values'' | ||
+ | *Amean = ''Estimated unweighted A values'' | ||
+ | </table> |
Revision as of 23:06, 19 July 2006
Linear models for microarray analysis
Linear models for microarray analysis (Limma) is a R and Bioconductor package for organising and analysing cDNA and Affymetrix microarray data. It is written by Gordon Smyth at WEHI.
Algorithm details
For a p * n matrix of expression intensities, Limma is fitting p linear models (one for each row). The lmFit function does this by calling functions such as lm.series which use lm.fit in Package:Stats. For cDNA/oligo two spotted technologies the matrix of expression intensities is usually the marix M values with respect to treatments. For Affymetrix single channel arrays the expression intensities are directly analysed comparing two treatments.
The linear model reduces to effectively estimating average M values using a categorical design matrix. If correlation between rows (spots) is estimated, then the function duplicateCorrelation is called. This fits a reml model on all genes to estimate a rho correlation matrix. A fisher transformation (identical to atanh(x)) is then applied to the rho matrix and an average rho calculating an mean correlation with trim=0.15 by default, which is then backtransformed to give a consensus correlation. This correlation can be utilised in lmFit by calling gls.series which fits a generalized least squares model.
List elements from lmFit
> names(fit) [1] "coefficients" "rank" "assign" "qr" [5] "df.residual" "sigma" "cov.coefficients" "stdev.unscaled" [9] "pivot" "method" "design" "Amean" [13] "genes"
|