Difference between revisions of "Talk:RP.R"

From Organic Design wiki
 
 
Line 1: Line 1:
 +
<table class=document-code><tr><td>
 
RP                package:RankProd                R Documentation
 
RP                package:RankProd                R Documentation
  
Line 119: Line 120:
 
           #pfp for identifying genes that are down-regulated in class 2  
 
           #pfp for identifying genes that are down-regulated in class 2  
 
           head(RP.out$pfp)
 
           head(RP.out$pfp)
 +
</table>
 +
 +
=RankProd1=
 +
The following line is suspect as it always pads out data.rep NA's with 1 in rank.rep
 +
  rank.rep[is.na(data.rep)] = 1

Latest revision as of 02:58, 20 March 2007

RP package:RankProd R Documentation

Rank Product Analysis of Microarray

Description:

    Perform rank product method to identify differentially  expressed
    genes. It is possible to do either a one-class or two-class
    analysis.

Usage:

        RP(data,cl,num.perm=100,logged=TRUE,
           na.rm=FALSE,gene.names=NULL,plot=FALSE, rand=NULL)

Arguments:

   data: the data set that should be analyzed. Every  row of this data
         set must correspond to a gene.
     cl: a vector containing the class labels of the samples. In the
         two class unpaired case, the label of a  sample is either 0
         (e.g., control group) or 1  (e.g., case group). For one class
          data, the label for  each sample should be 1.

num.perm: number of permutations used in the calculation of the null

         density. Default is 'num.perm=100'.
 logged: if "TRUE", data has bee logged, otherwise set it  to "FALSE"
  na.rm: if 'FALSE' (default), the NA value will not be used in
         computing rank. If 'TRUE', the missing  values will be
         replaced by the gene-wise mean of the non-missing values.
         Gene with all values missing  will be assigned "NA"

gene.names: if "NULL", no gene name will be assigned to the estimated

         percentage of  false positive predictions (pfp).
   plot: If "TRUE", plot the estimated pfp verse the  rank of each
         gene.
   rand: if specified, the random number generator will  be put in a
         reproducible state using the rand value as seed.

Value:

    A result of identifying differentially expressed genes  between
    two classes. The identification consists of two parts, the
    identification of  up-regulated  and down-regulated genes in 
    class 2 compared to class 1, respectively.
    pfp: estimated percentage of false positive predictions (pfp) up
         to  the position of each gene under two  identificaiton each
   pval: estimated pvalue for each gene being up- and down-regulated
    RPs: Original rank-product of each genes for two  dentificaiton
         each 
 RPrank: rank of the rank product of each genes
Orirank: original rank in each comparison, which  is used to construct
         rank product
  AveFC: fold change of average expression under class 1 over  that
         under class 2. log-fold change if data is in log  scaled,
         original fold change if data is unlogged. 

Note:

    Percentage of false prediction (pfp), in theory, is  equivalent of
    false  discovery rate (FDR), and it is possible to be large than
    1.
    The function looks for up- and down- regulated genes in two
    seperate steps, thus two pfps and pvalues are computed and used to
    identify  gene that belong to each group.   
    This function is suitable to deal with data from a  single origin,
    e.g. single  experiment. If the data has  different origin, e.g.
    generated at different  laboratories, please refer RP.advance.

Author(s):

    Fangxin Hong fhong@salk.edu

References:

    Breitling, R., Armengaud, P., Amtmann, A., and Herzyk,  P.(2004)
    Rank Products:A simple, yet powerful, new method to  detect
    differentially regulated genes in replicated microarray
    experiments, _FEBS Letter_, 57383-92

See Also:

    'topGene'   'RPadvance'   'plotRP'

Examples:


          # Load the data of Golub et al. (1999). data(golub) 
          # contains a 3051x38 gene expression
          # matrix called golub, a vector of length called golub.cl 
          # that consists of the 38 class labels,
          # and a matrix called golub.gnames whose third column 
          # contains the gene names.
          data(golub)


          #use a subset of data as example, apply the rank 
          #product method
          subset <- c(1:4,28:30)
          #Setting rand=123, to make the results reproducible,
          RP.out <- RP(golub[,subset],golub.cl[subset],rand=123) 
          
          # class 2: label =1, class 1: label = 0
          #pfp for identifying genes that are up-regulated in class 2 
          #pfp for identifying genes that are down-regulated in class 2 
          head(RP.out$pfp)

RankProd1

The following line is suspect as it always pads out data.rep NA's with 1 in rank.rep

 rank.rep[is.na(data.rep)] = 1