singh2002 {sda} | R Documentation |
Gene expression data (6033 genes for 102 samples) from the microarray study of Singh et al. (2002).
data(singh2002)
singh2002$x
is a 102 x 6033 matrix containing the expression levels.
The rows contain the samples and the columns the genes.
singh2002$y
is a factor containing the diagnosis for each sample ("cancer" or "healthy").
This data set contains measurements of the gene expression of 6033 genes for 102 observations: 52 prostate cancer patients and 50 healty men.
The data are described in Singh et al. (2001) and are provided in exactly the form as used by Efron (2008) - see http://www-stat.stanford.edu/~ckirby/brad/papers/Ebaydata.R.
D. Singh et al. 2002. Gene expression correlates of clinical prostate cancer behavior. Cancer Cell 1:203–209.
Efron, B. 2008. Empirical Bayes estimates for large-scale prediction problems. Technical Report, Standford University.
# load sda library library("sda") # load Singh et al (2001) data set data(singh2002) dim(singh2002$x) # 102 6033 hist(singh2002$x) singh2002$y # 2 levels