Golub {mpm} | R Documentation |
Golub (1999) Data
Description
Golub et al. (1999) data on gene expression profiles of 38
patients suffering from acute leukemia and a validation sample
of 34 patients.
Usage
data(Golub)
Format
The expression data are available in data frame Golub
with 5327 observations on the following 73 variables.
Gene
- a character vector with gene identifiers
1
- gene expression data for sample 1
2
- gene expression data for sample 2
3
- gene expression data for sample 3
4
- gene expression data for sample 4
5
- gene expression data for sample 5
6
- gene expression data for sample 6
7
- gene expression data for sample 7
8
- gene expression data for sample 8
9
- gene expression data for sample 9
10
- gene expression data for sample 10
11
- gene expression data for sample 11
12
- gene expression data for sample 12
13
- gene expression data for sample 13
14
- gene expression data for sample 14
15
- gene expression data for sample 15
16
- gene expression data for sample 16
17
- gene expression data for sample 17
18
- gene expression data for sample 18
19
- gene expression data for sample 19
20
- gene expression data for sample 20
21
- gene expression data for sample 21
22
- gene expression data for sample 22
23
- gene expression data for sample 23
24
- gene expression data for sample 24
25
- gene expression data for sample 25
26
- gene expression data for sample 26
27
- gene expression data for sample 27
34
- gene expression data for sample 34
35
- gene expression data for sample 35
36
- gene expression data for sample 36
37
- gene expression data for sample 37
38
- gene expression data for sample 38
28
- gene expression data for sample 28
29
- gene expression data for sample 29
30
- gene expression data for sample 30
31
- gene expression data for sample 31
32
- gene expression data for sample 32
33
- gene expression data for sample 33
39
- gene expression data for sample 39
40
- gene expression data for sample 40
42
- gene expression data for sample 42
47
- gene expression data for sample 47
48
- gene expression data for sample 48
49
- gene expression data for sample 49
41
- gene expression data for sample 41
43
- gene expression data for sample 43
44
- gene expression data for sample 44
45
- gene expression data for sample 45
46
- gene expression data for sample 46
70
- gene expression data for sample 70
71
- gene expression data for sample 71
72
- gene expression data for sample 72
68
- gene expression data for sample 68
69
- gene expression data for sample 69
67
- gene expression data for sample 67
55
- gene expression data for sample 55
56
- gene expression data for sample 56
59
- gene expression data for sample 59
52
- gene expression data for sample 52
53
- gene expression data for sample 53
51
- gene expression data for sample 51
50
- gene expression data for sample 50
54
- gene expression data for sample 54
57
- gene expression data for sample 57
58
- gene expression data for sample 58
60
- gene expression data for sample 60
61
- gene expression data for sample 61
65
- gene expression data for sample 65
66
- gene expression data for sample 66
63
- gene expression data for sample 63
64
- gene expression data for sample 64
62
- gene expression data for sample 62
The classes are in a separate numeric vector Golub.grp
with values
1
for the 38 ALL B-Cell samples, 2
for the 9 ALL T-Cell samples
and 3
for the 25 AML samples.
Details
The original data of Golub et al. (1999) were preprocessed
as follows: genes that were called 'absent' in all samples
were removed from the data sets, since these measurements
are considered unreliable by the manufacturer of the technology.
Negative measurements in the data were set to 1.
The resulting data frame contains 5327 genes of the 6817
originally reported by Golub et al. (1999).
Note
Luc Wouters et al. (2003), p. 1134 contains a typo
concerning the sample sizes of AML- and ALL-type and erroneously reported
Source
Golub, T. R., Slonim, D. K., Tamayo, P., et al. (1999). Molecular
classification of cancer: Class discovery and class prediction by
gene expression monitoring. Science 286, 531 – 537.
References
Luc Wouters et al. (2003). Graphical Exploration of Gene Expression Data:
A Comparative Study of Three Multivariate Methods, Biometrics, 59, 1131-1139.
[Package
mpm version 1.0-12
Index]