Golub {mpm}R Documentation

Golub (1999) Data

Description

Golub et al. (1999) data on gene expression profiles of 38 patients suffering from acute leukemia and a validation sample of 34 patients.

Usage

data(Golub)

Format

The expression data are available in data frame Golub with 5327 observations on the following 73 variables.

Gene
a character vector with gene identifiers
1
gene expression data for sample 1
2
gene expression data for sample 2
3
gene expression data for sample 3
4
gene expression data for sample 4
5
gene expression data for sample 5
6
gene expression data for sample 6
7
gene expression data for sample 7
8
gene expression data for sample 8
9
gene expression data for sample 9
10
gene expression data for sample 10
11
gene expression data for sample 11
12
gene expression data for sample 12
13
gene expression data for sample 13
14
gene expression data for sample 14
15
gene expression data for sample 15
16
gene expression data for sample 16
17
gene expression data for sample 17
18
gene expression data for sample 18
19
gene expression data for sample 19
20
gene expression data for sample 20
21
gene expression data for sample 21
22
gene expression data for sample 22
23
gene expression data for sample 23
24
gene expression data for sample 24
25
gene expression data for sample 25
26
gene expression data for sample 26
27
gene expression data for sample 27
34
gene expression data for sample 34
35
gene expression data for sample 35
36
gene expression data for sample 36
37
gene expression data for sample 37
38
gene expression data for sample 38
28
gene expression data for sample 28
29
gene expression data for sample 29
30
gene expression data for sample 30
31
gene expression data for sample 31
32
gene expression data for sample 32
33
gene expression data for sample 33
39
gene expression data for sample 39
40
gene expression data for sample 40
42
gene expression data for sample 42
47
gene expression data for sample 47
48
gene expression data for sample 48
49
gene expression data for sample 49
41
gene expression data for sample 41
43
gene expression data for sample 43
44
gene expression data for sample 44
45
gene expression data for sample 45
46
gene expression data for sample 46
70
gene expression data for sample 70
71
gene expression data for sample 71
72
gene expression data for sample 72
68
gene expression data for sample 68
69
gene expression data for sample 69
67
gene expression data for sample 67
55
gene expression data for sample 55
56
gene expression data for sample 56
59
gene expression data for sample 59
52
gene expression data for sample 52
53
gene expression data for sample 53
51
gene expression data for sample 51
50
gene expression data for sample 50
54
gene expression data for sample 54
57
gene expression data for sample 57
58
gene expression data for sample 58
60
gene expression data for sample 60
61
gene expression data for sample 61
65
gene expression data for sample 65
66
gene expression data for sample 66
63
gene expression data for sample 63
64
gene expression data for sample 64
62
gene expression data for sample 62

The classes are in a separate numeric vector Golub.grp with values 1 for the 38 ALL B-Cell samples, 2 for the 9 ALL T-Cell samples and 3 for the 25 AML samples.

Details

The original data of Golub et al. (1999) were preprocessed as follows: genes that were called 'absent' in all samples were removed from the data sets, since these measurements are considered unreliable by the manufacturer of the technology. Negative measurements in the data were set to 1.

The resulting data frame contains 5327 genes of the 6817 originally reported by Golub et al. (1999).

Note

Luc Wouters et al. (2003), p. 1134 contains a typo concerning the sample sizes of AML- and ALL-type and erroneously reported

Source

Golub, T. R., Slonim, D. K., Tamayo, P., et al. (1999). Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science 286, 531 – 537.

References

Luc Wouters et al. (2003). Graphical Exploration of Gene Expression Data: A Comparative Study of Three Multivariate Methods, Biometrics, 59, 1131-1139.


[Package mpm version 1.0-12 Index]