phoneme {ElemStatLearn} | R Documentation |
Data From a Acoustic-Phonetic Continuous Speech Corpus
Description
See Details.
Usage
data(phoneme)
Format
A data frame with 4509 observations on the following 258 variables.
- x.1
- a numeric vector
- x.2
- a numeric vector
- x.3
- a numeric vector
- x.4
- a numeric vector
- x.5
- a numeric vector
- x.6
- a numeric vector
- x.7
- a numeric vector
- x.8
- a numeric vector
- x.9
- a numeric vector
- x.10
- a numeric vector
- x.11
- a numeric vector
- x.12
- a numeric vector
- x.13
- a numeric vector
- x.14
- a numeric vector
- x.15
- a numeric vector
- x.16
- a numeric vector
- x.17
- a numeric vector
- x.18
- a numeric vector
- x.19
- a numeric vector
- x.20
- a numeric vector
- x.21
- a numeric vector
- x.22
- a numeric vector
- x.23
- a numeric vector
- x.24
- a numeric vector
- x.25
- a numeric vector
- x.26
- a numeric vector
- x.27
- a numeric vector
- x.28
- a numeric vector
- x.29
- a numeric vector
- x.30
- a numeric vector
- x.31
- a numeric vector
- x.32
- a numeric vector
- x.33
- a numeric vector
- x.34
- a numeric vector
- x.35
- a numeric vector
- x.36
- a numeric vector
- x.37
- a numeric vector
- x.38
- a numeric vector
- x.39
- a numeric vector
- x.40
- a numeric vector
- x.41
- a numeric vector
- x.42
- a numeric vector
- x.43
- a numeric vector
- x.44
- a numeric vector
- x.45
- a numeric vector
- x.46
- a numeric vector
- x.47
- a numeric vector
- x.48
- a numeric vector
- x.49
- a numeric vector
- x.50
- a numeric vector
- x.51
- a numeric vector
- x.52
- a numeric vector
- x.53
- a numeric vector
- x.54
- a numeric vector
- x.55
- a numeric vector
- x.56
- a numeric vector
- x.57
- a numeric vector
- x.58
- a numeric vector
- x.59
- a numeric vector
- x.60
- a numeric vector
- x.61
- a numeric vector
- x.62
- a numeric vector
- x.63
- a numeric vector
- x.64
- a numeric vector
- x.65
- a numeric vector
- x.66
- a numeric vector
- x.67
- a numeric vector
- x.68
- a numeric vector
- x.69
- a numeric vector
- x.70
- a numeric vector
- x.71
- a numeric vector
- x.72
- a numeric vector
- x.73
- a numeric vector
- x.74
- a numeric vector
- x.75
- a numeric vector
- x.76
- a numeric vector
- x.77
- a numeric vector
- x.78
- a numeric vector
- x.79
- a numeric vector
- x.80
- a numeric vector
- x.81
- a numeric vector
- x.82
- a numeric vector
- x.83
- a numeric vector
- x.84
- a numeric vector
- x.85
- a numeric vector
- x.86
- a numeric vector
- x.87
- a numeric vector
- x.88
- a numeric vector
- x.89
- a numeric vector
- x.90
- a numeric vector
- x.91
- a numeric vector
- x.92
- a numeric vector
- x.93
- a numeric vector
- x.94
- a numeric vector
- x.95
- a numeric vector
- x.96
- a numeric vector
- x.97
- a numeric vector
- x.98
- a numeric vector
- x.99
- a numeric vector
- x.100
- a numeric vector
- x.101
- a numeric vector
- x.102
- a numeric vector
- x.103
- a numeric vector
- x.104
- a numeric vector
- x.105
- a numeric vector
- x.106
- a numeric vector
- x.107
- a numeric vector
- x.108
- a numeric vector
- x.109
- a numeric vector
- x.110
- a numeric vector
- x.111
- a numeric vector
- x.112
- a numeric vector
- x.113
- a numeric vector
- x.114
- a numeric vector
- x.115
- a numeric vector
- x.116
- a numeric vector
- x.117
- a numeric vector
- x.118
- a numeric vector
- x.119
- a numeric vector
- x.120
- a numeric vector
- x.121
- a numeric vector
- x.122
- a numeric vector
- x.123
- a numeric vector
- x.124
- a numeric vector
- x.125
- a numeric vector
- x.126
- a numeric vector
- x.127
- a numeric vector
- x.128
- a numeric vector
- x.129
- a numeric vector
- x.130
- a numeric vector
- x.131
- a numeric vector
- x.132
- a numeric vector
- x.133
- a numeric vector
- x.134
- a numeric vector
- x.135
- a numeric vector
- x.136
- a numeric vector
- x.137
- a numeric vector
- x.138
- a numeric vector
- x.139
- a numeric vector
- x.140
- a numeric vector
- x.141
- a numeric vector
- x.142
- a numeric vector
- x.143
- a numeric vector
- x.144
- a numeric vector
- x.145
- a numeric vector
- x.146
- a numeric vector
- x.147
- a numeric vector
- x.148
- a numeric vector
- x.149
- a numeric vector
- x.150
- a numeric vector
- x.151
- a numeric vector
- x.152
- a numeric vector
- x.153
- a numeric vector
- x.154
- a numeric vector
- x.155
- a numeric vector
- x.156
- a numeric vector
- x.157
- a numeric vector
- x.158
- a numeric vector
- x.159
- a numeric vector
- x.160
- a numeric vector
- x.161
- a numeric vector
- x.162
- a numeric vector
- x.163
- a numeric vector
- x.164
- a numeric vector
- x.165
- a numeric vector
- x.166
- a numeric vector
- x.167
- a numeric vector
- x.168
- a numeric vector
- x.169
- a numeric vector
- x.170
- a numeric vector
- x.171
- a numeric vector
- x.172
- a numeric vector
- x.173
- a numeric vector
- x.174
- a numeric vector
- x.175
- a numeric vector
- x.176
- a numeric vector
- x.177
- a numeric vector
- x.178
- a numeric vector
- x.179
- a numeric vector
- x.180
- a numeric vector
- x.181
- a numeric vector
- x.182
- a numeric vector
- x.183
- a numeric vector
- x.184
- a numeric vector
- x.185
- a numeric vector
- x.186
- a numeric vector
- x.187
- a numeric vector
- x.188
- a numeric vector
- x.189
- a numeric vector
- x.190
- a numeric vector
- x.191
- a numeric vector
- x.192
- a numeric vector
- x.193
- a numeric vector
- x.194
- a numeric vector
- x.195
- a numeric vector
- x.196
- a numeric vector
- x.197
- a numeric vector
- x.198
- a numeric vector
- x.199
- a numeric vector
- x.200
- a numeric vector
- x.201
- a numeric vector
- x.202
- a numeric vector
- x.203
- a numeric vector
- x.204
- a numeric vector
- x.205
- a numeric vector
- x.206
- a numeric vector
- x.207
- a numeric vector
- x.208
- a numeric vector
- x.209
- a numeric vector
- x.210
- a numeric vector
- x.211
- a numeric vector
- x.212
- a numeric vector
- x.213
- a numeric vector
- x.214
- a numeric vector
- x.215
- a numeric vector
- x.216
- a numeric vector
- x.217
- a numeric vector
- x.218
- a numeric vector
- x.219
- a numeric vector
- x.220
- a numeric vector
- x.221
- a numeric vector
- x.222
- a numeric vector
- x.223
- a numeric vector
- x.224
- a numeric vector
- x.225
- a numeric vector
- x.226
- a numeric vector
- x.227
- a numeric vector
- x.228
- a numeric vector
- x.229
- a numeric vector
- x.230
- a numeric vector
- x.231
- a numeric vector
- x.232
- a numeric vector
- x.233
- a numeric vector
- x.234
- a numeric vector
- x.235
- a numeric vector
- x.236
- a numeric vector
- x.237
- a numeric vector
- x.238
- a numeric vector
- x.239
- a numeric vector
- x.240
- a numeric vector
- x.241
- a numeric vector
- x.242
- a numeric vector
- x.243
- a numeric vector
- x.244
- a numeric vector
- x.245
- a numeric vector
- x.246
- a numeric vector
- x.247
- a numeric vector
- x.248
- a numeric vector
- x.249
- a numeric vector
- x.250
- a numeric vector
- x.251
- a numeric vector
- x.252
- a numeric vector
- x.253
- a numeric vector
- x.254
- a numeric vector
- x.255
- a numeric vector
- x.256
- a numeric vector
- g
- a factor with levels
aa
ao
dcl
iy
sh
- speaker
- a factor with 437 levels
Details
These data arose from a collaboration between Andreas Buja, Werner
Stuetzle and Martin Maechler, and we used as an illustration in the
paper on Penalized Discriminant Analysis by Hastie, Buja and
Tibshirani (1995), referenced in the text.
The data were extracted from the TIMIT database (TIMIT
Acoustic-Phonetic Continuous Speech Corpus, NTIS, US Dept of Commerce)
which is a widely used resource for research in speech recognition. A
dataset was formed by selecting five phonemes for
classification based on digitized speech from this database. The
phonemes are transcribed as follows: "sh" as in "she", "dcl" as in
"dark", "iy" as the vowel in "she", "aa" as the vowel in "dark", and
"ao" as the first vowel in "water". From continuous speech of 50 male
speakers, 4509 speech frames of 32 msec duration were selected,
approximately 2 examples of each phoneme from each speaker. Each
speech frame is represented by 512 samples at a 16kHz sampling rate,
and each frame represents one of the above five phonemes. The
breakdown of the 4509 speech frames into phoneme frequencies is as
follows:
aa ao dcl iy sh
695 1022 757 1163 872
From each speech frame, we computed a log-periodogram, which is one of
several widely used methods for casting speech data in a form suitable
for speech recognition. Thus the data used in what follows consist of
4509 log-periodograms of length 256, with known class (phoneme)
memberships.
The data contain 256 columns labelled "x.1" - "x.256", a response
column labelled "g", and a column labelled "speaker" identifying the
different speakers.
Examples
head(str(phoneme))
[Package
ElemStatLearn version 0.1-6
Index]