births2006.smpl {nutshell}R Documentation

Births in the United States, 2006

Description

This data set contains information on babies born in the United States during 2006. There is one record per birth. This data set is a random ten percent sample.

This data set is used as an example in the book "R in a Nutshell" from O'Reilly Media. Indices for observations from the original data set that are used in this sample are included in births2006.idx.

Usage

data(births2006.smpl)

Format

A data frame with 427323 observations on the following 13 variables.

DOB_MM
Month of date of birth
DOB_WK
Day of week of birth
MAGER
Mother's age
TBO_REC
Total birth order
WTGAIN
Weight gain by mother
SEX
a factor with levels F M, representing the sex of the child
APGAR5
APGAR score
DMEDUC
Mother's education level
UPREVIS
Number of prenatal visits
ESTGEST
Estimated weeks of gestation
DMETH_REC
Delivery Method
DPLURAL
"Plural Births;" levels include 1 Single, 2 Twin, 3 Triplet, 4 Quadruplet, and 5 Quintuplet or higher
DBWT
Birth weight, in grams

Source

http://www.cdc.gov/nchs/data_access/Vitalstatsonline.htm

Examples

data(births2006.smpl)
library(lattice)
histogram(~DBWT|DPLURAL,data=births2006.smpl)
densityplot(~DBWT,groups=DPLURAL,data=births2006.smpl,
  plot.points=FALSE,auto.key=TRUE)

[Package nutshell version 1.0 Index]