HSB {candisc} | R Documentation |
The High School and Beyond Project was a longitudinal study of students in the U.S. carried out in 1980 by the National Center for Education Statistics. Data were collected from 58,270 high school students (28,240 seniors and 30,030 sophomores) and 1,015 secondary schools. The HSB data frame is sample of 600 observations, of unknown characteristics, originally taken from Tatsuoka (1988).
data(HSB)
A data frame with 600 observations on the following 15 variables. There is no missing data.
id
gender
male
female
race
hispanic
asian
african-amer
white
ses
low
middle
high
sch
public
private
prog
general
academic
vocation
locus
concept
mot
career
clerical
craftsman
farmer
homemaker
laborer
manager
military
operative
prof1
prof2
proprietor
protective
sales
school
service
technical
not working
read
write
math
sci
ss
Tatsuoka, M. M. (1988). Multivariate Analysis: Techniques for Educational and Psychological Research (2nd ed.). New York: Macmillan, Appendix F, 430-442.
Retrieved from: http://www.gseis.ucla.edu/courses/data/hbs6.dta
High School and Beyond data files: http://www.sscnet.ucla.edu/issr/da/index/techinfo/I78961.HTM
str(HSB) # main effects model hsb.mod <- lm( cbind(read, write, math, sci, ss) ~ gender + race + ses + sch + prog, data=HSB) Anova(hsb.mod) # Add some interactions hsb.mod1 <- update(hsb.mod, . ~ . + gender:race + ses:prog) heplot(hsb.mod1, col=palette()[c(2,1,3:6)], variables=c("read","math")) hsb.can1 <- candisc(hsb.mod1, term="race") heplot(hsb.can1, col=c("red", "black")) # show canonical results for all terms hsb.can <- candiscList(hsb.mod) hsb.can