haplo.long {SimHap} | R Documentation |
haplo.long
performes a series of linear mixed effects models using a simulation-based approach to account for uncertainty in haplotype assignment when phase is unknown.
haplo.long(fixed, random, pheno, haplo, cor="corCAR1", value = 0.2, form=~1, sim, effect = "add", sub = NULL)
fixed |
as per lme . A two-sided linear formula object describing the fixed-effects part of the model including SNP parameters, with the response on the left of a ~ operator and the terms, separated by + operators. |
random |
as per lme . A one-sided formula of the form ~x1+...+xn | g1/.../gm , with x1+...+xn specifying the model for the random effects and g1/.../gm the grouping structure (m may be equal to 1, in which case no / is required). The random effects formula will be repeated for all levels of grouping, in the case of multiple levels of grouping. |
pheno |
a dataframe containing phenotype data. |
haplo |
a haplotype object made by make.haplo.rare . |
cor |
a corStruct object describing the within-group correlation structure. Available correlation structures are corAR1 , corCAR1 , and corCompSymm . See the documentation of corClasses for a description of these. Defaults to corCAR1 . |
value |
for corAR1 - the value of the lag 1 autocorrelation, which must be between -1 and 1. For corCAR1 - the correlation between two observations one unit of time apart. Must be between 0 and 1. For corCompSymm - the correlation between any two correlated observations. Defaults to 0.2 . |
form |
a one sided formula of the form ~ t , or ~ t | g , specifying a time covariate t and, optionally, a grouping factor g . A covariate for this correlation structure must be integer valued. When a grouping factor is present in form , the correlation structure is assumed to apply only to observations within the same grouping level; observations with different grouping levels are assumed to be uncorrelated. Defaults to ~ 1 , which corresponds to using the order of the observations in the data as a covariate, and no groups. |
sim |
the number of simulations from which to evaluate the results. |
effect |
the haplotypic effect type: "add" for additive, "dom" for dominant and "rec" for recessive. Defaults to additive. See note. |
sub |
optional. An expression representing a subset of individuals on which to perform analysis. e.g. sub=expression(sex==1) . |
haplo.long
returns an object of class hapLong
.
The summary
function can be used to obtain and print a
summary of the results.
An object of class hapLong
is a list containing the
following components:
fixed_formula |
fixed effects formula. |
random_formula |
random effects formula. |
results |
a table containing the coefficients, standard errors and p-values of the parameter estimates, averaged over the sim models performed. |
empiricalResults |
a list containing the coefficients, standard errors and p-values calculated at each simulation. |
ANOD |
analysis of deviance table for the fitted model. |
logLik |
the log-likelihood for the fitted model. |
AIC |
Akaike Information Criterion for the linear model fit using formula. |
aicEmpirical |
Akaike Information Criteria calculated at each simulation. |
corStruct |
correlation structure used in the fitted model. |
effect |
the haplotypic effect modelled, `ADDITIVE', `DOMINANT' or `RECESSIVE' |
To model a codominant haplotypic effect, define the desired haplotype as a factor in the formula1
argument. e.g. factor(h.AAA)
, and use the default option for effect
Pamela A. McCaskie
Bates, D.M., Pinheiro, J.C. (1998) Computational methods for multilevel models. Available in PostScript or PDF formats at http://franz.stat.wisc.edu/pub/NLME/
Box, G.E.P., Jenkins, G.M., Reinsel, G.C. (1994) Time Series Analysis: Forecasting and Control, 3rd Edition, Holden-Day.
Davidian, M., Giltinan, D.M. (1995) Nonlinear Mixed Effects Models for Repeated Measurement Data, Chapman and Hall.
Laird, N.M., Ware, J.H. (1982) Random-Effects Models for Longitudinal Data, Biometrics, 38, 963-974.
Lindstrom, M.J., Bates, D.M. (1988) Newton-Raphson and EM Algorithms for Linear Mixed-Effects Models for Repeated-Measures Data, Journal of the American Statistical Association, 83, 1014-1022.
Littel, R.C., Milliken, G.A., Stroup, W.W., Wolfinger, R.D. (1996) SAS Systems for Mixed Models, SAS Institute.
Little, R.J.A., Rubin, D.B. (2002) Statistical Analysis with Missing Data. John Wiley and Sons, New Jersey.
McCaskie, P.A., Carter, K.W, Hazelton, M., Palmer, L.J. (2007) SimHap: A comprehensive modeling framework for epidemiological outcomes and a multiple imputation approach to haplotypic analysis of population-based data, [online] www.genepi.org.au/simhap.
Pinheiro, J.C., Bates, D.M. (1996) Unconstrained Parametrizations for Variance-Covariance Matrices, Statistics and Computing, 6, 289-296.
Pinheiro, J.C., Bates, D.M. (2000) Mixed-Effects Models in S and S-PLUS, Springer.
Rubin, D.B. (1996) Multiple imputation after 18+ years (with discussion). Journal of the American Statistical Society, 91:473-489.
snp.long
, haplo.quant
, haplo.quant
, haplo.long
data(SNPlong.dat) # convert SNP.dat to format required by infer.haplos longHaplo.dat <- SNP2Haplo(SNPlong.dat) data(longPheno.dat) # generate haplotype frequencies and haplotype design matrix myinfer<-infer.haplos(longHaplo.dat) # print haplotype frequencies generated by infer.haplos myinfer$hap.freq # generate haplo object where haplotypes with a frequency # below min.freq are grouped as a category called "rare" myhaplo<-make.haplo.rare(myinfer,min.freq=0.05) mymodel <- haplo.long(fixed=fev1f~h.ACV2, random=~1|id, pheno=longPheno.dat, haplo=myhaplo, form=~year|id, sim=10) summary(mymodel) # example with a subsetting variable - looking at males only mymodel <- haplo.long(fixed=fev1f~height+h.ACV2, random=~1|id, pheno=longPheno.dat, haplo=myhaplo, form=~year|id, sim=10, sub=expression(sex==1)) summary(mymodel)