writtenVariationLijk {languageR} | R Documentation |
This dataset documents variation in the use of the 80 most frequent words ending in the suffix -lijk in written Dutch.
data(writtenVariationLijk)
A data frame with 560 observations on the following 5 variables.
Corpus
belang
(Het Belang van Limburg),
gazet
(De Gazet van Antwerpen),
laatnieu
(Het Laatste Nieuws),
limburg
(De Limburger),
nrc
(NRC Handelsblad),
stand
(De Standaard), and
tele
(De Telegraaf).Word
Count
Country
Flanders
and
Netherlands
.Register
National
,
Quality
and Regional
coding the type of newspaper.Keune, K., Ernestus, M., Van Hout, R. and Baayen, R.H. (2005) Social, geographical, and register variation in Dutch: From written 'mogelijk' to spoken 'mok', Corpus Linguistics and Linguistic Theory, 1, 183-223.
## Not run: data(writtenVariationLijk) library(lme4, keep.source=FALSE) writtenVariationLijk.lmer = lmer(Count ~ Country * Register + (1|Word), data = writtenVariationLijk, family = "poisson", method = "Laplace") writtenVariationLijk.lmerA = lmer(Count ~ Country * Register + (Country|Word), data = writtenVariationLijk, family = "poisson", method = "Laplace") anova(writtenVariationLijk.lmer, writtenVariationLijk.lmerA) writtenVariationLijk.lmerA ## End(Not run)