Statistics and data sets for corpus frequency data


[Up] [Top]

Documentation for package ‘corpora’ version 0.4-3

Help Pages

corpora-package corpora: statistical inference from corpus frequency data
binom.pval P-values of the binomial test for frequency counts (corpora)
BNCbiber Biber's (1988) register features for the British National Corpus
BNCcomparison Comparison of written and spoken frequencies (BNC)
BNCdomains Distribution of domains in the British National Corpus (BNC)
BNCInChargeOf Collocations of the phrase "in charge of" (BNC)
BNCmeta Metadata for the British National Corpus (XML edition)
chisq Pearson's chi-squared statistic for frequency comparisons (corpora)
chisq.pval P-values of Pearson's chi-squared test for frequency comparisons (corpora)
cont.table Build contingency tables for frequency comparison (corpora)
corpora corpora: statistical inference from corpus frequency data
FakeCensus Simulated census data for examples and illustrations (corpora)
fisher.pval P-values of Fisher's exact test for frequency comparisons (corpora)
prop.cint Confidence interval for proportion based on frequency counts (corpora)
sample.df Random samples from data frames (corpora)
simulated.census Simulated census data for examples and illustrations (corpora)
simulated.wikipedia Simulated type and token counts for Wikipedia articles (corpora)
VSS A small corpus of very short stories with linguistic annotations
WackypediaStats Simulated type and token counts for Wikipedia articles (corpora)
z.score The z-score statistic for frequency counts (corpora)
z.score.pval P-values of the z-score test for frequency counts (corpora)