Quantitative Analysis of Textual Data


[Up] [Top]

Documentation for package ‘quanteda’ version 0.7.2-1

Help Pages

A B C D E F I K L M N P Q R S T U W misc

quanteda-package An R package for the quantitative analysis of textual data.

-- A --

as.data.frame-method Virtual class "dfm" for a document-feature matrix
as.dfm create a document-feature matrix
as.DocumentTermMatrix convert a dfm to a non-quanteda format
as.DocumentTermMatrix.dfm convert a dfm to a non-quanteda format
as.matrix-method Virtual class "dfm" for a document-feature matrix
as.wfm convert a dfm to a non-quanteda format
as.wfm.dfm convert a dfm to a non-quanteda format

-- B --

bigrams Create bigrams

-- C --

changeunits change the document units of a corpus
clean simple cleaning of text before processing
clean.character simple cleaning of text before processing
clean.corpus simple cleaning of text before processing
cleanC simple cleaning of text before processing
collocations Detect collocations from text
collocations.character Detect collocations from text
collocations.corpus Detect collocations from text
colSums-method Virtual class "dfm" for a document-feature matrix
convert convert a dfm to a non-quanteda format
convert.dfm convert a dfm to a non-quanteda format
corpus constructor for corpus objects
corpus.character constructor for corpus objects
corpus.corpusSource constructor for corpus objects
corpus.VCorpus constructor for corpus objects
corpusSource-class corpus source classes

-- D --

describeTexts summarize a corpus or a vector of texts
dfm create a document-feature matrix
dfm-class Virtual class "dfm" for a document-feature matrix
dfm.character create a document-feature matrix
dfm.corpus create a document-feature matrix
dfm2ldaformat convert a dfm to a non-quanteda format
dfm2ldaformat.dfm convert a dfm to a non-quanteda format
dfmDense-class Virtual class "dfm" for a document-feature matrix
dfmSparse-class Virtual class "dfm" for a document-feature matrix
dictionary create a dictionary
dictionary-class create a dictionary
docfreq get the document frequency of a feature
docfreq-method get the document frequency of a feature
docnames get or set document names
docnames.corpus get or set document names
docnames.dfm get or set document names
docnames<- get or set document names
docvars get or set for document-level variables
docvars<- get or set for document-level variables

-- E --

encoding get the encoding of documents in a corpus
encoding<- get the encoding of documents in a corpus
englishSyllables count syllables in a text
exampleString A paragraph of text for testing various text-based functions

-- F --

features extract the feature labels from a dfm
features.dfm extract the feature labels from a dfm

-- I --

ie2010Corpus Irish budget speeches from 2010
iebudgets Irish budget speeches from 2010
inaugCorpus A corpus of US presidential inaugural addresses from 1789-2013
inaugTexts A corpus of US presidential inaugural addresses from 1789-2013
is.corpus constructor for corpus objects
is.dfm create a document-feature matrix

-- K --

kwic List key words in context from a text or a corpus of texts.
kwic.character List key words in context from a text or a corpus of texts.
kwic.corpus List key words in context from a text or a corpus of texts.

-- L --

language get or set the language of corpus documents
language<- get or set the language of corpus documents
LBGexample dfm containing example data from Table 1 of Laver Benoit and Garry (2003)
lexdiv calculate lexical diversity
lexdiv.dfm calculate lexical diversity
lexdiv.numeric calculate lexical diversity

-- M --

metacorpus get or set corpus metadata
metacorpus<- get or set corpus metadata
metadoc get or set document-level meta-data
metadoc<- get or set document-level meta-data

-- N --

ndoc get the number of documents or features
ndoc.corpus get the number of documents or features
ndoc.dfm get the number of documents or features
nfeature get the number of documents or features
nfeature.corpus get the number of documents or features
nfeature.dfm get the number of documents or features
ngrams Create ngrams
ntoken count the number of tokens
ntoken.character count the number of tokens
ntoken.corpus count the number of tokens

-- P --

phrasetotoken convert phrases into single tokens
phrasetotoken-method convert phrases into single tokens
phrasetotoken.corpus convert phrases into single tokens
plot.dfm plot features as a wordcloud
predict.textmodel_wordscores_fitted Wordscores text model
print-method print a dfm object
print.dfm print a dfm object
print.settings Get or set the corpus settings
print.textmodel_wordfish_fitted wordfish text model
print.textmodel_wordscores_fitted Wordscores text model
print.textmodel_wordscores_predicted Wordscores text model

-- Q --

quantedaformat2dtm convert a dfm to a non-quanteda format
quantedaformat2dtm.dfm convert a dfm to a non-quanteda format

-- R --

removeFeatures remove features from an object
removeFeatures.character remove features from an object
removeFeatures.collocations remove features from an object
removeFeatures.dfm remove features from an object
rowSums-method Virtual class "dfm" for a document-feature matrix

-- S --

segment segment texts into component elements
segment.character segment texts into component elements
segment.corpus segment texts into component elements
settings Get or set the corpus settings
settings.corpus Get or set the corpus settings
settings.default Get or set the corpus settings
settings.dfm Get or set the corpus settings
settings<- Get or set the corpus settings
show-method print a dfm object
show-method wordfish text model
show-method Wordscores text model
similarity compute similarities between documents and/or features
similarity-method compute similarities between documents and/or features
smoother Weight the feature frequencies in a dfm by various methods
sort.dfm sort a dfm by one or more margins
stopwords access built-in stopwords
stopwordsGet access built-in stopwords
stopwordsRemove remove features from an object
subset.corpus extract a subset of a corpus
summary.character summarize a corpus or a vector of texts
summary.corpus summarize a corpus or a vector of texts
syllables count syllables in a text
syllables.character count syllables in a text

-- T --

t-method Virtual class "dfm" for a document-feature matrix
textfile read a text corpus source from a file
textfile-method read a text corpus source from a file
textmodel fit a text model
textmodel-method fit a text model
textmodel_ca correspondence analysis of a document-feature matrix
textmodel_fitted-class the fitted textmodel classes
textmodel_wordfish wordfish text model
textmodel_wordfish_fitted-class the fitted textmodel classes
textmodel_wordfish_predicted-class the fitted textmodel classes
textmodel_wordscores Wordscores text model
textmodel_wordscores_fitted-class the fitted textmodel classes
textmodel_wordscores_predicted-class the fitted textmodel classes
texts get or set corpus texts
texts<- get or set corpus texts
tf Weight the feature frequencies in a dfm by various methods
tfidf Weight the feature frequencies in a dfm by various methods
tokenise tokenize a set of texts
tokenize tokenize a set of texts
tokenize.character tokenize a set of texts
tokenize.corpus tokenize a set of texts
tokenizeOnly tokenizeOnly
tokenizeOnlyCppKB tokenizeOnly
tokenizeOnlyScan tokenizeOnly
topFeatures list the most frequent features
topfeatures list the most frequent features
topfeatures.dfm list the most frequent features
topfeatures.dgCMatrix list the most frequent features
trim Trim a dfm using threshold-based or random feature selection
trim-method Trim a dfm using threshold-based or random feature selection
trimdfm Trim a dfm using threshold-based or random feature selection

-- U --

ukimmigTexts Immigration-related sections of 2010 UK party manifestos

-- W --

weight Weight the feature frequencies in a dfm by various methods
weight-method Weight the feature frequencies in a dfm by various methods
weighting Weight the feature frequencies in a dfm by various methods
weighting-method Weight the feature frequencies in a dfm by various methods
wordstem stem words

-- misc --

+-method Virtual class "dfm" for a document-feature matrix
+.corpus constructor for corpus objects
.stopwords access built-in stopwords
[-method Virtual class "dfm" for a document-feature matrix
[.dfm Virtual class "dfm" for a document-feature matrix