A B C D E F I K L M N P Q R S T U W misc
quanteda-package | An R package for the quantitative analysis of textual data. |
as.data.frame-method | Virtual class "dfm" for a document-feature matrix |
as.dfm | create a document-feature matrix |
as.DocumentTermMatrix | convert a dfm to a non-quanteda format |
as.DocumentTermMatrix.dfm | convert a dfm to a non-quanteda format |
as.matrix-method | Virtual class "dfm" for a document-feature matrix |
as.wfm | convert a dfm to a non-quanteda format |
as.wfm.dfm | convert a dfm to a non-quanteda format |
bigrams | Create bigrams |
changeunits | change the document units of a corpus |
clean | simple cleaning of text before processing |
clean.character | simple cleaning of text before processing |
clean.corpus | simple cleaning of text before processing |
cleanC | simple cleaning of text before processing |
collocations | Detect collocations from text |
collocations.character | Detect collocations from text |
collocations.corpus | Detect collocations from text |
colSums-method | Virtual class "dfm" for a document-feature matrix |
convert | convert a dfm to a non-quanteda format |
convert.dfm | convert a dfm to a non-quanteda format |
corpus | constructor for corpus objects |
corpus.character | constructor for corpus objects |
corpus.corpusSource | constructor for corpus objects |
corpus.VCorpus | constructor for corpus objects |
corpusSource-class | corpus source classes |
describeTexts | summarize a corpus or a vector of texts |
dfm | create a document-feature matrix |
dfm-class | Virtual class "dfm" for a document-feature matrix |
dfm.character | create a document-feature matrix |
dfm.corpus | create a document-feature matrix |
dfm2ldaformat | convert a dfm to a non-quanteda format |
dfm2ldaformat.dfm | convert a dfm to a non-quanteda format |
dfmDense-class | Virtual class "dfm" for a document-feature matrix |
dfmSparse-class | Virtual class "dfm" for a document-feature matrix |
dictionary | create a dictionary |
dictionary-class | create a dictionary |
docfreq | get the document frequency of a feature |
docfreq-method | get the document frequency of a feature |
docnames | get or set document names |
docnames.corpus | get or set document names |
docnames.dfm | get or set document names |
docnames<- | get or set document names |
docvars | get or set for document-level variables |
docvars<- | get or set for document-level variables |
encoding | get the encoding of documents in a corpus |
encoding<- | get the encoding of documents in a corpus |
englishSyllables | count syllables in a text |
exampleString | A paragraph of text for testing various text-based functions |
features | extract the feature labels from a dfm |
features.dfm | extract the feature labels from a dfm |
ie2010Corpus | Irish budget speeches from 2010 |
iebudgets | Irish budget speeches from 2010 |
inaugCorpus | A corpus of US presidential inaugural addresses from 1789-2013 |
inaugTexts | A corpus of US presidential inaugural addresses from 1789-2013 |
is.corpus | constructor for corpus objects |
is.dfm | create a document-feature matrix |
kwic | List key words in context from a text or a corpus of texts. |
kwic.character | List key words in context from a text or a corpus of texts. |
kwic.corpus | List key words in context from a text or a corpus of texts. |
language | get or set the language of corpus documents |
language<- | get or set the language of corpus documents |
LBGexample | dfm containing example data from Table 1 of Laver Benoit and Garry (2003) |
lexdiv | calculate lexical diversity |
lexdiv.dfm | calculate lexical diversity |
lexdiv.numeric | calculate lexical diversity |
metacorpus | get or set corpus metadata |
metacorpus<- | get or set corpus metadata |
metadoc | get or set document-level meta-data |
metadoc<- | get or set document-level meta-data |
ndoc | get the number of documents or features |
ndoc.corpus | get the number of documents or features |
ndoc.dfm | get the number of documents or features |
nfeature | get the number of documents or features |
nfeature.corpus | get the number of documents or features |
nfeature.dfm | get the number of documents or features |
ngrams | Create ngrams |
ntoken | count the number of tokens |
ntoken.character | count the number of tokens |
ntoken.corpus | count the number of tokens |
phrasetotoken | convert phrases into single tokens |
phrasetotoken-method | convert phrases into single tokens |
phrasetotoken.corpus | convert phrases into single tokens |
plot.dfm | plot features as a wordcloud |
predict.textmodel_wordscores_fitted | Wordscores text model |
print-method | print a dfm object |
print.dfm | print a dfm object |
print.settings | Get or set the corpus settings |
print.textmodel_wordfish_fitted | wordfish text model |
print.textmodel_wordscores_fitted | Wordscores text model |
print.textmodel_wordscores_predicted | Wordscores text model |
quantedaformat2dtm | convert a dfm to a non-quanteda format |
quantedaformat2dtm.dfm | convert a dfm to a non-quanteda format |
removeFeatures | remove features from an object |
removeFeatures.character | remove features from an object |
removeFeatures.collocations | remove features from an object |
removeFeatures.dfm | remove features from an object |
rowSums-method | Virtual class "dfm" for a document-feature matrix |
segment | segment texts into component elements |
segment.character | segment texts into component elements |
segment.corpus | segment texts into component elements |
settings | Get or set the corpus settings |
settings.corpus | Get or set the corpus settings |
settings.default | Get or set the corpus settings |
settings.dfm | Get or set the corpus settings |
settings<- | Get or set the corpus settings |
show-method | print a dfm object |
show-method | wordfish text model |
show-method | Wordscores text model |
similarity | compute similarities between documents and/or features |
similarity-method | compute similarities between documents and/or features |
smoother | Weight the feature frequencies in a dfm by various methods |
sort.dfm | sort a dfm by one or more margins |
stopwords | access built-in stopwords |
stopwordsGet | access built-in stopwords |
stopwordsRemove | remove features from an object |
subset.corpus | extract a subset of a corpus |
summary.character | summarize a corpus or a vector of texts |
summary.corpus | summarize a corpus or a vector of texts |
syllables | count syllables in a text |
syllables.character | count syllables in a text |
t-method | Virtual class "dfm" for a document-feature matrix |
textfile | read a text corpus source from a file |
textfile-method | read a text corpus source from a file |
textmodel | fit a text model |
textmodel-method | fit a text model |
textmodel_ca | correspondence analysis of a document-feature matrix |
textmodel_fitted-class | the fitted textmodel classes |
textmodel_wordfish | wordfish text model |
textmodel_wordfish_fitted-class | the fitted textmodel classes |
textmodel_wordfish_predicted-class | the fitted textmodel classes |
textmodel_wordscores | Wordscores text model |
textmodel_wordscores_fitted-class | the fitted textmodel classes |
textmodel_wordscores_predicted-class | the fitted textmodel classes |
texts | get or set corpus texts |
texts<- | get or set corpus texts |
tf | Weight the feature frequencies in a dfm by various methods |
tfidf | Weight the feature frequencies in a dfm by various methods |
tokenise | tokenize a set of texts |
tokenize | tokenize a set of texts |
tokenize.character | tokenize a set of texts |
tokenize.corpus | tokenize a set of texts |
tokenizeOnly | tokenizeOnly |
tokenizeOnlyCppKB | tokenizeOnly |
tokenizeOnlyScan | tokenizeOnly |
topFeatures | list the most frequent features |
topfeatures | list the most frequent features |
topfeatures.dfm | list the most frequent features |
topfeatures.dgCMatrix | list the most frequent features |
trim | Trim a dfm using threshold-based or random feature selection |
trim-method | Trim a dfm using threshold-based or random feature selection |
trimdfm | Trim a dfm using threshold-based or random feature selection |
ukimmigTexts | Immigration-related sections of 2010 UK party manifestos |
weight | Weight the feature frequencies in a dfm by various methods |
weight-method | Weight the feature frequencies in a dfm by various methods |
weighting | Weight the feature frequencies in a dfm by various methods |
weighting-method | Weight the feature frequencies in a dfm by various methods |
wordstem | stem words |
+-method | Virtual class "dfm" for a document-feature matrix |
+.corpus | constructor for corpus objects |
.stopwords | access built-in stopwords |
[-method | Virtual class "dfm" for a document-feature matrix |
[.dfm | Virtual class "dfm" for a document-feature matrix |