Text Mining Package


[Package List] [Top]

Documentation for package ‘tm’ version 0.3-4.1

User Guides and Package Vignettes

Read overview or browse directory.

Help Pages

A C D E F G H I L M N O P R S T U V W X misc

%IN% Methods for Function %IN% in Package ‘tm’
%IN%,TextDocument,Corpus-method Methods for Function %IN% in Package ‘tm’
%IN%-methods Methods for Function %IN% in Package ‘tm’

-- A --

acq 50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq
activateCluster Allow ‘tm’ to Use a Cluster If Available
appendElem Methods for Function appendElem in Package ‘tm’
appendElem,Corpus,TextDocument-method Methods for Function appendElem in Package ‘tm’
appendElem,TextRepository,Corpus-method Methods for Function appendElem in Package ‘tm’
appendElem-methods Methods for Function appendElem in Package ‘tm’
appendMeta Methods for Function appendMeta in Package ‘tm’
appendMeta,Corpus-method Methods for Function appendMeta in Package ‘tm’
appendMeta,TextRepository-method Methods for Function appendMeta in Package ‘tm’
appendMeta-methods Methods for Function appendMeta in Package ‘tm’
asPlain Methods for Function asPlain in Package ‘tm’
asPlain,NewsgroupDocument-method Methods for Function asPlain in Package ‘tm’
asPlain,PlainTextDocument-method Methods for Function asPlain in Package ‘tm’
asPlain,RCV1Document-method Methods for Function asPlain in Package ‘tm’
asPlain,Reuters21578Document-method Methods for Function asPlain in Package ‘tm’
asPlain,StructuredTextDocument-method Methods for Function asPlain in Package ‘tm’
asPlain,XMLTextDocument-method Methods for Function asPlain in Package ‘tm’
asPlain-methods Methods for Function asPlain in Package ‘tm’
Author Text Document
Author,TextDocument-method Text Document
Author<-,TextDocument-method Text Document

-- C --

c,Corpus-method Methods for Function c in Package ‘tm’
c,TextDocument-method Methods for Function c in Package ‘tm’
c-methods Methods for Function c in Package ‘tm’
Cached Plain Text Document
Cached,NewsgroupDocument-method Newsgroup Text Document
Cached,PlainTextDocument-method Plain Text Document
Cached,StructuredTextDocument-method Structured Text Document
Cached,XMLTextDocument-method Text document
Cached<-,NewsgroupDocument-method Newsgroup Text Document
Cached<-,PlainTextDocument-method Plain Text Document
Cached<-,StructuredTextDocument-method Structured Text Document
Cached<-,XMLTextDocument-method Text document
CMetaData Corpus
CMetaData,Corpus-method Corpus
coerce,list,Corpus-method Corpus
colnames.DocumentTermMatrix Row, Column, Dim Names, Document IDs, and Terms
colnames.TermDocumentMatrix Row, Column, Dim Names, Document IDs, and Terms
Content Plain Text Document
Content,NewsgroupDocument-method Newsgroup Text Document
Content,PlainTextDocument-method Plain Text Document
Content,StructuredTextDocument-method Structured Text Document
Content,XMLTextDocument-method Text document
Content<-,NewsgroupDocument-method Newsgroup Text Document
Content<-,PlainTextDocument-method Plain Text Document
Content<-,StructuredTextDocument-method Structured Text Document
Content<-,XMLTextDocument-method Text document
convertMboxEml Convert E-Mails From mbox Format To eml Format
convertRCV1Plain Transform a RCV1 Document to a Plain Text Document
convertReut21578XMLPlain Transform a Reuters21578 XML Document to a Plain Text Document
Corpus Corpus
Corpus,Source-method Corpus
Corpus-class Corpus
crude 20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude
CSVSource Data Frame Source

-- D --

DataframeSource Data Frame Source
DataframeSource-class Source for Data Frames
DateTimeStamp Text Document
DateTimeStamp,TextDocument-method Text Document
DateTimeStamp<-,TextDocument-method Text Document
DBControl Corpus
DBControl,Corpus-method Corpus
deactivateCluster Disallow ‘tm’ to Use a Cluster
Description Text Document
Description,TextDocument-method Text Document
Description<-,TextDocument-method Text Document
Dictionary Dictionary
Dictionary-class Dictionary
Dictionary.character Dictionary
Dictionary.TermDocumentMatrix Dictionary
dim.DocumentTermMatrix The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
dim.TermDocumentMatrix The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
dimnames.DocumentTermMatrix Row, Column, Dim Names, Document IDs, and Terms
dimnames.TermDocumentMatrix Row, Column, Dim Names, Document IDs, and Terms
DirSource Directory Source
DirSource,character-method Directory Source
DirSource-class Source for Directories
dissimilarity Dissimilarity
DMetaData Corpus
DMetaData,Corpus-method Corpus
DMetaData<-,Corpus-method Corpus
Docs Row, Column, Dim Names, Document IDs, and Terms
DocumentTermMatrix Term-Document Matrix
DublinCore Methods for Function DublinCore in Package ‘tm’
DublinCore,TextDocument-method Methods for Function DublinCore in Package ‘tm’
DublinCore-methods Methods for Function DublinCore in Package ‘tm’
DublinCore<-,TextDocument-method Methods for Function DublinCore in Package ‘tm’

-- E --

eoi Methods for Function eoi in Package ‘tm’
eoi,DataframeSource-method Methods for Function eoi in Package ‘tm’
eoi,DirSource-method Methods for Function eoi in Package ‘tm’
eoi,GmaneSource-method Methods for Function eoi in Package ‘tm’
eoi,ReutersSource-method Methods for Function eoi in Package ‘tm’
eoi,URISource-method Methods for Function eoi in Package ‘tm’
eoi,VectorSource-method Methods for Function eoi in Package ‘tm’
eoi,XMLSource-method Methods for Function eoi in Package ‘tm’
eoi-methods Methods for Function eoi in Package ‘tm’

-- F --

findAssocs Find Associations in a Term-Document Matrix
findFreqTerms Find Frequent Terms
FunctionGenerator Function Generator Constructor
FunctionGenerator,function-method Function Generator Constructor
FunctionGenerator-class Function Generator

-- G --

getElem Methods for Function getElem in Package ‘tm’
getElem,DataframeSource-method Methods for Function getElem in Package ‘tm’
getElem,DirSource-method Methods for Function getElem in Package ‘tm’
getElem,GmaneSource-method Methods for Function getElem in Package ‘tm’
getElem,ReutersSource-method Methods for Function getElem in Package ‘tm’
getElem,URISource-method Methods for Function getElem in Package ‘tm’
getElem,VectorSource-method Methods for Function getElem in Package ‘tm’
getElem,XMLSource-method Methods for Function getElem in Package ‘tm’
getElem-methods Methods for Function getElem in Package ‘tm’
getFilters Get Available Filters
getReaders Get Available Readers
getSources Get Available Sources
getTransformations Get Available Transformations
GmaneSource Gmane Source
GmaneSource,ANY-method Gmane Source
GmaneSource,character-method Gmane Source
GmaneSource-class Source for Gmane Feeds

-- H --

Heading Text Document
Heading,TextDocument-method Text Document
Heading<-,TextDocument-method Text Document

-- I --

ID Text Document
ID,TextDocument-method Text Document
ID<-,TextDocument-method Text Document
inspect Inspect Objects

-- L --

Language Text Document
Language,TextDocument-method Text Document
Language<-,TextDocument-method Text Document
length Methods for Function length in Package ‘tm’
length,Corpus-method Methods for Function length in Package ‘tm’
length,TextRepository-method Methods for Function length in Package ‘tm’
length-methods Methods for Function length in Package ‘tm’
loadDoc Methods for Function loadDoc in Package ‘tm’
loadDoc,NewsgroupDocument-method Methods for Function loadDoc in Package ‘tm’
loadDoc,PlainTextDocument-method Methods for Function loadDoc in Package ‘tm’
loadDoc,StructuredTextDocument-method Methods for Function loadDoc in Package ‘tm’
loadDoc,XMLTextDocument-method Methods for Function loadDoc in Package ‘tm’
loadDoc-methods Methods for Function loadDoc in Package ‘tm’
LocalMetaData Text Document
LocalMetaData,TextDocument-method Text Document

-- M --

makeChunks Split a Corpus into Chunks
materialize Materialize Lazy Mappings
meta Methods for Function meta in Package ‘tm’
meta,Corpus-method Methods for Function meta in Package ‘tm’
meta,TextDocument-method Methods for Function meta in Package ‘tm’
meta,TextRepository-method Methods for Function meta in Package ‘tm’
meta-methods Methods for Function meta in Package ‘tm’
meta<-,Corpus-method Methods for Function meta in Package ‘tm’
meta<-,TextDocument-method Methods for Function meta in Package ‘tm’
meta<-,TextRepository-method Methods for Function meta in Package ‘tm’
MetaDataNode-class Metadata Node

-- N --

ncol.DocumentTermMatrix The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
ncol.TermDocumentMatrix The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
nDocs The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
NewsgroupDocument-class Newsgroup Text Document
nrow.DocumentTermMatrix The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
nrow.TermDocumentMatrix The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
nTerms The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix

-- O --

Origin Text Document
Origin,TextDocument-method Text Document
Origin<-,TextDocument-method Text Document

-- P --

pGetElem Methods for Function pGetElem in Package ‘tm’
pGetElem,DataframeSource-method Methods for Function pGetElem in Package ‘tm’
pGetElem,DirSource-method Methods for Function pGetElem in Package ‘tm’
pGetElem,VectorSource-method Methods for Function pGetElem in Package ‘tm’
pGetElem-methods Methods for Function pGetElem in Package ‘tm’
PlainTextDocument-class Plain Text Document
plot.TermDocumentMatrix Visualize a Term-Document Matrix
preprocessReut21578XML Preprocess the Reuters21578 XML archive.
prescindMeta Methods for Function prescindMetadata in Package ‘tm’
prescindMeta,Corpus,character-method Methods for Function prescindMetadata in Package ‘tm’
prescindMeta-methods Methods for Function prescindMetadata in Package ‘tm’

-- R --

RCV1Document-class RCV1 Text Document
readDOC Read In a MS Word Document
readGmane Read In A Newsgroup Document
readHTML Read In a Simple HTML Document
readNewsgroup Read In a Newsgroup Document
readPDF Read In a PDF Document
readPlain Read In a Text Document
readRCV1 Read In a Reuters Corpus Volume 1 Document
readReut21578XML Read In a Reuters21578 XML Document
readTabular Read In a Text Document
readXML Read In an XML Document
removeCitation Methods for Function removeCitation in Package ‘tm’
removeCitation,PlainTextDocument-method Methods for Function removeCitation in Package ‘tm’
removeCitation-methods Methods for Function removeCitation in Package ‘tm’
removeMeta Methods for Function removeMeta in Package ‘tm’
removeMeta,Corpus-method Methods for Function removeMeta in Package ‘tm’
removeMeta,TextRepository-method Methods for Function removeMeta in Package ‘tm’
removeMeta-methods Methods for Function removeMeta in Package ‘tm’
removeMultipart Methods for Function removeMultipart in Package ‘tm’
removeMultipart,PlainTextDocument-method Methods for Function removeMultipart in Package ‘tm’
removeMultipart-methods Methods for Function removeMultipart in Package ‘tm’
removeNumbers Methods for Function removeNumbers in Package ‘tm’
removeNumbers,PlainTextDocument-method Methods for Function removeNumbers in Package ‘tm’
removeNumbers-methods Methods for Function removeNumbers in Package ‘tm’
removePunctuation Methods for Function removePunctuation in Package ‘tm’
removePunctuation,PlainTextDocument-method Methods for Function removePunctuation in Package ‘tm’
removePunctuation-methods Methods for Function removePunctuation in Package ‘tm’
removeSignature Methods for Function removeSignature in Package ‘tm’
removeSignature,PlainTextDocument-method Methods for Function removeSignature in Package ‘tm’
removeSignature-methods Methods for Function removeSignature in Package ‘tm’
removeSparseTerms Remove Sparse Terms from a Term-Document Matrix
removeWords Methods for Function removeWords in Package ‘tm’
removeWords,PlainTextDocument,character-method Methods for Function removeWords in Package ‘tm’
removeWords-methods Methods for Function removeWords in Package ‘tm’
replacePatterns Methods for Function replacePatterns in Package ‘tm’
replacePatterns,PlainTextDocument,character,character-method Methods for Function replacePatterns in Package ‘tm’
replacePatterns-methods Methods for Function replacePatterns in Package ‘tm’
RepoMetaData Text Repository
RepoMetaData,TextRepository-method Text Repository
Reuters21578Document-class Reuters21578 Text Document
ReutersSource Reuters Source
ReutersSource,ANY-method Reuters Source
ReutersSource,character-method Reuters Source
rownames.DocumentTermMatrix Row, Column, Dim Names, Document IDs, and Terms
rownames.TermDocumentMatrix Row, Column, Dim Names, Document IDs, and Terms

-- S --

searchFullText Methods for Function searchFullText in Package ‘tm’
searchFullText,PlainTextDocument,character-method Methods for Function searchFullText in Package ‘tm’
searchFullText-methods Methods for Function searchFullText in Package ‘tm’
sFilter Statement Filter
show,Corpus-method Methods for Function show in Package ‘tm’
show,PlainTextDocument-method Methods for Function show in Package ‘tm’
show,TermDocumentMatrix-method Methods for Function show in Package ‘tm’
show,TextRepository-method Methods for Function show in Package ‘tm’
show-methods Methods for Function show in Package ‘tm’
Source-class Source
stemCompletion Complete Stems
stemDoc Methods for Function stemDoc in Package ‘tm’
stemDoc,PlainTextDocument-method Methods for Function stemDoc in Package ‘tm’
stemDoc-methods Methods for Function stemDoc in Package ‘tm’
stepNext Methods for Function stepNext in Package ‘tm’
stepNext,Source-method Methods for Function stepNext in Package ‘tm’
stepNext-methods Methods for Function stepNext in Package ‘tm’
stopwords Multilingual Stopwords
stripWhitespace Methods for Function stripWhitespace in Package ‘tm’
stripWhitespace,PlainTextDocument-method Methods for Function stripWhitespace in Package ‘tm’
stripWhitespace-methods Methods for Function stripWhitespace in Package ‘tm’
StructuredTextDocument-class Structured Text Document
summary,Corpus-method Methods for Function summary in Package ‘tm’
summary,TermDocumentMatrix-method Methods for Function summary in Package ‘tm’
summary,TextRepository-method Methods for Function summary in Package ‘tm’
summary-methods Methods for Function summary in Package ‘tm’

-- T --

TermDocMatrix Term-Document Matrix
TermDocumentMatrix Term-Document Matrix
termFreq Term Frequency Vector
Terms Row, Column, Dim Names, Document IDs, and Terms
TextDocument-class Text Document
TextRepository Text Repository
TextRepository,Corpus-method Text Repository
TextRepository-class Text Repository
tmFilter Methods for Function tmFilter in Package ‘tm’
tmFilter,Corpus-method Methods for Function tmFilter in Package ‘tm’
tmFilter-methods Methods for Function tmFilter in Package ‘tm’
tmIndex Methods for Function tmIndex in Package ‘tm’
tmIndex,Corpus-method Methods for Function tmIndex in Package ‘tm’
tmIndex-methods Methods for Function tmIndex in Package ‘tm’
tmIntersect Methods for Function tmIntersect in Package ‘tm’
tmIntersect,PlainTextDocument,character-method Methods for Function tmIntersect in Package ‘tm’
tmIntersect-methods Methods for Function tmIntersect in Package ‘tm’
tmMap Methods for Function tmMap in Package ‘tm’
tmMap,Corpus,function-method Methods for Function tmMap in Package ‘tm’
tmMap-methods Methods for Function tmMap in Package ‘tm’
tmReduce Combine Transformations
tmTolower Methods for Function tmTolower in Package ‘tm’
tmTolower,PlainTextDocument-method Methods for Function tmTolower in Package ‘tm’
tmTolower-methods Methods for Function tmTolower in Package ‘tm’
tmUpdate Methods for Function tmUpdate in Package ‘tm’
tmUpdate,Corpus,DirSource-method Methods for Function tmUpdate in Package ‘tm’
tmUpdate-methods Methods for Function tmUpdate in Package ‘tm’

-- U --

URI Plain Text Document
URI,NewsgroupDocument-method Newsgroup Text Document
URI,PlainTextDocument-method Plain Text Document
URI,StructuredTextDocument-method Structured Text Document
URI,XMLTextDocument-method Text document
URISource Uniform Resource Identifier Source
URISource,ANY-method Uniform Resource Identifier Source
URISource,character-method Uniform Resource Identifier Source
URISource-class Source for Directories

-- V --

VectorSource Gmane Source
VectorSource,ANY-method Gmane Source
VectorSource,vector-method Gmane Source
VectorSource-class Source for Vectors

-- W --

weightBin Weight Binary
WeightFunction Weighting Function Constructor
WeightFunction,function,character,character-method Weighting Function Constructor
WeightFunction-class Weighting Function
weightTf Weight By Term Frequency
weightTfIdf Weight By Term Frequency Inverse Document Frequency
writeCorpus Methods for Function writeCorpus in Package ‘tm’
writeCorpus,Corpus-method Methods for Function writeCorpus in Package ‘tm’
writeCorpus-methods Methods for Function writeCorpus in Package ‘tm’

-- X --

XMLSource XML Source
XMLSource-class Source for XML Files
XMLTextDocument-class Text document

-- misc --

[,Corpus,ANY,ANY,ANY-method Methods for Subset Functions in Package ‘tm’
[,Corpus-method Methods for Subset Functions in Package ‘tm’
[-methods Methods for Subset Functions in Package ‘tm’
[.TermDocumentMatrix Methods for Subset Functions in Package ‘tm’
[<-,Corpus,ANY,ANY,ANY-method Methods for Subset Functions in Package ‘tm’
[<-,Corpus-method Methods for Subset Functions in Package ‘tm’
[<--methods Methods for Subset Functions in Package ‘tm’
[[,Corpus,ANY,ANY-method Methods for Subset Functions in Package ‘tm’
[[,Corpus-method Methods for Subset Functions in Package ‘tm’
[[-methods Methods for Subset Functions in Package ‘tm’
[[<-,Corpus,ANY,ANY-method Methods for Subset Functions in Package ‘tm’
[[<-,Corpus-method Methods for Subset Functions in Package ‘tm’
[[<--methods Methods for Subset Functions in Package ‘tm’