Corpus-class {tm} | R Documentation |
A class representing a collection of text documents (denoted as corpus in linguistics).
Objects can be created by calls of the form
new("Corpus",...)
or by calling the function
Corpus
.
CMetaData
:MetaDataNode
containing
the document collection (corpus) specific meta data for the
collection in form of tag-value pairs and information about
children in form of a binary tree. This information is useful for
reconstructing meta data after e.g. merging document collections.DMetaData
:data.frame
containing
the document specific meta data for the collection. This dataframe
typically encompasses clustering or classification results which
basically are metadata for documents but form an own entity (e.g.,
with its name, the value range, etc.).DBControl
:list
with three
named components: useDb
indicates whether database support
is activated, dbName
holds the path to the database
storage, and dbType
stores the database type.
Class list
, directly.
signature(object = "Corpus")
: Returns the
corpus specific metadata in form of a tag-value
paired list.signature(object = "Corpus")
: Returns the
document specific metadata in form of a data frame.signature(object = "Corpus")
: Returns
the database configuration settings.Ingo Feinerer