makeChunks {tm}R Documentation

Split a Corpus into Chunks

Description

Split a corpus into equally sized chunks conserving document boundaries.

Usage

makeChunks(corpus, chunksize)

Arguments

corpus The corpus to be split into chunks.
chunksize The chunk size.

Value

A corpus consisting of the chunks. Note that corpus meta data is not passed on to the newly created chunk corpus.

Author(s)

Ingo Feinerer

Examples

txt <- system.file("texts", "txt", package = "tm")
ovid <- Corpus(DirSource(txt))
sapply(ovid, length)
ovidChunks <- makeChunks(ovid, 5)
sapply(ovidChunks, length)

[Package tm version 0.3-3 Index]