convertRCV1Plain {tm}R Documentation

Transform A RCV1 Document To A Plain Text Document

Description

Transform a Reuters Corpus Volume 1 XML document to a plain text document.

Usage

convertRCV1Plain(node, ...)

Arguments

node an XML node representing a <newsitem></newsitem> element from a well-formed RCV1 XML file.
... Arguments passed over by calling functions.

Value

A PlainTextDocument representing node.

Author(s)

Ingo Feinerer

See Also

asPlain

Examples

rcv1 <- system.file("texts", "rcv1", package = "tm")
rcv1TDC <- TextDocCol(DirSource(rcv1), readerControl = list(reader = readRCV1, language = "en_US", load = TRUE))
rcv1TDC[[1]]
asPlain(rcv1TDC[[1]], convertRCV1Plain)

[Package tm version 0.2 Index]