convertReut21578XMLPlain {tm}R Documentation

Transform a Reuters21578 XML Document to a Plain Text Document

Description

Transform a Reuters21578 XML document to a plain text document.

Usage

convertReut21578XMLPlain(node, ...)

Arguments

node an XML node representing a <REUTERS></REUTERS> element from a well-formed Reuters-21578 XML file.
... Arguments passed over by calling functions.

Value

A PlainTextDocument representing node.

Author(s)

Ingo Feinerer

See Also

asPlain

Examples

reut21578 <- system.file("texts", "reut21578", package = "tm")
reut21578TDC <- Corpus(DirSource(reut21578), readerControl = list(reader = readReut21578XML, language = "en_US", load = TRUE))
reut21578TDC[[1]]
asPlain(reut21578TDC[[1]], convertReut21578XMLPlain)

[Package tm version 0.3-3 Index]