Text Analysis Utilities


[Up] [Top]

Documentation for package ‘tau’ version 0.0-7

Help Pages

fixEncoding Adapt the (Declared) Encoding of a Character Vector
format.textcnt Term or Pattern Counting of Text Documents
is.ascii Adapt the (Declared) Encoding of a Character Vector
is.locale Adapt the (Declared) Encoding of a Character Vector
is.utf8 Adapt the (Declared) Encoding of a Character Vector
parse_IETF_language_tag Parse IETF Language Tag
remove_stopwords Preprocessing of Text Documents
textcnt Term or Pattern Counting of Text Documents
tokenize Preprocessing of Text Documents
translate Adapt the (Declared) Encoding of a Character Vector
translate_Unicode_latin_ligatures Translate Unicode Latin Ligatures