stopwords {lsa}R Documentation

Stopwordlists in German, English, and Dutch

Description

This data sets contain very common lists of words that want to be ignored when building up a document-term matrix. The stop word lists can be loaded by calling data(stopwords_en), data(stopwords_de), or data(stopwords_nl). The objects stopwords_de, stopwords_en, and data(stopwords_nl) must already exist before being handed over to textmatrix().

Usage

   data(stopwords_de)
   data(stopwords_en)
   data(stopwords_nl)

Format

A vector containing 424 (370) English (German) stop words (e.g. 'he', 'she', 'a').

Author(s)

Fridolin Wild fridolin.wild@wu-wien.ac.at, Marco Kalz marco.kalz@ou.nl (for Dutch)


[Package lsa version 0.59 Index]