stopwords {lsa}R Documentation

Stopwordlists in German and English

Description

This data sets contain very common lists of words that want to be ignored when building up a document-term matrix. The stop word lists can be loaded by calling data(stopwords_en) and data(stopwords_de). The objects stopwords_de and stopwords_en must already exist before being handed over to textmatrix().

Usage

   data(stopwords_de)
   data(stopwords_en)

Format

A vector containing 424 (370) English (German) stop words (e.g. 'he', 'she', 'a').

Author(s)

Fridolin Wild fridolin.wild@wu-wien.ac.at


[Package lsa version 0.57 Index]