okc_text | OkCupid Text Data |
step_lda | Calculates lda dimension estimates |
step_sequence_onehot | Generate the basic set of text features |
step_stem | Stemming of list-column variables |
step_stopwords | Filtering of stopwords from a list-column variable |
step_textfeature | Generate the basic set of text features |
step_texthash | Term frequency of tokens |
step_tf | Term frequency of tokens |
step_tfidf | Term frequency-inverse document frequency of tokens |
step_tokenfilter | Filter the tokens based on term frequency |
step_tokenize | Tokenization of character variables |
step_tokenmerge | Generate the basic set of text features |
step_untokenize | Untokenization of list-column variables |
step_word_embeddings | Pretrained word embeddings of tokens |
tidy.step_lda | Calculates lda dimension estimates |
tidy.step_sequence_onehot | Generate the basic set of text features |
tidy.step_stem | Stemming of list-column variables |
tidy.step_stopwords | Filtering of stopwords from a list-column variable |
tidy.step_textfeature | Generate the basic set of text features |
tidy.step_texthash | Term frequency of tokens |
tidy.step_tf | Term frequency of tokens |
tidy.step_tfidf | Term frequency-inverse document frequency of tokens |
tidy.step_tokenfilter | Filter the tokens based on term frequency |
tidy.step_tokenize | Tokenization of character variables |
tidy.step_tokenmerge | Generate the basic set of text features |
tidy.step_untokenize | Untokenization of list-column variables |
tidy.step_word_embeddings | Pretrained word embeddings of tokens |