Chinese Text Segmentation


[Up] [Top]

Documentation for package ‘jiebaR’ version 0.8

Help Pages

jiebaR-package A package for Chinese text segmentation
<=.keywords Keywords symbol
<=.qseg Quick mode symbol
<=.segment Text segmentation symbol
<=.simhash Simhash symbol
<=.tagger Tagger symbol
DICTPATH The path of dictionary
distance Hamming distance of words
edit_dict Edit default user dictionary
filecoding Files encoding detection
filter_segment Filter segmentation result This function helps remove some words in the segmentation result.
freq The frequency of words
get_idf generate IDF dict
get_qsegmodel Set quick mode model
get_tuple get tuple from the segmentation result
HMMPATH The path of dictionary
IDFPATH The path of dictionary
jiebaR A package for Chinese text segmentation
keywords Keyword extraction
new_user_word Add user word
print.inv Print worker settings
print.jieba Print worker settings
print.keywords Print worker settings
print.qseg Print worker settings
print.simhash Print worker settings
qseg Quick mode symbol
query_threshold Set query threshold
reset_qsegmodel Set quick mode model
segment Chinese text segmentation function
set_qsegmodel Set quick mode model
show_dictpath Show default path of dictionaries
simhash Simhash computation
STOPPATH The path of dictionary
tagging Speech Tagging
tobin simhash value to binary
USERPATH The path of dictionary
vector_distance Hamming distance of words
vector_keywords Keyword extraction
vector_simhash Simhash computation
words_locate Get text location
worker Initialize jiebaR worker
[.keywords Keywords symbol
[.qseg Quick mode symbol
[.segment Text segmentation symbol
[.simhash Simhash symbol
[.tagger Tagger symbol