The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.
global_idf3
.bind_tf_idf2
.
norm=TRUE
. Cosine nomalization is now performed on tf_idf
values as in the RMeCab package.tf="itf"
and idf="df"
options.pack
for performance.tokenize_mecab
and tokenize_sudachipy
.bind_lr
function which can calculate the ‘LR’ value of bigrams.pack
now always returns a tibble, not a data.frame.bind_tf_idf2
can calculate and bind the term frequency, inverse document frequency, and tf-idf of the tidy text dataset.collapse_tokens
, mute_tokens
, and lexical_density
can be used for handling a tidy text dataset of tokens.strj_tokenize
now preserves the original order of text names.prettify
now can get delim
argument.strj_fill_iter_mark
function.
strj_fill_iter_mark
now replaces a sequence of iteration marks recursively.strj_tokenize
function.
strj_tokenize
now can retrieve engine
argument to switch tokenizers for splitting text into tokens.ngram_tokenizer
function.pack
function.
pack
function. pack
now accepts pull
as its second argument and n
as its third argument.pull
now can accept a symbol.NEWS.md
file to track changes to the package.These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.