The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.
failure: length > 1 in coercion to logical
issue in n_gram_merge()
, in commit 4336ee6.stringdist
C API, and calling C functions in place of using stringdist::stringdistmatrix()
. This change results in speed improvements in function n_gram_merge()
, and requires that stringdist
v0.9.5.1 or greater be installed.n_gram_merge()
, renamed arg edit_dist_weights
to weight
. The only purpose of this arg is to be passed along to function stringdistmatrix
from the stringdist package (which uses the name weight
, so this change is simply to match that).Fixed issue in which input strings that contained accent marks were not being properly handled/clustered (#9). The fix involved adding stringi to Imports
and using stringi::stri_trans_general()
.
Fixed issue in n_gram_merge()
in which incorrect values were being return when input arg ignore_strings
was not NULL
, and arg bus_suffix = FALSE
(#7).
Fixed issue in which input strings that contained punctuation that was NOT surrounded by spaces was returning incorrect values (#6).
Fixed issue in which the edit value assigned to a cluster was sometimes not the most frequent string in that cluster (#5).
std::unordered_map()
, resulting in a substantial speed improvement when passing large character vectors (length 100,000+) to either of the exported functions (#8).These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.