The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.

audubon: Japanese Text Processing Tools

A collection of Japanese text processing tools for filling Japanese iteration marks, Japanese character type conversions, segmentation by phrase, and text normalization which is based on rules for the 'Sudachi' morphological analyzer and the 'NEologd' (Neologism dictionary for 'MeCab'). These features are specific to Japanese and are not implemented in 'ICU' (International Components for Unicode).

Version: 0.5.2
Depends: R (≥ 2.10)
Imports: dplyr (≥ 1.1.0), magrittr, Matrix, memoise, purrr, readr, rlang (≥ 0.4.11), stringi, V8
Suggests: roxygen2, spelling, testthat (≥ 3.0.0)
Published: 2024-04-27
DOI: 10.32614/CRAN.package.audubon
Author: Akiru Kato [cre, aut], Koki Takahashi [cph] (Author of japanese.js), Shuhei Iitsuka [cph] (Author of budoux), Taku Kudo [cph] (Author of TinySegmenter)
Maintainer: Akiru Kato <paithiov909 at gmail.com>
BugReports: https://github.com/paithiov909/audubon/issues
License: Apache License (≥ 2)
URL: https://github.com/paithiov909/audubon, https://paithiov909.github.io/audubon/
NeedsCompilation: no
Language: en-US
Materials: README NEWS
CRAN checks: audubon results

Documentation:

Reference manual: audubon.pdf

Downloads:

Package source: audubon_0.5.2.tar.gz
Windows binaries: r-devel: audubon_0.5.2.zip, r-release: audubon_0.5.2.zip, r-oldrel: audubon_0.5.2.zip
macOS binaries: r-release (arm64): audubon_0.5.2.tgz, r-oldrel (arm64): audubon_0.5.2.tgz, r-release (x86_64): audubon_0.5.2.tgz, r-oldrel (x86_64): audubon_0.5.2.tgz
Old sources: audubon archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=audubon to link to this page.

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.