The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.

stringdist: Approximate String Matching, Fuzzy Text Search, and String Distance Functions

Implements an approximate string matching version of R's native 'match' function. Also offers fuzzy text search based on various string distance measures. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well. Reference: MPJ van der Loo (2014) <doi:10.32614/RJ-2014-011>.

Version: 0.9.12
Depends: R (≥ 2.15.3)
Imports: parallel
Suggests: tinytest
Published: 2023-11-28
DOI: 10.32614/CRAN.package.stringdist
Author: Mark van der Loo ORCID iD [aut, cre], Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb], Chris Muir [ctb], Johannes Gruber [ctb], Brian Ripley [ctb]
Maintainer: Mark van der Loo <mark.vanderloo at gmail.com>
BugReports: https://github.com/markvanderloo/stringdist/issues
License: GPL-3
URL: https://github.com/markvanderloo/stringdist
NeedsCompilation: yes
Citation: stringdist citation info
Materials: README NEWS
In views: NaturalLanguageProcessing, OfficialStatistics
CRAN checks: stringdist results

Documentation:

Reference manual: stringdist.pdf
Vignettes: RJournal 6 111-122 (2014)
stringdist C/C++ API

Downloads:

Package source: stringdist_0.9.12.tar.gz
Windows binaries: r-devel: stringdist_0.9.12.zip, r-release: stringdist_0.9.12.zip, r-oldrel: stringdist_0.9.12.zip
macOS binaries: r-release (arm64): stringdist_0.9.12.tgz, r-oldrel (arm64): stringdist_0.9.12.tgz, r-release (x86_64): stringdist_0.9.12.tgz, r-oldrel (x86_64): stringdist_0.9.12.tgz
Old sources: stringdist archive

Reverse dependencies:

Reverse depends: AurieLSHGaussian, blink, diathor, VDJgermlines
Reverse imports: APCalign, available, bdc, bdlp, bibliometrix, biblioverlap, BiocCheck, bioseq, CAGEr, campfin, ChromSCape, clickR, ClustIRR, clustringr, ColOpenData, cols4all, concatipede, Conigrave, countries, CTDquerier, daqapo, deductive, DeepPINCS, discoverableresearch, dupree, exams.forge, fastLink, fctutils, fcuk, fedmatch, flora, fossilbrush, fuzzyjoin, genBaRcode, geneHapR, GeneStructureTools, GetLattesData, immunarch, immuneSIM, KOR.addrlink, labourR, levitate, LexisNexisTools, lingtypology, LinTInd, longreadvqs, LymphoSeq, MEDseq, messy.cats, mmstat4, occupationMeasurement, ogrdbstats, ontophylo, openPrimeR, palaeoverse, PGRdup, Platypus, poldis, qdap, r2dii.match, rabi, rassta, rdiversity, reclin2, refinr, RepertoiR, revtools, scRepertoire, scruff, SemNetCleaner, SentimentAnalysis, synthesisr, taxlist, ThomasJeffersonUniv, tidystringdist, tripr, ulex, utilsIPEA
Reverse linking to: refinr
Reverse suggests: embed, epiCleanr, GenProSeq, googleLanguageR, lavaanExtra, LSTbook, rlist, RPatternJoin, seqtrie, sjmisc, WorldFlora, zoomerjoin

Linking:

Please use the canonical form https://CRAN.R-project.org/package=stringdist to link to this page.

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.