stringdist

Approximate String Matching and String Distance Functions

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well.

Total

762,395

Last month

26,468

Last week

5,860

Average per day

882

Daily downloads

Total downloads

Description file content

Package
stringdist
Maintainer
Mark van der Loo
License
GPL-3
Title
Approximate String Matching and String Distance Functions
LazyData
no
Type
Package
LazyLoad
yes
Description
Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well.
Version
0.9.5.1
Depends
R (>= 2.15.3)
Imports
parallel
URL
BugReports
https://github.com/markvanderloo/stringdist/issues
Suggests
testthat
RoxygenNote
6.0.1
NeedsCompilation
yes
Packaged
2018-06-08 11:22:22 UTC; mark
Author
Mark van der Loo [aut, cre], Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb], Chris Muir [ctb]
Repository
CRAN
Date/Publication
2018-06-08 13:52:58 UTC

install.packages('stringdist')

0.9.5.1

2 months ago

https://github.com/markvanderloo/stringdist

Mark van der Loo

GPL-3

Depends on

R (>= 2.15.3)

Imports

parallel

Suggests

testthat

Discussions