stringdist

Approximate String Matching and String Distance Functions

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well.

Total

1,060,978

Last month

28,775

Last week

7,528

Average per day

959

Daily downloads

Total downloads

Description file content

Package
stringdist
Maintainer
Mark van der Loo
License
GPL-3
Title
Approximate String Matching and String Distance Functions
LazyData
no
Type
Package
LazyLoad
yes
Description
Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well.
Version
0.9.5.5
Depends
R (>= 2.15.3)
URL
BugReports
https://github.com/markvanderloo/stringdist/issues
Suggests
tinytest
Imports
parallel
Encoding
UTF-8
RoxygenNote
6.1.1
NeedsCompilation
yes
Packaged
2019-10-21 06:45:42 UTC; mark
Author
Mark van der Loo [aut, cre] (), Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb], Chris Muir [ctb]
Repository
CRAN
Date/Publication
2019-10-21 07:20:03 UTC

install.packages('stringdist')

0.9.5.5

a month ago

https://github.com/markvanderloo/stringdist

Mark van der Loo

GPL-3

Depends on

R (>= 2.15.3)

Imports

parallel

Suggests

tinytest

Discussions