parallelDist

Parallel Distance Matrix Computation using Multiple Threads

A fast parallelized alternative to R's native 'dist' function to calculate distance matrices for continuous, binary, and multi-dimensional input matrices, which supports a broad variety of 39 predefined distance functions from the 'stats', 'proxy' and 'dtw' R packages, as well as user- defined functions written in C++. For ease of use, the 'parDist' function extends the signature of the 'dist' function and uses the same parameter naming conventions as distance methods of existing R packages. The package is mainly implemented in C++ and leverages the 'RcppParallel' package to parallelize the distance computations with the help of the 'TinyThread' library. Furthermore, the 'Armadillo' linear algebra library is used for optimized matrix operations during distance calculations. The curiously recurring template pattern (CRTP) technique is applied to avoid virtual functions, which improves the Dynamic Time Warping calculations while the implementation stays flexible enough to support different DTW step patterns and normalization methods.

Total

1,883

Last month

423

Last week

105

Average per day

14

Daily downloads

Total downloads

Description file content

Package
parallelDist
Type
Package
Title
Parallel Distance Matrix Computation using Multiple Threads
Version
0.2.1
Author
Alexander Eckert [aut, cre]
Maintainer
Alexander Eckert
Description
A fast parallelized alternative to R's native 'dist' function to calculate distance matrices for continuous, binary, and multi-dimensional input matrices, which supports a broad variety of 39 predefined distance functions from the 'stats', 'proxy' and 'dtw' R packages, as well as user- defined functions written in C++. For ease of use, the 'parDist' function extends the signature of the 'dist' function and uses the same parameter naming conventions as distance methods of existing R packages. The package is mainly implemented in C++ and leverages the 'RcppParallel' package to parallelize the distance computations with the help of the 'TinyThread' library. Furthermore, the 'Armadillo' linear algebra library is used for optimized matrix operations during distance calculations. The curiously recurring template pattern (CRTP) technique is applied to avoid virtual functions, which improves the Dynamic Time Warping calculations while the implementation stays flexible enough to support different DTW step patterns and normalization methods.
License
GPL (>= 2)
LazyData
TRUE
URL
BugReports
https://github.com/alexeckert/parallelDist/issues
NeedsCompilation
yes
Depends
R (>= 3.0.2)
Imports
Rcpp (>= 0.12.6), RcppParallel (>= 4.3.20)
LinkingTo
Rcpp, RcppParallel, RcppArmadillo
SystemRequirements
C++11
Suggests
dtw, ggplot2, proxy, testthat, RcppArmadillo, RcppXPtrUtils
Packaged
2017-12-04 23:02:18 UTC; user
Repository
CRAN
Date/Publication
2017-12-04 23:47:31 UTC

install.packages('parallelDist')

0.2.1

9 days ago

https://github.com/alexeckert/parallelDist

Alexander Eckert

GPL (>= 2)

Depends on

R (>= 3.0.2)

Imports

Rcpp (>= 0.12.6), RcppParallel (>= 4.3.20)

Suggests

dtw, ggplot2, proxy, testthat, RcppArmadillo, RcppXPtrUtils

Discussions