fastLink

Fast Probabilistic Record Linkage with Missing Data

Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2017) ''Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records'', available at <http://imai.princeton.edu/research/linkage.html>.

Total

5,213

Last month

225

Last week

36

Average per day

8

Daily downloads

Total downloads

Description file content

Package
fastLink
Type
Package
Title
Fast Probabilistic Record Linkage with Missing Data
Version
0.4.0
Date
2018-05-15
Description
Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2017) ''Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records'', available at .
License
GPL (>= 3)
Imports
Matrix, parallel, foreach, doParallel, gtools, data.table, stringdist, stringr, stringi, Rcpp (>= 0.12.7), FactoClass, adagio, dplyr, plotrix, grDevices, graphics
Depends
R (>= 2.14.0)
LinkingTo
RcppArmadillo, Rcpp, RcppEigen
Encoding
UTF-8
LazyData
true
BugReports
https://github.com/kosukeimai/fastLink/issues
RoxygenNote
6.0.1
Suggests
testthat
NeedsCompilation
yes
Packaged
2018-05-15 19:36:45 UTC; benfifield
Author
Ted Enamorado [aut, cre], Ben Fifield [aut], Kosuke Imai [aut]
Maintainer
Ted Enamorado
Repository
CRAN
Date/Publication
2018-05-15 20:22:48 UTC

install.packages('fastLink')

0.4.0

2 months ago

Ted Enamorado

GPL (>= 3)

Depends on

R (>= 2.14.0)

Imports

Matrix, parallel, foreach, doParallel, gtools, data.table, stringdist, stringr, stringi, Rcpp (>= 0.12.7), FactoClass, adagio, dplyr, plotrix, grDevices, graphics

Suggests

testthat

Discussions