fastLink

Fast Probabilistic Record Linkage with Missing Data

Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2017) ''Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records'', available at <http://imai.princeton.edu/research/linkage.html>.

Total

5,792

Last month

361

Last week

152

Average per day

12

Daily downloads

Total downloads

Description file content

Package
fastLink
Type
Package
Title
Fast Probabilistic Record Linkage with Missing Data
Version
0.4.0
Date
2018-05-15
Description
Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2017) ''Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records'', available at .
License
GPL (>= 3)
Imports
Matrix, parallel, foreach, doParallel, gtools, data.table, stringdist, stringr, stringi, Rcpp (>= 0.12.7), FactoClass, adagio, dplyr, plotrix, grDevices, graphics
Depends
R (>= 2.14.0)
LinkingTo
RcppArmadillo, Rcpp, RcppEigen
Encoding
UTF-8
LazyData
true
BugReports
https://github.com/kosukeimai/fastLink/issues
RoxygenNote
6.0.1
Suggests
testthat
NeedsCompilation
yes
Packaged
2018-05-15 19:36:45 UTC; benfifield
Author
Ted Enamorado [aut, cre], Ben Fifield [aut], Kosuke Imai [aut]
Maintainer
Ted Enamorado
Repository
CRAN
Date/Publication
2018-05-15 20:22:48 UTC

install.packages('fastLink')

0.4.0

4 months ago

Ted Enamorado

GPL (>= 3)

Depends on

R (>= 2.14.0)

Imports

Matrix, parallel, foreach, doParallel, gtools, data.table, stringdist, stringr, stringi, Rcpp (>= 0.12.7), FactoClass, adagio, dplyr, plotrix, grDevices, graphics

Suggests

testthat

Discussions