Installation
About
Functions to assist in performing probabilistic record linkage and deduplication: generating pairs, comparing records, em-algorithm for estimating m- and u-probabilities (I. Fellegi & A. Sunter (1969) doi:10.1080/01621459.1969.10501049, T.N. Herzog, F.J. Scheuren, & W.E. Winkler (2007), "Data Quality and Record Linkage Techniques", ISBN:978-0-387-69502-0), forcing one-to-one matching. Can also be used for pre- and post-processing for machine learning methods for record linkage. Focus is on memory, CPU performance and flexibility.
github.com/djvanderlaan/reclin2 | |
Bug report | File report |
Key Metrics
Downloads
Yesterday | 17 0% |
Last 7 days | 133 -26% |
Last 30 days | 586 +1% |
Last 90 days | 2.010 +29% |
Last 365 days | 6.287 +60% |