privacy-engineering-tools

De-identification Tools using R

We found the following R packages for de-identifying data, or checking how well data are de-identified (in alphabetic order):

Name Description Data type Privacy models More info License Maintenance GitHub stars
Datacheck Open source R package and web app to check the presence of common identifiers Tabular data (CSV) - Project report and demo MIT Active 0-10
sdcMicro R package and web app to apply generalization, top- and bottom coding, recoding + analyze privacy risks and utility Tabular microdata (.Rdata, .sav, .sasb7dat, .csv, .txt, .dta) k-anonymity Documentation, demo GPL-v2 Active 10-100
sdcTable R package to apply statistical disclosure control to tables Tables (e.g., frequency tables) Suppression Documentation GPL-v2 Active 0-10