K-anonymized location data

The table k_anon_2_taxi_rides_condensed in the database ma_anonymized is a K-anonymized for the time and location columns of the taxi database (start and end trip times, and start and end trip longitude and latitude). For this table, K=2.

The database can be found at db001.gda-score.org.

Other databases

The GDA Score project offers a number of real databases that can be used to test and measure anonymization methods.

Raw NYC Taxi Database

This database contains four hours of New York City taxi rides (from Jan. 8, 2013, 8AM to noon).

Find out more

Raw Czech Banking Data

This dataset contains a set of banking transactions and other data from a Czech bank.

Find out more

Pseudonymization, Column Suppression

The column-suppressed pseudonymized tables are generated by simply deleting columns that contain Personally Identifying Information (PII).

Find out more