American Journal of Public Health Research
ISSN (Print): 2327-669X ISSN (Online): 2327-6703 Website: https://www.sciepub.com/journal/ajphr Editor-in-chief: Apply for this position
Open Access
Journal Browser
Go
American Journal of Public Health Research. 2025, 13(6), 257-262
DOI: 10.12691/ajphr-13-6-1
Open AccessArticle

Anonymization of Georeferenced Public Health Microdata

Simon Cremer1, Lydia Jehmlich1 and Rainer Lenz1,

1Cologne university of technology, arts and sciences, Cologne, Germany

Pub. Date: November 17, 2025

Cite this paper:
Simon Cremer, Lydia Jehmlich and Rainer Lenz. Anonymization of Georeferenced Public Health Microdata. American Journal of Public Health Research. 2025; 13(6):257-262. doi: 10.12691/ajphr-13-6-1

Abstract

Whether for the use of targeted advertising measures or tracing the spatial spread of viruses such as the recent corona virus: georeferenced microdata can - depending on the attributes it is provided with - hold enormous added value for society, science and research. However, the desired information can often not be extracted despite the inherent analytical content. The reason for this is that access to personal georeferenced datasets is severely restricted, as these are subject to statutory data protection. One way out of this dilemma is to apply a suitable anonymization method that guarantees data protection without significantly reducing the analytical validity of data. Based on the EU INSPIRE directive, the statistical offices of the EU are successively implementing the georeferencing of their surveys. This paper discusses selected anonymization methods being most promising for anonymizing georeferenced health data for research purposes, as they offer scope for combinations or more specific adaptations in order to balance out the trade-off between privacy and analytical validity of georeferenced health microdata.

Keywords:
data anonymization disclosure risk health microdata location privacy spatial analysis

Creative CommonsThis work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

References:

[1]  Ronning, G., Sturm, R., Höhne, J., Lenz, R., Rosemann, M., Scheffler, M. and Vorgrimler, D., Handbuch zur Anonymisierung wirtschaftsstatistischer Mikrodaten, Statistisches Bundesamt, Wiesbaden, Series ‘Statistik und Wissenschaft’, vol. 4, 2005.
 
[2]  JdS 2025, 56 ème Journées de Statistique de la Societé Francaise de Statistique, Université Aix-Marseille, june 2025, see: https://jds2025.sciencesconf.org/?lang=fr.
 
[3]  Lenz, R., Measuring the disclosure protection of micro aggregated business microdata - an analysis taking as an example the German Structure of Costs Survey, Journal of Official Statistics 22 (4), 681-710, 2006.
 
[4]  Eurostat, Statistical confidentiality and personal data protection, 2023. Available at: https:// ec.europa.eu/ eurostat/ de/web/ microdata/statistical-confidentiality-and-personal-data-protection.
 
[5]  Broen, K., Rob, T. and Jon, Z., Measuring the impact of spatial perturbations on the relationship between data privacy and validity of descriptive Statistics, 2021, International Journal of Health Geographics, 20 (3).
 
[6]  Dwork, C., Differential privacy, International colloquium on automata, languages, and programming, 1-12, 2006.
 
[7]  Gao, S., Rao, J., Liu, X., Kang, Y., Huang, Q., App, J., Exploring the effectiveness of geomasking techniques for protecting the geoprivacy of Twitter users, Journal of spatial inform. science, 19, 105–129, 2019.
 
[8]  Armstrong, M.P., Rushton, G. and Zimmermann, D.L., Geographically Masking Health Data to preserve Confidentiality, Statistics in Medicine, 18, 497-525, 1999.
 
[9]  Kwan, M., Casas, I. and Schmitz, B. (2004), Protection of Geoprivacy and Accuracy of Spa-tial Information: How Effective Are Geographical Masks? Cartographica, 39(2), 15-28, 2004.
 
[10]  Houfaf-Khoufaf, W. and Touya, G., Geographically Masking Addresses to Study COVID-19 Clusters, Univ. Gustave Eiffel, 2021.
 
[11]  Delmelle, E. M., Desjardins, M. R.; Jung, P., Owusu, C., Hohl, A., Dony, C., Uncertainty in geospatial health: challenges and oppurtunities ahead, Annals of Epidemology, 65, 15-30, 2022.
 
[12]  Hampton, K.H., Fitch, M.K., Allshouse, W.B., Doherty, I.A., Gesink, D.C., Leone, P.A. and Miller, W.C., Mapping Health Data: Improved Privacy Protection with Donut Method Geomasking, American Journal of Epidemiology, 172 (9), 1062–1069, 2010.
 
[13]  Ocaña-Riola, R., Common errors in disease mapping, Geospatial Health, 4 (2), 139-154, 2010.
 
[14]  Seidl, D.E., Paulus, G., Jankowski, P., and Regenfelder, M., Spatial obfuscation methods for privacy protection of household-level data, Applied Geography, 63, 253-263, 2015.
 
[15]  Kounadi, O. and Leitner, M., Adaptive areal elimination: A transparent way of disclo-sing protected spatial datasets, Computers, Environment and Urban Systems, 57, 59–67.
 
[16]  Koller, D., Wohlrab, D., Sedlmeir, G. and Augustin, J., Geografische Ansätze in der Gesundheitsberichterstattung, Bundesgesundheitsblatt, 63, 1108–1117, 2020.
 
[17]  Erfuhrt, K., Groß, M., Rendtel, U., Schmid, T., Kernel density smoothing of composite spatial data on administrative area level - A case study of voting data in Berlin, AStA Wirtsch Sozialstat Arch, 16, 25–49, 2022.
 
[18]  MacEachren, AM., How Maps Work: Representation, Visualization, and Design, The Guilford Press, 1995.
 
[19]  Zandbergen, P. A., Ensuring Confidentiality of Geocoded Health Data: Assessing Geographic Masking Strategies for Individual-Level Data, Advances in Medicine, 2014.
 
[20]  Swanlund, D., Schuurman, N., Zandbergen, P., Brussoni, M., Street masking: a network-based geographic mask for easily protecting geoprivacy, International Journal of Health Geographics, 19 (26), 1-11, 2020.
 
[21]  Wang, J., Kim, J., Kwan, M.-P., An exploratory assessment of the effectiveness of geomasking methods on privacy protection and analytical accuracy for individual-level geospatial data, Cartography and Geographic Information Society, 49 (5), 385-406, 2022.
 
[22]  Kounadi, O., Towards geoprivacy guidelines for spatial data, ETH Zürich Research Collection, ETH Library, 2015.
 
[23]  Leitner, M., Curtis, A., A first step towards a framework for presenting the location of confidential point data on maps—results of an empirical perceptual study, International Journal of Geographical Information Science, 20 (7), 813-822. 2006.
 
[24]  Polzin, F., Kounadi, O., Adaptive Voronoi Masking: A Method to Protect Confidential Discrete Spatial Data, 11th International Conference on Geographic Information Science, 2021.
 
[25]  Cremer, S., Jehmlich, L. and Lenz, R., Masking georeferenced health data - an analysis taking the example of partially synthetic data on sleep disorder, Privacy in Statistical Databases, Domingo-Ferrer and M. Önen (Eds.), LNCS 14915, 297–309, 2024.
 
[26]  Dalenius, T., Reiss, S. P., Data Swapping: A Technique for Disclosure Control, Journal of Statistical Planning and Inference, 6, 73-85, 1982.
 
[27]  Gutmann, M. P., Witkowski, K., Colyer, C., McFarland O’Rourke, J., McNally, J., Providing Spatial Data for Secondary Analysis: Issues and Current Practices Relating to Confidentiality, Population Research and Policy Review 27 (6), 639-665.
 
[28]  Zhang, S., Freundschuh, S.M., Lenzer, K., Zandbergen, P.A., The Location Swapping Method for Geomasking, Cartography and Geographic Information Science,44 (1), 22-34, 2017.
 
[29]  Richter, W., The verified neighbor approach to geoprivacy: An improved method for geographic masking, Journal of Exposure Science and Environmental Epidemiology 28, 109–118, 2018.
 
[30]  Swanlund, D., Schuurman, N., Zandbergen, P., Brussoni, M., Street masking: a network-based geographic mask for easily protecting geoprivacy, International Journal of Health Geographics, 19 (26), 1-11, 2020.
 
[31]  Li, N., Li, T. and Venkatasubramanian, S. (Eds.), t-closeness: Privacy beyond k-anonymity and l-diversity, IEEE 23rd international conference on data engineering.
 
[32]  Lenz, R., On the way to remote access to German official microdata, Statistique et nouvelles technologies de l’information, Revue des Nouvelles Technologies de l’information, 125-138, 2011 (paper presented at the 41ème Journées de Statistique de la Societé Francaise de Statistique, Bordeaux 2009).
 
[33]  Research data centres of the Federal Statistical Office and the Statistical Offices of the Federal States, https://www.forschungs-datenzentrum.de/de, last accessed on 3 september 2025.
 
[34]  Wenzel, M., Corr, D., Riedel, N., Hapfelmeier, J., Zimmermann, L., State of efficient research on secondary health data in the German health data lab, Georg Thieme Verlag, Stuttgart, 2025.