help button home button JAMIA Bigger figures
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

First published December 15, 2005 as JAMIA PrePrint; doi:10.1197/jamia.M1920
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M1920v1
13/2/160    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Cassa, C. A.
Right arrow Articles by Mandl, K. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cassa, C. A.
Right arrow Articles by Mandl, K. D.
J Am Med Inform Assoc. 2006;13:160-165. DOI 10.1197/jamia.M1920.
© 2006 American Medical Informatics Association


Application of Information Technology

A Context-sensitive Approach to Anonymizing Spatial Surveillance Data: Impact on Outbreak Detection

Christopher A. Cassa, MEng, Shaun J. Grannis, MD, MS, J. Marc Overhage, MD, PhD and Kenneth D. Mandl, MD, MPH

Affiliations of the authors: Children's Hospital Informatics Program, Children's Hospital Boston, Boston, MA (CAC, KDM); Clinical Decision Making Group, Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA (CAC); Harvard-MIT Division of Health Sciences and Technology, Cambridge, MA (CAC, KDM); Indiana University School of Medicine, Indianapolis, IN (SJG, JMO); The Regenstrief Institute, Inc., Indianapolis, IN (SJG, JMO); Harvard Medical School, Boston, MA (KDM).

Correspondence and reprints: Christopher A. Cassa, Children's Hospital Boston, Informatics Program–Mandl Group, 1 Autumn Street, #721, Boston, MA 02215-5362; e-mail: <cassa{at}mit.edu>.

Received for publication: 07/27/05; accepted for publication: 11/28/05.

Objective: The use of spatially based methods and algorithms in epidemiology and surveillance presents privacy challenges for researchers and public health agencies. We describe a novel method for anonymizing individuals in public health data sets by transposing their spatial locations through a process informed by the underlying population density. Further, we measure the impact of the skew on detection of spatial clustering as measured by a spatial scanning statistic.

Design: Cases were emergency department (ED) visits for respiratory illness. Baseline ED visit data were injected with artificially created clusters ranging in magnitude, shape, and location. The geocoded locations were then transformed using a de-identification algorithm that accounts for the local underlying population density.

Measurements: A total of 12,600 separate weeks of case data with artificially created clusters were combined with control data and the impact on detection of spatial clustering identified by a spatial scan statistic was measured.

Results: The anonymization algorithm produced an expected skew of cases that resulted in high values of data set k-anonymity. De-identification that moves points an average distance of 0.25 km lowers the spatial cluster detection sensitivity by less than 4% and lowers the detection specificity less than 1%.

Conclusion: A population-density–based Gaussian spatial blurring markedly decreases the ability to identify individuals in a data set while only slightly decreasing the performance of a standardly used outbreak detection tool. These findings suggest new approaches to anonymizing data for spatial epidemiology and surveillance.




This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
S. C. Wieland, C. A. Cassa, K. D. Mandl, and B. Berger
Revealing the spatial distribution of a disease while preserving privacy
PNAS, November 18, 2008; 105(46): 17608 - 17613.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
S. Sengupta, N. S. Calman, and G. Hripcsak
A Model for Expanded Public Health Reporting in the Context of HIPAA
J. Am. Med. Inform. Assoc., September 1, 2008; 15(5): 569 - 574.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
B. Y. Reis, C. Kirby, L. E. Hadden, K. Olson, A. J. McMurry, J. B. Daniel, and K. D. Mandl
AEGIS: A Robust and Scalable Real-time Public Health Surveillance System
J. Am. Med. Inform. Assoc., September 1, 2007; 14(5): 581 - 588.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
A. J. McMurry, C. A. Gilbert, B. Y. Reis, H. C. Chueh, I. S. Kohane, and K. D. Mandl
A Self-scaling, Distributed Information Architecture for Public Health, Research, and Clinical Care
J. Am. Med. Inform. Assoc., July 1, 2007; 14(4): 527 - 533.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. C. Wieland, J. S. Brownstein, B. Berger, and K. D. Mandl
Density-equalizing Euclidean minimum spanning trees for the detection of all disease cluster shapes
PNAS, May 29, 2007; 104(22): 9404 - 9409.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Public HealthHome page
K. L. Olson, S. J. Grannis, and K. D. Mandl
Privacy Protection Versus Cluster Detection in Spatial Epidemiology
Am J Public Health, November 1, 2006; 96(11): 2002 - 2008.
[Abstract] [Full Text] [PDF]


Home page
NEJMHome page
J. S. Brownstein, C. A. Cassa, and K. D. Mandl
No Place to Hide -- Reverse Identification of Patients from Published Maps
N. Engl. J. Med., October 19, 2006; 355(16): 1741 - 1742.
[Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2006 by the American Medical Informatics Association.