Article citationsMore >>

Moskowitz D RC, Adler N, Karter AJ, Moffet HH, Schillinger D. : Effect of the social hierarchy on patient-physician communication: results from the DISTANCE study. In UCSF Disparities Symposium. San Francisco, CA.

has been cited by the following article:

Article

The DISTANCE Model for Collaborative Research: Distributing Analytic Effort Using Scrambled Data Sets

1Division of Research, Kaiser Permanente Northern California, 2000 Broadway, Oakland, CA 94612

2Center for Vulnerable Populations, University of California San Francisco, 1001 Potrero Ave., San Francisco, CA 94110


Information Security and Computer Fraud. 2014, Vol. 2 No. 3, 33-38
DOI: 10.12691/iscf-2-3-1
Copyright © 2014 Science and Education Publishing

Cite this paper:
Howard H. Moffet, E. Margaret Warton, Melissa M. Parker, Jennifer Y. Liu, Courtney R. Lyles, Andrew J. Karter. The DISTANCE Model for Collaborative Research: Distributing Analytic Effort Using Scrambled Data Sets. Information Security and Computer Fraud. 2014; 2(3):33-38. doi: 10.12691/iscf-2-3-1.

Correspondence to: Howard  H. Moffet, Division of Research, Kaiser Permanente Northern California, 2000 Broadway, Oakland, CA 94612. Email: Howard.H.Moffet@kp.org

Abstract

Background: Data-sharing is encouraged to fulfill the ethical responsibility to transform research data into public health knowledge, but data sharing carries risks of improper disclosure and potential harm from release of individually identifiable data. Methods: The study objective was to develop and implement a novel method for scientific collaboration and data sharing which distributes the analytic burden while protecting patient privacy. A procedure was developed where in an investigator who is external to an analytic coordinating center (ACC) can conduct original research following a protocol governed by a Publications and Presentations (P&P) Committee. The collaborating investigator submits a study proposal and, if approved, develops the analytic specifications using existing data dictionaries and templates. An original data set is prepared according to the specifications and the external investigator is provided with a complete but de-identified and shuffled data set which retains all key data fields but which obfuscates individually identifiable data and patterns; this “scrambled data set” provides a “sandbox” for the external investigator to develop and test analytic code for analyses. The analytic code is then run against the original data at the ACC to generate output which is used by the external investigator in preparing a manuscript for journal submission. Results: The method has been successfully used with collaborators to produce many published papers and conference reports. Conclusion: By distributing the analytic burden, this method can facilitate collaboration and expand analytic capacity, resulting in more science for less money.

Keywords