Privacy‐preserving quality control of neuroimaging datasets in federated environments. Issue 7 (4th March 2022)
- Record Type:
- Journal Article
- Title:
- Privacy‐preserving quality control of neuroimaging datasets in federated environments. Issue 7 (4th March 2022)
- Main Title:
- Privacy‐preserving quality control of neuroimaging datasets in federated environments
- Authors:
- Saha, Debbrata K.
Calhoun, Vince D.
Du, Yuhui
Fu, Zening
Kwon, Soo Min
Sarwate, Anand D.
Panta, Sandeep R.
Plis, Sergey M. - Abstract:
- Abstract: Privacy concerns for rare disease data, institutional or IRB policies, access to local computational or storage resources or download capabilities are among the reasons that may preclude analyses that pool data to a single site. A growing number of multisite projects and consortia were formed to function in the federated environment to conduct productive research under constraints of this kind. In this scenario, a quality control tool that visualizes decentralized data in its entirety via global aggregation of local computations is especially important, as it would allow the screening of samples that cannot be jointly evaluated otherwise. To solve this issue, we present two algorithms: decentralized data stochastic neighbor embedding, dSNE, and its differentially private counterpart, DP‐dSNE. We leverage publicly available datasets to simultaneously map data samples located at different sites according to their similarities. Even though the data never leaves the individual sites, dSNE does not provide any formal privacy guarantees. To overcome that, we rely on differential privacy: a formal mathematical guarantee that protects individuals from being identified as contributors to a dataset. We implement DP‐dSNE with AdaCliP, a method recently proposed to add less noise to the gradients per iteration. We introduce metrics for measuring the embedding quality and validate our algorithms on these metrics against their centralized counterpart on two toy datasets. OurAbstract: Privacy concerns for rare disease data, institutional or IRB policies, access to local computational or storage resources or download capabilities are among the reasons that may preclude analyses that pool data to a single site. A growing number of multisite projects and consortia were formed to function in the federated environment to conduct productive research under constraints of this kind. In this scenario, a quality control tool that visualizes decentralized data in its entirety via global aggregation of local computations is especially important, as it would allow the screening of samples that cannot be jointly evaluated otherwise. To solve this issue, we present two algorithms: decentralized data stochastic neighbor embedding, dSNE, and its differentially private counterpart, DP‐dSNE. We leverage publicly available datasets to simultaneously map data samples located at different sites according to their similarities. Even though the data never leaves the individual sites, dSNE does not provide any formal privacy guarantees. To overcome that, we rely on differential privacy: a formal mathematical guarantee that protects individuals from being identified as contributors to a dataset. We implement DP‐dSNE with AdaCliP, a method recently proposed to add less noise to the gradients per iteration. We introduce metrics for measuring the embedding quality and validate our algorithms on these metrics against their centralized counterpart on two toy datasets. Our validation on six multisite neuroimaging datasets shows promising results for the quality control tasks of visualization and outlier detection, highlighting the potential of our private, decentralized visualization approach. Abstract : Privacy concerns for rare disease data, institutional or IRB policies, access to local computational or storage resources or download capabilities are among the reasons that may preclude analyses that pool data to a single site. A growing number of multi‐site projects and consortia were formed to function in the federated environment to conduct productive research under constraints of this kind. In this scenario, a quality control tool that visualizes decentralized data in its entirety via global aggregation of local computations is especially important, as it would allow the screening of samples that cannot be jointly evaluated otherwise. To solve this issue, we present two algorithms: decentralized data stochastic neighbor embedding, dSNE, and its differentially private counterpart, DP‐dSNE. Our validation on six multisite neuroimaging datasets shows promising results for the quality control tasks of visualization and outlier detection, highlighting the potential of our private, decentralized visualization approach. … (more)
- Is Part Of:
- Human brain mapping. Volume 43:Issue 7(2022)
- Journal:
- Human brain mapping
- Issue:
- Volume 43:Issue 7(2022)
- Issue Display:
- Volume 43, Issue 7 (2022)
- Year:
- 2022
- Volume:
- 43
- Issue:
- 7
- Issue Sort Value:
- 2022-0043-0007-0000
- Page Start:
- 2289
- Page End:
- 2310
- Publication Date:
- 2022-03-04
- Subjects:
- federated neuroimaging -- fMRI -- quality control -- sMRI
Brain mapping -- Periodicals
611.81 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1097-0193 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/hbm.25788 ↗
- Languages:
- English
- ISSNs:
- 1065-9471
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4336.031000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 21254.xml