Application of Federated Analytics in Health Data Research for Reducing Risks Involved in Data Sharing

24 Pages Posted: 14 Dec 2022

See all articles by Solmaz Eradat Oskoui

Solmaz Eradat Oskoui

Aridhia Informatics Ltd

Matthew Retford

Health Data Research UK

Rodrigo Barnes

Aridhia Informatics Ltd

Neil Postlethwaite

Health Data Research UK

Karen J. Hunter

Health Data Research UK

Simon Thompson

Swansea University - Swansea University Medical School

Chris Orton

Swansea University - Swansea University Medical School

D.V. Ford

Swansea University - Swansea University Medical School

Sharon Heys

Swansea University - Swansea University Medical School

Julie Kennedy

Swansea University - Swansea University Medical School

Cynthia McNerney

Swansea University - Administrative Data Research Wales

Jeffrey Peng

Swansea University - Swansea University Medical School

Hamed Ghanbariadolat

Swansea University - Swansea University Medical School

Sarah Rees

Swansea University - Swansea University Medical School

Rachel H. Mulholland

University of Edinburgh - Usher Institute

Aziz Sheikh

University of Edinburgh - Usher Institute

David Burgner

University of Melbourne - Murdoch Children's Research Institute

Meredith Lee Brockway

University of Manitoba

Meghan B. Azad

University of Manitoba - Department of Pediatrics & Child Health

Natalie Rodriguez

University of Manitoba

Helga Zoega

University of New South Wales (UNSW)

Sarah J. Stock

University of Edinburgh - Usher Institute

Clara Calvert

University of Edinburgh - Centre for Global Health Research

Jessica Miller

University of Melbourne - Murdoch Children's Research Institute

Nicole Fiorentino

University of Manitoba

Amy Racine

Cytel, Inc

Jonas Haggstrom

Cytel, Inc

Abstract

Background: The use of federated networks can reduce the risk of disclosure for sensitive datasets by removing the requirement to physically transfer data. Federated networks support federated analytics, a type of privacy-enhancing technology (PET) enabling trustworthy data access and analysis. 

Objectives: We aim to outline the methodology used by the International COVID-19 Data Alliance (ICODA) and its partners the Secure Anonymised Information Linkage (SAIL) Databank and Aridhia Informatics in implementing a federated network infrastructure and consequently testing federated analytics using test data provided for an ICODA exemplar project, the International Perinatal Outcome in the Pandemic (iPOP) Study. The ICODA Workbench - a trusted research environment (TRE) - was used to send federated requests to access this test data held within SAIL Databank.

Results: This project is the first example for successfully implementing a federated network for ICODA. The integration testing made use of aggregate-level data from the iPOP Study as the first step in putting in place the necessary technical and user experiences for future project studies using individual-level datasets from multiple data nodes. While the federated network was established, federated analytics was not used in the analysis of the iPOP Study due to challenges from a data standard, data governance, technology, skills and project duration perspective.

Conclusions: Creating federated networks requires an extensive amount of investment from a funding, data governance, technology, training, and people perspective. For future data scalability and providing researchers with a secure and robust data analysis platform to perform joint multi-site collaboration, establishing a federated network should be built into the medium to long term plans for study projects who are interested in using federated analytics. Federated networks have an enormous potential in bringing together national and international health care datasets and aiding the collaborative research effort within the healthcare sector to address key public health questions.

Note:
Funding Declaration: This work was supported by International COVID-19 Data Alliance (ICODA), an initiative funded by the COVID-19 Therapeutics Accelerator and convened by Health Data Research UK (HDR UK). We acknowledge funding from the Bill and Melinda Gates Foundation (INV-017293), Microsoft Artificial Intelligence (AI) for Health, and the Minderoo Foundation (INV-017293). Aridhia Informatics Ltd was funded by the Bill and Melinda Gates Foundation (INV-017293). SAIL Databank and the Secure eResearch Platform (SeRP) UK, based at Swansea University, were funded by an award from Health Data Research UK (2020.112), supported by funds from the ICODA initiative, in order to develop the underlying infrastructure and providing expertise in establishing the federated analytics platform and governance models. This study makes use of anonymised data held in the Secure Anonymised Information Linkage (SAIL) Databank. We would like to acknowledge the iPOP data providers who made their anonymised data available for research (details provided in [20] (manuscript submitted)). This project was approved by the SAIL Information Governance Review Panel, under project numbers 1292 and 1299. Helga Zoega was supported by a UNSW Scientia Program Award during the conduct of this study. Sarah J Stock was funded by a Wellcome Trust Clinical Career Development Fellowship (209560/Z/17/Z). Meghan B. Azad is supported by a Canada Research Chair in the Developmental Origins of Chronic Disease. All authors approved the version of the manuscript to be published.

Conflict of Interests: The authors declare no conflicts of interest that could have appeared to influence the work reported in this paper.

Keywords: Federated Networks, Federated Analytics, Covid-19, Health Data Research, Privacy-Preserving, Secondary Data, Data Re-use

Suggested Citation

Eradat Oskoui, Solmaz and Retford, Matthew and Barnes, Rodrigo and Postlethwaite, Neil and Hunter, Karen J. and Thompson, Simon and Orton, Chris and Ford, D.V. and Heys, Sharon and Kennedy, Julie and McNerney, Cynthia and Peng, Jeffrey and Ghanbariadolat, Hamed and Rees, Sarah and Mulholland, Rachel H. and Sheikh, Aziz and Burgner, David and Brockway, Meredith Lee and Azad, Meghan B. and Rodriguez, Natalie and Zoega, Helga and Stock, Sarah J. and Calvert, Clara and Miller, Jessica and Fiorentino, Nicole and Racine, Amy and Haggstrom, Jonas, Application of Federated Analytics in Health Data Research for Reducing Risks Involved in Data Sharing. Available at SSRN: https://ssrn.com/abstract=4302481 or http://dx.doi.org/10.2139/ssrn.4302481

Solmaz Eradat Oskoui (Contact Author)

Aridhia Informatics Ltd ( email )

Matthew Retford

Health Data Research UK ( email )

London
United Kingdom

Rodrigo Barnes

Aridhia Informatics Ltd ( email )

Neil Postlethwaite

Health Data Research UK ( email )

London
United Kingdom

Karen J. Hunter

Health Data Research UK ( email )

London
United Kingdom

Simon Thompson

Swansea University - Swansea University Medical School ( email )

Singleton Park
Swansea, SA2 8PP
United Kingdom

Chris Orton

Swansea University - Swansea University Medical School ( email )

Singleton Park
Swansea, SA2 8PP
United Kingdom

D.V. Ford

Swansea University - Swansea University Medical School ( email )

Singleton Park
Swansea, Wales SA2 8PP
United Kingdom

Sharon Heys

Swansea University - Swansea University Medical School ( email )

Julie Kennedy

Swansea University - Swansea University Medical School ( email )

Singleton Park
Swansea, SA2 8PP
United Kingdom

Cynthia McNerney

Swansea University - Administrative Data Research Wales ( email )

Jeffrey Peng

Swansea University - Swansea University Medical School ( email )

Singleton Park
Swansea, SA2 8PP
United Kingdom

Hamed Ghanbariadolat

Swansea University - Swansea University Medical School ( email )

Singleton Park
Swansea, SA2 8PP
United Kingdom

Sarah Rees

Swansea University - Swansea University Medical School ( email )

Singleton Park
Swansea, SA2 8PP
United Kingdom

Rachel H. Mulholland

University of Edinburgh - Usher Institute ( email )

Aziz Sheikh

University of Edinburgh - Usher Institute ( email )

David Burgner

University of Melbourne - Murdoch Children's Research Institute ( email )

Parkville, Victoria
Australia

Meredith Lee Brockway

University of Manitoba ( email )

Meghan B. Azad

University of Manitoba - Department of Pediatrics & Child Health ( email )

Winnipeg
Canada

Natalie Rodriguez

University of Manitoba ( email )

Helga Zoega

University of New South Wales (UNSW) ( email )

Sarah J. Stock

University of Edinburgh - Usher Institute ( email )

Clara Calvert

University of Edinburgh - Centre for Global Health Research ( email )

Jessica Miller

University of Melbourne - Murdoch Children's Research Institute ( email )

Nicole Fiorentino

University of Manitoba ( email )

Amy Racine

Cytel, Inc ( email )

Jonas Haggstrom

Cytel, Inc ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
81
Abstract Views
543
Rank
665,716
PlumX Metrics