-
Privacy-Preserving Federated Learning over Vertically and Horizontally Partitioned Data for Financial Anomaly Detection
Authors:
Swanand Ravindra Kadhe,
Heiko Ludwig,
Nathalie Baracaldo,
Alan King,
Yi Zhou,
Keith Houck,
Ambrish Rawat,
Mark Purcell,
Naoise Holohan,
Mikio Takeuchi,
Ryo Kawahara,
Nir Drucker,
Hayim Shaul,
Eyal Kushnir,
Omri Soceanu
Abstract:
The effective detection of evidence of financial anomalies requires collaboration among multiple entities who own a diverse set of data, such as a payment network system (PNS) and its partner banks. Trust among these financial institutions is limited by regulation and competition. Federated learning (FL) enables entities to collaboratively train a model when data is either vertically or horizontal…
▽ More
The effective detection of evidence of financial anomalies requires collaboration among multiple entities who own a diverse set of data, such as a payment network system (PNS) and its partner banks. Trust among these financial institutions is limited by regulation and competition. Federated learning (FL) enables entities to collaboratively train a model when data is either vertically or horizontally partitioned across the entities. However, in real-world financial anomaly detection scenarios, the data is partitioned both vertically and horizontally and hence it is not possible to use existing FL approaches in a plug-and-play manner.
Our novel solution, PV4FAD, combines fully homomorphic encryption (HE), secure multi-party computation (SMPC), differential privacy (DP), and randomization techniques to balance privacy and accuracy during training and to prevent inference threats at model deployment time. Our solution provides input privacy through HE and SMPC, and output privacy against inference time attacks through DP. Specifically, we show that, in the honest-but-curious threat model, banks do not learn any sensitive features about PNS transactions, and the PNS does not learn any information about the banks' dataset but only learns prediction labels. We also develop and analyze a DP mechanism to protect output privacy during inference. Our solution generates high-utility models by significantly reducing the per-bank noise level while satisfying distributed DP. To ensure high accuracy, our approach produces an ensemble model, in particular, a random forest. This enables us to take advantage of the well-known properties of ensembles to reduce variance and increase accuracy. Our solution won second prize in the first phase of the U.S. Privacy Enhancing Technologies (PETs) Prize Challenge.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
A Tutorial Introduction to Lattice-based Cryptography and Homomorphic Encryption
Authors:
Yang Li,
Kee Siong Ng,
Michael Purcell
Abstract:
Why study Lattice-based Cryptography? There are a few ways to answer this question. 1. It is useful to have cryptosystems that are based on a variety of hard computational problems so the different cryptosystems are not all vulnerable in the same way. 2. The computational aspects of lattice-based cryptosystem are usually simple to understand and fairly easy to implement in practice. 3. Lattice-bas…
▽ More
Why study Lattice-based Cryptography? There are a few ways to answer this question. 1. It is useful to have cryptosystems that are based on a variety of hard computational problems so the different cryptosystems are not all vulnerable in the same way. 2. The computational aspects of lattice-based cryptosystem are usually simple to understand and fairly easy to implement in practice. 3. Lattice-based cryptosystems have lower encryption/decryption computational complexities compared to popular cryptosystems that are based on the integer factorisation or the discrete logarithm problems. 4. Lattice-based cryptosystems enjoy strong worst-case hardness security proofs based on approximate versions of known NP-hard lattice problems. 5. Lattice-based cryptosystems are believed to be good candidates for post-quantum cryptography, since there are currently no known quantum algorithms for solving lattice problems that perform significantly better than the best-known classical (non-quantum) algorithms, unlike for integer factorisation and (elliptic curve) discrete logarithm problems. 6. Last but not least, interesting structures in lattice problems have led to significant advances in Homomorphic Encryption, a new research area with wide-ranging applications.
△ Less
Submitted 28 September, 2022; v1 submitted 17 August, 2022;
originally announced August 2022.
-
Adaptive Aggregation For Federated Learning
Authors:
K. R. Jayaram,
Vinod Muthusamy,
Gegi Thomas,
Ashish Verma,
Mark Purcell
Abstract:
Advances in federated learning (FL) algorithms,along with technologies like differential privacy and homomorphic encryption, have led to FL being increasingly adopted and used in many application domains. This increasing adoption has led to rapid growth in the number, size (number of participants/parties) and diversity (intermittent vs. active parties) of FL jobs. Many existing FL systems, based o…
▽ More
Advances in federated learning (FL) algorithms,along with technologies like differential privacy and homomorphic encryption, have led to FL being increasingly adopted and used in many application domains. This increasing adoption has led to rapid growth in the number, size (number of participants/parties) and diversity (intermittent vs. active parties) of FL jobs. Many existing FL systems, based on centralized (often single) model aggregators are unable to scale to handle large FL jobs and adapt to parties' behavior.
In this paper, we present a new scalable and adaptive architecture for FL aggregation. First, we demonstrate how traditional tree overlay based aggregation techniques (from P2P, publish-subscribe and stream processing research) can help FL aggregation scale, but are ineffective from a resource utilization and cost standpoint. Next, we present the design and implementation of AdaFed, which uses serverless/cloud functions to adaptively scale aggregation in a resource efficient and fault tolerant manner. We describe how AdaFed enables FL aggregation to be dynamically deployed only when necessary, elastically scaled to handle participant joins/leaves and is fault tolerant with minimal effort required on the (aggregation) programmer side. We also demonstrate that our prototype based on Ray scales to thousands of participants, and is able to achieve a >90% reduction in resource requirements and cost, with minimal impact on aggregation latency.
△ Less
Submitted 6 November, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
Towards an Accountable and Reproducible Federated Learning: A FactSheets Approach
Authors:
Nathalie Baracaldo,
Ali Anwar,
Mark Purcell,
Ambrish Rawat,
Mathieu Sinn,
Bashar Altakrouri,
Dian Balta,
Mahdi Sellami,
Peter Kuhn,
Ulrich Schopp,
Matthias Buchinger
Abstract:
Federated Learning (FL) is a novel paradigm for the shared training of models based on decentralized and private data. With respect to ethical guidelines, FL is promising regarding privacy, but needs to excel vis-à-vis transparency and trustworthiness. In particular, FL has to address the accountability of the parties involved and their adherence to rules, law and principles. We introduce AF^2 Fra…
▽ More
Federated Learning (FL) is a novel paradigm for the shared training of models based on decentralized and private data. With respect to ethical guidelines, FL is promising regarding privacy, but needs to excel vis-à-vis transparency and trustworthiness. In particular, FL has to address the accountability of the parties involved and their adherence to rules, law and principles. We introduce AF^2 Framework, where we instrument FL with accountability by fusing verifiable claims with tamper-evident facts, into reproducible arguments. We build on AI FactSheets for instilling transparency and trustworthiness into the AI lifecycle and expand it to incorporate dynamic and nested facts, as well as complex model compositions in FL. Based on our approach, an auditor can validate, reproduce and certify a FL process. This can be directly applied in practice to address the challenges of AI engineering and ethics.
△ Less
Submitted 24 February, 2022;
originally announced February 2022.
-
Private Graph Data Release: A Survey
Authors:
Yang Li,
Michael Purcell,
Thierry Rakotoarivelo,
David Smith,
Thilina Ranbaduge,
Kee Siong Ng
Abstract:
The application of graph analytics to various domains has yielded tremendous societal and economical benefits in recent years. However, the increasingly widespread adoption of graph analytics comes with a commensurate increase in the need to protect private information in graph data, especially in light of the many privacy breaches in real-world graph data that was supposed to preserve sensitive i…
▽ More
The application of graph analytics to various domains has yielded tremendous societal and economical benefits in recent years. However, the increasingly widespread adoption of graph analytics comes with a commensurate increase in the need to protect private information in graph data, especially in light of the many privacy breaches in real-world graph data that was supposed to preserve sensitive information. This paper provides a comprehensive survey of private graph data release algorithms that seek to achieve the fine balance between privacy and utility, with a specific focus on provably private mechanisms. Many of these mechanisms are natural extensions of the Differential Privacy framework to graph data, but we also investigate more general privacy formulations like Pufferfish Privacy that address some of the limitations of Differential Privacy. We also provide a wide-ranging survey of the applications of private graph data release mechanisms to social networks, finance, supply chain, and health care. This survey paper and the taxonomy it provides should benefit practitioners and researchers alike in the increasingly important area of private analytics and data release.
△ Less
Submitted 4 June, 2022; v1 submitted 9 July, 2021;
originally announced July 2021.
-
Knowledge- and Data-driven Services for Energy Systems using Graph Neural Networks
Authors:
Francesco Fusco,
Bradley Eck,
Robert Gormally,
Mark Purcell,
Seshu Tirupathi
Abstract:
The transition away from carbon-based energy sources poses several challenges for the operation of electricity distribution systems. Increasing shares of distributed energy resources (e.g. renewable energy generators, electric vehicles) and internet-connected sensing and control devices (e.g. smart heating and cooling) require new tools to support accurate, datadriven decision making. Modelling th…
▽ More
The transition away from carbon-based energy sources poses several challenges for the operation of electricity distribution systems. Increasing shares of distributed energy resources (e.g. renewable energy generators, electric vehicles) and internet-connected sensing and control devices (e.g. smart heating and cooling) require new tools to support accurate, datadriven decision making. Modelling the effect of such growing complexity in the electrical grid is possible in principle using state-of-the-art power-power flow models. In practice, the detailed information needed for these physical simulations may be unknown or prohibitively expensive to obtain. Hence, datadriven approaches to power systems modelling, including feedforward neural networks and auto-encoders, have been studied to leverage the increasing availability of sensor data, but have seen limited practical adoption due to lack of transparency and inefficiencies on large-scale problems. Our work addresses this gap by proposing a data- and knowledge-driven probabilistic graphical model for energy systems based on the framework of graph neural networks (GNNs). The model can explicitly factor in domain knowledge, in the form of grid topology or physics constraints, thus resulting in sparser architectures and much smaller parameters dimensionality when compared with traditional machine-learning models with similar accuracy. Results obtained from a real-world smart-grid demonstration project show how the GNN was used to inform grid congestion predictions and market bidding services for a distribution system operator participating in an energy flexibility market.
△ Less
Submitted 12 March, 2021;
originally announced March 2021.
-
IBM Federated Learning: an Enterprise Framework White Paper V0.1
Authors:
Heiko Ludwig,
Nathalie Baracaldo,
Gegi Thomas,
Yi Zhou,
Ali Anwar,
Shashank Rajamoni,
Yuya Ong,
Jayaram Radhakrishnan,
Ashish Verma,
Mathieu Sinn,
Mark Purcell,
Ambrish Rawat,
Tran Minh,
Naoise Holohan,
Supriyo Chakraborty,
Shalisha Whitherspoon,
Dean Steuer,
Laura Wynter,
Hifaz Hassan,
Sean Laguna,
Mikhail Yurochkin,
Mayank Agarwal,
Ebube Chuba,
Annie Abay
Abstract:
Federated Learning (FL) is an approach to conduct machine learning without centralizing training data in a single place, for reasons of privacy, confidentiality or data volume. However, solving federated machine learning problems raises issues above and beyond those of centralized machine learning. These issues include setting up communication infrastructure between parties, coordinating the learn…
▽ More
Federated Learning (FL) is an approach to conduct machine learning without centralizing training data in a single place, for reasons of privacy, confidentiality or data volume. However, solving federated machine learning problems raises issues above and beyond those of centralized machine learning. These issues include setting up communication infrastructure between parties, coordinating the learning process, integrating party results, understanding the characteristics of the training data sets of different participating parties, handling data heterogeneity, and operating with the absence of a verification data set.
IBM Federated Learning provides infrastructure and coordination for federated learning. Data scientists can design and run federated learning jobs based on existing, centralized machine learning models and can provide high-level instructions on how to run the federation. The framework applies to both Deep Neural Networks as well as ``traditional'' approaches for the most common machine learning libraries. {\proj} enables data scientists to expand their scope from centralized to federated machine learning, minimizing the learning curve at the outset while also providing the flexibility to deploy to different compute environments and design custom fusion algorithms.
△ Less
Submitted 22 July, 2020;
originally announced July 2020.
-
Scalable Deployment of AI Time-series Models for IoT
Authors:
Bradley Eck,
Francesco Fusco,
Robert Gormally,
Mark Purcell,
Seshu Tirupathi
Abstract:
IBM Research Castor, a cloud-native system for managing and deploying large numbers of AI time-series models in IoT applications, is described. Modelling code templates, in Python and R, following a typical machine-learning workflow are supported. A knowledge-based approach to managing model and time-series data allows the use of general semantic concepts for expressing feature engineering tasks.…
▽ More
IBM Research Castor, a cloud-native system for managing and deploying large numbers of AI time-series models in IoT applications, is described. Modelling code templates, in Python and R, following a typical machine-learning workflow are supported. A knowledge-based approach to managing model and time-series data allows the use of general semantic concepts for expressing feature engineering tasks. Model templates can be programmatically deployed against specific instances of semantic concepts, thus supporting model reuse and automated replication as the IoT application grows. Deployed models are automatically executed in parallel leveraging a serverless cloud computing framework. The complete history of trained model versions and rolling-horizon predictions is persisted, thus enabling full model lineage and traceability. Results from deployments in real-world smart-grid live forecasting applications are reported. Scalability of executing up to tens of thousands of AI modelling tasks is also evaluated.
△ Less
Submitted 24 March, 2020;
originally announced March 2020.
-
AI Modelling and Time-series Forecasting Systems for Trading Energy Flexibility in Distribution Grids
Authors:
Bradley Eck,
Francesco Fusco,
Robert Gormally,
Mark Purcell,
Seshu Tirupathi
Abstract:
We demonstrate progress on the deployment of two sets of technologies to support distribution grid operators integrating high shares of renewable energy sources, based on a market for trading local energy flexibilities. An artificial-intelligence (AI) grid modelling tool, based on probabilistic graphs, predicts congestions and estimates the amount and location of energy flexibility required to avo…
▽ More
We demonstrate progress on the deployment of two sets of technologies to support distribution grid operators integrating high shares of renewable energy sources, based on a market for trading local energy flexibilities. An artificial-intelligence (AI) grid modelling tool, based on probabilistic graphs, predicts congestions and estimates the amount and location of energy flexibility required to avoid such events. A scalable time-series forecasting system delivers large numbers of short-term predictions of distributed energy demand and generation. We discuss the deployment of the technologies at three trial demonstration sites across Europe, in the context of a research project carried out in a consortium with energy utilities, technology providers and research institutions.
△ Less
Submitted 18 September, 2019;
originally announced September 2019.
-
Castor: Contextual IoT Time Series Data and Model Management at Scale
Authors:
Bei Chen,
Bradley Eck,
Francesco Fusco,
Robert Gormally,
Mark Purcell,
Mathieu Sinn,
Seshu Tirupathi
Abstract:
We demonstrate Castor, a cloud-based system for contextual IoT time series data and model management at scale. Castor is designed to assist Data Scientists in (a) exploring and retrieving all relevant time series and contextual information that is required for their predictive modelling tasks; (b) seamlessly storing and deploying their predictive models in a cloud production environment; (c) monit…
▽ More
We demonstrate Castor, a cloud-based system for contextual IoT time series data and model management at scale. Castor is designed to assist Data Scientists in (a) exploring and retrieving all relevant time series and contextual information that is required for their predictive modelling tasks; (b) seamlessly storing and deploying their predictive models in a cloud production environment; (c) monitoring the performance of all predictive models in production and (semi-)automatically retraining them in case of performance deterioration. The main features of Castor are: (1) an efficient pipeline for ingesting IoT time series data in real time; (2) a scalable, hybrid data management service for both time series and contextual data; (3) a versatile semantic model for contextual information which can be easily adopted to different application domains; (4) an abstract framework for develo** and storing predictive models in R or Python; (5) deployment services which automatically train and/or score predictive models upon user-defined conditions. We demonstrate Castor for a real-world Smart Grid use case and discuss how it can be adopted to other application domains such as Smart Buildings, Telecommunication, Retail or Manufacturing.
△ Less
Submitted 8 February, 2019; v1 submitted 20 November, 2018;
originally announced November 2018.
-
Holographic imaging through a scattering medium by diffuser-aided statistical averaging
Authors:
Michael J. Purcell,
Manish Kumar,
Stephen C. Rand,
Vasudevan Lakshminarayanan
Abstract:
We introduce a practical digital holographic method capable of imaging through a diffusive or scattering medium. The method relies on statistical averaging from a rotating ground glass diffuser to negate the adverse effects caused by speckle introduced by a first, static diffuser or scattering medium. In particular, a setup based on Fourier transform holography is used to show that an image can be…
▽ More
We introduce a practical digital holographic method capable of imaging through a diffusive or scattering medium. The method relies on statistical averaging from a rotating ground glass diffuser to negate the adverse effects caused by speckle introduced by a first, static diffuser or scattering medium. In particular, a setup based on Fourier transform holography is used to show that an image can be recovered after scattering by introducing an additional diffuser in the optical setup. This method is capable of recovering object information from behind a scattering layer in biomedical or military imaging applications.
△ Less
Submitted 16 May, 2016;
originally announced May 2016.
-
Second-generation PLINK: rising to the challenge of larger and richer datasets
Authors:
Christopher C. Chang,
Carson C. Chow,
Laurent C. A. M. Tellier,
Shashaank Vattikuti,
Shaun M. Purcell,
James J. Lee
Abstract:
PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic…
▽ More
PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format.
To address these issues, we are develo** a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information.
The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.
△ Less
Submitted 17 October, 2014;
originally announced October 2014.
-
Nonmetallic Gasket and Miniature Plastic Turnbuckle Diamond Anvil Cell for Pulsed Magnetic Field Studies at Cryogenic Temperatures
Authors:
David E. Graf,
Ryan L. Stillwell,
Kenneth M. Purcell,
Stanley W. Tozer
Abstract:
A plastic turnbuckle diamond anvil cell (DAC) and nonmetallic gasket have been developed for pulsed magnetic field studies to address issues of eddy current heating and Lorentz forces in metal cells. The plastic cell evolved from our 6.3 mm metal diameter turnbuckle DAC that was designed in 1993 to rotate in the 9 mm sample space of Quantum Design's MPMS. Attempts to use this metal DAC in pulsed m…
▽ More
A plastic turnbuckle diamond anvil cell (DAC) and nonmetallic gasket have been developed for pulsed magnetic field studies to address issues of eddy current heating and Lorentz forces in metal cells. The plastic cell evolved from our 6.3 mm metal diameter turnbuckle DAC that was designed in 1993 to rotate in the 9 mm sample space of Quantum Design's MPMS. Attempts to use this metal DAC in pulsed magnetic fields caused the sample temperature to rise to T>70 K, necessitating the construction of a nonconductive cell and gasket. Pressures of 3 GPa have been produced in the plastic cell with 0.8 mm culets in an optical study conducted at T = 4 K. Variations of the cell are now being used for fermiology studies of metallic systems in pulsed magnetic fields that have required the development of a rotator and a special He-3 cryostat which are also discussed.
△ Less
Submitted 20 October, 2011;
originally announced October 2011.
-
Pressure Evolution of a Field Induced Fermi Surface Reconstruction and of the Neel Critical Field in CeIn3
Authors:
K. M. Purcell,
D. Graf,
M. Kano,
J. Bourg,
E. C. Palm,
T. Murphy,
R. McDonald,
C. H Mielke,
M. M. Altarawneh,
C. Petrovic,
Rongwei Hu,
T. Ebihara,
J. Cooley,
P. Schlottmann,
S. W. Tozer
Abstract:
We report high-pressure skin depth measurements on the heavy fermion material CeIn3 in magnetic fields up to 64 T using a self-resonant tank circuit based on a tunnel diode oscillator. At ambient pressure, an anomaly in the skin depth is seen at 45 T. The field where this anomaly occurs decreases with applied pressure until approximately 1.0 GPa, where it begins to increase before merging with t…
▽ More
We report high-pressure skin depth measurements on the heavy fermion material CeIn3 in magnetic fields up to 64 T using a self-resonant tank circuit based on a tunnel diode oscillator. At ambient pressure, an anomaly in the skin depth is seen at 45 T. The field where this anomaly occurs decreases with applied pressure until approximately 1.0 GPa, where it begins to increase before merging with the antiferromagnetic phase boundary. Possible origins for this transport anomaly are explored in terms of a Fermi surface reconstruction. The critical magnetic field at which the Neel ordered phase is suppressed is also mapped as a function of pressure and extrapolates to the previous ambient pressure measurements at high magnetic fields and high pressure measurements at zero magnetic field.
△ Less
Submitted 30 March, 2009;
originally announced March 2009.
-
Magnetic field induced lattice anomaly inside the superconducting state of CeCoIn$_5$: evidence of the proposed Fulde-Ferrell-Larkin-Ovchinnikov state
Authors:
V. F. Correa,
T. P. Murphy,
C. Martin,
K. M. Purcell,
E. C. Palm,
G. M. Schmiedeshoff,
J. C. Cooley,
S. W. Tozer
Abstract:
We report high magnetic field linear magnetostriction experiments on CeCoIn$_5$ single crystals. Two features are remarkable: (i) a sharp discontinuity in all the crystallographic axes associated with the upper superconducting critical field $B_{c2}$ that becomes less pronounced as the temperature increases; (ii) a distinctive second order-like feature observed only along the c-axis in the high…
▽ More
We report high magnetic field linear magnetostriction experiments on CeCoIn$_5$ single crystals. Two features are remarkable: (i) a sharp discontinuity in all the crystallographic axes associated with the upper superconducting critical field $B_{c2}$ that becomes less pronounced as the temperature increases; (ii) a distinctive second order-like feature observed only along the c-axis in the high field (10 T $ \lesssim B \leq B_{c2}$) low temperature ($T \lesssim$ 0.35 K) region. This second order transition is observed only when the magnetic field lies within 20$^o$ of the ab-planes and there is no signature of it above $B_{c2}$, which raises questions regarding its interpretation as a field induced magnetically ordered phase. Good agreement with previous results suggests that this anomaly is related to the transition to the Fulde-Ferrel-Larkin-Ovchinnikov superconducting state.
△ Less
Submitted 19 September, 2006;
originally announced September 2006.