-
Deep Learning of Multivariate Extremes via a Geometric Representation
Authors:
Callum J. R. Murphy-Barltrop,
Reetam Majumder,
Jordan Richards
Abstract:
The study of geometric extremes, where extremal dependence properties are inferred from the deterministic limiting shapes of scaled sample clouds, provides an exciting approach to modelling the extremes of multivariate data. These shapes, termed limit sets, link together several popular extremal dependence modelling frameworks. Although the geometric approach is becoming an increasingly popular mo…
▽ More
The study of geometric extremes, where extremal dependence properties are inferred from the deterministic limiting shapes of scaled sample clouds, provides an exciting approach to modelling the extremes of multivariate data. These shapes, termed limit sets, link together several popular extremal dependence modelling frameworks. Although the geometric approach is becoming an increasingly popular modelling tool, current inference techniques are limited to a low dimensional setting (d < 4), and generally require rigid modelling assumptions. In this work, we propose a range of novel theoretical results to aid with the implementation of the geometric extremes framework and introduce the first approach to modelling limit sets using deep learning. By leveraging neural networks, we construct asymptotically-justified yet flexible semi-parametric models for extremal dependence of high-dimensional data. We showcase the efficacy of our deep approach by modelling the complex extremal dependencies between meteorological and oceanographic variables in the North Sea off the coast of the UK.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Optimal Stock Portfolio Selection with a Multivariate Hidden Markov Model
Authors:
Reetam Majumder,
Qing Ji,
Nagaraj K. Neerchal
Abstract:
The underlying market trends that drive stock price fluctuations are often referred to in terms of bull and bear markets. Optimal stock portfolio selection methods need to take into account these market trends; however, the bull and bear market states tend to be unobserved and can only be assigned retrospectively. We fit a linked hidden Markov model (LHMM) to relative stock price changes for S&P 5…
▽ More
The underlying market trends that drive stock price fluctuations are often referred to in terms of bull and bear markets. Optimal stock portfolio selection methods need to take into account these market trends; however, the bull and bear market states tend to be unobserved and can only be assigned retrospectively. We fit a linked hidden Markov model (LHMM) to relative stock price changes for S&P 500 stocks from 2011--2016 based on weekly closing values. The LHMM consists of a multivariate state process whose individual components correspond to HMMs for each of the 12 sectors of the S\&P 500 stocks. The state processes are linked using a Gaussian copula so that the states of the component chains are correlated at any given time point. The LHMM allows us to capture more heterogeneity in the underlying market dynamics for each sector. In this study, stock performances are evaluated in terms of capital gains using the LHMM by utilizing historical stock price data. Based on the fitted LHMM, optimal stock portfolios are constructed to maximize capital gain while balancing reward and risk. Under out-of-sample testing, the annual capital gain for the portfolios for 2016--2017 are calculated. Portfolios constructed using the LHMM are able to generate returns comparable to the S&P 500 index.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Authors:
Qi Chen,
Xiubo Geng,
Corby Rosset,
Carolyn Buractaon,
**gwen Lu,
Tao Shen,
Kun Zhou,
Chenyan Xiong,
Yeyun Gong,
Paul Bennett,
Nick Craswell,
Xing Xie,
Fan Yang,
Bryan Tower,
Nikhil Rao,
Anlei Dong,
Wenqi Jiang,
Zheng Liu,
Mingqin Li,
Chuanjie Liu,
Zengzhong Li,
Rangan Majumder,
Jennifer Neville,
Andy Oakley,
Knut Magne Risvik
, et al. (6 additional authors not shown)
Abstract:
Recent breakthroughs in large models have highlighted the critical significance of data scale, labels and modals. In this paper, we introduce MS MARCO Web Search, the first large-scale information-rich web dataset, featuring millions of real clicked query-document labels. This dataset closely mimics real-world web document and query distribution, provides rich information for various kinds of down…
▽ More
Recent breakthroughs in large models have highlighted the critical significance of data scale, labels and modals. In this paper, we introduce MS MARCO Web Search, the first large-scale information-rich web dataset, featuring millions of real clicked query-document labels. This dataset closely mimics real-world web document and query distribution, provides rich information for various kinds of downstream tasks and encourages research in various areas, such as generic end-to-end neural indexer models, generic embedding models, and next generation information access system with large language models. MS MARCO Web Search offers a retrieval benchmark with three web retrieval challenge tasks that demand innovations in both machine learning and information retrieval system research domains. As the first dataset that meets large, real and rich data requirements, MS MARCO Web Search paves the way for future advancements in AI and system research. MS MARCO Web Search dataset is available at: https://github.com/microsoft/MS-MARCO-Web-Search.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Stochastic Gradient MCMC for Massive Geostatistical Data
Authors:
Mohamed A. Abba,
Brian J. Reich,
Reetam Majumder,
Brandon Feng
Abstract:
Gaussian processes (GPs) are commonly used for prediction and inference for spatial data analyses. However, since estimation and prediction tasks have cubic time and quadratic memory complexity in number of locations, GPs are difficult to scale to large spatial datasets. The Vecchia approximation induces sparsity in the dependence structure and is one of several methods proposed to scale GP infere…
▽ More
Gaussian processes (GPs) are commonly used for prediction and inference for spatial data analyses. However, since estimation and prediction tasks have cubic time and quadratic memory complexity in number of locations, GPs are difficult to scale to large spatial datasets. The Vecchia approximation induces sparsity in the dependence structure and is one of several methods proposed to scale GP inference. Our work adds to the substantial research in this area by develo** a stochastic gradient Markov chain Monte Carlo (SGMCMC) framework for efficient computation in GPs. At each step, the algorithm subsamples a minibatch of locations and subsequently updates process parameters through a Vecchia-approximated GP likelihood. Since the Vecchia-approximated GP has a time complexity that is linear in the number of locations, this results in scalable estimation in GPs. Through simulation studies, we demonstrate that SGMCMC is competitive with state-of-the-art scalable GP algorithms in terms of computational time and parameter estimation. An application of our method is also provided using the Argo dataset of ocean temperature measurements.
△ Less
Submitted 3 June, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.
-
Consistent Second-Order Treatment of Spin-Orbit Coupling and Dynamic Correlation in Quasidegenerate N-Electron Valence Perturbation Theory
Authors:
Rajat Majumder,
Alexander Yu. Sokolov
Abstract:
We present a formulation and implementation of second-order quasidegenerate N-electron valence perturbation theory (QDNEVPT2) that provides a balanced and accurate description of spin-orbit coupling and dynamic correlation effects in multiconfigurational electronic states. In our approach, the energies and wavefunctions of electronic states are computed by treating electron repulsion and spin-orbi…
▽ More
We present a formulation and implementation of second-order quasidegenerate N-electron valence perturbation theory (QDNEVPT2) that provides a balanced and accurate description of spin-orbit coupling and dynamic correlation effects in multiconfigurational electronic states. In our approach, the energies and wavefunctions of electronic states are computed by treating electron repulsion and spin-orbit coupling operators as equal perturbations to the non-relativistic complete active-space wavefunctions and their contributions are incorporated fully up to the second order. The spin-orbit effects are described using the Breit-Pauli (BP) or exact two-component Douglas-Kroll-Hess (DKH) Hamiltonians within spin-orbit mean-field approximation. The resulting second-order methods (BP2- and DKH2-QDNEVPT2) are capable of treating spin-orbit coupling effects in nearly degenerate electronic states by diagonalizing an effective Hamiltonian expanded in a compact non-relativistic basis. For a variety of atoms and small molecules across the entire periodic table, we demonstrate that DKH2-QDNEVPT2 is competitive in accuracy with variational two-component relativistic theories. BP2-QDNEVPT2 shows high accuracy for the second- and third-period elements, but its performance deteriorates for heavier atoms and molecules. We also consider the first-order spin-orbit QDNEVPT2 approximations (BP1- and DKH1-QDNEVPT2), among which DKH1-QDNEVPT2 is reliable but less accurate than DKH2-QDNEVPT2. Both DKH1- and DKH2-QDNEVPT2 hold promise as efficient and accurate electronic structure methods for treating electron correlation and spin-orbit coupling in a variety of applications.
△ Less
Submitted 13 May, 2024; v1 submitted 6 April, 2024;
originally announced April 2024.
-
Simulating Transient X-ray Photoelectron Spectra of Fe(CO)5 and Its Photodissociation Products With Multireference Algebraic Diagrammatic Construction Theory
Authors:
Nicholas P. Gaba,
Carlos E. V. de Moura,
Rajat Majumder,
Alexander Yu. Sokolov
Abstract:
Accurate simulations of transient X-ray photoelectron spectra (XPS) provide unique opportunities to bridge the gap between theory and experiment in understanding the photoactivated dynamics in molecules and materials. However, simulating X-ray photoelectron spectra along a photochemical reaction pathway is challenging as it requires accurate description of electronic structure incorporating core-h…
▽ More
Accurate simulations of transient X-ray photoelectron spectra (XPS) provide unique opportunities to bridge the gap between theory and experiment in understanding the photoactivated dynamics in molecules and materials. However, simulating X-ray photoelectron spectra along a photochemical reaction pathway is challenging as it requires accurate description of electronic structure incorporating core-hole screening, orbital relaxation, electron correlation, and spin-orbit coupling in excited states or at nonequilibrium ground-state geometries. In this work, we employ the recently developed multireference algebraic diagrammatic construction theory (MR-ADC) to investigate the core-ionized states and X-ray photoelectron spectra of Fe(CO)5 and its photodissociation products (Fe(CO)4, Fe(CO)3) following excitation with 266 nm light. The simulated transient Fe 3p and CO 3σ XPS spectra incorporating spin-orbit coupling and high-order electron correlation effects are shown to be in a good agreement with the experimental measurements by Leitner et al. [J. Chem. Phys. 149, 044307 (2018)]. Our calculations suggest that core-hole screening, spin-orbit coupling, and ligand-field splitting effects are similarly important in reproducing the experimentally observed chemical shifts in transient Fe 3p XPS spectra of iron carbonyl complexes. Our results also demonstrate that the MR-ADC methods can be very useful in interpreting the transient XPS spectra of transition metal compounds.
△ Less
Submitted 27 April, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
Kuramoto model subject to subsystem resetting: How resetting a part of the system may synchronize the whole of it
Authors:
Rupak Majumder,
Rohitashwa Chattopadhyay,
Shamik Gupta
Abstract:
We introduce and investigate the effects of a new class of stochastic resetting protocol called subsystem resetting, whereby a subset of the system constituents in a many-body interacting system undergoes bare evolution interspersed with simultaneous resets at random times, while the remaining constituents evolve solely under the bare dynamics. We pursue our investigation within the ambit of the w…
▽ More
We introduce and investigate the effects of a new class of stochastic resetting protocol called subsystem resetting, whereby a subset of the system constituents in a many-body interacting system undergoes bare evolution interspersed with simultaneous resets at random times, while the remaining constituents evolve solely under the bare dynamics. We pursue our investigation within the ambit of the well-known Kuramoto model of coupled phase-only oscillators of distributed natural frequencies. Here, the reset protocol corresponds to a chosen set of oscillators being reset to a synchronized state at random times. We find that the mean $ω_0$ of the natural frequencies plays a defining role in determining the long-time state of the system. For $ω_0=0$, the system reaches a synchronized stationary state at long times, characterized by a time-independent non-zero value of the synchronization order parameter. Moreover, we find that resetting even an infinitesimal fraction of the total number of oscillators has the drastic effect of synchronizing the entire system, even when the bare evolution does not support synchrony. By contrast, for $ω_0 \ne 0$, the dynamics allows at long times either a synchronized stationary state or an oscillatory synchronized state, with the latter characterized by an oscillatory behavior as a function of time of the order parameter, with a non-zero time-independent time average. Our results thus imply that the non-reset subsystem always gets synchronized at long times through the act of resetting of the reset subsystem. Our results, analytical using the Ott-Antonsen ansatz as well as those based on numerical simulations, are obtained for two representative oscillator frequency distributions, namely, a Lorentzian and a Gaussian. We discuss how subsystem resetting may be employed as an efficient mechanism to control attainment of global synchrony.
△ Less
Submitted 18 June, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
Multilingual E5 Text Embeddings: A Technical Report
Authors:
Liang Wang,
Nan Yang,
Xiaolong Huang,
Linjun Yang,
Rangan Majumder,
Furu Wei
Abstract:
This technical report presents the training methodology and evaluation results of the open-source multilingual E5 text embedding models, released in mid-2023. Three embedding models of different sizes (small / base / large) are provided, offering a balance between the inference efficiency and embedding quality. The training procedure adheres to the English E5 model recipe, involving contrastive pr…
▽ More
This technical report presents the training methodology and evaluation results of the open-source multilingual E5 text embedding models, released in mid-2023. Three embedding models of different sizes (small / base / large) are provided, offering a balance between the inference efficiency and embedding quality. The training procedure adheres to the English E5 model recipe, involving contrastive pre-training on 1 billion multilingual text pairs, followed by fine-tuning on a combination of labeled datasets. Additionally, we introduce a new instruction-tuned embedding model, whose performance is on par with state-of-the-art, English-only models of similar sizes. Information regarding the model release can be found at https://github.com/microsoft/unilm/tree/master/e5 .
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Thermodynamics of vison crystals in an anisotropic quantum spin liquid
Authors:
Ritwika Majumder,
Onur Erten,
Anamitra Mukherjee
Abstract:
Using unbiased Monte Carlo simulations and variational analysis, we present the ground state and finite temperature phase diagrams of an exactly solvable spin-orbital model with Kitaev-type interactions on a square lattice. We show that an array of new gapped and gapless vison crystals -- characterized by the periodic arrangement of $\mathbb{Z}_2$ flux excitations -- can be stabilized as a functio…
▽ More
Using unbiased Monte Carlo simulations and variational analysis, we present the ground state and finite temperature phase diagrams of an exactly solvable spin-orbital model with Kitaev-type interactions on a square lattice. We show that an array of new gapped and gapless vison crystals -- characterized by the periodic arrangement of $\mathbb{Z}_2$ flux excitations -- can be stabilized as a function of external magnetic field and exchange anisotropy. In particular, we discover a variety of `quarter phases' wherein new sixteen-site periodic patterns emerge, with only a quarter of the fluxes adopting 0-flux configurations. In contrast, the rest remain in $π$-flux configurations. Vison crystals break translational symmetry and undergo finite temperature phase transitions. We investigate the finite temperature properties of these phases and report the corresponding critical and crossover temperatures. Our results reveal an array of novel phases in exactly solvable extensions of the Kitaev model, wherein local and topological orders can coexist.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Improving Text Embeddings with Large Language Models
Authors:
Liang Wang,
Nan Yang,
Xiaolong Huang,
Linjun Yang,
Rangan Majumder,
Furu Wei
Abstract:
In this paper, we introduce a novel and simple method for obtaining high-quality text embeddings using only synthetic data and less than 1k training steps. Unlike existing methods that often depend on multi-stage intermediate pre-training with billions of weakly-supervised text pairs, followed by fine-tuning with a few labeled datasets, our method does not require building complex training pipelin…
▽ More
In this paper, we introduce a novel and simple method for obtaining high-quality text embeddings using only synthetic data and less than 1k training steps. Unlike existing methods that often depend on multi-stage intermediate pre-training with billions of weakly-supervised text pairs, followed by fine-tuning with a few labeled datasets, our method does not require building complex training pipelines or relying on manually collected datasets that are often constrained by task diversity and language coverage. We leverage proprietary LLMs to generate diverse synthetic data for hundreds of thousands of text embedding tasks across 93 languages. We then fine-tune open-source decoder-only LLMs on the synthetic data using standard contrastive loss. Experiments demonstrate that our method achieves strong performance on highly competitive text embedding benchmarks without using any labeled data. Furthermore, when fine-tuned with a mixture of synthetic and labeled data, our model sets new state-of-the-art results on the BEIR and MTEB benchmarks.
△ Less
Submitted 31 May, 2024; v1 submitted 30 December, 2023;
originally announced January 2024.
-
Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models
Authors:
Reek Majumder,
Jacquan Pollard,
M Sabbir Salek,
David Werth,
Gurcan Comert,
Adrian Gale,
Sakib Mahmud Khan,
Samuel Darko,
Mashrur Chowdhury
Abstract:
The environmental impacts of global warming driven by methane (CH4) emissions have catalyzed significant research initiatives in develo** novel technologies that enable proactive and rapid detection of CH4. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH4 and its related intensity in the affected areas. Various meteorological charact…
▽ More
The environmental impacts of global warming driven by methane (CH4) emissions have catalyzed significant research initiatives in develo** novel technologies that enable proactive and rapid detection of CH4. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH4 and its related intensity in the affected areas. Various meteorological characteristics, including wind speed, temperature, pressure, relative humidity, water vapor, and heat flux, were included in the simulation. We used the ensemble learning method to determine the best-performing weighted ensemble ML models built upon several weaker lower-layer ML models to (i) detect the presence of CH4 as a classification problem and (ii) predict the intensity of CH4 as a regression problem.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Large Search Model: Redefining Search Stack in the Era of LLMs
Authors:
Liang Wang,
Nan Yang,
Xiaolong Huang,
Linjun Yang,
Rangan Majumder,
Furu Wei
Abstract:
Modern search engines are built on a stack of different components, including query understanding, retrieval, multi-stage ranking, and question answering, among others. These components are often optimized and deployed independently. In this paper, we introduce a novel conceptual framework called large search model, which redefines the conventional search stack by unifying search tasks with one la…
▽ More
Modern search engines are built on a stack of different components, including query understanding, retrieval, multi-stage ranking, and question answering, among others. These components are often optimized and deployed independently. In this paper, we introduce a novel conceptual framework called large search model, which redefines the conventional search stack by unifying search tasks with one large language model (LLM). All tasks are formulated as autoregressive text generation problems, allowing for the customization of tasks through the use of natural language prompts. This proposed framework capitalizes on the strong language understanding and reasoning capabilities of LLMs, offering the potential to enhance search result quality while simultaneously simplifying the existing cumbersome search stack. To substantiate the feasibility of this framework, we present a series of proof-of-concept experiments and discuss the potential challenges associated with implementing this approach within real-world search systems.
△ Less
Submitted 2 January, 2024; v1 submitted 23 October, 2023;
originally announced October 2023.
-
Exploring Diverse Co** Mechanisms in 2023: A Comprehensive Survey Across Backgrounds and Cultures
Authors:
Abhijit Paul,
Rony Majumder
Abstract:
This study presents a pioneering investigation into the wide array of co** mechanisms employed by individuals in the year 2023, with a focus on data collected through the popular social media platform TikTok. Co** mechanisms are essential strategies that people adopt to navigate the challenges and stressors of everyday life, yet little research has been conducted on their comprehensive compila…
▽ More
This study presents a pioneering investigation into the wide array of co** mechanisms employed by individuals in the year 2023, with a focus on data collected through the popular social media platform TikTok. Co** mechanisms are essential strategies that people adopt to navigate the challenges and stressors of everyday life, yet little research has been conducted on their comprehensive compilation across different backgrounds, countries, and experiences.
Using TikTok as a data collection tool allowed us to access a diverse and extensive pool of participants, representing various cultural, social, and demographic backgrounds. Our study collates co** mechanisms reported by users from different parts of the world, facilitating the identification of both universal and culture-specific strategies.
This research contributes to the existing literature by providing a holistic view of co** mechanisms without being limited to specific fields or populations. By analyzing the co** methods shared on TikTok, we reveal a comprehensive list of strategies employed by people from diverse walks of life. The findings of this study not only shed light on how individuals cope with challenges in the modern era but also offer insights into the evolving co** trends and the role of social media in disseminating co** strategies. Understanding these co** mechanisms can have implications for mental health professionals, practitioners, and policymakers seeking to provide support and resources to individuals facing different stressors and hardships.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Semiparametric Estimation of the Shape of the Limiting Bivariate Point Cloud
Authors:
Reetam Majumder,
Benjamin A. Shaby,
Brian J. Reich,
Daniel Cooley
Abstract:
We propose a model to flexibly estimate joint tail properties by exploiting the convergence of an appropriately scaled point cloud onto a compact limit set. Characteristics of the shape of the limit set correspond to key tail dependence properties. We directly model the shape of the limit set using Bezier splines, which allow flexible and parsimonious specification of shapes in two dimensions. We…
▽ More
We propose a model to flexibly estimate joint tail properties by exploiting the convergence of an appropriately scaled point cloud onto a compact limit set. Characteristics of the shape of the limit set correspond to key tail dependence properties. We directly model the shape of the limit set using Bezier splines, which allow flexible and parsimonious specification of shapes in two dimensions. We then fit the Bezier splines to data in pseudo-polar coordinates using Markov chain Monte Carlo sampling, utilizing a limiting approximation to the conditional likelihood of the radii given angles. By imposing appropriate constraints on the parameters of the Bezier splines, we guarantee that each posterior sample is a valid limit set boundary, allowing direct posterior analysis of any quantity derived from the shape of the curve. Furthermore, we obtain interpretable inference on the asymptotic dependence class by using mixture priors with point masses on the corner of the unit box. Finally, we apply our model to bivariate datasets of extremes of variables related to fire risk and air pollution.
△ Less
Submitted 3 June, 2024; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Inference with Reference: Lossless Acceleration of Large Language Models
Authors:
Nan Yang,
Tao Ge,
Liang Wang,
Binxing Jiao,
Daxin Jiang,
Linjun Yang,
Rangan Majumder,
Furu Wei
Abstract:
We propose LLMA, an LLM accelerator to losslessly speed up Large Language Model (LLM) inference with references. LLMA is motivated by the observation that there are abundant identical text spans between the decoding result by an LLM and the reference that is available in many real world scenarios (e.g., retrieved documents). LLMA first selects a text span from the reference and copies its tokens t…
▽ More
We propose LLMA, an LLM accelerator to losslessly speed up Large Language Model (LLM) inference with references. LLMA is motivated by the observation that there are abundant identical text spans between the decoding result by an LLM and the reference that is available in many real world scenarios (e.g., retrieved documents). LLMA first selects a text span from the reference and copies its tokens to the decoder and then efficiently checks the tokens' appropriateness as the decoding result in parallel within one decoding step. The improved computational parallelism allows LLMA to achieve over 2x speed-up for LLMs with identical generation results as greedy decoding in many practical generation scenarios where significant overlap between in-context reference and outputs exists (e.g., search engines and multi-turn conversations).
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
Quantum computation: Efficient network partitioning for large scale critical infrastructures
Authors:
Saikat Ray Majumder,
Annarita Giani,
Weiwei Shen,
Bogdan Neculaes,
Daiwei Zhu,
Sonika Johri
Abstract:
Quantum computers are emerging as a viable alternative to tackle certain computational problems that are challenging for classical computers. With the rapid development of quantum hardware such as those based on trapped ions, there is practical motivation for identifying risk management problems that are efficiently solvable with these systems. Here we focus on network partitioning as a means for…
▽ More
Quantum computers are emerging as a viable alternative to tackle certain computational problems that are challenging for classical computers. With the rapid development of quantum hardware such as those based on trapped ions, there is practical motivation for identifying risk management problems that are efficiently solvable with these systems. Here we focus on network partitioning as a means for analyzing risk in critical infrastructures and present a quantum approach for its implementation. It is based on the potential speedup quantum computers can provide in the identification of eigenvalues and eigenvectors of sparse graph Laplacians, a procedure which is constrained by time and memory on classical computers.
△ Less
Submitted 3 February, 2023;
originally announced February 2023.
-
A Deep Learning Synthetic Likelihood Approximation of a Non-stationary Spatial Model for Extreme Streamflow Forecasting
Authors:
Reetam Majumder,
Brian Reich
Abstract:
Extreme streamflow is a key indicator of flood risk, and quantifying the changes in its distribution under non-stationary climate conditions is key to mitigating the impact of flooding events. We propose a non-stationary process mixture model (NPMM) for annual streamflow maxima over the central US (CUS) which uses downscaled climate model precipitation projections to forecast extremal streamflow.…
▽ More
Extreme streamflow is a key indicator of flood risk, and quantifying the changes in its distribution under non-stationary climate conditions is key to mitigating the impact of flooding events. We propose a non-stationary process mixture model (NPMM) for annual streamflow maxima over the central US (CUS) which uses downscaled climate model precipitation projections to forecast extremal streamflow. Spatial dependence for the model is specified as a convex combination of transformed Gaussian and max-stable processes, indexed by a weight parameter which identifies the asymptotic regime of the process. The weight parameter is modeled as a function of the annual precipitation for each of the two hydrologic regions within the CUS, introducing spatio-temporal non-stationarity within the model. The NPMM is flexible with desirable tail dependence properties, but yields an intractable likelihood. To address this, we embed a neural network within a density regression model which is used to learn a synthetic likelihood function using simulations from the NPMM with different parameter settings. Our model is fitted using observational data for 1972--2021, and inference carried out in a Bayesian framework. The two regions within the CUS are estimated to be in different asymptotic regimes based on the posterior distribution of the weight parameter. Annual streamflow maxima estimates based on global climate models for two representative climate pathway scenarios suggest an overall increase in the frequency and magnitude of extreme streamflow for 2006-2035 compared to the historical period of 1972-2005.
△ Less
Submitted 26 April, 2023; v1 submitted 14 December, 2022;
originally announced December 2022.
-
LEAD: Liberal Feature-based Distillation for Dense Retrieval
Authors:
Hao Sun,
Xiao Liu,
Yeyun Gong,
Anlei Dong,
**gwen Lu,
Yan Zhang,
Linjun Yang,
Rangan Majumder,
Nan Duan
Abstract:
Knowledge distillation is often used to transfer knowledge from a strong teacher model to a relatively weak student model. Traditional methods include response-based methods and feature-based methods. Response-based methods are widely used but suffer from lower upper limits of performance due to their ignorance of intermediate signals, while feature-based methods have constraints on vocabularies,…
▽ More
Knowledge distillation is often used to transfer knowledge from a strong teacher model to a relatively weak student model. Traditional methods include response-based methods and feature-based methods. Response-based methods are widely used but suffer from lower upper limits of performance due to their ignorance of intermediate signals, while feature-based methods have constraints on vocabularies, tokenizers and model architectures. In this paper, we propose a liberal feature-based distillation method (LEAD). LEAD aligns the distribution between the intermediate layers of teacher model and student model, which is effective, extendable, portable and has no requirements on vocabularies, tokenizers, or model architectures. Extensive experiments show the effectiveness of LEAD on widely-used benchmarks, including MS MARCO Passage Ranking, TREC 2019 DL Track, MS MARCO Document Ranking and TREC 2020 DL Track. Our code is available in https://github.com/microsoft/SimXNS/tree/main/LEAD.
△ Less
Submitted 11 December, 2023; v1 submitted 10 December, 2022;
originally announced December 2022.
-
Text Embeddings by Weakly-Supervised Contrastive Pre-training
Authors:
Liang Wang,
Nan Yang,
Xiaolong Huang,
Binxing Jiao,
Linjun Yang,
Daxin Jiang,
Rangan Majumder,
Furu Wei
Abstract:
This paper presents E5, a family of state-of-the-art text embeddings that transfer well to a wide range of tasks. The model is trained in a contrastive manner with weak supervision signals from our curated large-scale text pair dataset (called CCPairs). E5 can be readily used as a general-purpose embedding model for any tasks requiring a single-vector representation of texts such as retrieval, clu…
▽ More
This paper presents E5, a family of state-of-the-art text embeddings that transfer well to a wide range of tasks. The model is trained in a contrastive manner with weak supervision signals from our curated large-scale text pair dataset (called CCPairs). E5 can be readily used as a general-purpose embedding model for any tasks requiring a single-vector representation of texts such as retrieval, clustering, and classification, achieving strong performance in both zero-shot and fine-tuned settings. We conduct extensive evaluations on 56 datasets from the BEIR and MTEB benchmarks. For zero-shot settings, E5 is the first model that outperforms the strong BM25 baseline on the BEIR retrieval benchmark without using any labeled data. When fine-tuned, E5 obtains the best results on the MTEB benchmark, beating existing embedding models with 40x more parameters.
△ Less
Submitted 22 February, 2024; v1 submitted 7 December, 2022;
originally announced December 2022.
-
Simulating Spin-Orbit Coupling With Quasidegenerate N-Electron Valence Perturbation Theory
Authors:
Rajat Majumder,
Alexander Yu. Sokolov
Abstract:
We present the first implementation of spin-orbit coupling effects in fully internally contracted second-order quasidegenerate N-electron valence perturbation theory (SO-QDNEVPT2). The SO-QDNEVPT2 approach enables the computations of ground- and excited-state energies and oscillator strengths combining the description of static electron correlation with an efficient treatment of dynamic correlatio…
▽ More
We present the first implementation of spin-orbit coupling effects in fully internally contracted second-order quasidegenerate N-electron valence perturbation theory (SO-QDNEVPT2). The SO-QDNEVPT2 approach enables the computations of ground- and excited-state energies and oscillator strengths combining the description of static electron correlation with an efficient treatment of dynamic correlation and spin-orbit coupling. In addition to SO-QDNEVPT2 with the full description of one- and two-body spin-orbit interactions at the level of two-component Breit-Pauli Hamiltonian, our implementation also features a simplified approach that takes advantage of spin-orbit mean-field approximation (SOMF-QDNEVPT2). The accuracy of these methods is tested for the group 14 and 16 hydrides, 3d and 4d transition metal ions, and two actinide dioxides (neptunyl and plutonyl dications). The zero-field splittings of group 14 and 16 molecules computed using SO-QDNEVPT2 and SOMF-QDNEVPT2 are in a good agreement with the available experimental data. For the 3d transition metal ions, the SO-QDNEVPT2 method is significantly more accurate than SOMF-QDNEVPT2, while no substantial difference in the performance of two methods is observed for the 4d ions. Finally, we demonstrate that for the actinide dioxides the results of SO-QDNEVPT2 and SOMF-QDNEVPT2 are in a good agreement with the data from previous theoretical studies of these systems. Overall, our results demonstrate that SO-QDNEVPT2 and SOMF-QDNEVPT2 are promising multireference methods for treating spin-orbit coupling with a relatively low computational cost.
△ Less
Submitted 3 January, 2023; v1 submitted 11 November, 2022;
originally announced November 2022.
-
SPQR: An R Package for Semi-Parametric Density and Quantile Regression
Authors:
Steven G. Xu,
Reetam Majumder,
Brian J. Reich
Abstract:
We develop an R package SPQR that implements the semi-parametric quantile regression (SPQR) method in Xu and Reich (2021). The method begins by fitting a flexible density regression model using monotonic splines whose weights are modeled as data-dependent functions using artificial neural networks. Subsequently, estimates of conditional density and quantile process can all be obtained. Unlike many…
▽ More
We develop an R package SPQR that implements the semi-parametric quantile regression (SPQR) method in Xu and Reich (2021). The method begins by fitting a flexible density regression model using monotonic splines whose weights are modeled as data-dependent functions using artificial neural networks. Subsequently, estimates of conditional density and quantile process can all be obtained. Unlike many approaches to quantile regression that assume a linear model, SPQR allows for virtually any relationship between the covariates and the response distribution including non-linear effects and different effects on different quantile levels. To increase the interpretability and transparency of SPQR, model-agnostic statistics developed by Apley and Zhu (2020) are used to estimate and visualize the covariate effects and their relative importance on the quantile function. In this article, we detail how this framework is implemented in SPQR and illustrate how this package should be used in practice through simulated and real data examples.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval
Authors:
Kun Zhou,
Yeyun Gong,
Xiao Liu,
Wayne Xin Zhao,
Yelong Shen,
Anlei Dong,
**gwen Lu,
Rangan Majumder,
Ji-Rong Wen,
Nan Duan,
Weizhu Chen
Abstract:
Sampling proper negatives from a large document pool is vital to effectively train a dense retrieval model. However, existing negative sampling strategies suffer from the uninformative or false negative problem. In this work, we empirically show that according to the measured relevance scores, the negatives ranked around the positives are generally more informative and less likely to be false nega…
▽ More
Sampling proper negatives from a large document pool is vital to effectively train a dense retrieval model. However, existing negative sampling strategies suffer from the uninformative or false negative problem. In this work, we empirically show that according to the measured relevance scores, the negatives ranked around the positives are generally more informative and less likely to be false negatives. Intuitively, these negatives are not too hard (\emph{may be false negatives}) or too easy (\emph{uninformative}). They are the ambiguous negatives and need more attention during training. Thus, we propose a simple ambiguous negatives sampling method, SimANS, which incorporates a new sampling probability distribution to sample more ambiguous negatives. Extensive experiments on four public and one industry datasets show the effectiveness of our approach. We made the code and models publicly available in \url{https://github.com/microsoft/SimXNS}.
△ Less
Submitted 24 October, 2022; v1 submitted 21 October, 2022;
originally announced October 2022.
-
Stochastic Precipitation Generation for the Chesapeake Bay Watershed using Hidden Markov Models with Variational Bayes Parameter Estimation
Authors:
Reetam Majumder,
Nagaraj K. Neerchal,
Amita Mehta
Abstract:
Stochastic precipitation generators (SPGs) are a class of statistical models which generate synthetic data that can simulate dry and wet rainfall stretches for long durations. Generated precipitation time series data are used in climate projections, impact assessment of extreme weather events, and water resource and agricultural management. We construct an SPG for daily precipitation data that is…
▽ More
Stochastic precipitation generators (SPGs) are a class of statistical models which generate synthetic data that can simulate dry and wet rainfall stretches for long durations. Generated precipitation time series data are used in climate projections, impact assessment of extreme weather events, and water resource and agricultural management. We construct an SPG for daily precipitation data that is specified as a semi-continuous distribution at every location, with a point mass at zero for no precipitation and a mixture of two exponential distributions for positive precipitation. Our generators are obtained as hidden Markov models (HMMs) where the underlying climate conditions form the states. We fit a 3-state HMM to daily precipitation data for the Chesapeake Bay watershed in the Eastern coast of the USA for the wet season months of July to September from 2000--2019. Data is obtained from the GPM-IMERG remote sensing dataset, and existing work on variational HMMs is extended to incorporate semi-continuous emission distributions. In light of the high spatial dimension of the data, a stochastic optimization implementation allows for computational speedup. The most likely sequence of underlying states is estimated using the Viterbi algorithm, and we identify the differences in the weather regimes associated with the states of the proposed model. Synthetic data generated from the HMM can reproduce monthly precipitation statistics as well as spatial dependency present in the historical GPM-IMERG data.
△ Less
Submitted 13 December, 2022; v1 submitted 9 October, 2022;
originally announced October 2022.
-
PROD: Progressive Distillation for Dense Retrieval
Authors:
Zhenghao Lin,
Yeyun Gong,
Xiao Liu,
Hang Zhang,
Chen Lin,
Anlei Dong,
Jian Jiao,
**gwen Lu,
Daxin Jiang,
Rangan Majumder,
Nan Duan
Abstract:
Knowledge distillation is an effective way to transfer knowledge from a strong teacher to an efficient student model. Ideally, we expect the better the teacher is, the better the student. However, this expectation does not always come true. It is common that a better teacher model results in a bad student via distillation due to the nonnegligible gap between teacher and student. To bridge the gap,…
▽ More
Knowledge distillation is an effective way to transfer knowledge from a strong teacher to an efficient student model. Ideally, we expect the better the teacher is, the better the student. However, this expectation does not always come true. It is common that a better teacher model results in a bad student via distillation due to the nonnegligible gap between teacher and student. To bridge the gap, we propose PROD, a PROgressive Distillation method, for dense retrieval. PROD consists of a teacher progressive distillation and a data progressive distillation to gradually improve the student. We conduct extensive experiments on five widely-used benchmarks, MS MARCO Passage, TREC Passage 19, TREC Document 19, MS MARCO Document and Natural Questions, where PROD achieves the state-of-the-art within the distillation methods for dense retrieval. The code and models will be released.
△ Less
Submitted 24 June, 2023; v1 submitted 27 September, 2022;
originally announced September 2022.
-
Modeling Extremal Streamflow using Deep Learning Approximations and a Flexible Spatial Process
Authors:
Reetam Majumder,
Brian J. Reich,
Benjamin A. Shaby
Abstract:
Quantifying changes in the probability and magnitude of extreme flooding events is key to mitigating their impacts. While hydrodynamic data are inherently spatially dependent, traditional spatial models such as Gaussian processes are poorly suited for modeling extreme events. Spatial extreme value models with more realistic tail dependence characteristics are under active development. They are the…
▽ More
Quantifying changes in the probability and magnitude of extreme flooding events is key to mitigating their impacts. While hydrodynamic data are inherently spatially dependent, traditional spatial models such as Gaussian processes are poorly suited for modeling extreme events. Spatial extreme value models with more realistic tail dependence characteristics are under active development. They are theoretically justified, but give intractable likelihoods, making computation challenging for small datasets and prohibitive for continental-scale studies. We propose a process mixture model (PMM) which specifies spatial dependence in extreme values as a convex combination of a Gaussian process and a max-stable process, yielding desirable tail dependence properties but intractable likelihoods. To address this, we employ a unique computational strategy where a feed-forward neural network is embedded in a density regression model to approximate the conditional distribution at one spatial location given a set of neighbors. We then use this univariate density function to approximate the joint likelihood for all locations by way of a Vecchia approximation. The PMM is used to analyze changes in annual maximum streamflow within the US over the last 50 years, and is able to detect areas which show increases in extreme streamflow over time.
△ Less
Submitted 27 September, 2023; v1 submitted 5 August, 2022;
originally announced August 2022.
-
Unconventional Collective Resonance as Nonlinear Mechanism of Ectopic Activity in Excitable Media
Authors:
Alexander S. Teplenin,
Nina N. Kudryashova,
Rupamanjari Majumder,
Antoine A. F. de Vries,
Alexander V. Panfilov,
Daniel A. Pijnappels
Abstract:
Many physical, chemical and biological processes rely on intrinsic oscillations to employ resonance responses to external stimuli of certain frequency. Such resonance phenomena in biological systems are typically explained by one of two mechanisms: either a classical linear resonance of harmonic oscillator, or entrainment and phase locking of nonlinear limit cycle oscillators subjected to periodic…
▽ More
Many physical, chemical and biological processes rely on intrinsic oscillations to employ resonance responses to external stimuli of certain frequency. Such resonance phenomena in biological systems are typically explained by one of two mechanisms: either a classical linear resonance of harmonic oscillator, or entrainment and phase locking of nonlinear limit cycle oscillators subjected to periodic forcing. Here, we discover a nonlinear mechanism, which does not require intrinsic oscillations. Instead, the resonant frequency dependence arises from coupling between an excitable and a monostable region of the medium. This composite system is endowed with emergent bistability between a stable steady state and stable spatiotemporal oscillations. The resonant transition from stable state to oscillatory state is induced by waves of particular frequency travelling through the medium. This transition to the spatiotemporal oscillatory state requires accumulation of multiple waves, resulting in the exclusion of lower frequencies. The cutting off of high frequencies is realized by dam** of wave amplitude in the monostable zone and then by activating amplitude sensitive dynamics in the monostable units. We demonstrate this new resonance mechanism in a simplistic reaction-diffusion model. Also, we reveal this collective resonance mechanism in in-vitro experiments and detailed biophysical simulations representing a major type of arrhythmia. We further demonstrate, both experimentally and theoretically, that the ongoing spatiotemporal oscillations, such as ectopic activity in cardiac tissue, can be stopped by travelling waves of high frequency. Overall, we claim the universality of this resonance mechanism in a broad class of nonlinear biophysical systems. Specifically, we hypothesize that such phenomena could be found in neuronal systems as an alternative to traditional resonant processes.
△ Less
Submitted 8 July, 2022;
originally announced July 2022.
-
SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval
Authors:
Liang Wang,
Nan Yang,
Xiaolong Huang,
Binxing Jiao,
Linjun Yang,
Daxin Jiang,
Rangan Majumder,
Furu Wei
Abstract:
In this paper, we propose SimLM (Similarity matching with Language Model pre-training), a simple yet effective pre-training method for dense passage retrieval. It employs a simple bottleneck architecture that learns to compress the passage information into a dense vector through self-supervised pre-training. We use a replaced language modeling objective, which is inspired by ELECTRA, to improve th…
▽ More
In this paper, we propose SimLM (Similarity matching with Language Model pre-training), a simple yet effective pre-training method for dense passage retrieval. It employs a simple bottleneck architecture that learns to compress the passage information into a dense vector through self-supervised pre-training. We use a replaced language modeling objective, which is inspired by ELECTRA, to improve the sample efficiency and reduce the mismatch of the input distribution between pre-training and fine-tuning. SimLM only requires access to unlabeled corpus, and is more broadly applicable when there are no labeled data or queries. We conduct experiments on several large-scale passage retrieval datasets, and show substantial improvements over strong baselines under various settings. Remarkably, SimLM even outperforms multi-vector approaches such as ColBERTv2 which incurs significantly more storage cost. Our code and model check points are available at https://github.com/microsoft/unilm/tree/master/simlm .
△ Less
Submitted 12 May, 2023; v1 submitted 6 July, 2022;
originally announced July 2022.
-
Copula-based Risk Aggregation with Trapped Ion Quantum Computers
Authors:
Daiwei Zhu,
Weiwei Shen,
Annarita Giani,
Saikat Ray Majumder,
Bogdan Neculaes,
Sonika Johri
Abstract:
Copulas are mathematical tools for modeling joint probability distributions. Since copulas enable one to conveniently treat the marginal distribution of each variable and the interdependencies among variables separately, in the past 60 years they have become an essential analysis tool on classical computers in various fields ranging from quantitative finance and civil engineering to signal process…
▽ More
Copulas are mathematical tools for modeling joint probability distributions. Since copulas enable one to conveniently treat the marginal distribution of each variable and the interdependencies among variables separately, in the past 60 years they have become an essential analysis tool on classical computers in various fields ranging from quantitative finance and civil engineering to signal processing and medicine. The recent finding that copulas can be expressed as maximally entangled quantum states has revealed a promising approach to practical quantum advantages: performing tasks faster, requiring less memory, or, as we show, yielding better predictions. Studying the scalability of this quantum approach as both the precision and the number of modeled variables increase is crucial for its adoption in real-world applications. In this paper, we successfully apply a Quantum Circuit Born Machine (QCBM) based approach to modeling 3- and 4-variable copulas on trapped ion quantum computers. We study the training of QCBMs with different levels of precision and circuit design on a simulator and a state-of-the-art trapped ion quantum computer. We observe decreased training efficacy due to the increased complexity in parameter optimization as the models scale up. To address this challenge, we introduce an annealing-inspired strategy that dramatically improves the training results. In our end-to-end tests, various configurations of the quantum models make a comparable or better prediction in risk aggregation tasks than the standard classical models. Our detailed study of the copula paradigm using quantum computing opens opportunities for its deployment in various industries.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Development of Decision Support System for Effective COVID-19 Management
Authors:
shuvrangshu Jana,
Rudrashis Majumder,
Aashay Bhise,
Nobin Paul,
Stuti Garg,
Debasish Ghose
Abstract:
This paper discusses a Decision Support System (DSS) for cases prediction, allocation of resources, and lockdown management for managing COVID-19 at different levels of a government authority. Algorithms incorporated in the DSS are based on a data-driven modeling approach and independent of physical parameters of the region, and hence the proposed DSS is applicable to any area. Based on predicted…
▽ More
This paper discusses a Decision Support System (DSS) for cases prediction, allocation of resources, and lockdown management for managing COVID-19 at different levels of a government authority. Algorithms incorporated in the DSS are based on a data-driven modeling approach and independent of physical parameters of the region, and hence the proposed DSS is applicable to any area. Based on predicted active cases, the demand of lower-level units and total availability, allocation, and lockdown decision is made. A MATLAB-based GUI is developed based on the proposed DSS and could be implemented by the local authority.
△ Less
Submitted 12 March, 2022;
originally announced March 2022.
-
Cyclone Preparedness, Rescue Operations and Damage Assessment using UAVs
Authors:
Rudrashis Majumder,
Shuvrangshu Jana,
Prathyush P. Menon,
Debasish Ghose,
N. M. Prusty,
Bipasha Mukherjee,
Aditi Ghosh
Abstract:
UAV's capability to access remote and inaccessible areas within a quick time can be utilized for effective cyclone management. This paper presents the possible application of UAVs at different stages of cyclone mitigation. The overall system architecture necessary for preparedness, rescue operation, resource allocation, and damage assessment using UAVs during cyclones is described. Although genera…
▽ More
UAV's capability to access remote and inaccessible areas within a quick time can be utilized for effective cyclone management. This paper presents the possible application of UAVs at different stages of cyclone mitigation. The overall system architecture necessary for preparedness, rescue operation, resource allocation, and damage assessment using UAVs during cyclones is described. Although general commercial UAVs are reported to be used in cyclone operations, UAV systems should be planned specifically for cyclone operations to improve efficiency. Here, the specification required for effective and safe UAV operations in the post-cyclone scenario is presented. Mission planning required for various rescue, relief, and damage assessment missions related to cyclone management is discussed. A case study of deploying UAV in Amphan cyclone operation in West Bengal is also presented. This paper can help disaster management authorities to develop UAV systems specifically to cater to cyclone operations.
△ Less
Submitted 12 March, 2022;
originally announced March 2022.
-
Critical Medical Resource Allocation during COVID-19 Pandemic
Authors:
Shuvrangshu Jana,
Rudrashis Majumder,
Debasish Ghose
Abstract:
In this paper, an optimal resource allocation framework is proposed for the allocation of critical medical resources among different units during a pandemic. The framework is developed by considering the dynamics of Pandemic, hierarchical government structure, and non-uniformity of unit resource requirement among different units. The cost function is designed to minimize the difference between the…
▽ More
In this paper, an optimal resource allocation framework is proposed for the allocation of critical medical resources among different units during a pandemic. The framework is developed by considering the dynamics of Pandemic, hierarchical government structure, and non-uniformity of unit resource requirement among different units. The cost function is designed to minimize the difference between the demand, actual allocation, and ideal allocation, where ideal allocation for a region is considered based on the predicted active cases in a fraction of predicted total active cases of all regions. Different cost functions are used at a different level of organization based on the available information. The model can also accommodate severity of disaster in a region in this framework. A sample allocation case study is presented for the allocation of oxygen for different states of India.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
In situ process quality monitoring and defect detection for direct metal laser melting
Authors:
Sarah Felix,
Saikat Ray Majumder,
H. Kirk Mathews,
Michael Lexa,
Gabriel Lipsa,
Xiaohu **,
Subhrajit Roychowdhury,
Thomas Spears
Abstract:
Quality control and quality assurance are challenges in Direct Metal Laser Melting (DMLM). Intermittent machine diagnostics and downstream part inspections catch problems after undue cost has been incurred processing defective parts. In this paper we demonstrate two methodologies for in-process fault detection and part quality prediction that can be readily deployed on existing commercial DMLM sys…
▽ More
Quality control and quality assurance are challenges in Direct Metal Laser Melting (DMLM). Intermittent machine diagnostics and downstream part inspections catch problems after undue cost has been incurred processing defective parts. In this paper we demonstrate two methodologies for in-process fault detection and part quality prediction that can be readily deployed on existing commercial DMLM systems with minimal hardware modification. Novel features were derived from the time series of common photodiode sensors along with standard machine control signals. A Bayesian approach attributes measurements to one of multiple process states and a least squares regression model predicts severity of certain material defects.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
Game-Theoretic Model Based Resource Allocation During Floods
Authors:
Rudrashis Majumder,
Rakesh R Warier,
Debasish Ghose
Abstract:
For multiple emergencies caused by natural disasters, it is crucial to allocate resources equitably to each emergency location, especially when the availability of resources is limited in quantity. This paper has developed a multi-event crisis management system using a non-cooperative, complete information, strategic form game model. In the proposed system, each emergency event is assumed to occur…
▽ More
For multiple emergencies caused by natural disasters, it is crucial to allocate resources equitably to each emergency location, especially when the availability of resources is limited in quantity. This paper has developed a multi-event crisis management system using a non-cooperative, complete information, strategic form game model. In the proposed system, each emergency event is assumed to occur in different locations simultaneously. These locations are represented as the players in the game, competing with the other players for an optimal allocation of scarce resources available at different resource stations. The players incur a non-monetary cost for obtaining resource units. The objective of the proposed game is to derive optimal strategies for an effective and fair allocation of resources to the respective players.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
Hybrid Classical-Quantum Deep Learning Models for Autonomous Vehicle Traffic Image Classification Under Adversarial Attack
Authors:
Reek Majumder,
Sakib Mahmud Khan,
Fahim Ahmed,
Zadid Khan,
Frank Ngeni,
Gurcan Comert,
Judith Mwakalonge,
Dimitra Michalaka,
Mashrur Chowdhury
Abstract:
Image classification must work for autonomous vehicles (AV) operating on public roads, and actions performed based on image misclassification can have serious consequences. Traffic sign images can be misclassified by an adversarial attack on machine learning models used by AVs for traffic sign recognition. To make classification models resilient against adversarial attacks, we used a hybrid deep-l…
▽ More
Image classification must work for autonomous vehicles (AV) operating on public roads, and actions performed based on image misclassification can have serious consequences. Traffic sign images can be misclassified by an adversarial attack on machine learning models used by AVs for traffic sign recognition. To make classification models resilient against adversarial attacks, we used a hybrid deep-learning model with both the quantum and classical layers. Our goal is to study the hybrid deep-learning architecture for classical-quantum transfer learning models to support the current era of intermediate-scale quantum technology. We have evaluated the impacts of various white box adversarial attacks on these hybrid models. The classical part of hybrid models includes a convolution network from the pre-trained Resnet18 model, which extracts informative features from a high dimensional LISA traffic sign image dataset. The output from the classical processor is processed further through the quantum layer, which is composed of various quantum gates and provides support to various quantum mechanical features like entanglement and superposition. We have tested multiple combinations of quantum circuits to provide better classification accuracy with decreasing training data and found better resiliency for our hybrid classical-quantum deep learning model during attacks compared to the classical-only machine learning models.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
The effects of inhomogeneities on scroll-wave dynamics in an anatomically realistic mathematical model for canine ventricular tissue
Authors:
K. V. Rajany,
Rupamanjari Majumder,
Alok Ranjan Nayak,
Rahul Pandit
Abstract:
Ventricular tachycardia (VT) and ventricular fibrillation (VF) are lethal rhythm disorders, which are associated with the occurrence of abnormal electrical scroll waves in the heart. Given the technical limitations of imaging and probing, the in situ visualization of these waves inside cardiac tissue remains a challenge. Therefore, we must, perforce, rely on in-silico simulations of scroll waves i…
▽ More
Ventricular tachycardia (VT) and ventricular fibrillation (VF) are lethal rhythm disorders, which are associated with the occurrence of abnormal electrical scroll waves in the heart. Given the technical limitations of imaging and probing, the in situ visualization of these waves inside cardiac tissue remains a challenge. Therefore, we must, perforce, rely on in-silico simulations of scroll waves in mathematical models for cardiac tissue to develop an understanding of the dynamics of these waves in mammalian hearts. We use direct numerical simulations of the Hund-Rudy-Dynamic (HRD) model, for canine ventricular tissue, to examine the interplay between electrical scroll-waves and conduction and ionic inhomogeneities, in anatomically realistic canine ventricular geometries with muscle-fiber architecture. We find that millimeter-sized, distributed, conduction inhomogeneities cause a substantial decrease in the scroll wavelength, thereby increasing the probability for wave breaks; by contrast, single, localized, medium-sized ($\simeq $ cm) conduction inhomogeneities, exhibit the potential to suppress wave breaks or enable the self-organization of wave fragments into stable, intact scrolls. We show that ionic inhomogeneities, both distributed or localised, suppress scroll-wave break up. The dynamics of a stable rotating wave is not affected significantly by such inhomogeneities, except at high concentrations of distributed inhomogeneities, which can cause a partial break up of scroll waves. Our results indicate that inhomogeneities in the canine ventricular tissue are less arrhythmogenic than inhomogeneities in porcine ventricular tissue, for which an earlier in silico study has shown that the inhomogeneity-induced suppression of scroll waves is a rare occurrence.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
A mechanism for electric turbulence in cardiac tissue with optogenetic modification
Authors:
Rupamanjari Majumder,
Sayedeh Hussaini,
Vladimir S. Zykov,
Stefan Luther,
Eberhard Bodenschatz
Abstract:
Interruptions in nonlinear wave propagation, commonly referred to as wave breaks, are typical of many complex excitable systems. In the heart they lead to fatal rhythm disorders, the so-called arrhythmias, which are one of the main causes of sudden death in the industrialized world. Progress in the treatment and therapy of cardiac arrhythmias requires a detailed understanding of the triggers and d…
▽ More
Interruptions in nonlinear wave propagation, commonly referred to as wave breaks, are typical of many complex excitable systems. In the heart they lead to fatal rhythm disorders, the so-called arrhythmias, which are one of the main causes of sudden death in the industrialized world. Progress in the treatment and therapy of cardiac arrhythmias requires a detailed understanding of the triggers and dynamics of these wave breaks. In particular, two very important questions are: 1) What determines the potential of a wave break to initiate re-entry? and 2) How do these breaks evolve such that the system is able to maintain spatiotemporally chaotic electrical activity? Here we approach these questions numerically using optogenetics in an in silico model of human atrial tissue that has undergone chronic atrial fibrillation (cAF) remodelling. In the lesser known sub-threshold illumination régime, we discover a new mechanism of wave break initiation in cardiac tissue that occurs for gentle slopes of the restitution characteristics. This mechanism involves "conditioning" or resha** the wave profile from front to back, such that, removal of the external light source causes rapid recovery of cells at the waveback, leading to the creation of vulnerable windows for sustained re-entry in spatially extended systems.
△ Less
Submitted 22 October, 2020;
originally announced October 2020.
-
Spiral- and scroll-wave dynamics in mathematical models for canine and human ventricular tissue with varying Potassium and Calcium currents
Authors:
K. V. Rajany,
Alok Ranjan Nayak,
Rupamanjari Majumder,
Rahul Pandit
Abstract:
We conduct a systematic,direct-numerical-simulation study,in mathematical models for ventricular tissue,of the dependence of spiral-and scroll-wave dynamics on $G_{Kr}$, the maximal conductance of the delayed rectifier Potassium current($I_{Kr}$) channel,and the parameter $γ_{Cao}$,which determines the magnitude and shape of the current $I_{CaL}$ for the L-type calcium-current channel,in both squa…
▽ More
We conduct a systematic,direct-numerical-simulation study,in mathematical models for ventricular tissue,of the dependence of spiral-and scroll-wave dynamics on $G_{Kr}$, the maximal conductance of the delayed rectifier Potassium current($I_{Kr}$) channel,and the parameter $γ_{Cao}$,which determines the magnitude and shape of the current $I_{CaL}$ for the L-type calcium-current channel,in both square and anatomically realistic,whole-ventricle simulation domains using canine and human models. We use ventricular geometry with fiber-orientation details and employ a physiologically realistic model for a canine ventricular myocyte. We restrict ourselves to an HRD-model parameter regime, which does not produce spiral- and scroll-wave instabilities because of other,well-studied causes like a very sharp action-potential-duration-restitution (APDR) curve or early after depolarizations(EADs) at the single-cell level. We find that spiral- or scroll-wave dynamics are affected predominantly by a simultaneous change in $I_{CaL}$ and $I_{Kr}$,rather than by a change in any one of these currents;other currents do not have such a large effect on these wave dynamics in this parameter regime of the HRD model.We obtain stability diagrams in the $G_{Kr} -γ_{Cao}$ plane.In the 3D domain,the geometry of the domain supports the confinement of the scroll waves and makes them more stable compared to their spiral-wave counterparts in 2D domain. We have also carried out a comparison of our HRD results with their counterparts for the human-ventricular TP06 model and have found important differences. In both these models,to make a transition,(from broken-wave to stable-scroll states or vice versa) we must simultaneously increase $I_{Kr}$ and decrease $I_{CaL}$;a modification of only one of these currents is not enough to effect this transition.
△ Less
Submitted 2 November, 2020; v1 submitted 10 October, 2020;
originally announced October 2020.
-
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
Authors:
Yaobo Liang,
Nan Duan,
Yeyun Gong,
Ning Wu,
Fenfei Guo,
Weizhen Qi,
Ming Gong,
Linjun Shou,
Daxin Jiang,
Guihong Cao,
Xiaodong Fan,
Ruofei Zhang,
Rahul Agrawal,
Edward Cui,
Sining Wei,
Taroon Bharti,
Ying Qiao,
Jiun-Hung Chen,
Winnie Wu,
Shuguang Liu,
Fan Yang,
Daniel Campos,
Rangan Majumder,
Ming Zhou
Abstract:
In this paper, we introduce XGLUE, a new benchmark dataset that can be used to train large-scale cross-lingual pre-trained models using multilingual and bilingual corpora and evaluate their performance across a diverse set of cross-lingual tasks. Comparing to GLUE(Wang et al., 2019), which is labeled in English for natural language understanding tasks only, XGLUE has two main advantages: (1) it pr…
▽ More
In this paper, we introduce XGLUE, a new benchmark dataset that can be used to train large-scale cross-lingual pre-trained models using multilingual and bilingual corpora and evaluate their performance across a diverse set of cross-lingual tasks. Comparing to GLUE(Wang et al., 2019), which is labeled in English for natural language understanding tasks only, XGLUE has two main advantages: (1) it provides 11 diversified tasks that cover both natural language understanding and generation scenarios; (2) for each task, it provides labeled data in multiple languages. We extend a recent cross-lingual pre-trained model Unicoder(Huang et al., 2019) to cover both understanding and generation tasks, which is evaluated on XGLUE as a strong baseline. We also evaluate the base versions (12-layer) of Multilingual BERT, XLM and XLM-R for comparison.
△ Less
Submitted 22 May, 2020; v1 submitted 3 April, 2020;
originally announced April 2020.
-
Ionic-heterogeneity-induced spiral- and scroll-wave turbulence in mathematical models of cardiac tissue
Authors:
Soling Zimik,
Rupamanjari Majumder,
Rahul Pandit
Abstract:
Spatial variations in the electrical properties of cardiac tissue can occur because of cardiac diseases. We introduce such gradients into mathematical models for cardiac tissue and then study, by extensive numerical simulations, their effects on reentrant electrical waves and their stability in both two and three dimensions. We explain the mechanism of spiral- and scroll-wave instability, which en…
▽ More
Spatial variations in the electrical properties of cardiac tissue can occur because of cardiac diseases. We introduce such gradients into mathematical models for cardiac tissue and then study, by extensive numerical simulations, their effects on reentrant electrical waves and their stability in both two and three dimensions. We explain the mechanism of spiral- and scroll-wave instability, which entails anisotropic thinning in the wavelength of the waves because of anisotropic variation in its electrical properties.
△ Less
Submitted 12 July, 2018;
originally announced July 2018.
-
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
Authors:
Payal Bajaj,
Daniel Campos,
Nick Craswell,
Li Deng,
Jianfeng Gao,
Xiaodong Liu,
Rangan Majumder,
Andrew McNamara,
Bhaskar Mitra,
Tri Nguyen,
Mir Rosenberg,
Xia Song,
Alina Stoica,
Saurabh Tiwary,
Tong Wang
Abstract:
We introduce a large scale MAchine Reading COmprehension dataset, which we name MS MARCO. The dataset comprises of 1,010,916 anonymized questions---sampled from Bing's search query logs---each with a human generated answer and 182,669 completely human rewritten generated answers. In addition, the dataset contains 8,841,823 passages---extracted from 3,563,535 web documents retrieved by Bing---that…
▽ More
We introduce a large scale MAchine Reading COmprehension dataset, which we name MS MARCO. The dataset comprises of 1,010,916 anonymized questions---sampled from Bing's search query logs---each with a human generated answer and 182,669 completely human rewritten generated answers. In addition, the dataset contains 8,841,823 passages---extracted from 3,563,535 web documents retrieved by Bing---that provide the information necessary for curating the natural language answers. A question in the MS MARCO dataset may have multiple answers or no answers at all. Using this dataset, we propose three different tasks with varying levels of difficulty: (i) predict if a question is answerable given a set of context passages, and extract and synthesize the answer as a human would (ii) generate a well-formed answer (if possible) based on the context passages that can be understood with the question and passage context, and finally (iii) rank a set of retrieved passages given a question. The size of the dataset and the fact that the questions are derived from real user search queries distinguishes MS MARCO from other well-known publicly available datasets for machine reading comprehension and question-answering. We believe that the scale and the real-world nature of this dataset makes it attractive for benchmarking machine reading comprehension and question-answering models.
△ Less
Submitted 31 October, 2018; v1 submitted 28 November, 2016;
originally announced November 2016.
-
Scroll-wave dynamics in the presence of ionic and conduction inhomogeneities in an anatomically realistic mathematical model for the pig heart
Authors:
R. Majumder,
R Pandit,
A. V. Panfilov
Abstract:
Nonlinear waves of the reaction-diffusion (RD) type occur in many biophysical systems, including the heart, where they initiate cardiac contraction. Such waves can form vortices called scroll waves, which result in the onset of life-threatening cardiac arrhythmias. The dynamics of scroll waves is affected by the presence of inhomogeneities, which, in a very general way, can be of \textit{(i)} ioni…
▽ More
Nonlinear waves of the reaction-diffusion (RD) type occur in many biophysical systems, including the heart, where they initiate cardiac contraction. Such waves can form vortices called scroll waves, which result in the onset of life-threatening cardiac arrhythmias. The dynamics of scroll waves is affected by the presence of inhomogeneities, which, in a very general way, can be of \textit{(i)} ionic type, i.e., they affect the reaction part, or \textit{(ii)} conduction type, i.e., they affect the diffusion part of an RD equation. We demostrate, for the first time, by using a state-of-the-art, anatomically realistic model of the pig heart, how differences in the geometrical and biophysical nature of such inhomogeneities can influence scroll-wave dynamics in different ways. Our study reveals that conduction-type inhomogeneities become increasingly important at small length scales, i.e., in the case of multiple, randomly distributed, obstacles in space at the cellular scale ($0.2-0.4{\rm mm}$). Such configurations can lead to scroll-wave break up. In contrast, ionic inhomogeneities, affect scroll-wave dynamics significantly at large length scales, when these inhomogeneities are localized in space at the tissue level ($5-10$ mm). In such configurations, these inhomogeneities can (a) attract scroll waves, by pinning them to the heterogeneity, or (b) lead to scroll-wave breakup.
△ Less
Submitted 12 October, 2016;
originally announced October 2016.
-
Gravitational wave bursts from cosmic (super)strings: Quantitative analysis and constraints
Authors:
Xavier Siemens,
Jolien Creighton,
Irit Maor,
Saikat Ray Majumder,
Kipp Cannon,
Jocelyn Read
Abstract:
We discuss data analysis techniques that can be used in the search for gravitational wave bursts from cosmic strings. When data from multiple interferometers are available, we describe consistency checks that can be used to greatly reduce the false alarm rates. We construct an expression for the rate of bursts for arbitrary cosmic string loop distributions and apply it to simple known solutions.…
▽ More
We discuss data analysis techniques that can be used in the search for gravitational wave bursts from cosmic strings. When data from multiple interferometers are available, we describe consistency checks that can be used to greatly reduce the false alarm rates. We construct an expression for the rate of bursts for arbitrary cosmic string loop distributions and apply it to simple known solutions. The cosmology is solved exactly and includes the effects of a late-time acceleration. We find substantially lower burst rates than previous estimates suggest and explain the disagreement. Initial LIGO is unlikely to detect field theoretic cosmic strings with the usual loop sizes, though it may detect cosmic superstrings as well as cosmic strings and superstrings with non-standard loop sizes (which may be more realistic). In the absence of a detection, we show how to set upper limits based on the loudest event. Using Initial LIGO sensitivity curves, we show that these upper limits may result in interesting constraints on the parameter space of theories that lead to the production of cosmic strings.
△ Less
Submitted 26 April, 2006; v1 submitted 29 March, 2006;
originally announced March 2006.
-
Plans for the LIGO-TAMA Joint Search for Gravitational Wave Bursts
Authors:
Patrick J. Sutton,
Masaki Ando,
Patrick Brady,
Laura Cadonati,
Alessandra Di Credico,
Stephen Fairhurst,
Lee Samuel Finn,
Nobuyuki Kanda,
Erik Katsavounidis,
Sergey Klimenko,
Albert Lazzarini,
Szabolcs Marka,
John W. C. McNabb,
Saikat Ray Majumder,
Peter R. Saulson,
Hideyuki Tagoshi,
Hirotaka Takahashi,
Ryutaro Takahashi,
Daisuke Tatsumi,
Yoshiki Tsunesada,
S. E. Whitcomb
Abstract:
We describe the plans for a joint search for unmodelled gravitational wave bursts being carried out by the LIGO and TAMA collaborations using data collected during February-April 2003. We take a conservative approach to detection, requiring candidate gravitational wave bursts to be seen in coincidence by all four interferometers. We focus on some of the complications of performing this coinciden…
▽ More
We describe the plans for a joint search for unmodelled gravitational wave bursts being carried out by the LIGO and TAMA collaborations using data collected during February-April 2003. We take a conservative approach to detection, requiring candidate gravitational wave bursts to be seen in coincidence by all four interferometers. We focus on some of the complications of performing this coincidence analysis, in particular the effects of the different alignments and noise spectra of the interferometers.
△ Less
Submitted 28 December, 2004;
originally announced December 2004.