Search | arXiv e-print repository

Mind the Graph When Balancing Data for Fairness or Robustness

Authors: Jessica Schrouff, Alexis Bellot, Amal Rannen-Triki, Alan Malek, Isabela Albuquerque, Arthur Gretton, Alexander D'Amour, Silvia Chiappa

Abstract: Failures of fairness or robustness in machine learning predictive settings can be due to undesired dependencies between covariates, outcomes and auxiliary factors of variation. A common strategy to mitigate these failures is data balancing, which attempts to remove those undesired dependencies. In this work, we define conditions on the training distribution for data balancing to lead to fair or ro… ▽ More Failures of fairness or robustness in machine learning predictive settings can be due to undesired dependencies between covariates, outcomes and auxiliary factors of variation. A common strategy to mitigate these failures is data balancing, which attempts to remove those undesired dependencies. In this work, we define conditions on the training distribution for data balancing to lead to fair or robust models. Our results display that, in many cases, the balanced distribution does not correspond to selectively removing the undesired dependencies in a causal graph of the task, leading to multiple failure modes and even interference with other mitigation techniques such as regularization. Overall, our results highlight the importance of taking the causal graph into account before performing data balancing. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.04824 [pdf, other]

FunBO: Discovering Acquisition Functions for Bayesian Optimization with FunSearch

Authors: Virginia Aglietti, Ira Ktena, Jessica Schrouff, Eleni Sgouritsa, Francisco J. R. Ruiz, Alan Malek, Alexis Bellot, Silvia Chiappa

Abstract: The sample efficiency of Bayesian optimization algorithms depends on carefully crafted acquisition functions (AFs) guiding the sequential collection of function evaluations. The best-performing AF can vary significantly across optimization problems, often requiring ad-hoc and problem-specific choices. This work tackles the challenge of designing novel AFs that perform well across a variety of expe… ▽ More The sample efficiency of Bayesian optimization algorithms depends on carefully crafted acquisition functions (AFs) guiding the sequential collection of function evaluations. The best-performing AF can vary significantly across optimization problems, often requiring ad-hoc and problem-specific choices. This work tackles the challenge of designing novel AFs that perform well across a variety of experimental settings. Based on FunSearch, a recent work using Large Language Models (LLMs) for discovery in mathematical sciences, we propose FunBO, an LLM-based method that can be used to learn new AFs written in computer code by leveraging access to a limited number of evaluations for a set of objective functions. We provide the analytic expression of all discovered AFs and evaluate them on various global optimization benchmarks and hyperparameter optimization tasks. We show how FunBO identifies AFs that generalize well in and out of the training distribution of functions, thus outperforming established general-purpose AFs and achieving competitive performance against AFs that are customized to specific function types and are learned via transfer-learning algorithms. △ Less

Submitted 1 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

arXiv:2306.07858 [pdf, other]

Additive Causal Bandits with Unknown Graph

Authors: Alan Malek, Virginia Aglietti, Silvia Chiappa

Abstract: We explore algorithms to select actions in the causal bandit setting where the learner can choose to intervene on a set of random variables related by a causal graph, and the learner sequentially chooses interventions and observes a sample from the interventional distribution. The learner's goal is to quickly find the intervention, among all interventions on observable variables, that maximizes th… ▽ More We explore algorithms to select actions in the causal bandit setting where the learner can choose to intervene on a set of random variables related by a causal graph, and the learner sequentially chooses interventions and observes a sample from the interventional distribution. The learner's goal is to quickly find the intervention, among all interventions on observable variables, that maximizes the expectation of an outcome variable. We depart from previous literature by assuming no knowledge of the causal graph except that latent confounders between the outcome and its ancestors are not present. We first show that the unknown graph problem can be exponentially hard in the parents of the outcome. To remedy this, we adopt an additional additive assumption on the outcome which allows us to solve the problem by casting it as an additive combinatorial linear bandit problem with full-bandit feedback. We propose a novel action-elimination algorithm for this setting, show how to apply this algorithm to the causal bandit problem, provide sample complexity bounds, and empirically validate our findings on a suite of randomly generated causal models, effectively showing that one does not need to explicitly learn the parents of the outcome to identify the best intervention. △ Less

Submitted 13 June, 2023; originally announced June 2023.

Journal ref: International Conference on Machine Learning, 2023

arXiv:2305.20011 [pdf, other]

Constrained Causal Bayesian Optimization

Authors: Virginia Aglietti, Alan Malek, Ira Ktena, Silvia Chiappa

Abstract: We propose constrained causal Bayesian optimization (cCBO), an approach for finding interventions in a known causal graph that optimize a target variable under some constraints. cCBO first reduces the search space by exploiting the graph structure and, if available, an observational dataset; and then solves the restricted optimization problem by modelling target and constraint quantities using Gau… ▽ More We propose constrained causal Bayesian optimization (cCBO), an approach for finding interventions in a known causal graph that optimize a target variable under some constraints. cCBO first reduces the search space by exploiting the graph structure and, if available, an observational dataset; and then solves the restricted optimization problem by modelling target and constraint quantities using Gaussian processes and by sequentially selecting interventions via a constrained expected improvement acquisition function. We propose different surrogate models that enable to integrate observational and interventional data while capturing correlation among effects with increasing levels of sophistication. We evaluate cCBO on artificial and real-world causal graphs showing successful trade off between fast convergence and percentage of feasible interventions. △ Less

Submitted 31 May, 2023; originally announced May 2023.

Journal ref: International Conference on Machine Learning, 2023

arXiv:2301.12278 [pdf, other]

Pragmatic Fairness: Develo** Policies with Outcome Disparity Control

Authors: Limor Gultchin, Siyuan Guo, Alan Malek, Silvia Chiappa, Ricardo Silva

Abstract: We introduce a causal framework for designing optimal policies that satisfy fairness constraints. We take a pragmatic approach asking what we can do with an action space available to us and only with access to historical data. We propose two different fairness constraints: a moderation breaking constraint which aims at blocking moderation paths from the action and sensitive attribute to the outcom… ▽ More We introduce a causal framework for designing optimal policies that satisfy fairness constraints. We take a pragmatic approach asking what we can do with an action space available to us and only with access to historical data. We propose two different fairness constraints: a moderation breaking constraint which aims at blocking moderation paths from the action and sensitive attribute to the outcome, and by that at reducing disparity in outcome levels as much as the provided action space permits; and an equal benefit constraint which aims at distributing gain from the new and maximized policy equally across sensitive attribute levels, and thus at kee** pre-existing preferential treatment in place or avoiding the introduction of new disparity. We introduce practical methods for implementing the constraints and illustrate their uses on experiments with semi-synthetic models. △ Less

Submitted 28 January, 2023; originally announced January 2023.

arXiv:2211.14385 [pdf, other]

Pac-Man Pete: An extensible framework for building AI in VEX Robotics

Authors: Jacob Zietek, Nicholas Wade, Cole Roberts, Aref Malek, Manish Pylla, Will Xu, Sagar Patil

Abstract: This technical report details VEX Robotics team BLRSAI's development of a fully autonomous robot for VEX Robotics' Tip** Point AI Competition. We identify and develop three separate critical components. This includes a Unity simulation and reinforcement learning model training pipeline, a malleable computer vision pipeline, and a data transfer pipeline to offload large computations from the VEX… ▽ More This technical report details VEX Robotics team BLRSAI's development of a fully autonomous robot for VEX Robotics' Tip** Point AI Competition. We identify and develop three separate critical components. This includes a Unity simulation and reinforcement learning model training pipeline, a malleable computer vision pipeline, and a data transfer pipeline to offload large computations from the VEX V5 Brain/micro-controller to an external computer. We give the community access to all of these components in hopes they can reuse and improve upon them in the future, and that it'll spark new ideas for autonomy as well as the necessary infrastructure and programs for AI in educational robotics. △ Less

Submitted 25 November, 2022; originally announced November 2022.

arXiv:2112.09188 [pdf, other]

Effects of direction reversals on patterns of active filaments

Authors: Leila Abbaspour, Ali Malek, Stefan Karpitschka, Stefan Klumpp

Abstract: Active matter systems provide fascinating examples of pattern formation and collective motility without counterparts in equilibrium systems. Here, we employ Brownian dynamics simulations to study the collective motion and self-organization in systems of self-propelled semiflexible filaments, inspired by the gliding motility of \textit{filamentous Cyanobacteria}. Specifically, we investigate the in… ▽ More Active matter systems provide fascinating examples of pattern formation and collective motility without counterparts in equilibrium systems. Here, we employ Brownian dynamics simulations to study the collective motion and self-organization in systems of self-propelled semiflexible filaments, inspired by the gliding motility of \textit{filamentous Cyanobacteria}. Specifically, we investigate the influence of stochastic direction reversals on the patterns. We explore pattern formation and dynamics by modulating three relevant physical parameters, the bending stiffness, the activity, and the reversal rate. In the absence of reversals, our results show rich dynamical behavior including spiral formation and collective motion of aligned clusters of various sizes, depending on the bending stiffness and self-propulsion force. The presence of reversals diminishes spiral formation and reduces the sizes of clusters or suppresses clustering entirely. This homogenizing effect of direction reversals can be understood as reversals providing an additional mechanism to either unwind spirals or to resolve clusters. △ Less

Submitted 16 December, 2021; originally announced December 2021.

Comments: 15 pages, 11 figures

arXiv:2107.05481 [pdf, other]

Prequential MDL for Causal Structure Learning with Neural Networks

Authors: Jorg Bornschein, Silvia Chiappa, Alan Malek, Rosemary Nan Ke

Abstract: Learning the structure of Bayesian networks and causal relationships from observations is a common goal in several areas of science and technology. We show that the prequential minimum description length principle (MDL) can be used to derive a practical scoring function for Bayesian networks when flexible and overparametrized neural networks are used to model the conditional probability distributi… ▽ More Learning the structure of Bayesian networks and causal relationships from observations is a common goal in several areas of science and technology. We show that the prequential minimum description length principle (MDL) can be used to derive a practical scoring function for Bayesian networks when flexible and overparametrized neural networks are used to model the conditional probability distributions between observed variables. MDL represents an embodiment of Occam's Razor and we obtain plausible and parsimonious graph structures without relying on sparsity inducing priors or other regularizers which must be tuned. Empirically we demonstrate competitive results on synthetic and real-world data. The score often recovers the correct structure even in the presence of strongly nonlinear relationships between variables; a scenario were prior approaches struggle and usually fail. Furthermore we discuss how the the prequential score relates to recent work that infers causal structure from the speed of adaptation when the observations come from a source undergoing distributional shift. △ Less

Submitted 2 July, 2021; originally announced July 2021.

arXiv:2106.01476 [pdf, other]

doi 10.1021/jacs.1c04142

Low density interior in supercooled aqueous nanodroplets expels ions to the subsurface

Authors: Shahrazad M. A. Malek, Victor Kwan, Ivan Saika-Voivod, Styliani Consta

Abstract: The interaction between water and ions within droplets plays a key role in the chemical reactivity of atmospheric and man-made aerosols. Here we report direct computational evidence that in supercooled aqueous nanodroplets a lower density core of tetrahedrally coordinated water expels the cosmotropic ions to the denser and more disordered subsurface. In contrast, at room temperature, depending on… ▽ More The interaction between water and ions within droplets plays a key role in the chemical reactivity of atmospheric and man-made aerosols. Here we report direct computational evidence that in supercooled aqueous nanodroplets a lower density core of tetrahedrally coordinated water expels the cosmotropic ions to the denser and more disordered subsurface. In contrast, at room temperature, depending on the nature of the ion the radial distribution in the droplet core is nearly uniform or elevated towards the center. We analyze the spatial distribution of a single ion in terms of a reference electrostatic model. The energy of the system in the analytical model is expressed as the sum of the electrostatic and surface energy of a deformable droplet. The model predicts that the ion is subject to a harmonic potential centered at the droplet's center of mass. We name this effect "electrostatic confinement". The model's predictions are consistent with the simulation findings for a single ion at room temperature but not at supercooling. We anticipate this study to be the starting point for investigating the structure of supercooled (electro)sprayed droplets that are used to preserve the conformations of macromolecules originating from the bulk solution. △ Less

Submitted 10 August, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

arXiv:2011.03567 [pdf, other]

Anytime-Valid Inference for Multinomial Count Data

Authors: Michael Lindon, Alan Malek

Abstract: Many experiments are concerned with the comparison of counts between treatment groups. Examples include the number of successful signups in conversion rate experiments, or the number of errors produced by software versions in canary experiments. Observations typically arrive in data streams and practitioners wish to continuously monitor their experiments, sequentially testing hypotheses while main… ▽ More Many experiments are concerned with the comparison of counts between treatment groups. Examples include the number of successful signups in conversion rate experiments, or the number of errors produced by software versions in canary experiments. Observations typically arrive in data streams and practitioners wish to continuously monitor their experiments, sequentially testing hypotheses while maintaining Type I error probabilities under optional stop** and continuation. These goals are frequently complicated in practice by non-stationary time dynamics. We provide practical solutions through sequential tests of multinomial hypotheses, hypotheses about many inhomogeneous Bernoulli processes and hypotheses about many time-inhomogeneous Poisson counting processes. For estimation, we further provide confidence sequences for multinomial probability vectors, all contrasts among probabilities of inhomogeneous Bernoulli processes and all contrasts among intensities of time-inhomogeneous Poisson counting processes. Together, these provide an "anytime-valid" inference framework for a wide variety of experiments dealing with count outcomes, which we illustrate with a number of industry applications. △ Less

Submitted 28 May, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

Journal ref: 36th Conference on Neural Information Processing Systems (NeurIPS2022)

arXiv:2008.05064 [pdf]

doi 10.1007/s40593-018-0166-3

Effects of Voice-Based Synthetic Assistant on Performance of Emergency Care Provider in Training

Authors: Praveen Damacharla, Parashar Dhakal, Sebastian Stumbo, Ahmad Y. Javaid, Subhashini Ganapathy, David A. Malek, Douglas C. Hodge, Vijay Devabhaktuni

Abstract: As part of a perennial project, our team is actively engaged in develo** new synthetic assistant (SA) technologies to assist in training combat medics and medical first responders. It is critical that medical first responders are well trained to deal with emergencies more effectively. This would require real-time monitoring and feedback for each trainee. Therefore, we introduced a voice-based SA… ▽ More As part of a perennial project, our team is actively engaged in develo** new synthetic assistant (SA) technologies to assist in training combat medics and medical first responders. It is critical that medical first responders are well trained to deal with emergencies more effectively. This would require real-time monitoring and feedback for each trainee. Therefore, we introduced a voice-based SA to augment the training process of medical first responders and enhance their performance in the field. The potential benefits of SAs include a reduction in training costs and enhanced monitoring mechanisms. Despite the increased usage of voice-based personal assistants (PAs) in day-to-day life, the associated effects are commonly neglected for a study of human factors. Therefore, this paper focuses on performance analysis of the developed voice-based SA in emergency care provider training for a selected emergency treatment scenario. The research discussed in this paper follows design science in develo** proposed technology; at length, we discussed architecture and development and presented working results of voice-based SA. The empirical testing was conducted on two groups as user studies using statistical analysis tools, one trained with conventional methods and the other with the help of SA. The statistical results demonstrated the amplification in training efficacy and performance of medical responders powered by SA. Furthermore, the paper also discusses the accuracy and time of task execution (t) and concludes with the guidelines for resolving the identified problems. △ Less

Submitted 11 August, 2020; originally announced August 2020.

ACM Class: H.1.2; I.2.1; I.2.7

Journal ref: Int J Artif Intell Educ, 29, 122-143, 2018

arXiv:2006.03650 [pdf]

doi 10.1007/s11123-019-00555-8

The Effects of Access to Credit on Productivity Among Microenterprises: Separating Technological Changes from Changes in Technical Efficiency

Authors: Nusrat Abedin Jimi, Plamen Nikolov, Mohammad Abdul Malek, Subal Kumbhakar

Abstract: Improving productivity among farm microenterprises is important, especially in low-income countries where market imperfections are pervasive and resources are scarce. Relaxing credit constraints can increase the productivity of farmers. Using a field experiment involving microenterprises in Bangladesh, we estimate the impact of access to credit on the overall productivity of rice farmers, and dise… ▽ More Improving productivity among farm microenterprises is important, especially in low-income countries where market imperfections are pervasive and resources are scarce. Relaxing credit constraints can increase the productivity of farmers. Using a field experiment involving microenterprises in Bangladesh, we estimate the impact of access to credit on the overall productivity of rice farmers, and disentangle the total effect into technological change (frontier shift) and technical efficiency changes. We find that relative to the baseline rice output per decimal, access to credit results in, on average, approximately a 14 percent increase in yield, holding all other inputs constant. After decomposing the total effect into the frontier shift and efficiency improvement, we find that, on average, around 11 percent of the increase in output comes from changes in technology, or frontier shift, while the remaining 3 percent is attributed to improvements in technical efficiency. The efficiency gain is higher for modern hybrid rice varieties, and almost zero for traditional rice varieties. Within the treatment group, the effect is greater among pure tenant and mixed-tenant farm households compared with farmers that only cultivate their own land. △ Less

Submitted 5 June, 2020; originally announced June 2020.

Journal ref: Journal of Productivity Analysis, 52 (2019): 37-55 (2019)

arXiv:1912.03861 [pdf, other]

Daily Data Assimilation of a Hydrologic Model Using the Ensemble Kalman Filter

Authors: Sami A. Malek, Alexandre M. Bayen, Steven D. Glaser

Abstract: Accurate runoff forecasting is crucial for reservoir operators as it allows optimized water management, flood control and hydropower generation. Land surface models in mountainous regions depend on climatic inputs such as precipitation, temperature and solar radiation to model the water and energy dynamics and produce runoff as output. With the rapid development of cheap electronics applied in var… ▽ More Accurate runoff forecasting is crucial for reservoir operators as it allows optimized water management, flood control and hydropower generation. Land surface models in mountainous regions depend on climatic inputs such as precipitation, temperature and solar radiation to model the water and energy dynamics and produce runoff as output. With the rapid development of cheap electronics applied in various systems, such as Wireless Sensor Networks (WSNs), satellite and airborne technologies, the prospect of practically measuring spatial Snow Water Equivalent in a dense temporal scale is increasing. We present a framework for updating the Precipitation Runoff Modeling System (PRMS) with Snow Water Equivalent (SWE) maps and runoff measurements on a daily timescale based on the Ensemble Kalman Filter (ENKF). Results show that by assimilating SWE daily, the modeled SWE gets updated accordingly, however no improvement is observed at the runoff model output. Instead, a deterioration consistently occurs. Augmenting the state space with model parameters and runoff model output allows for filter update with previous day measured runoff using the joint state-parameter method, and showed a considerable improvement in the daily runoff output of up to 60% reduction in RMSE for the wet water year 2011 relative to the no assimilation scenario, and improvement of up to 28% compared to a naive autoregressive AR(1) filter. Additional simulation years showed consistent improvement compared to no assimilation, but varied relative to the previous day autoregressive forecast during the dry year 2014. △ Less

Submitted 9 December, 2019; originally announced December 2019.

Comments: 18 pages, 5 figures, 4 tables and supplement

Journal ref: Published as Masters thesis here: https://www2.eecs.berkeley.edu/Pubs/TechRpts/2019/EECS-2019-101.html

arXiv:1905.13709 [pdf, other]

doi 10.1063/1.5096990

Surface tension of supercooled water nanodroplets from computer simulations

Authors: Shahrazad M. A. Malek, Peter H. Poole, Ivan Saika-Voivod

Abstract: We estimate the liquid-vapour surface tension from simulations of TIP4P/2005 water nanodroplets of size $N$=100 to 2880 molecules over a temperature $T$ range of 180 K to 300 K. We compute the planar surface tension $γ_p$, the curvature-dependent surface tension $γ_s$, and the Tolman length $δ$, via two approaches, one based on the pressure tensor (the "mechanical route") and the other on the Lapl… ▽ More We estimate the liquid-vapour surface tension from simulations of TIP4P/2005 water nanodroplets of size $N$=100 to 2880 molecules over a temperature $T$ range of 180 K to 300 K. We compute the planar surface tension $γ_p$, the curvature-dependent surface tension $γ_s$, and the Tolman length $δ$, via two approaches, one based on the pressure tensor (the "mechanical route") and the other on the Laplace pressure (the "thermodynamic route"). We find that these two routes give different results for $γ_p$, $γ_s$ and $δ$, although in all cases we find that $δ\ge 0$ and is independent of $T$. Nonetheless, the $T$ dependence of $γ_p$ is consistent between the two routes and with that of Vega and de Miguel [J. Chem. Phys. 126, 154707 (2007)] down to the crossing of the Widom line at 230 K for ambient pressure. Below 230 K, $γ_p$ rises more rapidly on cooling than predicted from behavior for $T\ge 300$ K. We show that the increase in $γ_p$ at low $T$ is correlated to the emergence of a well-structured random tetrahedral network in our nanodroplet cores, and thus that the surface tension can be used as a probe to detect behavior associated with the proposed liquid-liquid phase transition in supercooled water. △ Less

Submitted 31 May, 2019; originally announced May 2019.

Comments: 11 pages, 11 figures

arXiv:1901.01992 [pdf, ps, other]

Large-Scale Markov Decision Problems via the Linear Programming Dual

Authors: Yasin Abbasi-Yadkori, Peter L. Bartlett, Xi Chen, Alan Malek

Abstract: We consider the problem of controlling a fully specified Markov decision process (MDP), also known as the planning problem, when the state space is very large and calculating the optimal policy is intractable. Instead, we pursue the more modest goal of optimizing over some small family of policies. Specifically, we show that the family of policies associated with a low-dimensional approximation of… ▽ More We consider the problem of controlling a fully specified Markov decision process (MDP), also known as the planning problem, when the state space is very large and calculating the optimal policy is intractable. Instead, we pursue the more modest goal of optimizing over some small family of policies. Specifically, we show that the family of policies associated with a low-dimensional approximation of occupancy measures yields a tractable optimization. Moreover, we propose an efficient algorithm, scaling with the size of the subspace but not the state space, that is able to find a policy with low excess loss relative to the best policy in this class. To the best of our knowledge, such results did not exist in the literature previously. We bound excess loss in the average cost and discounted cost cases, which are treated separately. Preliminary experiments show the effectiveness of the proposed algorithms in a queueing application. △ Less

Submitted 6 January, 2019; originally announced January 2019.

Comments: 53 pages. arXiv admin note: text overlap with arXiv:1402.6763

arXiv:1805.07514 [pdf]

doi 10.1142/S0217984919500775

Optimization of Neon Soft X-ray Emission in Low Energy Dense Plasma Focus Device

Authors: M. A. Malek, M. K. Islam, M. Salahuddin

Abstract: The Lee model code is used in numerical experiments for characterizing and optimizing neon soft X-ray (Ysxr) yield of UNU/ICTP PFF machine operated at 14 kV and 30 micro F. The neon Ysxr yield of the dense plasma focus device is enhanced by reducing static inductance (L0) and anode length (z0) along with increasing anode radius (a) and cathode radius (b), kee** their ratio (c = b/a) constant at… ▽ More The Lee model code is used in numerical experiments for characterizing and optimizing neon soft X-ray (Ysxr) yield of UNU/ICTP PFF machine operated at 14 kV and 30 micro F. The neon Ysxr yield of the dense plasma focus device is enhanced by reducing static inductance (L0) and anode length (z0) along with increasing anode radius (a) and cathode radius (b), kee** their ratio (c = b/a) constant at 3.368. At the optimum combination of the electrodes geometry and static inductance, the maximum computed value of neon Ysxr yield is 63.61 J at operating pressure 3.3 Torr with corresponding X-ray yield efficiency 2.16%, while the end axial speed becomes 6.42 cm/s. This value of neon Ysxr yield is twelve to thirteen times higher than the measured value (5.4 J) at 3.0 Torr. It is also found that this neon Ysxr yield is improved around seven times from previously computed value (9.5 J) at 3.5 Torr for optimum anode configuration of this machine. Our obtained results of neon Ysxr yield are also compared with the computed results of AECS-PF2 machine operated at 15 kV and 25 micro F and is found that our results are about three times better than that from the optimized AECS-PF2 at L0 = 15 nH. △ Less

Submitted 19 May, 2018; originally announced May 2018.

Comments: Article

arXiv:1805.03233 [pdf]

doi 10.1038/hgv.2016.16

The Qatar Genome: A Population-Specific Tool for Precision Medicine in the Middle East

Authors: Khalid A. Fakhro, Michelle R. Staudt, Monica Denise Ramstetter, Amal Robay, Joel A. Malek, Ramin Badii, Ajayeb Al-Nabet Al-Marri, Charbel Abi Khalil, Alya Al-Shakaki, Omar Chidiac, Dora Stadler, Mahmoud Zirie, Amin Jayyousi, Jacqueline Salit, Jason G. Mezey, Ronald G. Crystal, Juan L. Rodriguez-Flores

Abstract: Reaching the full potential of precision medicine depends on the quality of personalized genome interpretation. In order to facilitate precision medicine in regions of the Middle East and North Africa (MENA), a population-specific reference genome for the indigenous Arab popula-tion of Qatar (QTRG) was constructed by incorporating allele frequency data from sequencing of 1,161 Qataris, representin… ▽ More Reaching the full potential of precision medicine depends on the quality of personalized genome interpretation. In order to facilitate precision medicine in regions of the Middle East and North Africa (MENA), a population-specific reference genome for the indigenous Arab popula-tion of Qatar (QTRG) was constructed by incorporating allele frequency data from sequencing of 1,161 Qataris, representing 0.4% of the population. A total of 20.9 million SNP and 3.1 million indels were observed in Qatar, including an average of 1.79% novel variants per individual ge-nome. Replacement of the GRCh37 standard reference with QTRG in a best practices genome analysis workflow resulted in an average of 7* deeper coverage depth (an improvement of 23%), and 756,671 fewer variants on average, a reduction of 16% that is attributed to common Qatari alleles being present in the QTRG reference. The benefit for using QTRG varies across ances-tries, a factor that should be taken into consideration when selecting an appropriate reference for analysis. △ Less

Submitted 13 May, 2018; v1 submitted 8 May, 2018; originally announced May 2018.

Comments: Includes supplementary figures missing from publisher website

Journal ref: Hum Genome Var. 2016 Jun 30;3:16016

arXiv:1803.00269 [pdf, other]

doi 10.1002/mma.6352

A new approach to solving multi-order fractional equations using BEM and Chebyshev matrix

Authors: Moein Khalighi, Mohammad Amirian Matlob, Alaeddin Malek

Abstract: In this paper, the boundary element method is combined with Chebyshev operational matrix technique to solve two-dimensional multi-order time-fractional partial differential equations; nonlinear and linear in respect to spatial and temporal variables, respectively. Fractional derivatives are estimated by Caputo sense. Boundary element method is used to convert the main problem into a system of a mu… ▽ More In this paper, the boundary element method is combined with Chebyshev operational matrix technique to solve two-dimensional multi-order time-fractional partial differential equations; nonlinear and linear in respect to spatial and temporal variables, respectively. Fractional derivatives are estimated by Caputo sense. Boundary element method is used to convert the main problem into a system of a multi-order fractional ordinary differential equation. Then, the produced system is approximated by Chebyshev operational matrix technique, ans its condition number is analyzed. Accuracy and efficiency of the proposed hybrid scheme are demonstrated by solving three different types of two-dimensional time fractional convection-diffusion equations numerically. The convergent rates are calculated for different meshing within the boundary element technique. Numerical results are given by graphs and tables for solutions and different type of error norms. △ Less

Submitted 17 February, 2020; v1 submitted 1 March, 2018; originally announced March 2018.

MSC Class: 65M38-65M70-35R11

arXiv:1802.09514 [pdf, ps, other]

Best Arm Identification for Contaminated Bandits

Authors: Jason Altschuler, Victor-Emmanuel Brunel, Alan Malek

Abstract: This paper studies active learning in the context of robust statistics. Specifically, we propose a variant of the Best Arm Identification problem for \emph{contaminated bandits}, where each arm pull has probability $\varepsilon$ of generating a sample from an arbitrary contamination distribution instead of the true underlying distribution. The goal is to identify the best (or approximately best) t… ▽ More This paper studies active learning in the context of robust statistics. Specifically, we propose a variant of the Best Arm Identification problem for \emph{contaminated bandits}, where each arm pull has probability $\varepsilon$ of generating a sample from an arbitrary contamination distribution instead of the true underlying distribution. The goal is to identify the best (or approximately best) true distribution with high probability, with a secondary goal of providing guarantees on the quality of this distribution. The primary challenge of the contaminated bandit setting is that the true distributions are only partially identifiable, even with infinite samples. To address this, we develop tight, non-asymptotic sample complexity bounds for high-probability estimation of the first two robust moments (median and median absolute deviation) from contaminated samples. These concentration inequalities are the main technical contributions of the paper and may be of independent interest. Using these results, we adapt several classical Best Arm Identification algorithms to the contaminated bandit setting and derive sample complexity upper bounds for our problem. Finally, we provide matching information-theoretic lower bounds on the sample complexity (up to a small logarithmic factor). △ Less

Submitted 15 May, 2019; v1 submitted 26 February, 2018; originally announced February 2018.

Comments: to appear in Journal of Machine Learning Research (JMLR)

Journal ref: Journal of Machine Learning Research (JMLR), 20(91), 1-39, 2019

arXiv:1802.02015 [pdf, ps, other]

Compact ADI method for solving two-dimensional Riesz space fractional diffusion equation

Authors: Sohrab Valizadeh, Alaeddin Malek, Abdollah Borhanifar

Abstract: In this paper, a compact alternating direction implicit (ADI) method has been developed for solving two-dimensional Riesz space fractional diffusion equation. The precision of the discretization method used in spatial directions is twice the order of the corresponding fractional derivatives. It is proved that the proposed method is unconditionally stable via the matrix analysis method and the maxi… ▽ More In this paper, a compact alternating direction implicit (ADI) method has been developed for solving two-dimensional Riesz space fractional diffusion equation. The precision of the discretization method used in spatial directions is twice the order of the corresponding fractional derivatives. It is proved that the proposed method is unconditionally stable via the matrix analysis method and the maximum error in achieving convergence is discussed. Several numerical examples are considered aiming to demonstrate the validity and applicability of the proposed technique. △ Less

Submitted 18 April, 2020; v1 submitted 6 February, 2018; originally announced February 2018.

Comments: 16 pages, 2 tables, This article has not been published in any journals

MSC Class: 35R11; 34K28; 65M06; 65M12

arXiv:1711.03994 [pdf, other]

doi 10.1088/1361-648X/aab196

Evaluating the Laplace pressure of water nanodroplets from simulations

Authors: Shahrazad M. A. Malek, Francesco Sciortino, Peter H. Poole, Ivan Saika-Voivod

Abstract: We calculate the components of the microscopic pressure tensor as a function of radial distance r from the centre of a spherical water droplet, modelled using the TIP4P/2005 potential. To do so, we modify a coarse-graining method for calculating the microscopic pressure [T. Ikeshoji, B. Hafskjold, and H. Furuholt, Mol. Simul. 29, 101 (2003)] in order to apply it to a rigid molecular model of water… ▽ More We calculate the components of the microscopic pressure tensor as a function of radial distance r from the centre of a spherical water droplet, modelled using the TIP4P/2005 potential. To do so, we modify a coarse-graining method for calculating the microscopic pressure [T. Ikeshoji, B. Hafskjold, and H. Furuholt, Mol. Simul. 29, 101 (2003)] in order to apply it to a rigid molecular model of water. As test cases, we study nanodroplets ranging in size from 776 to 2880 molecules at 220 K. Beneath a surface region comprising approximately two molecular layers, the pressure tensor becomes approximately isotropic and constant with r. We find that the dependence of the pressure on droplet radius is that expected from the Young-Laplace equation, despite the small size of the droplets. △ Less

Submitted 14 March, 2018; v1 submitted 10 November, 2017; originally announced November 2017.

Comments: 10 pages, 6 figures

Journal ref: J. Phys.: Condens. Matter 30, 144005 (2018)

arXiv:1710.10622 [pdf, ps, other]

doi 10.1140/epje/i2017-11588-2

"Swarm relaxation": Equilibrating a large ensemble of computer simulations

Authors: Shahrazad M. A. Malek, Richard K. Bowles, Ivan Saika-Voivod, Francesco Sciortino, Peter H. Poole

Abstract: It is common practice in molecular dynamics and Monte Carlo computer simulations to run multiple, separately-initialized simulations in order to improve the sampling of independent microstates. Here we examine the utility of an extreme case of this strategy, in which we run a large ensemble of $M$ independent simulations (a "swarm"), each of which is relaxed to equilibrium. We show that if $M$ is… ▽ More It is common practice in molecular dynamics and Monte Carlo computer simulations to run multiple, separately-initialized simulations in order to improve the sampling of independent microstates. Here we examine the utility of an extreme case of this strategy, in which we run a large ensemble of $M$ independent simulations (a "swarm"), each of which is relaxed to equilibrium. We show that if $M$ is of order $10^3$, we can monitor the swarm's relaxation to equilibrium, and confirm its attainment, within $\sim 10\barτ$, where $\barτ$ is the equilibrium relaxation time. As soon as a swarm of this size attains equilibrium, the ensemble of $M$ final microstates from each run is sufficient for the evaluation of most equilibrium properties without further sampling. This approach dramatically reduces the wall-clock time required, compared to a single long simulation, by a factor of several hundred, at the cost of an increase in the total computational effort by a small factor. It is also well-suited to modern computing systems having thousands of processors, and is a viable strategy for simulation studies that need to produce high-precision results in a minimum of wall-clock time. We present results obtained by applying this approach to several test cases. △ Less

Submitted 29 October, 2017; originally announced October 2017.

Comments: 12 pages. To appear in Eur. Phy. J. E, 2017

Journal ref: Eur. Phys. J. E 40, 98 (2017)

arXiv:1610.08865 [pdf, other]

Hit-and-Run for Sampling and Planning in Non-Convex Spaces

Authors: Yasin Abbasi-Yadkori, Peter L. Bartlett, Victor Gabillon, Alan Malek

Abstract: We propose the Hit-and-Run algorithm for planning and sampling problems in non-convex spaces. For sampling, we show the first analysis of the Hit-and-Run algorithm in non-convex spaces and show that it mixes fast as long as certain smoothness conditions are satisfied. In particular, our analysis reveals an intriguing connection between fast mixing and the existence of smooth measure-preserving map… ▽ More We propose the Hit-and-Run algorithm for planning and sampling problems in non-convex spaces. For sampling, we show the first analysis of the Hit-and-Run algorithm in non-convex spaces and show that it mixes fast as long as certain smoothness conditions are satisfied. In particular, our analysis reveals an intriguing connection between fast mixing and the existence of smooth measure-preserving map**s from a convex space to the non-convex space. For planning, we show advantages of Hit-and-Run compared to state-of-the-art planning methods such as Rapidly-Exploring Random Trees. △ Less

Submitted 19 October, 2016; originally announced October 2016.

arXiv:1607.06800 [pdf, ps, other]

Dynamical Transitions in a Dragged Growing Polymer Chain

Authors: Ali Malek, Reiner Kree

Abstract: We extend the Rouse model of polymer dynamics to situations of non-stationary chain growth. For a dragged polymer chain of length $N(t) = t^α$, we find two transitions in conformational dynamics. At $α= 1/2$, the propagation of tension and the average shape of the chain change qualitatively, while at $α= 1 $ the average center-of-mass motion stops. These transitions are due to a simple physical me… ▽ More We extend the Rouse model of polymer dynamics to situations of non-stationary chain growth. For a dragged polymer chain of length $N(t) = t^α$, we find two transitions in conformational dynamics. At $α= 1/2$, the propagation of tension and the average shape of the chain change qualitatively, while at $α= 1 $ the average center-of-mass motion stops. These transitions are due to a simple physical mechanism: a race duel between tension propagation and polymer growth. Therefore they should also appear for growing semi-flexible or stiff polymers. The generalized Rouse model inherits much of the versatility of the original Rouse model: it can be efficiently simulated and it is amenable to analytical treatment. △ Less

Submitted 22 July, 2016; originally announced July 2016.

arXiv:1603.04190 [pdf, ps, other]

Online Isotonic Regression

Authors: Wojciech Kotłowski, Wouter M. Koolen, Alan Malek

Abstract: We consider the online version of the isotonic regression problem. Given a set of linearly ordered points (e.g., on the real line), the learner must predict labels sequentially at adversarially chosen positions and is evaluated by her total squared loss compared against the best isotonic (non-decreasing) function in hindsight. We survey several standard online learning algorithms and show that non… ▽ More We consider the online version of the isotonic regression problem. Given a set of linearly ordered points (e.g., on the real line), the learner must predict labels sequentially at adversarially chosen positions and is evaluated by her total squared loss compared against the best isotonic (non-decreasing) function in hindsight. We survey several standard online learning algorithms and show that none of them achieve the optimal regret exponent; in fact, most of them (including Online Gradient Descent, Follow the Leader and Exponential Weights) incur linear regret. We then prove that the Exponential Weights algorithm played over a covering net of isotonic functions has a regret bounded by $O\big(T^{1/3} \log^{2/3}(T)\big)$ and present a matching $Ω(T^{1/3})$ lower bound on regret. We provide a computationally efficient version of this algorithm. We also analyze the noise-free case, in which the revealed labels are isotonic, and show that the bound can be improved to $O(\log T)$ or even to $O(1)$ (when the labels are revealed in isotonic order). Finally, we extend the analysis beyond squared loss and give bounds for entropic loss and absolute loss. △ Less

Submitted 7 October, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

Comments: 25 pages

arXiv:1407.5899 [pdf, ps, other]

doi 10.1063/1.4915917

Crystallization of Lennard-Jones nanodroplets: from near melting to deeply supercooled

Authors: Shahrazad M. A. Malek, Gregory P. Morrow, Ivan Saika-Voivod

Abstract: We carry out molecular dynamics (MD) and Monte Carlo (MC) simulations to characterize nucleation in liquid clusters of 600 Lennard-Jones particles over a broad range of temperatures. We use the formalism of mean first-passage times to determine the rate and find that Classical Nucleation Theory (CNT) predicts the rate quite well, even when employing simple modelling of crystallite shape, chemical… ▽ More We carry out molecular dynamics (MD) and Monte Carlo (MC) simulations to characterize nucleation in liquid clusters of 600 Lennard-Jones particles over a broad range of temperatures. We use the formalism of mean first-passage times to determine the rate and find that Classical Nucleation Theory (CNT) predicts the rate quite well, even when employing simple modelling of crystallite shape, chemical potential, surface tension and particle attachment rate, down to the temperature where the droplet loses metastability and crystallization proceeds through growth-limited nucleation in an unequilibrated liquid. Below this crossover temperature, the nucleation rate is still predicted when MC simulations are used to directly calculate quantities required by CNT. Discrepancy in critical embryo sizes obtained from MD and MC arises when twinned structures with five-fold symmetry provide a competing free energy pathway out of the critical region. We find that crystallization begins with hcp-fcc stacked precritical nuclei and differentiation to various end structures occurs when these embryos become critical. We confirm that using the largest embryo in the system as a reaction coordinate is useful in determining the onset of growth-limited nucleation and show that it gives the same free energy barriers as the full cluster size distribution once the proper reference state is identified. We find that the bulk melting temperature controls the rate, even though the solid-liquid coexistence temperature for the droplet is significantly lower. The value of surface tension that renders close agreement between CNT and direct rate determination is significantly lower than what is expected for the bulk system. △ Less

Submitted 6 March, 2015; v1 submitted 22 July, 2014; originally announced July 2014.

Comments: 17 pages, 14 figures

Journal ref: J. Chem. Phys. 142, 124506 (2015)

arXiv:1405.1069 [pdf]

doi 10.1063/1.4890286

Integrating Atomic Layer Deposition and Ultra-High Vacuum Physical Vapor Deposition for In Situ Fabrication of Tunnel Junctions

Authors: Alan J. Elliot, Gary A. Malek, Rongtao Lu, Siyuan Han, Haifeng Yiu, Shi** Zhao, Judy Z. Wu

Abstract: Atomic Layer Deposition (ALD) is a promising technique for growing ultrathin, pristine dielectrics on metal substrates, which is essential to many electronic devices. Tunnel junctions are an excellent example which require a leak-free, ultrathin dielectric tunnel barrier of typical thickness around 1 nm between two metal electrodes. A challenge in the development of ultrathin dielectric tunnel bar… ▽ More Atomic Layer Deposition (ALD) is a promising technique for growing ultrathin, pristine dielectrics on metal substrates, which is essential to many electronic devices. Tunnel junctions are an excellent example which require a leak-free, ultrathin dielectric tunnel barrier of typical thickness around 1 nm between two metal electrodes. A challenge in the development of ultrathin dielectric tunnel barrier using ALD is controlling the nucleation of dielectrics on metals with minimal formation of native oxides at the metal surface for high-quality interfaces between the tunnel barrier and metal electrodes. This poses a critical need for integrating ALD with ultra-high vacuum (UHV) physical vapor deposition. In order to address these challenges, a viscous-flow ALD chamber was designed and interfaced to an UHV magnetron sputtering chamber via a load lock. A sample transportation system was implemented for in situ sample transfer between the ALD, load lock, and sputtering chambers. Using this integrated ALD-UHV sputtering system, superconductor-insulator-superconductor (SIS) Nb/Al/Al2O3/Nb Josephson tunnel junctions were fabricated with tunnel barriers of thickness varied from sub-nm to ~ 1 nm. The suitability of using an Al wetting layer for initiation of the ALD Al2O3 tunnel barrier was investigated with ellipsometry, atomic force microscopy, and electrical transport measurements. With optimized processing conditions, leak-free SIS tunnel junctions were obtained, demonstrating the viability of this integrated ALD-UHV sputtering system for the fabrication of tunnel junctions and devices comprised of metal-dielectric-metal multilayers. △ Less

Submitted 5 May, 2014; originally announced May 2014.

Comments: 25 pages, 13 figures, 1 table

arXiv:1402.6763 [pdf, ps, other]

Linear Programming for Large-Scale Markov Decision Problems

Authors: Yasin Abbasi-Yadkori, Peter L. Bartlett, Alan Malek

Abstract: We consider the problem of controlling a Markov decision process (MDP) with a large state space, so as to minimize average cost. Since it is intractable to compete with the optimal policy for large scale problems, we pursue the more modest goal of competing with a low-dimensional family of policies. We use the dual linear programming formulation of the MDP average cost problem, in which the variab… ▽ More We consider the problem of controlling a Markov decision process (MDP) with a large state space, so as to minimize average cost. Since it is intractable to compete with the optimal policy for large scale problems, we pursue the more modest goal of competing with a low-dimensional family of policies. We use the dual linear programming formulation of the MDP average cost problem, in which the variable is a stationary distribution over state-action pairs, and we consider a neighborhood of a low-dimensional subset of the set of stationary distributions (defined in terms of state-action features) as the comparison class. We propose two techniques, one based on stochastic convex optimization, and one based on constraint sampling. In both cases, we give bounds that show that the performance of our algorithms approaches the best achievable by any policy in the comparison class. Most importantly, these results depend on the size of the comparison class, but not on the size of the state space. Preliminary experiments show the effectiveness of the proposed algorithms in a queuing application. △ Less

Submitted 26 February, 2014; originally announced February 2014.

Comments: 27 pages, 3 figures

arXiv:1310.6918 [pdf]

Discontinuities of energy derivatives in spin-density functional theory

Authors: Ali M. Malek, Robert Balawender

Abstract: Standard approximations for the exchange-correlation functional are known to deviate from linear dependence of the energy on the electron and spin numbers (in -space). Violation of this flat-plane condition underlies the failure of all known approximate functionals to describe band gaps in strongly correlated systems. Then crucial for further development of functionals is recognition of the behavi… ▽ More Standard approximations for the exchange-correlation functional are known to deviate from linear dependence of the energy on the electron and spin numbers (in -space). Violation of this flat-plane condition underlies the failure of all known approximate functionals to describe band gaps in strongly correlated systems. Then crucial for further development of functionals is recognition of the behavior of the energy as a function of, and its derivatives (derivatives discontinuity pattern at 0K limit). In this Letter, the energy derivatives are analyzed thoroughly. It is shown that apart from the well-known type of discontinuity pattern, three other patterns are possible in the vicinity of the singlet state point. The zero temperature limits of energy derivatives at this point are derived, found different for various directions. Existence of all discontinuity patterns is illustrated on the example of diatomic molecules set. △ Less

Submitted 25 October, 2013; originally announced October 2013.

Comments: Pages 1-15 are a manuscript, pages 16-22 are supplemental material (equations derivation, full details of the calculations), pages 23-50 are numerical data

Showing 1–29 of 29 results for author: Malek, A