Search | arXiv e-print repository

Efficient Wait-Free Linearizable Implementations of Approximate Bounded Counters Using Read-Write Registers

Authors: Colette Johnen, Adnane Khattabi, Alessia Milani, Jennifer L. Welch

Abstract: Relaxing the sequential specification of a shared object is a way to obtain an implementation with better performance compared to implementing the original specification. We apply this approach to the Counter object, under the assumption that the number of times the Counter is incremented in any execution is at most a known bound $m$. We consider the $k$-multiplicative-accurate Counter object, whe… ▽ More Relaxing the sequential specification of a shared object is a way to obtain an implementation with better performance compared to implementing the original specification. We apply this approach to the Counter object, under the assumption that the number of times the Counter is incremented in any execution is at most a known bound $m$. We consider the $k$-multiplicative-accurate Counter object, where each read operation returns an approximate value that is within a multiplicative factor $k$ of the accurate value. More specifically, a read is allowed to return an approximate value $x$ of the number $v$ of increments previously applied to the counter such that $v/k \le x \le vk$. We present three algorithms to implement this object in a wait-free linearizable manner in the shared memory model using read-write registers. All the algorithms have read operations whose worst-case step complexity improves exponentially on that for an exact $m$-bounded counter (which in turn improves exponentially on that for an exact unbounded counter). Two of the algorithms have read step complexity that is asymptotically optimal. The algorithms differ in their requirements on $k$, step complexity of the increment operation, and space complexity. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 26 pages, to be published in SIROCCO 2024 proceedings

arXiv:2312.13713 [pdf, other]

doi 10.1103/PhysRevB.109.094111

Optical Transmission Enhancement of Ionic Crystals via Superionic Fluoride Transfer: Growing VUV-Transparent Radioactive Crystals

Authors: Kjeld Beeks, Tomas Sikorsky, Fabian Schaden, Martin Pressler, Felix Schneider, Björn N. Koch, Thomas Pronebner, David Werban, Niyusha Hosseini, Georgy Kazakov, Jan Welch, Johannes H. Sterba, Florian Kraus, Thorsten Schumm

Abstract: The 8 eV first nuclear excited state in $^{229}$Th is a candidate for implementing an nuclear clock. Do** $^{229}$Th into ionic crystals such as CaF$_2$ is expected to suppress non-radiative decay, enabling nuclear spectroscopy and the realization of a solid-state optical clock. Yet, the inherent radioactivity of $^{229}$Th prohibits the growth of high-quality single crystals with high $^{229}$T… ▽ More The 8 eV first nuclear excited state in $^{229}$Th is a candidate for implementing an nuclear clock. Do** $^{229}$Th into ionic crystals such as CaF$_2$ is expected to suppress non-radiative decay, enabling nuclear spectroscopy and the realization of a solid-state optical clock. Yet, the inherent radioactivity of $^{229}$Th prohibits the growth of high-quality single crystals with high $^{229}$Th concentration; radiolysis causes fluoride loss, increasing absorption at 8 eV. We overcome this roadblock by annealing $^{229}$Th doped CaF$_2$ at 1250$\unicode{x2103}$ in CF$_4$. The technique presented here allows to adjust the fluoride content without crystal melting, preserving its single-crystal structure. Superionic state annealing ensures rapid fluoride distribution, creating fully transparent and radiation-hard crystals. This approach enables control over the charge state of dopants which can be used in deep UV optics, laser crystals, scintillators, and nuclear clocks. △ Less

Submitted 29 February, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: 5 pages, 5 figures

arXiv:2312.10100 [pdf, other]

Data-Adaptive Dimensional Analysis for Accurate Interpolation and Extrapolation in Computer Experiments

Authors: G. Alexi Rodriguez-Arelis, William J. Welch

Abstract: Dimensional analysis (DA) pays attention to fundamental physical dimensions such as length and mass when modelling scientific and engineering systems. It goes back at least a century to Buckingham's Pi theorem, which characterizes a scientifically meaningful model in terms of a limited number of dimensionless variables. The methodology has only been exploited relatively recently by statisticians f… ▽ More Dimensional analysis (DA) pays attention to fundamental physical dimensions such as length and mass when modelling scientific and engineering systems. It goes back at least a century to Buckingham's Pi theorem, which characterizes a scientifically meaningful model in terms of a limited number of dimensionless variables. The methodology has only been exploited relatively recently by statisticians for design and analysis of experiments, however, and computer experiments in particular. The basic idea is to build models in terms of new dimensionless quantities derived from the original input and output variables. A scientifically valid formulation has the potential for improved prediction accuracy in principle, but the implementation of DA is far from straightforward. There can be a combinatorial number of possible models satisfying the conditions of the theory. Empirical approaches for finding effective derived variables will be described, and improvements in prediction accuracy will be demonstrated. As DA's dimensionless quantities for a statistical model typically compare the original variables rather than use their absolute magnitudes, DA is less dependent on the choice of experimental ranges in the training data. Hence, we are also able to illustrate sustained accuracy gains even when extrapolating substantially outside the training data. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Comments: 21 pages, 11 figures

arXiv:2308.04646 [pdf, other]

Multi-Valued Connected Consensus: A New Perspective on Crusader Agreement and Adopt-Commit

Authors: Hagit Attiya, Jennifer L. Welch

Abstract: Algorithms to solve fault-tolerant consensus in asynchronous systems often rely on primitives such as crusader agreement, adopt-commit, and graded broadcast, which provide weaker agreement properties than consensus. Although these primitives have a similar flavor, they have been defined and implemented separately in ad hoc ways. We propose a new problem called connected consensus that has as speci… ▽ More Algorithms to solve fault-tolerant consensus in asynchronous systems often rely on primitives such as crusader agreement, adopt-commit, and graded broadcast, which provide weaker agreement properties than consensus. Although these primitives have a similar flavor, they have been defined and implemented separately in ad hoc ways. We propose a new problem called connected consensus that has as special cases crusader agreement, adopt-commit, and graded broadcast, and generalizes them to handle multi-valued inputs. The generalization is accomplished by relating the problem to approximate agreement on graphs. We present three algorithms for multi-valued connected consensus in asynchronous message-passing systems, one tolerating crash failures and two tolerating malicious (unauthenticated Byzantine) failures. We extend the definition of binding, a desirable property recently identified as supporting binary consensus algorithms that are correct against adaptive adversaries, to the multi-valued input case and show that all our algorithms satisfy the property. Our crash-resilient algorithm has failure-resilience and time complexity that we show are optimal. When restricted to the case of binary inputs, the algorithm has improved time complexity over prior algorithms. Our two algorithms for malicious failures trade off failure resilience and time complexity. The first algorithm has time complexity that we prove is optimal but worse failure-resilience, while the second has failure-resilience that we prove is optimal but worse time complexity. When restricted to the case of binary inputs, the time complexity (as well as resilience) of the second algorithm matches that of prior algorithms. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: 38 pages, 5 figures

arXiv:2306.00453 [pdf, other]

A Gaussian Sliding Windows Regression Model for Hydrological Inference

Authors: Stefan Schrunner, Joseph Janssen, Anna Jenul, Jiguo Cao, Ali A. Ameli, William J. Welch

Abstract: Statistical models are an essential tool to model, forecast and understand the hydrological processes in watersheds. In particular, the modeling of time lags associated with the time between rainfall occurrence and subsequent changes in streamflow, is of high practical importance. Since water can take a variety of flowpaths to generate streamflow, a series of distinct runoff pulses from different… ▽ More Statistical models are an essential tool to model, forecast and understand the hydrological processes in watersheds. In particular, the modeling of time lags associated with the time between rainfall occurrence and subsequent changes in streamflow, is of high practical importance. Since water can take a variety of flowpaths to generate streamflow, a series of distinct runoff pulses from different flowpath may combine to create the observed streamflow time series. Current state-of-the-art models are not able to sufficiently confront the problem complexity with interpretable parametrization, which would allow insights into the dynamics of the distinct flow paths for hydrological inference. The proposed Gaussian Sliding Windows Regression Model targets this problem by combining the concept of multiple windows sliding along the time axis with multiple linear regression. The window kernels, which indicate the weights applied to different time lags, are implemented via Gaussian-shaped kernels. As a result, each window can represent one flowpath and, thus, offers the potential for straightforward process inference. Experiments on simulated and real-world scenarios underline that the proposed model achieves accurate parameter estimates and competitive predictive performance, while fostering explainable and interpretable hydrological modeling. △ Less

Submitted 30 September, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

arXiv:2303.06501 [pdf, other]

Learning from limited temporal data: Dynamically sparse historical functional linear models with applications to Earth science

Authors: Joseph Janssen, Shizhe Meng, Asad Haris, Stefan Schrunner, Jiguo Cao, William J. Welch, Nadja Kunz, Ali A. Ameli

Abstract: Scientists and statisticians often want to learn about the complex relationships that connect two time-varying variables. Recent work on sparse functional historical linear models confirms that they are promising for this purpose, but several notable limitations exist. Most importantly, previous works have imposed sparsity on the historical coefficient function, but have not allowed the sparsity,… ▽ More Scientists and statisticians often want to learn about the complex relationships that connect two time-varying variables. Recent work on sparse functional historical linear models confirms that they are promising for this purpose, but several notable limitations exist. Most importantly, previous works have imposed sparsity on the historical coefficient function, but have not allowed the sparsity, hence lag, to vary with time. We simplify the framework of sparse functional historical linear models by using a rectangular coefficient structure along with Whittaker smoothing, then reduce the assumptions of the previous frameworks by estimating the dynamic time lag from a hierarchical coefficient structure. We motivate our study by aiming to extract the physical rainfall-runoff processes hidden within hydrological data. We show the promise and accuracy of our method using eight simulation studies, further justified by two real sets of hydrological data. △ Less

Submitted 3 April, 2024; v1 submitted 11 March, 2023; originally announced March 2023.

arXiv:2302.10525 [pdf, other]

Molecular chaos in dense active systems

Authors: Lu Chen, Kyle J. Welch, Premkumar Leishangthem, Dipanjan Ghosh, Bokai Zhang, Ting-Pi Sun, Josh Klukas, Zhanchun Tu, Xiang Cheng, Xinliang Xu

Abstract: The hypothesis of molecular chaos plays the central role in kinetic theory, which provides a closure leading to the Boltzmann equation for quantitative description of classic fluids. Yet how to properly extend it to active systems is still an open question in nonequilibrium physics. Combining experiment, simulation, and theory, we investigate the emergent collective behaviors of self-propelled par… ▽ More The hypothesis of molecular chaos plays the central role in kinetic theory, which provides a closure leading to the Boltzmann equation for quantitative description of classic fluids. Yet how to properly extend it to active systems is still an open question in nonequilibrium physics. Combining experiment, simulation, and theory, we investigate the emergent collective behaviors of self-propelled particles that exhibit collision avoidance, a moving strategy commonly adopted in natural and engineering active systems. This dense active system shows unusual phase dynamics strongly regulated by many-body interactions, which cannot be explained by theories assuming molecular chaos. To rationalize the interplay between different emergent phases, a simple kinetic model is proposed with a revised molecular chaos hypothesis, which treats the many-body effect implicitly via categorizing different types of particle pair collisions. Our model predicts an optimal growth rate of flocking and illustrates a generic approach for understanding dense active systems. △ Less

Submitted 21 February, 2023; originally announced February 2023.

Comments: 8 pages, 6 figures

arXiv:2211.05445 [pdf, other]

Growth and characterization of thorium-doped calcium fluoride single crystals

Authors: Kjeld Beeks, Tomas Sikorsky, Veronika Rosecker, Martin Pressler, Fabian Schaden, David Werban, Niyusha Hosseini, Lukas Rudischer, Felix Schneider, Patrick Berwian, Jochen Friedrich, Dieter Hainz, Jan Welch, Johannes H. Sterba, Georgy Kazakov, Thorsten Schumm

Abstract: We have grown $^{232}$Th:CaF$_2$ and $^{229}$Th:CaF$_2$ single crystals for investigations on the VUV laser-accessible first nuclear excited state of $^{229}$Th. To reach high do** concentrations despite the extreme scarcity (and radioactivity) of $^{229}$Th, we have scaled down the crystal volume by a factor 100 compared to established commercial or scientific growth processes. We use the verti… ▽ More We have grown $^{232}$Th:CaF$_2$ and $^{229}$Th:CaF$_2$ single crystals for investigations on the VUV laser-accessible first nuclear excited state of $^{229}$Th. To reach high do** concentrations despite the extreme scarcity (and radioactivity) of $^{229}$Th, we have scaled down the crystal volume by a factor 100 compared to established commercial or scientific growth processes. We use the vertical gradient freeze method on 3.2 mm diameter seed single crystals with a 2 mm drilled pocket, filled with a co-precipitated CaF$_2$:ThF$_4$:PbF$_2$ powder in order to grow single crystals. Concentrations of $4\cdot10^{19}$ cm$^{-3}$ have been realized with $^{232}$Th with good ($>$10%) VUV transmission. However, the intrinsic radioactivity of $^{229}$Th drives radio-induced dissociation during growth and radiation damage after solidification. Both lead to a degradation of VUV transmission, limiting the $^{229}$Th concentration to $<5\cdot10^{17}$ cm$^{-3}$. △ Less

Submitted 10 November, 2022; originally announced November 2022.

Comments: 12 pages, 18 figures

arXiv:2210.07501 [pdf, other]

Thermocapillary Convection in Superimposed Layers of Self-Rewetting Fluids: Analytical and Lattice Boltzmann Computational Study

Authors: Bashir Elbousefi, William Schupbach, Kannan N. Premnath, Samuel W. J. Welch

Abstract: Self-rewetting fluids (SRFs), such as aqueous solutions of long-chain alcohols, exhibit anomalous quadratic dependence of surface tension on temperature having a minimum and with a positive gradient. When compared to the normal fluids (NFs), the SRFs can be associated with significantly modified interfacial dynamics, which have recently been exploited to enhance flow and thermal transport in vario… ▽ More Self-rewetting fluids (SRFs), such as aqueous solutions of long-chain alcohols, exhibit anomalous quadratic dependence of surface tension on temperature having a minimum and with a positive gradient. When compared to the normal fluids (NFs), the SRFs can be associated with significantly modified interfacial dynamics, which have recently been exploited to enhance flow and thermal transport in various applications (e.g., microgravity and microscale transport systems). In this work, first, we develop a new analytical solution of thermocapillary convection in superimposed two SRF layers confined within a microchannel that is sinusoidally heated on one side and maintained at a uniform temperature on the other side under the cree** flow regime. Then, a robust central moment lattice Boltzmann method using the Allen-Cahn equation for interface tracking, two-fluid motion, and the energy transport for numerical simulations of SRFs is constructed. The analytical and computational techniques are generally shown to be in good quantitative agreement with one another. Moreover, the effect of the various characteristic parameters on the magnitude and the distribution thermocapillary-driven motion is studied. The thermocapillary flow patterns in SRFs are shown to be strikingly different when compared to the NFs: For otherwise the same conditions, the SRFs result in eight periodic counterrotating thermocapillary convection rolls, while the NFs exhibit only four such vortices. Moreover, the direction of the circulating fluid motion in such vortical structures for the SRFs is found to be towards the hotter zones on the interfaces, which is opposite to that in NFs. By tuning the sensitivity coefficients of the surface tension on temperature, it is shown that the magnitude as well the overall thermocapillary flow patterns can be significantly manipulated. △ Less

Submitted 13 October, 2022; originally announced October 2022.

Comments: 55 pages, 29 figures

arXiv:2209.00545 [pdf, other]

A Bilevel Optimization Method for Tensor Recovery Under Metric Learning Constraints

Authors: Maryam Bagherian, Davoud A. Tarzanagh, Ivo Dinov, Joshua D. Welch

Abstract: Tensor completion and tensor decomposition are important problems in many domains. In this work, we leverage the connection between these problems to learn a distance metric that improves both decomposition and completion. We show that the optimal Mahalanobis distance metric for the completion task is closely related to the Tucker decomposition of the completed tensor. Then, we formulate a bilevel… ▽ More Tensor completion and tensor decomposition are important problems in many domains. In this work, we leverage the connection between these problems to learn a distance metric that improves both decomposition and completion. We show that the optimal Mahalanobis distance metric for the completion task is closely related to the Tucker decomposition of the completed tensor. Then, we formulate a bilevel optimization problem to perform joint tensor completion and decomposition, subject to metric learning constraints. The metric learning constraints also allow us to flexibly incorporate similarity side information and coupled matrices, when available, into the tensor recovery process. We derive an algorithm to solve the bilevel optimization problem and prove its global convergence. When evaluated on real data, our approach performs significantly better compared to previous methods. △ Less

Submitted 1 September, 2022; originally announced September 2022.

arXiv:2207.04166 [pdf, other]

Variational Mixtures of ODEs for Inferring Cellular Gene Expression Dynamics

Authors: Yichen Gu, David Blaauw, Joshua Welch

Abstract: A key problem in computational biology is discovering the gene expression changes that regulate cell fate transitions, in which one cell type turns into another. However, each individual cell cannot be tracked longitudinally, and cells at the same point in real time may be at different stages of the transition process. This can be viewed as a problem of learning the behavior of a dynamical system… ▽ More A key problem in computational biology is discovering the gene expression changes that regulate cell fate transitions, in which one cell type turns into another. However, each individual cell cannot be tracked longitudinally, and cells at the same point in real time may be at different stages of the transition process. This can be viewed as a problem of learning the behavior of a dynamical system from observations whose times are unknown. Additionally, a single progenitor cell type often bifurcates into multiple child cell types, further complicating the problem of modeling the dynamics. To address this problem, we developed an approach called variational mixtures of ordinary differential equations. By using a simple family of ODEs informed by the biochemistry of gene expression to constrain the likelihood of a deep generative model, we can simultaneously infer the latent time and latent state of each cell and predict its future gene expression state. The model can be interpreted as a mixture of ODEs whose parameters vary continuously across a latent space of cell states. Our approach dramatically improves data fit, latent time inference, and future cell state estimation of single-cell gene expression data compared to previous approaches. △ Less

Submitted 8 July, 2022; originally announced July 2022.

Journal ref: Proceedings of the 39th International Conference on Machine Learning, 2022

arXiv:2202.04198 [pdf, other]

Multivariate cluster point process to quantify and explore multi-entity configurations: Application to biofilm image data

Authors: Suman Majumder, Brent A. Coull, Jessica L. Mark Welch, Patrick J. La Riviere, Floyd E. Dewhirst, Jacqueline R. Starr, Kyu Ha Lee

Abstract: Clusters of similar or dissimilar objects are encountered in many fields. Frequently used approaches treat the central object of each cluster as latent. Yet, often objects of one or more types cluster around objects of another type. Such arrangements are common in biomedical images of cells, in which nearby cell types likely interact. Quantifying spatial relationships may elucidate biological mech… ▽ More Clusters of similar or dissimilar objects are encountered in many fields. Frequently used approaches treat the central object of each cluster as latent. Yet, often objects of one or more types cluster around objects of another type. Such arrangements are common in biomedical images of cells, in which nearby cell types likely interact. Quantifying spatial relationships may elucidate biological mechanisms. Parent-offspring statistical frameworks can be usefully applied even when central objects (parents) differ from peripheral ones (offspring). We propose the novel multivariate cluster point process (MCPP) to quantify multi-object (e.g., multi-cellular) arrangements. Unlike commonly used approaches, the MCPP exploits locations of the central parent object in clusters. It accounts for possibly multilayered, multivariate clustering. The model formulation requires specification of which object types function as cluster centers and which reside peripherally. If such information is unknown, the relative roles of object types may be explored by comparing fit of different models via the deviance information criterion (DIC). In simulated data, we compared DIC of a series of models; the MCPP correctly identified simulated relationships. It also produced more accurate and precise parameter estimates than the classical univariate Neyman-Scott process model. We also used the MCPP to quantify proposed configurations and explore new ones in human dental plaque biofilm image data. MCPP models quantified simultaneous clustering of Streptococcus and Porphyromonas around Corynebacterium and of Pasteurellaceae around Streptococcus and successfully captured hypothesized structures for all taxa. Further exploration suggested the presence of clustering between Fusobacterium and Leptotrichia, a previously unreported relationship. △ Less

Submitted 31 January, 2024; v1 submitted 8 February, 2022; originally announced February 2022.

MSC Class: 62

arXiv:2109.11611 [pdf, other]

doi 10.1063/5.0072262

Faraday rotation study of plasma bubbles in GeV wakefield accelerators

Authors: Y. Y. Chang, X. Cheng, A. Hannasch, M. LaBerge, J. M. Shaw, K. Weichman, J. Welch, A. Bernstein, W. Henderson, R. Zgadzaj, M. C. Downer

Abstract: We visualize plasma bubbles driven by 0.67 PW laser pulses in plasma of density $n_e \approx 5\times10^{17}$ ${\rm cm}^{-3}$ by imaging Faraday rotation patterns imprinted on linearly-polarized probe pulses of wavelength $λ_{pr} = 1.05 μ$m and duration $τ_{pr} = 2$ ps or $1$ ps that cross the bubble's path at right angles. When the bubble captures and accelerates tens to hundreds of pC of electron… ▽ More We visualize plasma bubbles driven by 0.67 PW laser pulses in plasma of density $n_e \approx 5\times10^{17}$ ${\rm cm}^{-3}$ by imaging Faraday rotation patterns imprinted on linearly-polarized probe pulses of wavelength $λ_{pr} = 1.05 μ$m and duration $τ_{pr} = 2$ ps or $1$ ps that cross the bubble's path at right angles. When the bubble captures and accelerates tens to hundreds of pC of electron charge, we observe two parallel streaks of length $cτ_{pr}$ straddling the drive pulse propagation axis, separated by $\sim45$ $μ$m, in which probe polarization rotates by $0.3^\circ$ to more than $5^\circ$ in opposite directions. Accompanying simulations show that they result from Faraday rotation within portions of dense bubble side walls that are pervaded by the azimuthal magnetic field of accelerating electrons during the probe transit across the bubble. Analysis of the width of the streaks shows that quasi-monoenergetic high-energy electrons and trailing lower energy electrons inside the bubble contribute distinguishable portions of the observed signals, and that relativistic flow of sheath electrons suppresses Faraday rotation from the rear of the bubble. The results demonstrate favorable scaling of Faraday rotation diagnostics to $40\times$ lower plasma density than previously demonstrated. △ Less

Submitted 23 September, 2021; originally announced September 2021.

Comments: 7 pages, 5 figures

arXiv:2106.15554 [pdf, other]

Blunting an Adversary Against Randomized Concurrent Programs with Linearizable Implementations

Authors: Hagit Attiya, Constantin Enea, Jennifer L. Welch

Abstract: Atomic shared objects, whose operations take place instantaneously, are a powerful abstraction for designing complex concurrent programs. Since they are not always available, they are typically substituted with software implementations. A prominent condition relating these implementations to their atomic specifications is linearizability, which preserves safety properties of the programs using the… ▽ More Atomic shared objects, whose operations take place instantaneously, are a powerful abstraction for designing complex concurrent programs. Since they are not always available, they are typically substituted with software implementations. A prominent condition relating these implementations to their atomic specifications is linearizability, which preserves safety properties of the programs using them. However linearizability does not preserve hyper-properties, which include probabilistic guarantees of randomized programs: an adversary can greatly amplify the probability of a bad outcome. This unwelcome behavior prevents modular reasoning, which is the key benefit provided by the use of linearizable object implementations. A more restrictive property, strong linearizability, does preserve hyper-properties but it is impossible to achieve in many situations. This paper suggests a novel approach to blunting the adversary's additional power that works even in cases where strong linearizability is not achievable. We show that a wide class of linearizable implementations, including well-known ones for registers and snapshots, can be modified to approximate the probabilistic guarantees of randomized programs when using atomic objects. The technical approach is to transform the algorithm of each method of an existing linearizable implementation by repeating a carefully chosen prefix of the method several times and then randomly choosing which repetition to use subsequently. We prove that the probability of a bad outcome decreases with the number of repetitions, approaching the probability attained when using atomic objects. The class of implementations to which our transformation applies includes the ABD implementation of a shared register using message-passing and the Afek et al. implementation of an atomic snapshot using single-writer registers. △ Less

Submitted 1 March, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

Comments: 22 pages Revised version generalizes the class of implementations to which the transformation applies

arXiv:2105.06614 [pdf, other]

Impossibility of Strongly-Linearizable Message-Passing Objects via Simulation by Single-Writer Registers

Authors: Hagit Attiya, Constantin Enea, Jennifer Welch

Abstract: A key way to construct complex distributed systems is through modular composition of linearizable concurrent objects. A prominent example is shared registers, which have crash-tolerant implementations on top of message-passing systems, allowing the advantages of shared memory to carry over to message-passing. Yet linearizable registers do not always behave properly when used inside randomized prog… ▽ More A key way to construct complex distributed systems is through modular composition of linearizable concurrent objects. A prominent example is shared registers, which have crash-tolerant implementations on top of message-passing systems, allowing the advantages of shared memory to carry over to message-passing. Yet linearizable registers do not always behave properly when used inside randomized programs. A strengthening of linearizability, called strong linearizability, has been shown to preserve probabilistic behavior, as well as other hypersafety properties. In order to exploit composition and abstraction in message-passing systems, it is crucial to know whether there exist strongly-linearizable implementations of registers in message-passing. This paper answers the question in the negative: there are no strongly-linearizable fault-tolerant message-passing implementations of multi-writer registers, max-registers, snapshots or counters. This result is proved by reduction from the corresponding result by Helmi et al. The reduction is a novel extension of the BG simulation that connects shared-memory and message-passing, supports long-lived objects, and preserves strong linearizability. The main technical challenge arises from the discrepancy between the potentially minuscule fraction of failures to be tolerated in the simulated message-passing algorithm and the large fraction of failures that can afflict the simulating shared-memory system. The reduction is general and can be viewed as the inverse of the ABD simulation of shared memory in message-passing. △ Less

Submitted 27 August, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

Comments: 18 pages. To appear in International Symposium on Distributed Computing (DISC), Oct. 2021

arXiv:2104.03164 [pdf, other]

doi 10.1016/j.eswa.2022.119060

Distilling and Transferring Knowledge via cGAN-generated Samples for Image Classification and Regression

Authors: Xin Ding, Yongwei Wang, Zuheng Xu, Z. Jane Wang, William J. Welch

Abstract: Knowledge distillation (KD) has been actively studied for image classification tasks in deep learning, aiming to improve the performance of a student based on the knowledge from a teacher. However, applying KD in image regression with a scalar response variable has been rarely studied, and there exists no KD method applicable to both classification and regression tasks yet. Moreover, existing KD m… ▽ More Knowledge distillation (KD) has been actively studied for image classification tasks in deep learning, aiming to improve the performance of a student based on the knowledge from a teacher. However, applying KD in image regression with a scalar response variable has been rarely studied, and there exists no KD method applicable to both classification and regression tasks yet. Moreover, existing KD methods often require a practitioner to carefully select or adjust the teacher and student architectures, making these methods less flexible in practice. To address the above problems in a unified way, we propose a comprehensive KD framework based on cGANs, termed cGAN-KD. Fundamentally different from existing KD methods, cGAN-KD distills and transfers knowledge from a teacher model to a student model via cGAN-generated samples. This novel mechanism makes cGAN-KD suitable for both classification and regression tasks, compatible with other KD methods, and insensitive to the teacher and student architectures. An error bound for a student model trained in the cGAN-KD framework is derived in this work, providing a theory for why cGAN-KD is effective as well as guiding the practical implementation of cGAN-KD. Extensive experiments on CIFAR-100 and ImageNet-100 show that we can combine state of the art KD methods with the cGAN-KD framework to yield a new state of the art. Moreover, experiments on Steering Angle and UTKFace demonstrate the effectiveness of cGAN-KD in image regression tasks, where existing KD methods are inapplicable. △ Less

Submitted 26 December, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

arXiv:2103.11166 [pdf, other]

Efficient Subsampling of Realistic Images From GANs Conditional on a Class or a Continuous Variable

Authors: Xin Ding, Yongwei Wang, Z. Jane Wang, William J. Welch

Abstract: Recently, subsampling or refining images generated from unconditional GANs has been actively studied to improve the overall image quality. Unfortunately, these methods are often observed less effective or inefficient in handling conditional GANs (cGANs) -- conditioning on a class (aka class-conditional GANs) or a continuous variable (aka continuous cGANs or CcGANs). In this work, we introduce an e… ▽ More Recently, subsampling or refining images generated from unconditional GANs has been actively studied to improve the overall image quality. Unfortunately, these methods are often observed less effective or inefficient in handling conditional GANs (cGANs) -- conditioning on a class (aka class-conditional GANs) or a continuous variable (aka continuous cGANs or CcGANs). In this work, we introduce an effective and efficient subsampling scheme, named conditional density ratio-guided rejection sampling (cDR-RS), to sample high-quality images from cGANs. Specifically, we first develop a novel conditional density ratio estimation method, termed cDRE-F-cSP, by proposing the conditional Softplus (cSP) loss and an improved feature extraction mechanism. We then derive the error bound of a density ratio model trained with the cSP loss. Finally, we accept or reject a fake image in terms of its estimated conditional density ratio. A filtering scheme is also developed to increase fake images' label consistency without losing diversity when sampling from CcGANs. We extensively test the effectiveness and efficiency of cDR-RS in sampling from both class-conditional GANs and CcGANs on five benchmark datasets. When sampling from class-conditional GANs, cDR-RS outperforms modern state-of-the-art methods by a large margin (except DRE-F-SP+RS) in terms of effectiveness. Although the effectiveness of cDR-RS is often comparable to that of DRE-F-SP+RS, cDR-RS is substantially more efficient. When sampling from CcGANs, the superiority of cDR-RS is even more noticeable in terms of both effectiveness and efficiency. Notably, with the consumption of reasonable computational resources, cDR-RS can substantially reduce Label Score without decreasing the diversity of CcGAN-generated images, while other methods often need to trade much diversity for slightly improved Label Score. △ Less

Submitted 20 April, 2022; v1 submitted 20 March, 2021; originally announced March 2021.

arXiv:2011.07466 [pdf, other]

Continuous Conditional Generative Adversarial Networks: Novel Empirical Losses and Label Input Mechanisms

Authors: Xin Ding, Yongwei Wang, Zuheng Xu, William J. Welch, Z. Jane Wang

Abstract: This work proposes the continuous conditional generative adversarial network (CcGAN), the first generative model for image generation conditional on continuous, scalar conditions (termed regression labels). Existing conditional GANs (cGANs) are mainly designed for categorical conditions (eg, class labels); conditioning on regression labels is mathematically distinct and raises two fundamental prob… ▽ More This work proposes the continuous conditional generative adversarial network (CcGAN), the first generative model for image generation conditional on continuous, scalar conditions (termed regression labels). Existing conditional GANs (cGANs) are mainly designed for categorical conditions (eg, class labels); conditioning on regression labels is mathematically distinct and raises two fundamental problems:(P1) Since there may be very few (even zero) real images for some regression labels, minimizing existing empirical versions of cGAN losses (aka empirical cGAN losses) often fails in practice;(P2) Since regression labels are scalar and infinitely many, conventional label input methods are not applicable. The proposed CcGAN solves the above problems, respectively, by (S1) reformulating existing empirical cGAN losses to be appropriate for the continuous scenario; and (S2) proposing a naive label input (NLI) method and an improved label input (ILI) method to incorporate regression labels into the generator and the discriminator. The reformulation in (S1) leads to two novel empirical discriminator losses, termed the hard vicinal discriminator loss (HVDL) and the soft vicinal discriminator loss (SVDL) respectively, and a novel empirical generator loss. The error bounds of a discriminator trained with HVDL and SVDL are derived under mild assumptions in this work. Two new benchmark datasets (RC-49 and Cell-200) and a novel evaluation metric (Sliding Fréchet Inception Distance) are also proposed for this continuous scenario. Our experiments on the Circular 2-D Gaussians, RC-49, UTKFace, Cell-200, and Steering Angle datasets show that CcGAN is able to generate diverse, high-quality samples from the image distribution conditional on a given regression label. Moreover, in these experiments, CcGAN substantially outperforms cGAN both visually and quantitatively. △ Less

Submitted 30 October, 2023; v1 submitted 15 November, 2020; originally announced November 2020.

Comments: Accepted by IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

arXiv:2010.14782 [pdf, other]

Classification Beats Regression: Counting of Cells from Greyscale Microscopic Images based on Annotation-free Training Samples

Authors: Xin Ding, Qiong Zhang, William J. Welch

Abstract: Modern methods often formulate the counting of cells from microscopic images as a regression problem and more or less rely on expensive, manually annotated training images (e.g., dot annotations indicating the centroids of cells or segmentation masks identifying the contours of cells). This work proposes a supervised learning framework based on classification-oriented convolutional neural networks… ▽ More Modern methods often formulate the counting of cells from microscopic images as a regression problem and more or less rely on expensive, manually annotated training images (e.g., dot annotations indicating the centroids of cells or segmentation masks identifying the contours of cells). This work proposes a supervised learning framework based on classification-oriented convolutional neural networks (CNNs) to count cells from greyscale microscopic images without using annotated training images. In this framework, we formulate the cell counting task as an image classification problem, where the cell counts are taken as class labels. This formulation has its limitation when some cell counts in the test stage do not appear in the training data. Moreover, the ordinal relation among cell counts is not utilized. To deal with these limitations, we propose a simple but effective data augmentation (DA) method to synthesize images for the unseen cell counts. We also introduce an ensemble method, which can not only moderate the influence of unseen cell counts but also utilize the ordinal information to improve the prediction accuracy. This framework outperforms many modern cell counting methods and won the data analysis competition (Case Study 1: Counting Cells From Microscopic Images https://ssc.ca/en/case-study/case-study-1-counting-cells-microscopic-images) of the 47th Annual Meeting of the Statistical Society of Canada (SSC). Our code is available at https://github.com/anno2020/CellCount_TinyBBBC005. △ Less

Submitted 29 October, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

Journal ref: The CAAI International Conference on Artificial Intelligence (CICAI 2021)

arXiv:2003.07787 [pdf, ps, other]

Store-Collect in the Presence of Continuous Churn with Application to Snapshots and Lattice Agreement

Authors: Hagit Attiya, Sweta Kumari, Archit Somani, Jennifer L. Welch

Abstract: We present an algorithm for implementing a store-collect object in an asynchronous crash-prone message-passing dynamic system, where nodes continually enter and leave. The algorithm is very simple and efficient, requiring just one round trip for a store operation and two for a collect. We then show the versatility of the store-collect object for implementing churn-tolerant versions of useful data… ▽ More We present an algorithm for implementing a store-collect object in an asynchronous crash-prone message-passing dynamic system, where nodes continually enter and leave. The algorithm is very simple and efficient, requiring just one round trip for a store operation and two for a collect. We then show the versatility of the store-collect object for implementing churn-tolerant versions of useful data structures, while shielding the user from the complications of the underlying churn. In particular, we present elegant and efficient implementations of atomic snapshot and generalized lattice agreement objects that use store-collect. △ Less

Submitted 5 November, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

Comments: 30 pages

arXiv:1910.06716 [pdf, other]

Byzantine-Tolerant Register in a System with Continuous Churn

Authors: Saptaparni Kumar, Jennifer L. Welch

Abstract: A shared read/write register emulation provides the illusion of shared-memory on top of message-passing models. The main hurdle with such emulations is dealing with server faults in the system. Several crash-tolerant register emulations in static systems require algorithms to replicate the value of the shared register onto a majority of servers. Majority correctness is necessary for such emulation… ▽ More A shared read/write register emulation provides the illusion of shared-memory on top of message-passing models. The main hurdle with such emulations is dealing with server faults in the system. Several crash-tolerant register emulations in static systems require algorithms to replicate the value of the shared register onto a majority of servers. Majority correctness is necessary for such emulations. Byzantine faults are considered to be the worst kind of faults that can happen in any distributed system. Emulating a Byzantine-tolerant register requires replicating the register value on to more than two-thirds of the servers. Emulating a register in a dynamic system where servers and clients can enter and leave the system and be faulty is harder than in static systems. There are several crash-tolerant register emulations for dynamic systems. This paper presents the first emulation of a multi-reader multi-writer atomic register in a system that can withstand nodes continually entering and leaving, imposes no upper bound on the system size and can tolerate Byzantine servers. The algorithm works as long as the number of servers entering and leaving during a fixed time interval is at most a constant fraction of the system size at the beginning of the interval, and as long as the number of Byzantine servers in the system is at most f. Although our algorithm requires that there be a constant known upper bound on the number of Byzantine servers, this restriction is unavoidable, as we show that it is impossible to emulate an atomic register if the system size and maximum number of servers that can be Byzantine in the system is unknown to the nodes. △ Less

Submitted 13 October, 2019; originally announced October 2019.

Comments: arXiv admin note: text overlap with arXiv:1708.03274

arXiv:1909.10670 [pdf, other]

doi 10.1109/TSP.2020.2979601

Subsampling Generative Adversarial Networks: Density Ratio Estimation in Feature Space with Softplus Loss

Authors: Xin Ding, Z. Jane Wang, William J. Welch

Abstract: Filtering out unrealistic images from trained generative adversarial networks (GANs) has attracted considerable attention recently. Two density ratio based subsampling methods---Discriminator Rejection Sampling (DRS) and Metropolis-Hastings GAN (MH-GAN)---were recently proposed, and their effectiveness in improving GANs was demonstrated on multiple datasets. However, DRS and MH-GAN are based on di… ▽ More Filtering out unrealistic images from trained generative adversarial networks (GANs) has attracted considerable attention recently. Two density ratio based subsampling methods---Discriminator Rejection Sampling (DRS) and Metropolis-Hastings GAN (MH-GAN)---were recently proposed, and their effectiveness in improving GANs was demonstrated on multiple datasets. However, DRS and MH-GAN are based on discriminator based density ratio estimation (DRE) methods, so they may not work well if the discriminator in the trained GAN is far from optimal. Moreover, they do not apply to some GANs (e.g., MMD-GAN). In this paper, we propose a novel Softplus (SP) loss for DRE. Based on it, we develop a sample-based DRE method in a feature space learned by a specially designed and pre-trained ResNet-34 (DRE-F-SP). We derive the rate of convergence of a density ratio model trained under the SP loss. Then, we propose three different density ratio subsampling methods (DRE-F-SP+RS, DRE-F-SP+MH, and DRE-F-SP+SIR) for GANs based on DRE-F-SP. Our subsampling methods do not rely on the optimality of the discriminator and are suitable for all types of GANs. We empirically show our subsampling approach can substantially outperform DRS and MH-GAN on a synthetic dataset and the CIFAR-10 dataset, using multiple GANs. △ Less

Submitted 20 February, 2020; v1 submitted 23 September, 2019; originally announced September 2019.

arXiv:1909.04166 [pdf, other]

Central Moment Lattice Boltzmann Method using a Pressure-based Formulation for Multiphase Flows at High Density Ratios and including Effects of Surface Tension and Marangoni Stresses

Authors: Farzaneh Hajabdollahi, Kannan Premnath, Samuel W. J. Welch

Abstract: Simulation of multiphase flows require coupled capturing or tracking of the interfaces in conjunction with the solution of fluid motion often occurring at multiple scales. We will present unified cascaded LB methods based on central moments for the solution of the incompressible two-phase flows at high density ratios and for capturing of the interfacial dynamics. Based on a modified continuous Bol… ▽ More Simulation of multiphase flows require coupled capturing or tracking of the interfaces in conjunction with the solution of fluid motion often occurring at multiple scales. We will present unified cascaded LB methods based on central moments for the solution of the incompressible two-phase flows at high density ratios and for capturing of the interfacial dynamics. Based on a modified continuous Boltzmann equation (MCBE) for two-phase flows, where a kinetic transformation to the distribution function involving the pressure field is introduced to reduce the associated numerical stiffness at high density gradients, a central moment cascaded LB formulation for computing the fluid motion will be constructed. In this LB scheme, the collision step is prescribed by the relaxation of various central moments to their equilibria that are reformulated in terms of the pressure field obtained via matching to the continuous equilibria based on the transformed Maxwell distribution. Furthermore, the differential treatments for the effects of the source term representing the change due to the pressure field and of the source term due to the interfacial tension force and body forces appearing in the MCBE on different moments are consistently accounted for in this cascaded LB solver that computes the pressure and velocity fields. In addition, another cascaded LB scheme via modified equilibria will be developed to solve for the interfacial dynamics represented by a phase field model based on the conservative Allen-Cahn equation. Based on numerical simulations of a variety of two-phase flow benchmark problems at high density ratios and involving the effects of surface tension and its tangential gradients (Marangoni stresses), we will validate our unified cascaded LB approach and also demonstrate improvements in numerical stability. △ Less

Submitted 9 September, 2019; originally announced September 2019.

Comments: This work was presented at the 71st Annual Meeting of the APS Division of Fluid Dynamics (DFD), Atlanta, Georgia, Nov. 2018 (http://meetings.aps.org/link/BAPS.2018.DFD.L31.7) with travel support from NSF. The first author's Ph.D. dissertation [52] is based, in part, on the research contribution presented in this work

arXiv:1908.05357 [pdf, other]

Sequential Computer Experimental Design for Estimating an Extreme Probability or Quantile

Authors: Hao Chen, William J. Welch

Abstract: A computer code can simulate a system's propagation of variation from random inputs to output measures of quality. Our aim here is to estimate a critical output tail probability or quantile without a large Monte Carlo experiment. Instead, we build a statistical surrogate for the input-output relationship with a modest number of evaluations and then sequentially add further runs, guided by a criter… ▽ More A computer code can simulate a system's propagation of variation from random inputs to output measures of quality. Our aim here is to estimate a critical output tail probability or quantile without a large Monte Carlo experiment. Instead, we build a statistical surrogate for the input-output relationship with a modest number of evaluations and then sequentially add further runs, guided by a criterion to improve the estimate. We compare two criteria in the literature. Moreover, we investigate two practical questions: how to design the initial code runs and how to model the input distribution. Hence, we close the gap between the theory of sequential design and its application. △ Less

Submitted 14 August, 2019; originally announced August 2019.

arXiv:1907.01181 [pdf, other]

Adaptive Partitioning Design and Analysis for Emulation of a Complex Computer Code

Authors: Sonja Surjanovic, William J. Welch

Abstract: Computer models are used as replacements for physical experiments in a large variety of applications. Nevertheless, direct use of the computer model for the ultimate scientific objective is often limited by the complexity and cost of the model. Historically, Gaussian process regression has proven to be the almost ubiquitous choice for a fast statistical emulator for such a computer model, due to i… ▽ More Computer models are used as replacements for physical experiments in a large variety of applications. Nevertheless, direct use of the computer model for the ultimate scientific objective is often limited by the complexity and cost of the model. Historically, Gaussian process regression has proven to be the almost ubiquitous choice for a fast statistical emulator for such a computer model, due to its flexible form and analytical expressions for measures of predictive uncertainty. However, even this statistical emulator can be computationally intractable for large designs, due to computing time increasing with the cube of the design size. Multiple methods have been proposed for addressing this problem. We discuss several of them, and compare their predictive and computational performance in several scenarios. We then propose solving this problem using an adaptive partitioning emulator (APE). The new approach is motivated by the idea that most computer models are only complex in particular regions of the input space. By taking a data-adaptive approach to the development of a design, and choosing to partition the space in the regions of highest variability, we obtain a higher density of points in these regions and hence accurate prediction. △ Less

Submitted 2 July, 2019; originally announced July 2019.

arXiv:1906.10649 [pdf, other]

doi 10.1038/s41566-019-0549-5

Tunable Isolated Attosecond X-ray Pulses with Gigawatt Peak Power from a Free-Electron Laser

Authors: Joseph Duris, Siqi Li, Taran Driver, Elio G. Champenois, James P. MacArthur, Alberto A. Lutman, Zhen Zhang, Philipp Rosenberger, Jeff W. Aldrich, Ryan Coffee, Giacomo Coslovich, Franz-Josef Decker, James M. Glownia, Gregor Hartmann, Wolfram Helml, Andrei Kamalov, Jonas Knurr, Jacek Krzywinski, Ming-Fu Lin, Megan Nantel, Adi Natan, Jordan O'Neal, Niranjan Shivaram, Peter Walter, Anna Wang , et al. (9 additional authors not shown)

Abstract: The quantum mechanical motion of electrons in molecules and solids occurs on the sub-femtosecond timescale. Consequently, the study of ultrafast electronic phenomena requires the generation of laser pulses shorter than 1 fs and of sufficient intensity to interact with their target with high probability. Probing these dynamics with atomic-site specificity requires the extension of sub-femtosecond p… ▽ More The quantum mechanical motion of electrons in molecules and solids occurs on the sub-femtosecond timescale. Consequently, the study of ultrafast electronic phenomena requires the generation of laser pulses shorter than 1 fs and of sufficient intensity to interact with their target with high probability. Probing these dynamics with atomic-site specificity requires the extension of sub-femtosecond pulses to the soft X-ray spectral region. Here we report the generation of isolated GW-scale soft X-ray attosecond pulses with an X-ray free-electron laser. Our source has a pulse energy that is six orders of magnitude larger than any other source of isolated attosecond pulses in the soft X-ray spectral region, with a peak power in the tens of gigawatts. This unique combination of high intensity, high photon energy and short pulse duration enables the investigation of electron dynamics with X-ray non-linear spectroscopy and single-particle imaging. △ Less

Submitted 25 June, 2019; originally announced June 2019.

Journal ref: Duris, J., Li, S., Driver, T. et al. Nat. Photonics 14, 30-36 (2020)

arXiv:1807.05139 [pdf, ps, other]

A Tight Lower Bound for Clock Synchronization in Odd-Ary M-Toroids

Authors: Reginald Frank, Jennifer L. Welch

Abstract: Synchronizing clocks in a distributed system in which processes communicate through messages with uncertain delays is subject to inherent errors. Prior work has shown upper and lower bounds on the best synchronization achievable in a variety of network topologies and assumptions about the uncertainty on the message delays. However, until now there has not been a tight closed-form expression for th… ▽ More Synchronizing clocks in a distributed system in which processes communicate through messages with uncertain delays is subject to inherent errors. Prior work has shown upper and lower bounds on the best synchronization achievable in a variety of network topologies and assumptions about the uncertainty on the message delays. However, until now there has not been a tight closed-form expression for the optimal synchronization in $k$-ary $m$-cubes with wraparound, where $k$ is odd. In this paper, we prove a lower bound of $\frac{1}{4}um\left(k-\frac{1}{k}\right)$, where $k$ is the (odd) number of processes in the each of the $m$ dimensions, and $u$ is the uncertainty in delay on every link. Our lower bound matches the previously known upper bound. △ Less

Submitted 13 July, 2018; originally announced July 2018.

Comments: 5 pages, 4 figures, to appear as a brief announcement at 2018 International Symposium on Distributed Computing (2018)

arXiv:1806.10241 [pdf, other]

Cascaded Lattice Boltzmann Method based on Central Moments for Axisymmetric Thermal Flows Including Swirling Effects

Authors: Farzaneh Hajabdollahi, Kannan N. Premnath, Samuel W. J. Welch

Abstract: A cascaded lattice Boltzmann (LB) approach based on central moments and multiple relaxation times to simulate thermal convective flows, which are driven by buoyancy forces and/or swirling effects, in the cylindrical coordinate system with axial symmetry is presented. In this regard, the dynamics of the axial and radial momentum components along with the pressure are represented by means of the 2D… ▽ More A cascaded lattice Boltzmann (LB) approach based on central moments and multiple relaxation times to simulate thermal convective flows, which are driven by buoyancy forces and/or swirling effects, in the cylindrical coordinate system with axial symmetry is presented. In this regard, the dynamics of the axial and radial momentum components along with the pressure are represented by means of the 2D Navier-Stokes equations with geometric mass and momentum source terms in the pseudo Cartesian form, while the evolutions of the azimuthal momentum and the temperature field are each modeled by an advection-diffusion type equation with appropriate local source terms. Based on these, cascaded LB schemes involving three distribution functions are formulated to solve for the fluid motion in the meridian plane using a D2Q9 lattice, and to solve for the azimuthal momentum and the temperature field each using a D2Q5 lattice. The geometric mass and momentum source terms for the flow fields and the energy source term for the temperature field are included using a new symmetric operator splitting technique, via pre-collision and post-collision source steps around the cascaded collision step for each distribution function. These result in a particularly simple and compact formulation to directly represent the effect of various geometric source terms consistently in terms of changes in the appropriate zeroth and first order moments. Simulations of several complex buoyancy-driven thermal flows and including rotational effects in cylindrical geometries using the new axisymmetric cascaded LB schemes show good agreement with prior benchmark results for the structures of the velocity and thermal fields as well as the heat transfer rates given in terms of the Nusselt numbers. △ Less

Submitted 26 June, 2018; originally announced June 2018.

Comments: 49 pages,12 figures

arXiv:1802.03532 [pdf, other]

Bayesian Optimization Using Monotonicity Information and Its Application in Machine Learning Hyperparameter

Authors: Wenyi Wang, William J. Welch

Abstract: We propose an algorithm for a family of optimization problems where the objective can be decomposed as a sum of functions with monotonicity properties. The motivating problem is optimization of hyperparameters of machine learning algorithms, where we argue that the objective, validation error, can be decomposed as monotonic functions of the hyperparameters. Our proposed algorithm adapts Bayesian o… ▽ More We propose an algorithm for a family of optimization problems where the objective can be decomposed as a sum of functions with monotonicity properties. The motivating problem is optimization of hyperparameters of machine learning algorithms, where we argue that the objective, validation error, can be decomposed as monotonic functions of the hyperparameters. Our proposed algorithm adapts Bayesian optimization methods to incorporate the monotonicity constraints. We illustrate the advantages of exploiting monotonicity using illustrative examples and demonstrate the improvements in optimization efficiency for some machine learning hyperparameter tuning applications. △ Less

Submitted 16 February, 2018; v1 submitted 10 February, 2018; originally announced February 2018.

Comments: Citation style errors fixed

arXiv:1710.01698

Time-Resolved Pulse Propagation in Glass in Single-Shot

Authors: Yen-Yu Chang, Zhengyan Li, James Welch, Rafal Zgadzaj, Aaron Bernstein, Michael C. Downer

Abstract: We report time-resolved pulse self-steepening and temporal splitting in flint glass (SF11) in single-shot using broadband frequency-domain streak camera (B-FDSC). The broadband ($60$ nm) probe beam generated through a compact coverslip array provides $\sim 40$ fs temporal resolution. The experimental results support the theoretical model of pulse self-steepening and indicate that multiphoton ioniz… ▽ More We report time-resolved pulse self-steepening and temporal splitting in flint glass (SF11) in single-shot using broadband frequency-domain streak camera (B-FDSC). The broadband ($60$ nm) probe beam generated through a compact coverslip array provides $\sim 40$ fs temporal resolution. The experimental results support the theoretical model of pulse self-steepening and indicate that multiphoton ionization (MPI) initiates the pulse splitting process in glass. We perform a three-dimensional simulation to verify the experimental results. △ Less

Submitted 5 November, 2019; v1 submitted 4 October, 2017; originally announced October 2017.

Comments: Require major revision

arXiv:1710.01454

Observation of Plasma Bubble Structures in a GeV Laser-Plasma Accelerator

Authors: Yen-Yu Chang, Kathleen Weichman, Xiantao Cheng, Joseph M. Shaw, James Welch, Maxwell LaBerge, Andrea Hannasch, Rafal Zgadzaj, Aaron Bernstein, Watson Henderson, Michael C. Downer

Abstract: We measure characteristics of plasma bubbles in GeV-class laser-plasma accelerators (LPAs) using Faraday rotation diagnostics. We extend these techniques, previously demonstrated for LPAs in atmospheric density plasmas (electron density $n_e >10^{19}$ cm$^{-3}$), to LPAs in low-density plasmas ($n_e \approx 5\times10^{17}$ cm$^{-3}$), in which plasma bubbles are $\sim 5$ times larger, and correspo… ▽ More We measure characteristics of plasma bubbles in GeV-class laser-plasma accelerators (LPAs) using Faraday rotation diagnostics. We extend these techniques, previously demonstrated for LPAs in atmospheric density plasmas (electron density $n_e >10^{19}$ cm$^{-3}$), to LPAs in low-density plasmas ($n_e \approx 5\times10^{17}$ cm$^{-3}$), in which plasma bubbles are $\sim 5$ times larger, and correspondingly easier to visualize in detail. The signals show $\approx 0.5^\circ$ rotation streaks of opposite sign separated by $\sim50$ $μ$m, consistent with bubble diameter; no on-axis rotation; streaks length consistent with transverse probe pulse duration ($180$ $μ$m for $500$ fs pulse length, and $600$ $μ$m for $2$ ps pulse length). We utilized an anamorphic imaging system to obtain a wide longitudinal field of view ($>1$ cm) and a high transverse resolution ($<9$ $μ$m). We also demonstrated that Faraday rotation signals are sensitive to the stages of acceleration processes using extended 2D Finite Difference Time Domain (FDTD) simulation. △ Less

Submitted 5 November, 2019; v1 submitted 3 October, 2017; originally announced October 2017.

Comments: Require major revision

arXiv:1708.07013 [pdf, ps, other]

doi 10.1021/acs.nanolett.7b03441

Characterization of the sub-micrometer hierarchy levels in the twist-bend nematic phase with nanometric helices via photopolymerization. Explanation for the sign reversal in the polar response

Authors: Vitaly P. Panov, Sithara P. Sreenilayam, Yuri P. Panarin, Jagdish K. Vij, Chris J. Welch, Georg H. Mehl

Abstract: Photo-polymerization of a reactive mesogen mixed with a mesogenic dimer, shown to exhibit the twist-bend nematic phase ($N_{TB}$), reveals the complex structure of the self-deformation patterns observed in planar cells. The polymerized reactive mesogen retains the structure formed by liquid crystalline molecules in the twist bend phase, thus enabling observation by Scanning Electron Microscope (SE… ▽ More Photo-polymerization of a reactive mesogen mixed with a mesogenic dimer, shown to exhibit the twist-bend nematic phase ($N_{TB}$), reveals the complex structure of the self-deformation patterns observed in planar cells. The polymerized reactive mesogen retains the structure formed by liquid crystalline molecules in the twist bend phase, thus enabling observation by Scanning Electron Microscope (SEM). Hierarchical ordering scales from tens of nanometers to micrometers are imaged in detail. Submicron features, anticipated from earlier X-ray experiments, are visualized directly. In the self-deformation stripes formed in the $N_{TB}$ phase, the average director field is found tilted in the cell plane by an angle of up to 45$^{\circ}$ from the cell rubbing direction. This tilting explains the sign inversion being observed in the electro-optical studies. △ Less

Submitted 23 August, 2017; originally announced August 2017.

arXiv:1708.03274 [pdf, ps, other]

Simulating a Shared Register in a System that Never Stops Changing

Authors: Hagit Attiya, Hyun Chul Chung, Faith Ellen, Saptaparni Kumar, Jennifer L. Welch

Abstract: Simulating a shared register can mask the intricacies of designing algorithms for asynchronous message-passing systems subject to crash failures, since it allows them to run algorithms designed for the simpler shared-memory model. Typically such simulations replicate the value of the register in multiple servers and require readers and writers to communicate with a majority of servers. The success… ▽ More Simulating a shared register can mask the intricacies of designing algorithms for asynchronous message-passing systems subject to crash failures, since it allows them to run algorithms designed for the simpler shared-memory model. Typically such simulations replicate the value of the register in multiple servers and require readers and writers to communicate with a majority of servers. The success of this approach for static systems, where the set of nodes (readers, writers, and servers) is fixed, has motivated several similar simulations for dynamic systems, where nodes may enter and leave. However, existing simulations need to assume that the system eventually stops changing for a long enough period or that the system size is bounded. This paper presents the first simulation of an atomic read/write register in a crash-prone asynchronous system that can change size and withstand nodes continually entering and leaving. The simulation allows the system to keep changing, provided that the number of nodes entering and leaving during a fixed time interval is at most a constant fraction of the current system size. The simulation also tolerates node crashes as long as the number of failed nodes in the system is at most a constant fraction of the current system size. △ Less

Submitted 10 August, 2017; originally announced August 2017.

arXiv:1708.02906 [pdf, other]

Implementing $\Diamond P$ with Bounded Messages on a Network of ADD Channels

Authors: Saptaparni Kumar, Jennifer Welch

Abstract: We present an implementation of the eventually perfect failure detector ($\Diamond P$) from the original hierarchy of the Chandra-Toueg oracles on an arbitrary partitionable network composed of unreliable channels that can lose and reorder messages. Prior implementations of $\Diamond P$ have assumed different partially synchronous models ranging from bounded point-to-point message delay and reliab… ▽ More We present an implementation of the eventually perfect failure detector ($\Diamond P$) from the original hierarchy of the Chandra-Toueg oracles on an arbitrary partitionable network composed of unreliable channels that can lose and reorder messages. Prior implementations of $\Diamond P$ have assumed different partially synchronous models ranging from bounded point-to-point message delay and reliable communication to unbounded message size and known network topologies. We implement $\Diamond P$ under very weak assumptions on an arbitrary, partitionable network composed of Average Delayed/Dropped (ADD) channels to model unreliable communication. Unlike older implementations, our failure detection algorithm uses bounded-sized messages to eventually detect all nodes that are unreachable (crashed or disconnected) from it. △ Less

Submitted 9 August, 2017; originally announced August 2017.

arXiv:1708.02187 [pdf]

doi 10.1038/s41467-018-05968-x

Hot Carrier Dynamics in Photoexcited Gold Nanostructures: Role of Interband Excitations and Evidence for Ballistic Transport

Authors: Giulia Tagliabue, Adam S Jermyn, Ravishankar Sundararaman, Alex J Welch, Joseph S DuChene, Artur R Davoyan, Prineha Narang, Harry A Atwater

Abstract: Harnessing short-lived photoexcited electron-hole pairs in metal nanostructures has the potential to define a new phase of optoelectronics, enabling control of athermal mechanisms for light harvesting, photodetection and photocatalysis. To date, however, the spatiotemporal dynamics and transport of these photoexcited carriers have been only qualitatively characterized. Plasmon excitation has been… ▽ More Harnessing short-lived photoexcited electron-hole pairs in metal nanostructures has the potential to define a new phase of optoelectronics, enabling control of athermal mechanisms for light harvesting, photodetection and photocatalysis. To date, however, the spatiotemporal dynamics and transport of these photoexcited carriers have been only qualitatively characterized. Plasmon excitation has been widely viewed as an efficient mechanism for generating non-thermal hot carriers. Despite numerous experiments, conclusive evidence elucidating and quantifying the full dynamics of hot carrier generation, transport, and injection has not been reported. Here, we combine experimental measurements with coupled first-principles electronic structure theory and Boltzmann transport calculations to provide unprecedented insight into the internal quantum efficiency, and hence internal physics, of hot carriers in photoexcited gold (Au)-gallium nitride (GaN) nanostructures. Our results indicate that photoexcited electrons generated in 20 nm-thick Au nanostructures im**e ballistically on the Au-GaN interface. This discovery suggests that the energy of hot carriers could be harnessed from metal nanostructures without substantial losses via thermalization. Measurements and calculations also reveal the important role of metal band structure in hot carrier generation at energies above the interband threshold of the plasmonic nanoantenna. Taken together, our results advance the understanding of excited carrier dynamics in realistically-scaled metallic nanostructures and lay the foundations for the design of new optoelectronic devices that operate in the ballistic regime. △ Less

Submitted 7 August, 2017; originally announced August 2017.

Comments: 9 pages, 4 figures

arXiv:1707.00727 [pdf, other]

Regression Phalanxes

Authors: Hongyang Zhang, William J. Welch, Ruben H. Zamar

Abstract: Tomal et al. (2015) introduced the notion of "phalanxes" in the context of rare-class detection in two-class classification problems. A phalanx is a subset of features that work well for classification tasks. In this paper, we propose a different class of phalanxes for application in regression settings. We define a "Regression Phalanx" - a subset of features that work well together for prediction… ▽ More Tomal et al. (2015) introduced the notion of "phalanxes" in the context of rare-class detection in two-class classification problems. A phalanx is a subset of features that work well for classification tasks. In this paper, we propose a different class of phalanxes for application in regression settings. We define a "Regression Phalanx" - a subset of features that work well together for prediction. We propose a novel algorithm which automatically chooses Regression Phalanxes from high-dimensional data sets using hierarchical clustering and builds a prediction model for each phalanx for further ensembling. Through extensive simulation studies and several real-life applications in various areas (including drug discovery, chemical analysis of spectra data, microarray analysis and climate projections) we show that an ensemble of Regression Phalanxes improves prediction accuracy when combined with effective prediction methods like Lasso or Random Forests. △ Less

Submitted 3 July, 2017; originally announced July 2017.

arXiv:1706.06971 [pdf, ps, other]

Ensembles of phalanxes across assessment metrics for robust ranking of homologous proteins

Authors: Jabed H Tomal, William J Welch, Ruben H Zamar

Abstract: Two proteins are homologous if they have a common evolutionary origin, and the binary classification problem is to identify proteins in a candidate set that are homologous to a particular native protein. The feature (explanatory) variables available for classification are various measures of similarity of proteins. There are multiple classification problems of this type for different native protei… ▽ More Two proteins are homologous if they have a common evolutionary origin, and the binary classification problem is to identify proteins in a candidate set that are homologous to a particular native protein. The feature (explanatory) variables available for classification are various measures of similarity of proteins. There are multiple classification problems of this type for different native proteins and their respective candidate sets. Homologous proteins are rare in a single candidate set, giving a highly unbalanced two-class problem. The goal is to rank proteins in a candidate set according to the probability of being homologous to the set's native protein. An ideal classifier will place all the homologous proteins at the head of such a list. Our approach uses an ensemble of models in a classifier and an ensemble of assessment metrics. For a given metric a classifier combines models, each based on a subset of the available feature variables which we call phalanxes. The proposed ensemble of phalanxes identifies strong and diverse subsets of feature variables. A second phase of ensembling aggregates classifiers based on diverse evaluation metrics. The overall result is called an ensemble of phalanxes and metrics. It provide robustness against both close and distant homologues. △ Less

Submitted 9 September, 2019; v1 submitted 21 June, 2017; originally announced June 2017.

Comments: 29 pages, 4 figures, 8 tables and 2 algorithms

arXiv:1705.08637 [pdf, other]

Bright 5 - 85 MeV Compton gamma-ray pulses from GeV laser-plasma accelerator and plasma mirror

Authors: J. M. Shaw, A. C. Bernstein, R. Zgadzaj, A. Hannasch, M. LaBerge, Y. Y. Chang, K. Weichman, J. Welch, W. Henderson, H. -E. Tsai, N. Fazel, X. Wang, T. Ditmire, M. Donovan, G. Dyer, E. Gaul, J. Gordon, M. Martinez, M. Spinks, T. Toncian, C. Wagner, M. C. Downer

Abstract: We convert a GeV laser-plasma electron accelerator into a compact femtosecond-pulsed $γ$-ray source by inserting a $100 μ$m-thick glass plate $\sim3$ cm after the accelerator exit. With near-unity reliability, and requiring only crude alignment, this glass plasma mirror retro-reflected spent drive laser pulses (photon energy $\hbarω_L = 1.17$ eV) with $>50\%$ efficiency back onto trailing electron… ▽ More We convert a GeV laser-plasma electron accelerator into a compact femtosecond-pulsed $γ$-ray source by inserting a $100 μ$m-thick glass plate $\sim3$ cm after the accelerator exit. With near-unity reliability, and requiring only crude alignment, this glass plasma mirror retro-reflected spent drive laser pulses (photon energy $\hbarω_L = 1.17$ eV) with $>50\%$ efficiency back onto trailing electrons (peak Lorentz factor $1000 < γ_e < 4400$), creating an optical undulator that generated $\sim10^8 γ$-ray photons with sub-mrad divergence, estimated peak brilliance $\sim10^{21}$ photons/s/mm$^2$/mrad$^2$/$0.1\%$ bandwidth and negligible bremsstrahlung background. The $γ$-ray photon energy $E_γ= 4γ_e^2 \hbarω_L$, inferred from the measured $γ_e$ on each shot, peaked from 5 to 85 MeV, spanning a range otherwise available with comparable brilliance only from large-scale GeV-linac-based high-intensity $γ$-ray sources. △ Less

Submitted 24 May, 2017; originally announced May 2017.

arXiv:1702.02252 [pdf]

doi 10.1088/1538-3873/aa5d4f

New Cooled Feeds for the Allen Telescope Array

Authors: Wm. J. Welch, Matthew Fleming, Chris Munson, Jill Tarter, G. R. Harp, Robert Spencer, Niklas Wadefalk

Abstract: We have developed a new generation of low noise, broadband feeds for the Allen Telescope Array at the Hat Creek Observatory in Northern California. The new feeds operate over the frequency range 0.9 to 14 GHz. The noise temperatures of the feeds have been substantially improved by cooling the entire feed structure as well as the low noise amplifiers to 70 K. To achieve this improved performance, t… ▽ More We have developed a new generation of low noise, broadband feeds for the Allen Telescope Array at the Hat Creek Observatory in Northern California. The new feeds operate over the frequency range 0.9 to 14 GHz. The noise temperatures of the feeds have been substantially improved by cooling the entire feed structure as well as the low noise amplifiers to 70 K. To achieve this improved performance, the new feeds are mounted in glass vacuum bottles with plastic lenses that maximize the microwave transmission through the bottles. Both the cooled feeds and their low noise amplifiers produce total system temperatures that are in the range 25-30 K from 1 GHz to 5 GHz and 40-50 K up to 12.5 GHz. △ Less

Submitted 7 February, 2017; originally announced February 2017.

Comments: 16 pages, 11 figures, accepted to Publications of the Astronomical Society of the Pacific

Journal ref: Publications of the Astronomical Society of the Pacific 129.974 (2017): 045002

arXiv:1506.03679 [pdf, other]

doi 10.1088/0004-637X/808/1/102

Resolving Protoplanetary Disks at Millimeter Wavelengths by CARMA

Authors: Woo** Kwon, Leslie W. Looney, Lee G. Mundy, William J. Welch

Abstract: We present continuum observations at 1.3 and 2.7 mm using the Combined Array for Research in Millimeter-wave Astronomy (CARMA) toward six protoplanetary disks in the Taurus molecular cloud: CI Tau, DL Tau, DO Tau, FT Tau, Haro 6-13, and HL Tau. We constrain physical properties of the disks with Bayesian inference using two disk models; flared power-law disk model and flared accretion disk model. C… ▽ More We present continuum observations at 1.3 and 2.7 mm using the Combined Array for Research in Millimeter-wave Astronomy (CARMA) toward six protoplanetary disks in the Taurus molecular cloud: CI Tau, DL Tau, DO Tau, FT Tau, Haro 6-13, and HL Tau. We constrain physical properties of the disks with Bayesian inference using two disk models; flared power-law disk model and flared accretion disk model. Comparing the physical properties, we find that the more extended disks are less flared and that the dust opacity spectral index (beta) is smaller in the less massive disks. In addition, disks with a steeper mid-plane density gradient have a smaller beta, which suggests that grains grow and radially move. Furthermore, we compare the two disk models quantitatively and find that the accretion disk model provides a better fit overall. We also discuss the possibilities of substructures on three extended protoplanetary disks. △ Less

Submitted 11 June, 2015; originally announced June 2015.

Comments: 45 pages, 10 figures, 7 tables, to be published in ApJ

arXiv:1412.5608 [pdf, other]

Efficient Approximation of Diagonal Unitaries over the Clifford+T Basis

Authors: Jonathan Welch, Alex Bocharov, Krysta M. Svore

Abstract: We present an algorithm for the approximate decomposition of diagonal operators, focusing specifically on decompositions over the Clifford+$T$ basis, that minimize the number of phase-rotation gates in the synthesized approximation circuit. The equivalent $T$-count of the synthesized circuit is bounded by $k \, C_0 \log_2(1/\varepsilon) + E(n,k)$, where $k$ is the number of distinct phases in the… ▽ More We present an algorithm for the approximate decomposition of diagonal operators, focusing specifically on decompositions over the Clifford+$T$ basis, that minimize the number of phase-rotation gates in the synthesized approximation circuit. The equivalent $T$-count of the synthesized circuit is bounded by $k \, C_0 \log_2(1/\varepsilon) + E(n,k)$, where $k$ is the number of distinct phases in the diagonal $n$-qubit unitary, $\varepsilon$ is the desired precision, $C_0$ is a quality factor of the implementation method ($1<C_0<4$), and $E(n,k)$ is the total entanglement cost (in $T$ gates). We determine an optimal decision boundary in $(k,n,\varepsilon)$-space where our decomposition algorithm achieves lower entanglement cost than previous state-of-the-art techniques. Our method outperforms state-of-the-art techniques for a practical range of $\varepsilon$ values and diagonal operators and can reduce the number of $T$ gates exponentially in $n$ when $k << 2^n$. △ Less

Submitted 18 November, 2015; v1 submitted 17 December, 2014; originally announced December 2014.

Comments: 18 pages, 8 figures; introduction improved for readability, references added (in particular to Dawson & Nielsen)

ACM Class: D.3.4

Journal ref: Quantum Information and Computation, vol. 16, no. 1,2, pp. 87-104, Rinton Press, January 2016

arXiv:1411.1449 [pdf, other]

doi 10.1103/PhysRevE.91.022603

Atomistic study of macroscopic analogs to short chain molecules

Authors: Kyle J. Welch, Clayton S. G. Kilmer, Eric I. Corwin

Abstract: We use a bath of chaotic surface waves in water to mechanically and macroscopically mimic the thermal behavior of a short articulated chain with only nearest-neighbor interactions. The chaotic waves provide isotropic and random agitation to which a temperature can be ascribed, allowing the chain to passively explore its degrees of freedom in analogy to thermal motion. We track the chain in real ti… ▽ More We use a bath of chaotic surface waves in water to mechanically and macroscopically mimic the thermal behavior of a short articulated chain with only nearest-neighbor interactions. The chaotic waves provide isotropic and random agitation to which a temperature can be ascribed, allowing the chain to passively explore its degrees of freedom in analogy to thermal motion. We track the chain in real time and infer end-to-end potentials using Boltzmann statistics. We extrapolate our results, by using Monte Carlo simulations of self-avoiding polymers, to lengths not accessible in our system. In the long chain limit we demonstrate universal scaling of the statistical parameters of all chains in agreement with well-known predictions for self-avoiding walks. However, we find that the behavior of chains below a characteristic length scale is fundamentally different. We find that short chains have much greater compressional stiffness than would be expected. However, chains rapidly soften as length increases to meet with expected scalings. △ Less

Submitted 17 February, 2015; v1 submitted 5 November, 2014; originally announced November 2014.

Comments: 6 pages, 5 figures, 3 supplemental figures

Journal ref: Phys. Rev. E 91, 022603 (2015)

arXiv:1306.3991 [pdf, ps, other]

doi 10.1088/1367-2630/16/3/033040

Efficient Quantum Circuits for Diagonal Unitaries Without Ancillas

Authors: Jonathan Welch, Daniel Greenbaum, Sarah Mostame, Alán Aspuru-Guzik

Abstract: The accurate evaluation of diagonal unitary operators is often the most resource-intensive element of quantum algorithms such as real-space quantum simulation and Grover search. Efficient circuits have been demonstrated in some cases but generally require ancilla registers, which can dominate the qubit resources. In this paper, we point out a correspondence between Walsh functions and a basis for… ▽ More The accurate evaluation of diagonal unitary operators is often the most resource-intensive element of quantum algorithms such as real-space quantum simulation and Grover search. Efficient circuits have been demonstrated in some cases but generally require ancilla registers, which can dominate the qubit resources. In this paper, we point out a correspondence between Walsh functions and a basis for diagonal operators that gives a simple way to construct efficient circuits for diagonal unitaries without ancillas. This correspondence reduces the problem of constructing the minimal-depth circuit within a given error tolerance, for an arbitrary diagonal unitary $e^{if(\hat{x})}$ in the $|x>$ basis, to that of finding the minimal-length Walsh-series approximation to the function $f(x)$. We apply this approach to the quantum simulation of the classical Eckart barrier problem of quantum chemistry, demonstrating that high-fidelity quantum simulations can be achieved with few qubits and low depth. △ Less

Submitted 17 June, 2013; originally announced June 2013.

Comments: 8 pages, 7 figures

arXiv:1303.4805 [pdf, ps, other]

doi 10.1214/14-AOAS778

Ensembling classification models based on phalanxes of variables with applications in drug discovery

Authors: Jabed H. Tomal, William J. Welch, Ruben H. Zamar

Abstract: Statistical detection of a rare class of objects in a two-class classification problem can pose several challenges. Because the class of interest is rare in the training data, there is relatively little information in the known class response labels for model building. At the same time the available explanatory variables are often moderately high dimensional. In the four assays of our drug-discove… ▽ More Statistical detection of a rare class of objects in a two-class classification problem can pose several challenges. Because the class of interest is rare in the training data, there is relatively little information in the known class response labels for model building. At the same time the available explanatory variables are often moderately high dimensional. In the four assays of our drug-discovery application, compounds are active or not against a specific biological target, such as lung cancer tumor cells, and active compounds are rare. Several sets of chemical descriptor variables from computational chemistry are available to classify the active versus inactive class; each can have up to thousands of variables characterizing molecular structure of the compounds. The statistical challenge is to make use of the richness of the explanatory variables in the presence of scant response information. Our algorithm divides the explanatory variables into subsets adaptively and passes each subset to a base classifier. The various base classifiers are then ensembled to produce one model to rank new objects by their estimated probabilities of belonging to the rare class of interest. The essence of the algorithm is to choose the subsets such that variables in the same group work well together; we call such groups phalanxes. △ Less

Submitted 15 May, 2015; v1 submitted 19 March, 2013; originally announced March 2013.

Comments: Published at http://dx.doi.org/10.1214/14-AOAS778 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS778

Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 69-93

arXiv:1210.8246 [pdf]

doi 10.1109/TAP.2011.2122214

Primary Beam and Dish Surface Characterization at the Allen Telescope Array by Radio Holography

Authors: ATA GROUP, Shannon Atkinson, D. C. Backer, P. R. Backus, William Barott, Amber Bauermeister, Leo Blitz, D. C. -J. Bock, Geoffrey C. Bower, Tucker Bradford, Calvin Cheng, Steve Croft, Matt Dexter, John Dreher, Greg Engargiola, Ed Fields, Carl Heiles, Tamara Helfer, Jane Jordan, Susan Jorgensen, Tom Kilsdonk, Colby Gutierrez-Kraybill, Garrett Keating, Casey Law, John Lugten , et al. (24 additional authors not shown)

Abstract: The Allen Telescope Array (ATA) is a cm-wave interferometer in California, comprising 42 antenna elements with 6-m diameter dishes. We characterize the antenna optical accuracy using two-antenna interferometry and radio holography. The distortion of each telescope relative to the average is small, with RMS differences of 1 percent of beam peak value. Holography provides images of dish illumination… ▽ More The Allen Telescope Array (ATA) is a cm-wave interferometer in California, comprising 42 antenna elements with 6-m diameter dishes. We characterize the antenna optical accuracy using two-antenna interferometry and radio holography. The distortion of each telescope relative to the average is small, with RMS differences of 1 percent of beam peak value. Holography provides images of dish illumination pattern, allowing characterization of as-built mirror surfaces. The ATA dishes can experience mm-scale distortions across -2 meter lengths due to mounting stresses or solar radiation. Experimental RMS errors are 0.7 mm at night and 3 mm under worst case solar illumination. For frequencies 4, 10, and 15 GHz, the nighttime values indicate sensitivity losses of 1, 10 and 20 percent, respectively. The ATA.s exceptional wide-bandwidth permits observations over a continuous range 0.5 to 11.2 GHz, and future retrofits may increase this range to 15 GHz. Beam patterns show a slowly varying focus frequency dependence. We probe the antenna optical gain and beam pattern stability as a function of focus and observation frequency, concluding that ATA can produce high fidelity images over a decade of simultaneous observation frequencies. In the day, the antenna sensitivity and pointing accuracy are affected. We find that at frequencies greater than 5 GHz, daytime observations greater than 5 GHz will suffer some sensitivity loss and it may be necessary to make antenna pointing corrections on a 1 to 2 hourly basis. △ Less

Submitted 31 October, 2012; originally announced October 2012.

Comments: 19 pages, 23 figures, 3 tables, Authors indicated by an double dagger (‡) are affiliated with the SETI Institute, Mountain View, CA 95070. Authors indicated by a section break (§) are affiliated with the Hat Creek Radio Observatory and/or the Radio Astronomy Laboratory, both affiliated with the University of California Berkeley, Berkeley CA

Journal ref: IEEE Transactions on Antennas and Propagation, 59 (2011) p. 2004

arXiv:1205.5829 [pdf]

doi 10.1371/journal.pgen.1003129

Population genomics of the Wolbachia endosymbiont in Drosophila melanogaster

Authors: Mark F. Richardson, Lucy A. Weinert, John J. Welch, Raquel S. Linheiro, Michael M. Magwire, Francis M. Jiggins, Casey M. Bergman

Abstract: Wolbachia are maternally-inherited symbiotic bacteria commonly found in arthropods, which are able to manipulate the reproduction of their host in order to maximise their transmission. Here we use whole genome resequencing data from 290 lines of Drosophila melanogaster from North America, Europe and Africa to predict Wolbachia infection status, estimate cytoplasmic genome copy number, and reconstr… ▽ More Wolbachia are maternally-inherited symbiotic bacteria commonly found in arthropods, which are able to manipulate the reproduction of their host in order to maximise their transmission. Here we use whole genome resequencing data from 290 lines of Drosophila melanogaster from North America, Europe and Africa to predict Wolbachia infection status, estimate cytoplasmic genome copy number, and reconstruct Wolbachia and mtDNA genome sequences. Complete Wolbachia and mitochondrial genomes show congruent phylogenies, consistent with strict vertical transmission through the maternal cytoplasm and imperfect transmission of Wolbachia. Bayesian phylogenetic analysis reveals that the most recent common ancestor of all Wolbachia and mitochondrial genomes in D. melanogaster dates to around 8,000 years ago. We find evidence for a recent incomplete global replacement of ancestral Wolbachia and mtDNA lineages, which is likely to be one of several similar incomplete replacement events that have occurred since the out-of-Africa migration that allowed D. melanogaster to colonize worldwide habitats. △ Less

Submitted 6 November, 2012; v1 submitted 25 May, 2012; originally announced May 2012.

Comments: 41 pages, 5 figures

Journal ref: Richardson MF, Weinert LA, Welch JJ, Linheiro RS, Magwire MM, et al. (2012) Population Genomics of the Wolbachia Endosymbiont in Drosophila melanogaster. PLoS Genet 8(12): e1003129

arXiv:1202.6536 [pdf, ps, other]

doi 10.1214/11-AOAS491

Efficient, adaptive cross-validation for tuning and comparing models, with application to drug discovery

Authors: Hui Shen, William J. Welch, Jacqueline M. Hughes-Oliver

Abstract: Cross-validation (CV) is widely used for tuning a model with respect to user-selected parameters and for selecting a "best" model. For example, the method of $k$-nearest neighbors requires the user to choose $k$, the number of neighbors, and a neural network has several tuning parameters controlling the network complexity. Once such parameters are optimized for a particular data set, the next step… ▽ More Cross-validation (CV) is widely used for tuning a model with respect to user-selected parameters and for selecting a "best" model. For example, the method of $k$-nearest neighbors requires the user to choose $k$, the number of neighbors, and a neural network has several tuning parameters controlling the network complexity. Once such parameters are optimized for a particular data set, the next step is often to compare the various optimized models and choose the method with the best predictive performance. Both tuning and model selection boil down to comparing models, either across different values of the tuning parameters or across different classes of statistical models and/or sets of explanatory variables. For multiple large sets of data, like the PubChem drug discovery cheminformatics data which motivated this work, reliable CV comparisons are computationally demanding, or even infeasible. In this paper we develop an efficient sequential methodology for model comparison based on CV. It also takes into account the randomness in CV. The number of models is reduced via an adaptive, multiplicity-adjusted sequential algorithm, where poor performers are quickly eliminated. By exploiting matching of individual observations, it is sometimes even possible to establish the statistically significant inferiority of some models with just one execution of CV. △ Less

Submitted 29 February, 2012; originally announced February 2012.

Comments: Published in at http://dx.doi.org/10.1214/11-AOAS491 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS491

Journal ref: Annals of Applied Statistics 2011, Vol. 5, No. 4, 2668-2687

arXiv:1109.2522 [pdf, other]

Ion-sensitive phase transitions driven by Debye-Hückel non-ideality

Authors: Kyle J. Welch, Fred Gittes

Abstract: We find that the Debye-Hückel nonideality of dilute aqueous electrolytes is sufficient to drive volume phase transitions and criticality, even in the absence of a self-attracting or elastic network. Our result follows from a Landau mean-field theory for a system of confined ions in an external solution of mixed-valence counterions, where the ratio of squared monovalent to divalent ion concentratio… ▽ More We find that the Debye-Hückel nonideality of dilute aqueous electrolytes is sufficient to drive volume phase transitions and criticality, even in the absence of a self-attracting or elastic network. Our result follows from a Landau mean-field theory for a system of confined ions in an external solution of mixed-valence counterions, where the ratio of squared monovalent to divalent ion concentration provides a temperature-like variable for the phase transition. Our analysis was motivated by long-studied volume phase transitions via ion exchange in ionic gels, but our findings agree with existing theory for volume-temperature phase transitions in charged hard-sphere models and other systems by Fisher and Levin, and McGahay and Tomozawa. Our mean-field model predicts a continuous line of gas-liquid-type critical points connecting a purely monovalent, divalent-sensitive critical point at one extreme with a divalent, monovalent-sensitive critical point at the other; an alternative representation of the Landau functional handles this second limit. It follows that critical sensitivity to ion valence is tunable to any desired valence ratio. The critical or discontinuous dependent variable can be the confinement volume; alternatively the internal electrical potential may be more convenient in applications. Our simplified conditions for ionic phase transitions to occur, together with our relatively simple theory to describe them, may facilitate exploration of tunable critical sensitivity in areas such as ion detection technology, biological switches and osmotic control. △ Less

Submitted 12 September, 2011; originally announced September 2011.

Comments: arXiv admin note: significant text overlap with arXiv:1102.3337

arXiv:1102.3337 [pdf, other]

Ionic phase transitions in non-ideal systems

Authors: Kyle J. Welch, Fred Gittes

Abstract: We construct an explicitly solvable Landau mean-field theory for volume phase transitions of confined or fixed ions driven by relative concentrations of divalent and monovalent counterions. Such phase transitions have been widely studied in ionic gels, where the mechanism relies on self-attraction or elasticity of a network. We find here that non-ideal behavior of ions in aqueous solution can in t… ▽ More We construct an explicitly solvable Landau mean-field theory for volume phase transitions of confined or fixed ions driven by relative concentrations of divalent and monovalent counterions. Such phase transitions have been widely studied in ionic gels, where the mechanism relies on self-attraction or elasticity of a network. We find here that non-ideal behavior of ions in aqueous solution can in theory drive phase transitions without a self-attracting or elastic network. We represent non-ideality by a Debye-Hückel-like power-law activity, or correlation free energy, and retain a mechanical self-repulsion to avoid runaway collapse due to the non-ideal term. Within this model we find a continuous line of gas-liquid-type critical points, connecting a purely monovalent, divalent-sensitive critical point at one extreme with a divalent, monovalent-sensitive critical point at the other. An alternative representation of the Landau functional handles the second case. We include a formula for electrical potential, which may be a convenient proxy for critically varying volume. Our relatively simple mean-field formulation may facilitate explorations of tunable critical sensitivity in areas such as ion detection technology and biological osmotic control. △ Less

Submitted 16 February, 2011; originally announced February 2011.

Comments: 4 pages. Version of Feb 14 2011

arXiv:1009.4443 [pdf, ps, other]

doi 10.1088/0004-637X/725/2/1792

The Allen Telescope Array Pi GHz Sky Survey I. Survey Description and Static Catalog Results for the Bootes Field

Authors: Geoffrey C. Bower, Steve Croft, Garrett Keating, David Whysong, Rob Ackermann, Shannon Atkinson, Don Backer, Peter Backus, Billy Barott, Amber Bauermeister, Leo Blitz, Douglas Bock, Tucker Bradford, Calvin Cheng, Chris Cork, Mike Davis, Dave DeBoer, Matt Dexter, John Dreher, Greg Engargiola, Ed Fields, Matt Fleming, R. James Forster, Colby Gutierrez-Kraybill, G. R. Harp , et al. (28 additional authors not shown)

Abstract: The Pi GHz Sky Survey (PiGSS) is a key project of the Allen Telescope Array. PiGSS is a 3.1 GHz survey of radio continuum emission in the extragalactic sky with an emphasis on synoptic observations that measure the static and time-variable properties of the sky. During the 2.5-year campaign, PiGSS will twice observe ~250,000 radio sources in the 10,000 deg^2 region of the sky with b > 30 deg to an… ▽ More The Pi GHz Sky Survey (PiGSS) is a key project of the Allen Telescope Array. PiGSS is a 3.1 GHz survey of radio continuum emission in the extragalactic sky with an emphasis on synoptic observations that measure the static and time-variable properties of the sky. During the 2.5-year campaign, PiGSS will twice observe ~250,000 radio sources in the 10,000 deg^2 region of the sky with b > 30 deg to an rms sensitivity of ~1 mJy. Additionally, sub-regions of the sky will be observed multiple times to characterize variability on time scales of days to years. We present here observations of a 10 deg^2 region in the Bootes constellation overlap** the NOAO Deep Wide Field Survey field. The PiGSS image was constructed from 75 daily observations distributed over a 4-month period and has an rms flux density between 200 and 250 microJy. This represents a deeper image by a factor of 4 to 8 than we will achieve over the entire 10,000 deg^2. We provide flux densities, source sizes, and spectral indices for the 425 sources detected in the image. We identify ~100$ new flat spectrum radio sources; we project that when completed PiGSS will identify 10^4 flat spectrum sources. We identify one source that is a possible transient radio source. This survey provides new limits on faint radio transients and variables with characteristic durations of months. △ Less

Submitted 27 September, 2010; v1 submitted 22 September, 2010; originally announced September 2010.

Comments: Accepted for publication in ApJ; revision submitted with extraneous figure removed

Showing 1–50 of 67 results for author: Welch, J