Search | arXiv e-print repository

Entity-Centric Reinforcement Learning for Object Manipulation from Pixels

Authors: Dan Haramati, Tal Daniel, Aviv Tamar

Abstract: Manipulating objects is a hallmark of human intelligence, and an important task in domains such as robotics. In principle, Reinforcement Learning (RL) offers a general approach to learn object manipulation. In practice, however, domains with more than a few objects are difficult for RL agents due to the curse of dimensionality, especially when learning from raw image observations. In this work we… ▽ More Manipulating objects is a hallmark of human intelligence, and an important task in domains such as robotics. In principle, Reinforcement Learning (RL) offers a general approach to learn object manipulation. In practice, however, domains with more than a few objects are difficult for RL agents due to the curse of dimensionality, especially when learning from raw image observations. In this work we propose a structured approach for visual RL that is suitable for representing multiple objects and their interaction, and use it to learn goal-conditioned manipulation of several objects. Key to our method is the ability to handle goals with dependencies between the objects (e.g., moving objects in a certain order). We further relate our architecture to the generalization capability of the trained agent, based on a theoretical result for compositional generalization, and demonstrate agents that learn with 3 objects but generalize to similar tasks with over 10 objects. Videos and code are available on the project website: https://sites.google.com/view/entity-centric-rl △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: ICLR 2024 Spotlight. Videos and code are available on the project website: https://sites.google.com/view/entity-centric-rl

arXiv:2403.09373 [pdf, other]

doi 10.1103/PhysRevD.109.124012

Gravitational Waves in Chern-Simons-Gauss-Bonnet Gravity

Authors: Tatsuya Daniel, Leah Jenks

Abstract: It is known that the four-dimensional effective field theory arising from heterotic string theory is general relativity with both a Chern-Simons and Gauss-Bonnet term. We study the propagation of gravitational waves in this combination of Chern-Simons and Gauss-Bonnet gravity, both of which have an associated scalar field, the axion and the dilaton respectively, that are kinetically coupled. We re… ▽ More It is known that the four-dimensional effective field theory arising from heterotic string theory is general relativity with both a Chern-Simons and Gauss-Bonnet term. We study the propagation of gravitational waves in this combination of Chern-Simons and Gauss-Bonnet gravity, both of which have an associated scalar field, the axion and the dilaton respectively, that are kinetically coupled. We review how the combination of dynamical Chern-Simons and Gauss-Bonnet gravities can arise from string theory as corrections to general relativity and show how the gravitational wave waveform is modified in such a theory. We compare our results to a novel framework recently introduced for parametrizing the parity-violating sector (Chern-Simons), and use that to guide our construction of a similar parametrization for the parity-conserving (Gauss-Bonnet) sector. In general, we find that the contributions from the parity-violating and parity-conserving sectors are similar. Moreover, the kinetic coupling between the axion and dilaton introduces an extra contribution to the parity-violating sector of the gravitational waves. Using our parametrization, we are able to comment on initial constraints for the theory parameters, including the time variations of the axion and dilaton. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 15 pages, 1 figure

Journal ref: Phys. Rev. D 109 (2024) 12, 124012

arXiv:2403.00974 [pdf, ps, other]

Motif distribution and function of sparse deep neural networks

Authors: Olivia T. Zahn, Thomas L. Daniel, J. Nathan Kutz

Abstract: We characterize the connectivity structure of feed-forward, deep neural networks (DNNs) using network motif theory. To address whether a particular motif distribution is characteristic of the training task, or function of the DNN, we compare the connectivity structure of 350 DNNs trained to simulate a bio-mechanical flight control system with different randomly initialized parameters. We develop a… ▽ More We characterize the connectivity structure of feed-forward, deep neural networks (DNNs) using network motif theory. To address whether a particular motif distribution is characteristic of the training task, or function of the DNN, we compare the connectivity structure of 350 DNNs trained to simulate a bio-mechanical flight control system with different randomly initialized parameters. We develop and implement algorithms for counting second- and third-order motifs and calculate their significance using their Z-score. The DNNs are trained to solve the inverse problem of the flight dynamics model in Bustamante, et al. (2022) (i.e., predict the controls necessary for controlled flight from the initial and final state-space inputs) and are sparsified through an iterative pruning and retraining algorithm Zahn, et al. (2022). We show that, despite random initialization of network parameters, enforced sparsity causes DNNs to converge to similar connectivity patterns as characterized by their motif distributions. The results suggest how neural network function can be encoded in motif distributions, suggesting a variety of experiments for informing function and control. △ Less

Submitted 1 March, 2024; originally announced March 2024.

arXiv:2310.15723 [pdf, other]

Data Processing Engine (DPE): Data Analysis Tool for Particle Tracking and Mixed Radiation Field Characterization with Pixel Detectors Timepix

Authors: Marek Lukas, Granja Carlos, Jakubek Jan, Ingerle Jan, Turecek Daniel, Vuolo Marco, Oancea Cristina

Abstract: Hybrid semiconductor pixelated detectors from the Timepix family are advanced detectors for online particle tracking, offering energy measurement and precise time stam** capabilities for particles of various types and energies. This inherent capability makes them highly suitable for various applications, including imaging, medical fields such as radiotherapy and particle therapy, space-based app… ▽ More Hybrid semiconductor pixelated detectors from the Timepix family are advanced detectors for online particle tracking, offering energy measurement and precise time stam** capabilities for particles of various types and energies. This inherent capability makes them highly suitable for various applications, including imaging, medical fields such as radiotherapy and particle therapy, space-based applications aboard satellites and the International Space Station, and industrial applications. The data generated by these detectors is complex, necessitating the development and deployment of various analytical techniques to extract essential information. For this purpose, and to aid the Timepix user community, it was designed and developed the "Data Processing Engine" (DPE) as an advanced tool for data processing designed explicitly for Timepix detectors. The functionality of the DPE is structured into three distinct processing levels: i) Pre-processing: This phase involves clusterization and the application of necessary calibrations and corrections. ii) Processing: This stage includes particle classification, employing machine learning algorithms, and the recognition of radiation fields. iii) Post-processing: Involves various analyses, such as directional analysis, coincidence analysis, frame analysis, Compton directional analysis, and the generation of physics products, are performed. The core of the DPE is supported by an extensive experimental database containing calibrations and referential radiation fields of typical environments, including protons, ions, electrons, gamma rays and X-rays, as well as thermal and fast neutrons. To enhance accessibility, the DPE is implemented into various user interface platforms such as a command-line tool, an application programming interface, and as a graphical user interface in the form of a web portal. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: 9 pages, proceedings IWORID

arXiv:2310.00127 [pdf, other]

Sensor Placement for Flap** Wing Model Using Stochastic Observability Gramians

Authors: Burak Boyacıoğlu, Mahnoush Babaei, Amanuel H. Mamo, Sarah Bergbreiter, Thomas L. Daniel, Kristi A. Morgansen

Abstract: Systems in nature are stochastic as well as nonlinear. In traditional applications, engineered filters aim to minimize the stochastic effects caused by process and measurement noise. Conversely, a previous study showed that the process noise can reveal the observability of a system that was initially categorized as unobservable when deterministic tools were used. In this paper, we develop a stocha… ▽ More Systems in nature are stochastic as well as nonlinear. In traditional applications, engineered filters aim to minimize the stochastic effects caused by process and measurement noise. Conversely, a previous study showed that the process noise can reveal the observability of a system that was initially categorized as unobservable when deterministic tools were used. In this paper, we develop a stochastic framework to explore observability analysis and sensor placement. This framework allows for direct studies of the effects of stochasticity on optimal sensor placement and selection to improve filter error covariance. Numerical results are presented for sensor selection that optimizes stochastic empirical observability in a bioinspired setting. △ Less

Submitted 10 July, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

Comments: 12 pages, 5 figures, 2 tables, to be published in the proceedings of the 2024 American Control Conference

MSC Class: 93B07

arXiv:2308.00111 [pdf, other]

doi 10.1088/1475-7516/2023/12/041

An SZ-Like Effect on Cosmological Gravitational Wave Backgrounds

Authors: Tatsuya Daniel, Marcell Howard, Morgane König

Abstract: Cosmological gravitational wave backgrounds (CGWBs) are the conglomeration of unresolved gravitational wave signals from early Universe sources, which make them a promising tool for cosmologists. Because gravitons decouple from the cosmic plasma early on, one can consider interactions between gravitons and any particle species that were present in the very early Universe. We show that analogous to… ▽ More Cosmological gravitational wave backgrounds (CGWBs) are the conglomeration of unresolved gravitational wave signals from early Universe sources, which make them a promising tool for cosmologists. Because gravitons decouple from the cosmic plasma early on, one can consider interactions between gravitons and any particle species that were present in the very early Universe. We show that analogous to the cosmic microwave background, elastic scattering on any cosmological background will induce small distortions in its energy density spectrum. We then quantify the magnitude of these spin-dependent spectral distortions when attributed to the dark matter in the early Universe. Lastly, we give estimates for potentially measurable distortions on CGWBs due to gravitational scattering by primordial black holes. △ Less

Submitted 22 December, 2023; v1 submitted 31 July, 2023; originally announced August 2023.

Comments: 29 pages, 3 figures, Submitted to JCAP; Updated draft commensurate with published version: 30 pages, 3 figures

Journal ref: JCAP12(2023)041

arXiv:2306.05957 [pdf, other]

DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles

Authors: Tal Daniel, Aviv Tamar

Abstract: We propose a new object-centric video prediction algorithm based on the deep latent particle (DLP) representation. In comparison to existing slot- or patch-based representations, DLPs model the scene using a set of keypoints with learned parameters for properties such as position and size, and are both efficient and interpretable. Our method, deep dynamic latent particles (DDLP), yields state-of-t… ▽ More We propose a new object-centric video prediction algorithm based on the deep latent particle (DLP) representation. In comparison to existing slot- or patch-based representations, DLPs model the scene using a set of keypoints with learned parameters for properties such as position and size, and are both efficient and interpretable. Our method, deep dynamic latent particles (DDLP), yields state-of-the-art object-centric video prediction results on several challenging datasets. The interpretable nature of DDLP allows us to perform ``what-if'' generation -- predict the consequence of changing properties of objects in the initial frames, and DLP's compact structure enables efficient diffusion-based unconditional video generation. Videos, code and pre-trained models are available: https://taldatech.github.io/ddlp-web △ Less

Submitted 8 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

Comments: TMLR 2024. Project site: https://taldatech.github.io/ddlp-web

arXiv:2305.11448 [pdf, ps, other]

Spacetime geometry of acoustics and electromagnetism

Authors: Lucas Burns, Tatsuya Daniel, Stephon Alexander, Justin Dressel

Abstract: Both acoustics and electromagnetism represent measurable fields in terms of dynamical potential fields. Electromagnetic force-fields form a spacetime bivector that is represented by a dynamical energy-momentum 4-vector potential field. Acoustic pressure and velocity fields form an energy-momentum density 4-vector field that is represented by a dynamical action scalar potential field. Surprisingly,… ▽ More Both acoustics and electromagnetism represent measurable fields in terms of dynamical potential fields. Electromagnetic force-fields form a spacetime bivector that is represented by a dynamical energy-momentum 4-vector potential field. Acoustic pressure and velocity fields form an energy-momentum density 4-vector field that is represented by a dynamical action scalar potential field. Surprisingly, standard field theory analyses of spin angular momentum based on these traditional potential representations contradict recent experiments, which motivates a careful reassessment of both theories. We analyze extensions of both theories that use the full geometric structure of spacetime to respect essential symmetries enforced by vacuum wave propagation. The resulting extensions are geometrically complete and phase-invariant (i.e., dual-symmetric) formulations that span all five grades of spacetime, with dynamical potentials and measurable fields spanning complementary grades that are related by a spacetime vector derivative (i.e., the quantum Dirac operator). These complete representations correct the equations of motion, energy-momentum tensors, forces experienced by probes, Lagrangian densities, and allowed gauge freedoms, while making manifest the deep structural connections to relativistic quantum field theories. Finally, we discuss the implications of these corrections to experimental tests. △ Less

Submitted 19 May, 2023; originally announced May 2023.

Comments: 26 pages, 4 tables, for the Advances in Operator Theory with Applications to Mathematical Physics Conference, November 2022

arXiv:2207.11856 [pdf, other]

doi 10.1103/PhysRevD.106.106012

An Exact Fermionic Chern-Simons-Kodama State in Quantum Gravity

Authors: Stephon Alexander, Tatsuya Daniel, Marcell Howard, Morgane Konig

Abstract: The Chern-Simons-Kodama (CSK) state is an exact, non-perturbative wave function in the Ashtekar formulation of classical General Relativity. In this work, we find a generalized fermionic CSK state by solving the extended gravitational and fermionic Hamiltonian constraints of the Wheeler-DeWitt equation exactly. We show that this new state reduces to the original Kodama state upon symmetry reductio… ▽ More The Chern-Simons-Kodama (CSK) state is an exact, non-perturbative wave function in the Ashtekar formulation of classical General Relativity. In this work, we find a generalized fermionic CSK state by solving the extended gravitational and fermionic Hamiltonian constraints of the Wheeler-DeWitt equation exactly. We show that this new state reduces to the original Kodama state upon symmetry reduction to FRW coordinates with perturbative fermionic corrections, making contact with the Hartle-Hawking and Vilenkin wave functions of the universe in cosmology. We also find that when both torsion and fermions are non-vanishing, the wave function possesses a finite amplitude to evade the Big Bang curvature singularity. △ Less

Submitted 11 August, 2022; v1 submitted 24 July, 2022; originally announced July 2022.

Journal ref: Phys. Rev. D 106, 10612 (2022)

arXiv:2207.08885 [pdf, other]

The Ashtekar Variables and a Varying Cosmological Constant from Dynamical Chern-Simons Gravity

Authors: Stephon Alexander, Tatsuya Daniel, Joao Magueijo

Abstract: We revisit the Kodama state by quantizing the theory of General Relativity (GR) with dynamical Chern-Simons (dCS) gravity. We find a new exact solution to the Wheeler-DeWitt equation where the Pontryagin term induces a modification in the Kodama state from quantizing GR alone. The dCS modification directly encodes the variation of the cosmological constant. We revisit the Kodama state by quantizing the theory of General Relativity (GR) with dynamical Chern-Simons (dCS) gravity. We find a new exact solution to the Wheeler-DeWitt equation where the Pontryagin term induces a modification in the Kodama state from quantizing GR alone. The dCS modification directly encodes the variation of the cosmological constant. △ Less

Submitted 31 March, 2024; v1 submitted 18 July, 2022; originally announced July 2022.

Comments: 8 pages, 1 figure

arXiv:2205.15821 [pdf, other]

Unsupervised Image Representation Learning with Deep Latent Particles

Authors: Tal Daniel, Aviv Tamar

Abstract: We propose a new representation of visual data that disentangles object position from appearance. Our method, termed Deep Latent Particles (DLP), decomposes the visual input into low-dimensional latent ``particles'', where each particle is described by its spatial location and features of its surrounding region. To drive learning of such representations, we follow a VAE-based approach and introduc… ▽ More We propose a new representation of visual data that disentangles object position from appearance. Our method, termed Deep Latent Particles (DLP), decomposes the visual input into low-dimensional latent ``particles'', where each particle is described by its spatial location and features of its surrounding region. To drive learning of such representations, we follow a VAE-based approach and introduce a prior for particle positions based on a spatial-softmax architecture, and a modification of the evidence lower bound loss inspired by the Chamfer distance between particles. We demonstrate that our DLP representations are useful for downstream tasks such as unsupervised keypoint (KP) detection, image manipulation, and video prediction for scenes composed of multiple dynamic objects. In addition, we show that our probabilistic interpretation of the problem naturally provides uncertainty estimates for particle locations, which can be used for model selection, among other tasks. Videos and code are available: https://taldatech.github.io/deep-latent-particles-web/ △ Less

Submitted 26 July, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

Comments: ICML 2022. Project webpage and code: https://taldatech.github.io/deep-latent-particles-web/

Journal ref: Proceedings of the 39th International Conference on Machine Learning, in Proceedings of Machine Learning Research 162:4644-4665 (2022)

arXiv:2201.01852 [pdf, other]

doi 10.1371/journal.pcbi.1010512

Pruning deep neural networks generates a sparse, bio-inspired nonlinear controller for insect flight

Authors: Olivia Zahn, Jorge Bustamante Jr., Callin Switzer, Thomas Daniel, J. Nathan Kutz

Abstract: Insect flight is a strongly nonlinear and actuated dynamical system. As such, strategies for understanding its control have typically relied on either model-based methods or linearizations thereof. Here we develop a framework that combines model predictive control on an established flight dynamics model and deep neural networks (DNN) to create an efficient method for solving the inverse problem of… ▽ More Insect flight is a strongly nonlinear and actuated dynamical system. As such, strategies for understanding its control have typically relied on either model-based methods or linearizations thereof. Here we develop a framework that combines model predictive control on an established flight dynamics model and deep neural networks (DNN) to create an efficient method for solving the inverse problem of flight control. We turn to natural systems for inspiration since they inherently demonstrate network pruning with the consequence of yielding more efficient networks for a specific set of tasks. This bio-inspired approach allows us to leverage network pruning to optimally sparsify a DNN architecture in order to perform flight tasks with as few neural connections as possible, however, there are limits to sparsification. Specifically, as the number of connections falls below a critical threshold, flight performance drops considerably. We develop sparsification paradigms and explore their limits for control tasks. Monte Carlo simulations also quantify the statistical distribution of network weights during pruning given initial random weights of the DNNs. We demonstrate that on average, the network can be pruned to retain approximately 7% of the original network weights, with statistical distributions quantified at each layer of the network. Overall, this work shows that sparsely connected DNNs are capable of predicting the forces required to follow flight trajectories. Additionally, sparsification has sharp performance limits. △ Less

Submitted 5 January, 2022; originally announced January 2022.

arXiv:2108.12291 [pdf, ps, other]

Optimal piecewise linear data compression for solutions of parametrized partial differential equations

Authors: Thomas Daniel, Fabien Casenave, Nissrine Akkari, David Ryckelynck

Abstract: Model order reduction has been extensively studied over the last two decades. Projection-based methods such as the Proper Orthogonal Decomposition and the Reduced Basis Method enjoy the important advantages of Galerkin methods in the derivation of the reduced problem, but are limited to linear data compression for which the reduced solution is sought as a linear combination of spatial modes. Nonli… ▽ More Model order reduction has been extensively studied over the last two decades. Projection-based methods such as the Proper Orthogonal Decomposition and the Reduced Basis Method enjoy the important advantages of Galerkin methods in the derivation of the reduced problem, but are limited to linear data compression for which the reduced solution is sought as a linear combination of spatial modes. Nonlinear data compression must be used when the solution manifold is not embedded in a low-dimensional subspace. Early methods involve piecewise linear data compression, by constructing a dictionary of reduced-order models tailored to a partition of the solution manifold. In this work, we introduce the concept of optimal partition of the solution manifold in terms of normalized Kolmogorov widths, and prove that the optimal partitions can be found by means of a representative-based clustering algorithm using the sine dissimilarity measure on the solution manifold. △ Less

Submitted 27 August, 2021; originally announced August 2021.

arXiv:2108.04012 [pdf, other]

doi 10.1051/meca/2022001

Uncertainty quantification for industrial design using dictionaries of reduced order models

Authors: Thomas Daniel, Fabien Casenave, Nissrine Akkari, David Ryckelynck, Christian Rey

Abstract: We consider the dictionary-based ROM-net (Reduced Order Model) framework [T. Daniel, F. Casenave, N. Akkari, D. Ryckelynck, Model order reduction assisted by deep neural networks (ROM-net), Advanced modeling and Simulation in Engineering Sciences 7 (16), 2020] and summarize the underlying methodologies and their recent improvements. The main contribution of this work is the application of the comp… ▽ More We consider the dictionary-based ROM-net (Reduced Order Model) framework [T. Daniel, F. Casenave, N. Akkari, D. Ryckelynck, Model order reduction assisted by deep neural networks (ROM-net), Advanced modeling and Simulation in Engineering Sciences 7 (16), 2020] and summarize the underlying methodologies and their recent improvements. The main contribution of this work is the application of the complete workflow to a real-life industrial model of an elastoviscoplastic high-pressure turbine blade subjected to thermal, centrifugal and pressure loadings, for the quantification of the uncertainty on dual quantities (such as the accumulated plastic strain and the stress tensor), generated by the uncertainty on the temperature loading field. The dictionary-based ROM-net computes predictions of dual quantities of interest for 1008 Monte Carlo draws of the temperature loading field in 2 hours and 48 minutes, which corresponds to a speedup greater than 600 with respect to a reference parallel solver using domain decomposition, with a relative error in the order of 2%. Another contribution of this work consists in the derivation of a meta-model to reconstruct the dual quantities of interest over the complete mesh from their values on the reduced integration points. △ Less

Submitted 9 August, 2021; originally announced August 2021.

Journal ref: Mech. Ind., 23, (2022)

arXiv:2106.08476 [pdf, ps, other]

doi 10.1016/j.abb.2021.108923

Fluid flow in the sarcomere

Authors: Sage A Malingen, Kaitlyn Hood, Eric Lauga, Anette Hosoi, Thomas L Daniel

Abstract: A highly organized and densely packed lattice of molecular machinery within the sarcomeres of muscle cells powers contraction. Although many of the proteins that drive contraction have been studied extensively, the mechanical impact of fluid shearing within the lattice of molecular machinery has received minimal attention. It was recently proposed that fluid flow augments substrate transport in th… ▽ More A highly organized and densely packed lattice of molecular machinery within the sarcomeres of muscle cells powers contraction. Although many of the proteins that drive contraction have been studied extensively, the mechanical impact of fluid shearing within the lattice of molecular machinery has received minimal attention. It was recently proposed that fluid flow augments substrate transport in the sarcomere, however, this analysis used analytical models of fluid flow in the molecular machinery that could not capture its full complexity. By building a finite element model of the sarcomere, we estimate the explicit flow field, and contrast it with analytical models. Our results demonstrate that viscous drag forces on sliding filaments are surprisingly small in contrast to the forces generated by single myosin molecular motors. This model also indicates that the energetic cost of fluid flow through viscous shearing with lattice proteins is likely minimal. The model also highlights a steep velocity gradient between sliding filaments and demonstrates that the maximal radial fluid velocity occurs near the tips of the filaments. To our knowledge, this is the first computational analysis of fluid flow within the highly structured sarcomere. △ Less

Submitted 15 June, 2021; originally announced June 2021.

Journal ref: Archives of Biochemistry and Biophysics, 108923 (2021)

arXiv:2103.16660 [pdf, other]

doi 10.3847/1538-4365/abf73d

Stellar Metallicities from SkyMapper Photometry II: Precise photometric metallicities of $\sim$280,000 giant stars with [Fe/H] $< -0.75$ in the Milky Way

Authors: Anirudh Chiti, Anna Frebel, Mohammad K. Mardini, Tatsuya W. Daniel, Xiaowei Ou, Anastasiia V. Uvarova

Abstract: The Milky Way's metal-poor stars are nearby ancient objects that are used to study early chemical evolution and the assembly and structure of the Milky Way. Here we present reliable metallicities of $\sim280,000$ stars with $-3.75 \lesssim$ [Fe/H] $\lesssim -0.75$ down to $g=17$ derived using metallicity-sensitive photometry from the second data release (DR2) of the SkyMapper Southern Survey. We u… ▽ More The Milky Way's metal-poor stars are nearby ancient objects that are used to study early chemical evolution and the assembly and structure of the Milky Way. Here we present reliable metallicities of $\sim280,000$ stars with $-3.75 \lesssim$ [Fe/H] $\lesssim -0.75$ down to $g=17$ derived using metallicity-sensitive photometry from the second data release (DR2) of the SkyMapper Southern Survey. We use the dependency of the flux through the SkyMapper $v$ filter on the strength of the Ca II K absorption features, in tandem with SkyMapper $u,g,i$ photometry, to derive photometric metallicities for these stars. We find that metallicities derived in this way compare well to metallicities derived in large-scale spectroscopic surveys, and use such comparisons to calibrate and quantify systematics as a function of location, reddening, and color. We find good agreement with metallicities from the APOGEE, LAMOST, and GALAH surveys, based on a standard deviation of $σ\sim0.25$dex of the residuals of our photometric metallicities with respect to metallicities from those surveys. We also compare our derived photometric metallicities to metallicities presented in a number of high-resolution spectroscopic studies to validate the low metallicity end ([Fe/H] $< -2.5$) of our photometric metallicity determinations. In such comparisons, we find the metallicities of stars with photometric [Fe/H] $< -2.5$ in our catalog show no significant offset and a scatter of $σ\sim$0.31dex level relative to those in high-resolution work when considering the cooler stars ($g-i > 0.65$) in our sample. We also present an expanded catalog containing photometric metallicities of $\sim720,000$ stars as a data table for further exploration of the metal-poor Milky Way. △ Less

Submitted 30 March, 2021; originally announced March 2021.

Comments: 15 pages, 9 figures, 2 tables; submitted to ApJS and revised after one round of referee feedback. Full version of Table 2 in source

arXiv:2103.16642 [pdf, other]

doi 10.3847/2041-8213/abd629

The Metal-Poor Metallicity Distribution of the Ancient Milky Way

Authors: Anirudh Chiti, Mohammad K. Mardini, Anna Frebel, Tatsuya Daniel

Abstract: We present a low metallicity map of the Milky Way consisting of $\sim$111,000 giants with $-3.5 \lesssim$ [Fe/H] $\lesssim -$0.75, based on public photometry from the second data release of the SkyMapper survey. These stars extend out to $\sim$7kpc from the solar neighborhood and cover the main Galactic stellar populations, including the thick disk and the inner halo. Notably, this map can reliabl… ▽ More We present a low metallicity map of the Milky Way consisting of $\sim$111,000 giants with $-3.5 \lesssim$ [Fe/H] $\lesssim -$0.75, based on public photometry from the second data release of the SkyMapper survey. These stars extend out to $\sim$7kpc from the solar neighborhood and cover the main Galactic stellar populations, including the thick disk and the inner halo. Notably, this map can reliably differentiate metallicities down to [Fe/H] $\sim -3.0$, and thus provides an unprecedented view into the ancient, metal-poor Milky Way. Among the more metal-rich stars in our sample ([Fe/H] $> -2.0$), we recover a clear spatial dependence of decreasing mean metallicity as a function of scale height that maps onto the thick disk component of the Milky Way. When only considering the very metal-poor stars in our sample ([Fe/H] $< -$2), we recover no such spatial dependence in their mean metallicity out to a scale height of $|Z|\sim7$ kpc. We find that the metallicity distribution function (MDF) of the most metal-poor stars in our sample ($-3.0 <$ [Fe/H] $< -2.3$) is well fit with an exponential profile with a slope of $Δ\log(N)/Δ$[Fe/H] = 1.52$\pm$0.05, and shifts to $Δ\log(N)/Δ$[Fe/H] = 1.53$\pm$0.10 after accounting for target selection effects. For [Fe/H] $< -2.3$, the MDF is largely insensitive to scale height $|Z|$ out to $\sim5$kpc, showing that very and extremely metal-poor stars are in every galactic component. △ Less

Submitted 30 March, 2021; originally announced March 2021.

Comments: 9 pages, 5 figures; accepted for publication in ApJL. Minor corrections after acceptance addressing referee report for Chiti et al. ApJS submitted

arXiv:2103.13683 [pdf, other]

doi 10.1016/j.jcp.2022.111120

Physics-informed cluster analysis and a priori efficiency criterion for the construction of local reduced-order bases

Authors: Thomas Daniel, Fabien Casenave, Nissrine Akkari, Ali Ketata, David Ryckelynck

Abstract: Nonlinear model order reduction has opened the door to parameter optimization and uncertainty quantification in complex physics problems governed by nonlinear equations. In particular, the computational cost of solving these equations can be reduced by means of local reduced-order bases. This article examines the benefits of a physics-informed cluster analysis for the construction of cluster-speci… ▽ More Nonlinear model order reduction has opened the door to parameter optimization and uncertainty quantification in complex physics problems governed by nonlinear equations. In particular, the computational cost of solving these equations can be reduced by means of local reduced-order bases. This article examines the benefits of a physics-informed cluster analysis for the construction of cluster-specific reduced-order bases. We illustrate that the choice of the dissimilarity measure for clustering is fundamental and highly affects the performances of the local reduced-order bases. It is shown that clustering with an angle-based dissimilarity on simulation data efficiently decreases the intra-cluster Kolmogorov $N$-width. Additionally, an a priori efficiency criterion is introduced to assess the relevance of a ROM-net, a methodology for the reduction of nonlinear physics problems introduced in our previous work in [T. Daniel, F. Casenave, N. Akkari, D. Ryckelynck, Model order reduction assisted by deep neural networks (ROM-net), Advanced Modeling and Simulation in Engineering Sciences 7 (16), 2020]. This criterion also provides engineers with a very practical method for ROM-nets' hyperparameters calibration under constrained computational costs for the training phase. On five different physics problems, our physics-informed clustering strategy significantly outperforms classic strategies for the construction of local reduced-order bases in terms of projection errors. △ Less

Submitted 3 December, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

Journal ref: J. Comput. Phys., 458, 111120 (2022)

arXiv:2101.04530 [pdf, other]

doi 10.3390/mca26010017

Data augmentation and feature selection for automatic model recommendation in computational physics

Authors: Thomas Daniel, Fabien Casenave, Nissrine Akkari, David Ryckelynck

Abstract: Classification algorithms have recently found applications in computational physics for the selection of numerical methods or models adapted to the environment and the state of the physical system. For such classification tasks, labeled training data come from numerical simulations and generally correspond to physical fields discretized on a mesh. Three challenging difficulties arise: the lack of… ▽ More Classification algorithms have recently found applications in computational physics for the selection of numerical methods or models adapted to the environment and the state of the physical system. For such classification tasks, labeled training data come from numerical simulations and generally correspond to physical fields discretized on a mesh. Three challenging difficulties arise: the lack of training data, their high dimensionality, and the non-applicability of common data augmentation techniques to physics data. This article introduces two algorithms to address these issues, one for dimensionality reduction via feature selection, and one for data augmentation. These algorithms are combined with a wide variety of classifiers for their evaluation. When combined with a stacking ensemble made of six multilayer perceptrons and a ridge logistic regression, they enable reaching an accuracy of 90% on our classification problem for nonlinear structural mechanics. △ Less

Submitted 12 January, 2021; originally announced January 2021.

Journal ref: Math. Comput. Appl. 26(1), 17, (2021)

arXiv:2012.13253 [pdf, other]

Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder

Authors: Tal Daniel, Aviv Tamar

Abstract: The recently introduced introspective variational autoencoder (IntroVAE) exhibits outstanding image generations, and allows for amortized inference using an image encoder. The main idea in IntroVAE is to train a VAE adversarially, using the VAE encoder to discriminate between generated and real data samples. However, the original IntroVAE loss function relied on a particular hinge-loss formulation… ▽ More The recently introduced introspective variational autoencoder (IntroVAE) exhibits outstanding image generations, and allows for amortized inference using an image encoder. The main idea in IntroVAE is to train a VAE adversarially, using the VAE encoder to discriminate between generated and real data samples. However, the original IntroVAE loss function relied on a particular hinge-loss formulation that is very hard to stabilize in practice, and its theoretical convergence analysis ignored important terms in the loss. In this work, we take a step towards better understanding of the IntroVAE model, its practical implementation, and its applications. We propose the Soft-IntroVAE, a modified IntroVAE that replaces the hinge-loss terms with a smooth exponential loss on generated samples. This change significantly improves training stability, and also enables theoretical analysis of the complete algorithm. Interestingly, we show that the IntroVAE converges to a distribution that minimizes a sum of KL distance from the data distribution and an entropy term. We discuss the implications of this result, and demonstrate that it induces competitive image generation and reconstruction. Finally, we describe two applications of Soft-IntroVAE to unsupervised image translation and out-of-distribution detection, and demonstrate compelling results. Code and additional information is available on the project website -- https://taldatech.github.io/soft-intro-vae-web △ Less

Submitted 25 March, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

Comments: CVPR 2021, Extended version. Code and additional information is available on the project website - https://taldatech.github.io/soft-intro-vae-web

arXiv:2011.00054 [pdf]

Waymo's Safety Methodologies and Safety Readiness Determinations

Authors: Nick Webb, Dan Smith, Christopher Ludwick, Trent Victor, Qi Hommes, Francesca Favaro, George Ivanov, Tom Daniel

Abstract: Waymo's safety methodologies, which draw on well established engineering processes and address new safety challenges specific to Automated Vehicle technology, provide a firm foundation for safe deployment of Waymo's Level 4 ADS, which Waymo also refers to as the Waymo Driver. Waymo's determination of its readiness to deploy its AVs safely in different settings rests on that firm foundation and on… ▽ More Waymo's safety methodologies, which draw on well established engineering processes and address new safety challenges specific to Automated Vehicle technology, provide a firm foundation for safe deployment of Waymo's Level 4 ADS, which Waymo also refers to as the Waymo Driver. Waymo's determination of its readiness to deploy its AVs safely in different settings rests on that firm foundation and on a thorough analysis of risks specific to a particular Operational Design Domain. Waymo's process for making these readiness determinations entails an ordered examination of the relevant outputs from all of its safety methodologies combined with careful safety and engineering judgment focused on the specific facts relevant for a particular determination. Waymo will approve when it determines the ADS is ready for the new conditions without creating any unreasonable risks to safety. This paper explains Waymo's methodologies as applied to the three layers of its technology: hardware, ADS behavior, and operations, and also explains Waymo's safety governance. Waymo will continue to apply and adapt those methodologies, and to learn from the important contributions of others in the AV industry, as Waymo continues to build an ever safer and more able ADS. △ Less

Submitted 30 October, 2020; originally announced November 2020.

Comments: 28 pages, 1 figure

ACM Class: I.2.9

arXiv:2011.00038 [pdf]

Waymo Public Road Safety Performance Data

Authors: Matthew Schwall, Tom Daniel, Trent Victor, Francesca Favaro, Henning Hohnhold

Abstract: Waymo's mission to reduce traffic injuries and fatalities and improve mobility for all has led us to expand deployment of automated vehicles on public roads without a human driver behind the wheel. As part of this process, Waymo is committed to providing the public with informative and relevant data regarding the demonstrated safety of Waymo's automated driving system, which we call the Waymo Driv… ▽ More Waymo's mission to reduce traffic injuries and fatalities and improve mobility for all has led us to expand deployment of automated vehicles on public roads without a human driver behind the wheel. As part of this process, Waymo is committed to providing the public with informative and relevant data regarding the demonstrated safety of Waymo's automated driving system, which we call the Waymo Driver. The data presented in this paper represents more than 6.1 million miles of automated driving in the Phoenix, Arizona metropolitan area, including operations with a trained operator behind the steering wheel from calendar year 2019 and 65,000 miles of driverless operation without a human behind the steering wheel from 2019 and the first nine months of 2020. The paper includes every collision and minor contact experienced during these operations as well as every predicted contact identified using Waymo's counterfactual, what if, simulation of events had the vehicle's trained operator not disengaged automated driving. There were 47 contact events that occurred over this time period, consisting of 18 actual and 29 simulated contact events, none of which would be expected to result in severe or life threatening injuries. This paper presents the collision typology and severity for each actual and simulated event, along with diagrams depicting each of the most significant events. Nearly all the events involved one or more road rule violations or other errors by a human driver or road user, including all eight of the most severe events, which we define as involving actual or expected airbag deployment in any involved vehicle. When compared to national collision statistics, the Waymo Driver completely avoided certain collision modes that human driven vehicles are frequently involved in, including road departure and collisions with fixed objects. △ Less

Submitted 30 October, 2020; originally announced November 2020.

Comments: 15 pages, 9 figures, 1 table

ACM Class: I.2.9

arXiv:1912.04922 [pdf, other]

Learning Precisely Timed Feedforward Control of the Sensor-Denied Inverted Pendulum

Authors: Thomas L. Mohren, Thomas L. Daniel, Steven L. Brunton

Abstract: Time delays due to signal latency, computational complexity, and sensor-denied environments, pose a critical challenge in both engineered and biological control systems. In this work, we investigate biologically inspired strategies to develop precisely timed feedforward control laws for engineered systems with large time delays. We demonstrate this approach on the nonlinear pendulum with partially… ▽ More Time delays due to signal latency, computational complexity, and sensor-denied environments, pose a critical challenge in both engineered and biological control systems. In this work, we investigate biologically inspired strategies to develop precisely timed feedforward control laws for engineered systems with large time delays. We demonstrate this approach on the nonlinear pendulum with partially denied observations, so that it is only possible to measure the state of the system near the upright position. Given a large disturbance that overwhelms the local feedback controller, it is necessary to add or remove energy from the pendulum so that it returns to the upright position after one full revolution. The partial observation near the upright position introduces a significant delay between observations and the region where actuation is most effective. Thus, we develop a learning algorithm that integrates sensor information into a precisely timed feedforward control signal to overcome this delay with minimal computation, training data, and set of control decisions. This simple controller can serve as a model for many biological systems, and can be implemented in engineered systems with time delays. △ Less

Submitted 10 December, 2019; originally announced December 2019.

Comments: 6 pages, 5 figures

arXiv:1911.04971 [pdf, other]

Deep Variational Semi-Supervised Novelty Detection

Authors: Tal Daniel, Thanard Kurutach, Aviv Tamar

Abstract: In anomaly detection (AD), one seeks to identify whether a test sample is abnormal, given a data set of normal samples. A recent and promising approach to AD relies on deep generative models, such as variational autoencoders (VAEs), for unsupervised learning of the normal data distribution. In semi-supervised AD (SSAD), the data also includes a small sample of labeled anomalies. In this work, we p… ▽ More In anomaly detection (AD), one seeks to identify whether a test sample is abnormal, given a data set of normal samples. A recent and promising approach to AD relies on deep generative models, such as variational autoencoders (VAEs), for unsupervised learning of the normal data distribution. In semi-supervised AD (SSAD), the data also includes a small sample of labeled anomalies. In this work, we propose two variational methods for training VAEs for SSAD. The intuitive idea in both methods is to train the encoder to `separate' between latent vectors for normal and outlier data. We show that this idea can be derived from principled probabilistic formulations of the problem, and propose simple and effective algorithms. Our methods can be applied to various data types, as we demonstrate on SSAD datasets ranging from natural images to astronomy and medicine, can be combined with any VAE model architecture, and are naturally compatible with ensembling. When comparing to state-of-the-art SSAD methods that are not specific to particular data types, we obtain marked improvement in outlier detection. △ Less

Submitted 4 November, 2021; v1 submitted 12 November, 2019; originally announced November 2019.

Comments: NeurIPS 2021 Workshop on DGMs and Downstream Applications

arXiv:1904.12420 [pdf, other]

A mechanism for sarcomere breathing: volume change and advective flow within the myofilament lattice

Authors: Julie A Cass, C. Dave Williams, Tom C Irving, Eric Lauga, Sage Malingen, Tom L. Daniel, Simon N. Sponberg

Abstract: During muscle contraction, myosin motors anchored to thick filaments bind to and slide actin thin filaments. These motors rely on energy derived from ATP, supplied, in part, by diffusion from the sarcoplasm to the interior of the lattice of actin and myosin filaments. The radial spacing of filaments in this lattice may change or remain constant during contraction. If the lattice is isovolumetric,… ▽ More During muscle contraction, myosin motors anchored to thick filaments bind to and slide actin thin filaments. These motors rely on energy derived from ATP, supplied, in part, by diffusion from the sarcoplasm to the interior of the lattice of actin and myosin filaments. The radial spacing of filaments in this lattice may change or remain constant during contraction. If the lattice is isovolumetric, it must expand when the muscle shortens. If, however, the spacing is constant or has a different pattern of axial and radial motion, then the lattice changes volume during contraction, driving fluid motion and assisting in the transport of molecules between the contractile lattice and the surrounding intracellular space. We first create an advective-diffusive-reaction flow model and show that the flow into and out of the sarcomere lattice would be significant in the absence of lattice expansion. Advective transport coupled to diffusion has the potential to substantially enhance metabolite exchange within the crowded sarcomere. Using time-resolved x-ray diffraction of contracting muscle, we next show that the contractile lattice is neither isovolumetric nor constant in spacing. Instead, lattice spacing is time-varying, depends on activation, and can manifest as an effective time-varying Poisson ratio. The resulting fluid flow in the sarcomere lattice of synchronous insect flight muscles is greater than expected for constant lattice spacing conditions. Lattice spacing depends on a variety of factors that produce radial force, including crossbridges, titin-like molecules, and other structural proteins. Volume change and advective transport varies with the phase of muscle stimulation but remains significant at all conditions. Akin to "breathing," advective-diffusive transport in sarcomeres is sufficient to promote metabolite exchange and may play a role in the regulation of contraction itself. △ Less

Submitted 24 June, 2021; v1 submitted 28 April, 2019; originally announced April 2019.

Comments: 4 figs, 3 supplemental movies, 1 supplemental figure

arXiv:1804.07884 [pdf, other]

doi 10.1073/pnas.1808909115

Neural-inspired sensors enable sparse, efficient classification of spatiotemporal data

Authors: Thomas L. Mohren, Thomas L. Daniel, Steven L. Brunton, Bingni W. Brunton

Abstract: Sparse sensor placement is a central challenge in the efficient characterization of complex systems when the cost of acquiring and processing data is high. Leading sparse sensing methods typically exploit either spatial or temporal correlations, but rarely both. This work introduces a new sparse sensor optimization that is designed to leverage the rich spatiotemporal coherence exhibited by many sy… ▽ More Sparse sensor placement is a central challenge in the efficient characterization of complex systems when the cost of acquiring and processing data is high. Leading sparse sensing methods typically exploit either spatial or temporal correlations, but rarely both. This work introduces a new sparse sensor optimization that is designed to leverage the rich spatiotemporal coherence exhibited by many systems. Our approach is inspired by the remarkable performance of flying insects, which use a few embedded strain-sensitive neurons to achieve rapid and robust flight control despite large gust disturbances. Specifically, we draw on nature to identify targeted neural-inspired sensors on a flap** wing to detect body rotation. This task is particularly challenging as the rotational twisting mode is three orders-of-magnitude smaller than the flap** modes. We show that nonlinear filtering in time, built to mimic strain-sensitive neurons, is essential to detect rotation, whereas instantaneous measurements fail. Optimized sparse sensor placement results in efficient classification with approximately ten sensors, achieving the same accuracy and noise robustness as full measurements consisting of hundreds of sensors. Sparse sensing with neural inspired encoding establishes a new paradigm in hyper-efficient, embodied sensing of spatiotemporal data and sheds light on principles of biological sensing for agile flight control. △ Less

Submitted 20 April, 2018; originally announced April 2018.

Comments: 21 pages, 19 figures

arXiv:1503.00330 [pdf, other]

GPU Based Path Integral Control with Learned Dynamics

Authors: Grady Williams, Eric Rombokas, Tom Daniel

Abstract: We present an algorithm which combines recent advances in model based path integral control with machine learning approaches to learning forward dynamics models. We take advantage of the parallel computing power of a GPU to quickly take a massive number of samples from a learned probabilistic dynamics model, which we use to approximate the path integral form of the optimal control. The resulting a… ▽ More We present an algorithm which combines recent advances in model based path integral control with machine learning approaches to learning forward dynamics models. We take advantage of the parallel computing power of a GPU to quickly take a massive number of samples from a learned probabilistic dynamics model, which we use to approximate the path integral form of the optimal control. The resulting algorithm runs in a receding-horizon fashion in realtime, and is subject to no restrictive assumptions about costs, constraints, or dynamics. A simple change to the path integral control formulation allows the algorithm to take model uncertainty into account during planning, and we demonstrate its performance on a quadrotor navigation task. In addition to this novel adaptation of path integral control, this is the first time that a receding-horizon implementation of iterative path integral control has been run on a real system. △ Less

Submitted 1 March, 2015; originally announced March 2015.

Comments: 6 pages, NIPS 2014 - Autonomously Learning Robots Workshop

arXiv:1402.5702 [pdf, other]

Feedback Control as a Framework for Understanding Tradeoffs in Biology

Authors: Noah J. Cowan, Mustafa Mert Ankarali, Jonathan P. Dyhr, Manu S. Madhav, Eatai Roth, Shahin Sefati, Simon Sponberg, Sarah A. Stamper, Eric S. Fortune, Thomas L. Daniel

Abstract: Control theory arose from a need to control synthetic systems. From regulating steam engines to tuning radios to devices capable of autonomous movement, it provided a formal mathematical basis for understanding the role of feedback in the stability (or change) of dynamical systems. It provides a framework for understanding any system with feedback regulation, including biological ones such as regu… ▽ More Control theory arose from a need to control synthetic systems. From regulating steam engines to tuning radios to devices capable of autonomous movement, it provided a formal mathematical basis for understanding the role of feedback in the stability (or change) of dynamical systems. It provides a framework for understanding any system with feedback regulation, including biological ones such as regulatory gene networks, cellular metabolic systems, sensorimotor dynamics of moving animals, and even ecological or evolutionary dynamics of organisms and populations. Here we focus on four case studies of the sensorimotor dynamics of animals, each of which involves the application of principles from control theory to probe stability and feedback in an organism's response to perturbations. We use examples from aquatic (electric fish station kee** and jamming avoidance), terrestrial (cockroach wall following) and aerial environments (flight control in moths) to highlight how one can use control theory to understand how feedback mechanisms interact with the physical dynamics of animals to determine their stability and response to sensory inputs and perturbations. Each case study is cast as a control problem with sensory input, neural processing, and motor dynamics, the output of which feeds back to the sensory inputs. Collectively, the interaction of these systems in a closed loop determines the behavior of the entire system. △ Less

Submitted 23 February, 2014; originally announced February 2014.

Comments: Submitted to Integr Comp Biol

Showing 1–28 of 28 results for author: Daniel, T