Search | arXiv e-print repository

doi 10.1051/0004-6361/202347732

Runaway BN supergiant star HD 93840: Progenitor of an imminent core-collapse supernova above the Galactic plane

Authors: D. Weßmayer, M. A. Urbaneja, K. Butler, N. Przybilla

Abstract: We present a quantitative spectral analysis of the extreme nitrogen-enhanced supergiant HD 93840 (BN1 Ib) at an intermediate galactic latitude. Based on an optical high-resolution spectrum and complementary ultraviolet and infrared (spectro-)photometry, in addition to Gaia data, we carried out a full characterisation of the star's properties. We used both hydrostatic and unified (photosphere+wind)… ▽ More We present a quantitative spectral analysis of the extreme nitrogen-enhanced supergiant HD 93840 (BN1 Ib) at an intermediate galactic latitude. Based on an optical high-resolution spectrum and complementary ultraviolet and infrared (spectro-)photometry, in addition to Gaia data, we carried out a full characterisation of the star's properties. We used both hydrostatic and unified (photosphere+wind) model atmospheres that account for deviations from local thermodynamic equilibrium. A highly unusual surface CNO-mixing signature and a marked stellar overluminosity compared to the mass imply a binary channel for the star's past evolution. The kinematics shows that it has reached its current position above the Galactic plane as a runaway star, likely ejected by the supernova explosion of its former companion star. Its current bulk composition, with a notably increased mean molecular weight due to core He- and progressed shell H-burning, suggests an advanced evolutionary stage. It is poised to yield a rare core-collapse supernova of a blue supergiant about ten OB star population scale heights above the Galactic disk relatively soon, contributing to the metal enrichment of the circumgalactic medium. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 19 pages, 14 figures, published in Astronomy & Astrophysics

Journal ref: A&A, 687, L7 (2024)

arXiv:2406.19573 [pdf, other]

On Counterfactual Interventions in Vector Autoregressive Models

Authors: Kurt Butler, Marija Iloska, Petar M. Djuric

Abstract: Counterfactual reasoning allows us to explore hypothetical scenarios in order to explain the impacts of our decisions. However, addressing such inquires is impossible without establishing the appropriate mathematical framework. In this work, we introduce the problem of counterfactual reasoning in the context of vector autoregressive (VAR) processes. We also formulate the inference of a causal mode… ▽ More Counterfactual reasoning allows us to explore hypothetical scenarios in order to explain the impacts of our decisions. However, addressing such inquires is impossible without establishing the appropriate mathematical framework. In this work, we introduce the problem of counterfactual reasoning in the context of vector autoregressive (VAR) processes. We also formulate the inference of a causal model as a joint regression task where for inference we use both data with and without interventions. After learning the model, we exploit linearity of the VAR model to make exact predictions about the effects of counterfactual interventions. Furthermore, we quantify the total causal effects of past counterfactual interventions. The source code for this project is freely available at https://github.com/KurtButler/counterfactual_interventions. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.13142 [pdf, other]

Optimal pre-train/fine-tune strategies for accurate material property predictions

Authors: Reshma Devi, Keith T. Butler, Gopalakrishnan Sai Gautam

Abstract: Overcoming the challenge of limited data availability within materials science is crucial for the broad-based applicability of machine learning within materials science. One pathway to overcome this limited data availability is to use the framework of transfer learning (TL), where a pre-trained (PT) machine learning model (on a larger dataset) can be fine-tuned (FT) on a target (typically smaller)… ▽ More Overcoming the challenge of limited data availability within materials science is crucial for the broad-based applicability of machine learning within materials science. One pathway to overcome this limited data availability is to use the framework of transfer learning (TL), where a pre-trained (PT) machine learning model (on a larger dataset) can be fine-tuned (FT) on a target (typically smaller) dataset. Our study systematically explores the effectiveness of various PT/FT strategies to learn and predict material properties with limited data. Specifically, we leverage graph neural networks (GNNs) to PT/FT on seven diverse curated materials datasets, encompassing sizes ranging from 941 to 132,752 datapoints. We consider datasets that cover a spectrum of material properties, ranging from band gaps (electronic) to formation energies (thermodynamic) and shear moduli (mechanical). We study the influence of PT and FT dataset sizes, strategies that can be employed for FT, and other hyperparameters on pair-wise TL among the datasets considered. We find our pair-wise PT-FT models to consistently outperform models trained from scratch on the target datasets. Importantly, we develop a GNN framework that is simultaneously PT on multiple properties (MPT), enabling the construction of generalized GNN models. Our MPT models outperform pair-wise PT-FT models on several datasets considered, and more significantly, on a 2D material band gap dataset that is completely out-of-distribution from the PT datasets. Finally, we expect our PT/FT and MPT frameworks to be generalizable to other GNNs and materials properties, which can accelerate materials design and discovery for various applications. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.10318 [pdf, other]

Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

Authors: Tuo Zhang, Tiantian Feng, Yibin Ni, Mengqin Cao, Ruying Liu, Katharine Butler, Yanjun Weng, Mi Zhang, Shrikanth S. Narayanan, Salman Avestimehr

Abstract: Large vision-language models (VLMs) have demonstrated remarkable abilities in understanding everyday content. However, their performance in the domain of art, particularly culturally rich art forms, remains less explored. As a pearl of human wisdom and creativity, art encapsulates complex cultural narratives and symbolism. In this paper, we offer the Pun Rebus Art Dataset, a multimodal dataset for… ▽ More Large vision-language models (VLMs) have demonstrated remarkable abilities in understanding everyday content. However, their performance in the domain of art, particularly culturally rich art forms, remains less explored. As a pearl of human wisdom and creativity, art encapsulates complex cultural narratives and symbolism. In this paper, we offer the Pun Rebus Art Dataset, a multimodal dataset for art understanding deeply rooted in traditional Chinese culture. We focus on three primary tasks: identifying salient visual elements, matching elements with their symbolic meanings, and explanations for the conveyed messages. Our evaluation reveals that state-of-the-art VLMs struggle with these tasks, often providing biased and hallucinated explanations and showing limited improvement through in-context learning. By releasing the Pun Rebus Art Dataset, we aim to facilitate the development of VLMs that can better understand and interpret culturally specific content, promoting greater inclusiveness beyond English-based corpora. △ Less

Submitted 14 June, 2024; originally announced June 2024.

arXiv:2404.18991 [pdf, other]

A dusty proto-cluster surrounding the binary galaxy HerBS-70 at $z = 2.3$

Authors: Tom J. L. C. Bakx, S. Berta, H. Dannerbauer, P. Cox, K. M. Butler, M. Hagimoto, D. H. Hughes, D. A. Riechers, P. P. van der Werf, C. Yang, A. J. Baker, A. Beelen, G. J. Bendo, E. Borsato, V. Buat, A. R. Cooray, L. Dunne, S. Dye, S. Eales, R. Gavazzi, A. I. Harris, D. Ismail, R. J. Ivison, B. Jones, M. Krips , et al. (16 additional authors not shown)

Abstract: We report on deep SCUBA-2 observations at 850$μ$m and NOEMA spectroscopic measurements at 2 mm of the environment surrounding the luminous, massive ($M_{*} \approx 2 \times 10^{11}$ M$_{\odot}$) Herschel-selected source HerBS-70. This source was revealed by previous NOEMA observations to be a binary system of dusty star-forming galaxies at $z= 2.3$, with the East component (HerBS-70E) hosting an A… ▽ More We report on deep SCUBA-2 observations at 850$μ$m and NOEMA spectroscopic measurements at 2 mm of the environment surrounding the luminous, massive ($M_{*} \approx 2 \times 10^{11}$ M$_{\odot}$) Herschel-selected source HerBS-70. This source was revealed by previous NOEMA observations to be a binary system of dusty star-forming galaxies at $z= 2.3$, with the East component (HerBS-70E) hosting an Active Galactic Nucleus (AGN). The SCUBA-2 observations detected, in addition to the binary system, twenty-one sources at $> 3.5 σ$ over an area of $\sim 25$ square comoving Mpc with a sensitivity of $σ_{850} = 0.75$ mJy. The surface density of continuum sources around HerBS-70 is three times higher than for field galaxies. The NOEMA spectroscopic measurements confirm the protocluster membership of three of the nine brightest sources through their CO(4 - 3) line emission, yielding a volume density 36 times higher than for field galaxies. All five confirmed sub-mm galaxies in the HerBS-70 system have relatively short gas depletion times ($80 - 500$ Myr), indicating the onset of quenching for this protocluster core due to the depletion of gas. The dark matter halo mass of the HerBS-70 system is estimated around $5 \times{} 10^{13}$ M$_{\odot}$, with a projected current-day mass of $10^{15}$ M$_{\odot}$, similar to the local Virgo and Coma clusters. These observations support the claim that DSFGs, in particular the ones with observed multiplicity, can trace cosmic overdensities. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: 19 pages, 13 figures, accepted for publication in MNRAS

arXiv:2404.15143 [pdf, other]

Every Breath You Don't Take: Deepfake Speech Detection Using Breath

Authors: Seth Layton, Thiago De Andrade, Daniel Olszewski, Kevin Warren, Kevin Butler, Patrick Traynor

Abstract: Deepfake speech represents a real and growing threat to systems and society. Many detectors have been created to aid in defense against speech deepfakes. While these detectors implement myriad methodologies, many rely on low-level fragments of the speech generation process. We hypothesize that breath, a higher-level part of speech, is a key component of natural speech and thus improper generation… ▽ More Deepfake speech represents a real and growing threat to systems and society. Many detectors have been created to aid in defense against speech deepfakes. While these detectors implement myriad methodologies, many rely on low-level fragments of the speech generation process. We hypothesize that breath, a higher-level part of speech, is a key component of natural speech and thus improper generation in deepfake speech is a performant discriminator. To evaluate this, we create a breath detector and leverage this against a custom dataset of online news article audio to discriminate between real/deepfake speech. Additionally, we make this custom dataset publicly available to facilitate comparison for future work. Applying our simple breath detector as a deepfake speech discriminator on in-the-wild samples allows for accurate classification (perfect 1.0 AUPRC and 0.0 EER on test data) across 33.6 hours of audio. We compare our model with the state-of-the-art SSL-wav2vec model and show that this complex deep learning model completely fails to classify the same in-the-wild samples (0.72 AUPRC and 0.99 EER). △ Less

Submitted 26 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

Comments: Submitted to ACM journal -- Digital Threats: Research and Practice

arXiv:2404.11815 [pdf, other]

AquaSonic: Acoustic Manipulation of Underwater Data Center Operations and Resource Management

Authors: Jennifer Sheldon, Weidong Zhu, Adnan Abdullah, Sri Hrushikesh Varma Bhupathiraju, Takeshi Sugawara, Kevin R. B. Butler, Md Jahidul Islam, Sara Rampazzi

Abstract: Underwater datacenters (UDCs) hold promise as next-generation data storage due to their energy efficiency and environmental sustainability benefits. While the natural cooling properties of water save power, the isolated aquatic environment and long-range sound propagation in water create unique vulnerabilities which differ from those of on-land data centers. Our research discovers the unique vulne… ▽ More Underwater datacenters (UDCs) hold promise as next-generation data storage due to their energy efficiency and environmental sustainability benefits. While the natural cooling properties of water save power, the isolated aquatic environment and long-range sound propagation in water create unique vulnerabilities which differ from those of on-land data centers. Our research discovers the unique vulnerabilities of fault-tolerant storage devices, resource allocation software, and distributed file systems to acoustic injection attacks in UDCs. With a realistic testbed approximating UDC server operations, we empirically characterize the capabilities of acoustic injection underwater and find that an attacker can reduce fault-tolerant RAID 5 storage system throughput by 17% up to 100%. Our closed-water analyses reveal that attackers can (i) cause unresponsiveness and automatic node removal in a distributed filesystem with only 2.4 minutes of sustained acoustic injection, (ii) induce a distributed database's latency to increase by up to 92.7% to reduce system reliability, and (iii) induce load-balance managers to redirect up to 74% of resources to a target server to cause overload or force resource colocation. Furthermore, we perform open-water experiments in a lake and find that an attacker can cause controlled throughput degradation at a maximum allowable distance of 6.35 m using a commercial speaker. We also investigate and discuss the effectiveness of standard defenses against acoustic injection attacks. Finally, we formulate a novel machine learning-based detection system that reaches 0% False Positive Rate and 98.2% True Positive Rate trained on our dataset of profiled hard disk drives under 30-second FIO benchmark execution. With this work, we aim to help manufacturers proactively protect UDCs against acoustic injection attacks and ensure the security of subsea computing infrastructures. △ Less

Submitted 7 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

Comments: Accepted to IEEE S&P 2024

arXiv:2403.07072 [pdf, other]

Explainable Learning with Gaussian Processes

Authors: Kurt Butler, Guanchao Feng, Petar M. Djuric

Abstract: The field of explainable artificial intelligence (XAI) attempts to develop methods that provide insight into how complicated machine learning methods make predictions. Many methods of explanation have focused on the concept of feature attribution, a decomposition of the model's prediction into individual contributions corresponding to each input feature. In this work, we explore the problem of fea… ▽ More The field of explainable artificial intelligence (XAI) attempts to develop methods that provide insight into how complicated machine learning methods make predictions. Many methods of explanation have focused on the concept of feature attribution, a decomposition of the model's prediction into individual contributions corresponding to each input feature. In this work, we explore the problem of feature attribution in the context of Gaussian process regression (GPR). We take a principled approach to defining attributions under model uncertainty, extending the existing literature. We show that although GPR is a highly flexible and non-parametric approach, we can derive interpretable, closed-form expressions for the feature attributions. When using integrated gradients as an attribution method, we show that the attributions of a GPR model also follow a Gaussian process distribution, which quantifies the uncertainty in attribution arising from uncertainty in the model. We demonstrate, both through theory and experimentation, the versatility and robustness of this approach. We also show that, when applicable, the exact expressions for GPR attributions are both more accurate and less computationally expensive than the approximations currently used in practice. The source code for this project is freely available under MIT license at https://github.com/KurtButler/2024_attributions_paper. △ Less

Submitted 11 March, 2024; originally announced March 2024.

Comments: 38 pages, 7 figures

MSC Class: 60G15

arXiv:2402.07687 [pdf, other]

Privacy-Preserving Gaze Data Streaming in Immersive Interactive Virtual Reality: Robustness and User Experience

Authors: Ethan Wilson, Azim Ibragimov, Michael J. Proulx, Sai Deep Tetali, Kevin Butler, Eakta Jain

Abstract: Eye tracking is routinely being incorporated into virtual reality (VR) systems. Prior research has shown that eye tracking data, if exposed, can be used for re-identification attacks. The state of our knowledge about currently existing privacy mechanisms is limited to privacy-utility trade-off curves based on data-centric metrics of utility, such as prediction error, and black-box threat models. W… ▽ More Eye tracking is routinely being incorporated into virtual reality (VR) systems. Prior research has shown that eye tracking data, if exposed, can be used for re-identification attacks. The state of our knowledge about currently existing privacy mechanisms is limited to privacy-utility trade-off curves based on data-centric metrics of utility, such as prediction error, and black-box threat models. We propose that for interactive VR applications, it is essential to consider user-centric notions of utility and a variety of threat models. We develop a methodology to evaluate real-time privacy mechanisms for interactive VR applications that incorporate subjective user experience and task performance metrics. We evaluate selected privacy mechanisms using this methodology and find that re-identification accuracy can be decreased to as low as 14% while maintaining a high usability score and reasonable task performance. Finally, we elucidate three threat scenarios (black-box, black-box with exemplars, and white-box) and assess how well the different privacy mechanisms hold up to these adversarial scenarios. This work advances the state of the art in VR privacy by providing a methodology for end-to-end assessment of the risk of re-identification attacks and potential mitigating solutions. △ Less

Submitted 21 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: To appear in IEEE Transactions on Visualization and Computer Graphics

arXiv:2401.02930 [pdf, other]

doi 10.1109/OJSP.2024.3351593

Dagma-DCE: Interpretable, Non-Parametric Differentiable Causal Discovery

Authors: Daniel Waxman, Kurt Butler, Petar M. Djuric

Abstract: We introduce Dagma-DCE, an interpretable and model-agnostic scheme for differentiable causal discovery. Current non- or over-parametric methods in differentiable causal discovery use opaque proxies of ``independence'' to justify the inclusion or exclusion of a causal relationship. We show theoretically and empirically that these proxies may be arbitrarily different than the actual causal strength.… ▽ More We introduce Dagma-DCE, an interpretable and model-agnostic scheme for differentiable causal discovery. Current non- or over-parametric methods in differentiable causal discovery use opaque proxies of ``independence'' to justify the inclusion or exclusion of a causal relationship. We show theoretically and empirically that these proxies may be arbitrarily different than the actual causal strength. Juxtaposed to existing differentiable causal discovery algorithms, \textsc{Dagma-DCE} uses an interpretable measure of causal strength to define weighted adjacency matrices. In a number of simulated datasets, we show our method achieves state-of-the-art level performance. We additionally show that \textsc{Dagma-DCE} allows for principled thresholding and sparsity penalties by domain-experts. The code for our method is available open-source at https://github.com/DanWaxman/DAGMA-DCE, and can easily be adapted to arbitrary differentiable models. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 9 pages, 2 figures. Accepted to the IEEE Open Journal of Signal Processing

Journal ref: IEEE Open Journal of Signal Processing, vol. 5, pp. 393-401, 2024

arXiv:2312.05294 [pdf, other]

doi 10.1002/aenm.202304230

Effects of Grain Boundaries and Surfaces on Electronic and Mechanical Properties of Solid Electrolytes

Authors: Weihang Xie, Zeyu Deng, Zhengyu Liu, Theodosios Famprikis, Keith T. Butler, Pieremanuele Canepa

Abstract: Extended defects, including exposed surfaces and grain boundaries, are critical to the properties of polycrystalline solid electrolytes in all-solid-state batteries (ASSBs). These defects can significantly alter the mechanical and electronic properties of solid electrolytes, with direct manifestations on the performance of ASSBs. Here, by building a library of 590 surfaces and grain boundaries of… ▽ More Extended defects, including exposed surfaces and grain boundaries, are critical to the properties of polycrystalline solid electrolytes in all-solid-state batteries (ASSBs). These defects can significantly alter the mechanical and electronic properties of solid electrolytes, with direct manifestations on the performance of ASSBs. Here, by building a library of 590 surfaces and grain boundaries of 11 relevant solid electrolytes $-$including halides, oxides, and sulfides$-$ their electronic, mechanical, and thermodynamic characteristics are linked to the functional properties of polycrystalline solid electrolytes. It is found that the energy required to mechanically ``separate'' grain boundaries can be significantly lower than in the bulk region of materials, which can trigger preferential cracking of solid electrolyte particles in the grain boundary regions. The brittleness of ceramic solid electrolytes, inferred from the predicted low fracture toughnesses at the grain boundaries, contributes to their cracking under local pressure imparted by Lithium or Sodium penetration in the grain boundaries. Extended defects of solid electrolytes introduce new electronic ``interfacial'' states within bandgaps of solid electrolytes. These interfacial states alter and possibly increase locally the availability of free electrons and holes in solid electrolytes. Factoring effects arising from extended defects appear crucial to explain electrochemical and $-$mechanical observations in ASSBs. △ Less

Submitted 4 January, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

Journal ref: Adv. Energy Mater., 2304230 (2024)

arXiv:2308.06164 [pdf, other]

doi 10.1051/0004-6361/202347253

The blue supergiant Sher 25 revisited in the Gaia era

Authors: D. Weßmayer, N. Przybilla, A. Ebenbichler, P. Aschenbrenner, K. Butler

Abstract: Aims. The evolutionary status of the blue supergiant Sher 25 and its membership to the massive cluster NGC 3603 are investigated. Methods. A hybrid non-LTE (local thermodynamic equilibrium) spectrum synthesis approach is employed to analyse a high-resolution optical spectrum of Sher 25 and five similar early B-type comparison stars in order to derive atmospheric parameters and elemental abundances… ▽ More Aims. The evolutionary status of the blue supergiant Sher 25 and its membership to the massive cluster NGC 3603 are investigated. Methods. A hybrid non-LTE (local thermodynamic equilibrium) spectrum synthesis approach is employed to analyse a high-resolution optical spectrum of Sher 25 and five similar early B-type comparison stars in order to derive atmospheric parameters and elemental abundances. Fundamental stellar parameters are determined by considering stellar evolution tracks, Gaia Data Release 3 (DR3) data and complementary distance information. Interstellar reddening and the reddening law along the sight line towards Sher 25 are constrained employing UV photometry for the first time in addition to optical and infrared data. The distance to NGC 3603 is reevaluated based on Gaia DR3 data of the innermost cluster O-stars. Results. The spectroscopic distance derived from the quantitative analysis implies that Sher 25 lies in the foreground of NGC 3603, which is found to have a distance of $d_\mathrm{NGC 3603}$ = 6250$\pm$150 pc. A cluster membership is also excluded as the hourglass nebula is unaffected by the vigorous stellar winds of the cluster stars and from the different excitation signatures of the hourglass nebula and the nebula around NGC 3603. Sher 25 turns out to have a luminosity of log L/L$_\odot$ = 5.48$\pm$0.14, equivalent to that of a $\sim$27 $M_\odot$ supergiant in a single-star scenario, which is about half of the mass assumed so far, bringing it much closer in its characteristics to Sk-69°202, the progenitor of SN 1987A. Sher 25 is significantly older than NGC 3603. Further arguments for a binary (merger) evolutionary scenario of Sher 25 are discussed. △ Less

Submitted 11 August, 2023; originally announced August 2023.

Comments: 27 pages, 22 figures, Accepted for publication in Astronomy & Astrophysics, Data: https://doi.org/10.5281/zenodo.8230158

Journal ref: A&A 677, A175 (2023)

arXiv:2307.15748 [pdf, ps, other]

doi 10.1051/0004-6361/202346803

z-GAL -- A NOEMA spectroscopic redshift survey of bright Herschel galaxies: [III] Physical properties

Authors: S. Berta, F. Stanley, D. Ismail, P. Cox, R. Neri, C. Yang, A. J. Young, S. **, H. Dannerbauer, T. J. Bakx, A. Beelen, A. Weiss, A. Nanni, A. Omont, P. van der Werf, M. Krips, A. J. Baker, G. Bendo, E. Borsato, V. Buat, K. M. Butler, N. Chartab, A. Cooray, S. Dye, S. Eales , et al. (13 additional authors not shown)

Abstract: The z-GAL survey observed 137 bright Herschel-selected targets with the IRAM NOrthern Extended Millimeter Array, with the aim to measure their redshift and study their properties. Several of them have been resolved into multiple sources. Consequently, robust spectroscopic redshifts have been measured for 165 individual galaxies in the range 0.8<z<6.5. In this paper we analyse the millimetre spectr… ▽ More The z-GAL survey observed 137 bright Herschel-selected targets with the IRAM NOrthern Extended Millimeter Array, with the aim to measure their redshift and study their properties. Several of them have been resolved into multiple sources. Consequently, robust spectroscopic redshifts have been measured for 165 individual galaxies in the range 0.8<z<6.5. In this paper we analyse the millimetre spectra of the z-GAL sources, using both their continuum and line emission to derive their physical properties. At least two spectral lines are detected for each source, including transitions of 12CO, [CI], and H2O. The observed 12CO line ratios and spectral line energy distributions of individual sources resemble those of local starbursts. In seven sources the para-H2O(2_11-2_02) transition is detected and follows the IR versus H2O luminosity relation of sub-millimetre galaxies. The molecular gas mass of the z-GAL sources is derived from their 12CO, [CI], and sub-millimetre dust continuum emission. The three tracers lead to consistent results, with the dust continuum showing the largest scatter when compared to 12CO. The gas-to-dust mass ratio of these sources was computed by combining the information derived from 12CO and the dust continuum and has a median value of 107, similar to star-forming galaxies of near-solar metallicity. The same combined analysis leads to depletion timescales in the range between 0.1 and 1.0 Gyr, which place the z-GAL sources between the `main sequence' of star formation and the locus of starbursts. Finally, we derived a first estimate of stellar masses - modulo possible gravitational magnification - by inverting known gas scaling relations: the z-GAL sample is confirmed to be mostly composed by starbursts, whereas ~25% of its members lie on the main sequence of star-forming galaxies (within +/- 0.5 dex). △ Less

Submitted 28 July, 2023; originally announced July 2023.

Comments: Accepted for publication on A&A; 26 pages; 12 figures

Journal ref: A&A 678, A28 (2023)

arXiv:2307.15747 [pdf, other]

doi 10.1051/0004-6361/202346804

z-GAL -- A NOEMA spectroscopic redshift survey of bright Herschel galaxies: [II] Dust properties

Authors: D. Ismail, A. Beelen, V. Buat, S. Berta, P. Cox, F. Stanley, A. Young, S. **, R. Neri, T. Bakx, H. Dannerbauer, K. Butler, A. Cooray, A. Nanni, A. Omont, S. Serjeant, P. van der Werf, C. Vlahakis, A. Weiss, C. Yang, A. J. Baker, G. Bendo, E. Borsato, N. Chartab, S. Dye , et al. (12 additional authors not shown)

Abstract: (Abridged) We present the dust properties of 125 bright Herschel galaxies selected from the z-GAL survey. The large instantaneous bandwidth of NOEMA provides an exquisite sampling of the underlying dust continuum emission at 2 and 3 mm in the observed frame, with flux densities in at least four side bands for each source. Together with the available Herschel 250, 350, and 500 micron and SCUBA-2 85… ▽ More (Abridged) We present the dust properties of 125 bright Herschel galaxies selected from the z-GAL survey. The large instantaneous bandwidth of NOEMA provides an exquisite sampling of the underlying dust continuum emission at 2 and 3 mm in the observed frame, with flux densities in at least four side bands for each source. Together with the available Herschel 250, 350, and 500 micron and SCUBA-2 850 micron flux densities, the spectral energy distribution of each source can be analyzed from the far-infrared to the millimeter, with a fine sampling of the Rayleigh-Jeans tail. This wealth of data provides a solid basis to derive robust dust properties, in particular the dust emissivity index, beta, and the dust temperature, T(dust). In order to demonstrate our ability to constrain the dust properties, we used a flux-generated mock catalog and analyzed the results under the assumption of an optically thin and optically thick modified black body emission. For the z-GAL sources, we report a range of dust emissivities with beta ~ 1.5 - 3 estimated up to high precision with relative uncertainties that vary in the range 7% - 15%, and an average of 2.2 +/- 0.3. We find dust temperatures varying from 20 to 50 K with an average of T(dust) ~ 30 K for the optically thin case and ~38 K in the optically thick case. For all the sources, we estimate the dust masses and apparent infrared luminosities (based on the optically thin approach). An inverse correlation is found between T(dust) and beta, which is similar to what is seen in the local Universe. Finally, we report an increasing trend in the dust temperature as a function of redshift at a rate of 6.5 +/- 0.5 K/z for this 500 micron-selected sample. Based on this study, future prospects are outlined to further explore the evolution of dust temperature across cosmic time. △ Less

Submitted 28 July, 2023; originally announced July 2023.

Comments: Accepted for publication on A&A; 32 pages; 26 figures

Journal ref: A&A 678, A27 (2023)

arXiv:2307.15732 [pdf, other]

doi 10.1051/0004-6361/202346801

z-GAL -- A NOEMA spectroscopic redshift survey of bright Herschel galaxies: [I] Overview

Authors: P. Cox, R. Neri, S. Berta, D. Ismail, F. Stanley, A. Young, S. **, T. Bakx, A. Beelen, H. Dannerbauer, M. Krips, M. Lehnert, A. Omont, D. A. Riechers, A. J. Baker, G. Bendo, E. Borsato, V. Buat, K. Butler, N. Chartab, A. Cooray, S. Dye, S. Eales, R. Gavazzi, D. Hughes , et al. (13 additional authors not shown)

Abstract: (Abridged) Using the IRAM NOEMA interferometer, we measures the redshifts of 126 bright galaxies detected in the Herschel H-ATLAS, HeLMS, and HerS surveys. We report reliable spectroscopic redshifts for a total of 124 of the Herschel-selected galaxies. The redshifts are estimated from scans of the 3 and 2-mm bands (and, in one case, the 1-mm band) and are based on the detection of at least two emi… ▽ More (Abridged) Using the IRAM NOEMA interferometer, we measures the redshifts of 126 bright galaxies detected in the Herschel H-ATLAS, HeLMS, and HerS surveys. We report reliable spectroscopic redshifts for a total of 124 of the Herschel-selected galaxies. The redshifts are estimated from scans of the 3 and 2-mm bands (and, in one case, the 1-mm band) and are based on the detection of at least two emission lines. Together with the Pilot Programme (Neri et al. 2020), including spectroscopic redshifts of 11 sources, our survey has derived precise redshifts for 135 bright Herschel-selected galaxies, making it the largest sample of high-z galaxies with robust redshifts to date. Most emission lines detected are from 12CO (mainly from J=2-1 to 5-4), with some sources seen in [CI] and H2O emission lines. The spectroscopic redshifts are in the range 0.8<z<6.55 with a median value of z=2.56 +/- 0.10. The line widths of the sources are large, with a mean value for the full width at half maximum Delta(V) of 590 +/- 25 km/s and with 35% of the sources having widths of 700 km/s < Delta(V) < 1800 km/s. Most of the sources are unresolved or barely resolved on scales of 2 to 3 arcsec (or linear sizes of 15-25 kpc, unlensed). Some fields reveal double or multiple sources and, in some cases, sources at different redshifts. Taking these sources into account, there are, in total, 165 individual sources with robust spectroscopic redshifts, including lensed galaxies, binary systems, and over-densities. We present an overview of the z-GAL survey and provide the observed properties of the emission lines, the derived spectroscopic redshifts, and an atlas of the entire sample. The data presented here will serve as a foundation for the other z-GAL papers in this series reporting on the dust emission, the molecular and atomic gas properties, and a detailed analysis of the nature of the sources. △ Less

Submitted 28 July, 2023; originally announced July 2023.

Comments: Accepted for publication on A&A; 63 pages

Journal ref: A&A 678, A26 (2023)

arXiv:2307.04340 [pdf, other]

Crystal Structure Generation with Autoregressive Large Language Modeling

Authors: Luis M. Antunes, Keith T. Butler, Ricardo Grau-Crespo

Abstract: The generation of plausible crystal structures is often the first step in predicting the structure and properties of a material from its chemical composition. Quickly generating and predicting inorganic crystal structures is important for the discovery of new materials, which can target applications such as energy or electronic devices. However, most current methods for crystal structure predictio… ▽ More The generation of plausible crystal structures is often the first step in predicting the structure and properties of a material from its chemical composition. Quickly generating and predicting inorganic crystal structures is important for the discovery of new materials, which can target applications such as energy or electronic devices. However, most current methods for crystal structure prediction are computationally expensive, slowing the pace of innovation. Seeding structure prediction algorithms with quality generated candidates can overcome a major bottleneck. Here, we introduce CrystaLLM, a methodology for the versatile generation of crystal structures, based on the autoregressive large language modeling (LLM) of the Crystallographic Information File (CIF) format. Trained on millions of CIF files, CrystaLLM focuses on modeling crystal structures through text. CrystaLLM can produce plausible crystal structures for a wide range of inorganic compounds unseen in training, as demonstrated by ab initio simulations. The integration with predictors of formation energy permits the use of a Monte Carlo Tree Search algorithm to improve the generation of meaningful structures. Our approach challenges conventional representations of crystals, and demonstrates the potential of LLMs for learning effective 'world models' of crystal chemistry, which will lead to accelerated discovery and innovation in materials science. △ Less

Submitted 12 February, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

Comments: Added new results and supplementary information

arXiv:2307.00784 [pdf, other]

Element similarity in high-dimensional materials representations

Authors: Anthony Onwuli, Ashish V. Hegde, Kevin Nguyen, Keith T. Butler, Aron Walsh

Abstract: The traditional display of elements in the periodic table is convenient for the study of chemistry and physics. However, the atomic number alone is insufficient for training statistical machine learning models to describe and extract composition-structure-property relationships. Here, we assess the similarity and correlations contained within high-dimensional local and distributed representations… ▽ More The traditional display of elements in the periodic table is convenient for the study of chemistry and physics. However, the atomic number alone is insufficient for training statistical machine learning models to describe and extract composition-structure-property relationships. Here, we assess the similarity and correlations contained within high-dimensional local and distributed representations of the chemical elements, as implemented in an open-source Python package ElementEmbeddings. These include element vectors of up to 200 dimensions derived from known physical properties, crystal structure analysis, natural language processing, and deep learning models. A range of distance measures are compared and a clustering of elements into familiar groups is found using dimensionality reduction techniques. The cosine similarity is used to assess the utility of these metrics for crystal structure prediction, showing that they can outperform the traditional radius ratio rules for the structural classification of AB binary solids. △ Less

Submitted 24 August, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: 7 pages, 8 figures

arXiv:2305.14080 [pdf, other]

Eye-tracked Virtual Reality: A Comprehensive Survey on Methods and Privacy Challenges

Authors: Efe Bozkir, Süleyman Özdel, Mengdi Wang, Brendan David-John, Hong Gao, Kevin Butler, Eakta Jain, Enkelejda Kasneci

Abstract: Latest developments in computer hardware, sensor technologies, and artificial intelligence can make virtual reality (VR) and virtual spaces an important part of human everyday life. Eye tracking offers not only a hands-free way of interaction but also the possibility of a deeper understanding of human visual attention and cognitive processes in VR. Despite these possibilities, eye-tracking data al… ▽ More Latest developments in computer hardware, sensor technologies, and artificial intelligence can make virtual reality (VR) and virtual spaces an important part of human everyday life. Eye tracking offers not only a hands-free way of interaction but also the possibility of a deeper understanding of human visual attention and cognitive processes in VR. Despite these possibilities, eye-tracking data also reveal privacy-sensitive attributes of users when it is combined with the information about the presented stimulus. To address these possibilities and potential privacy issues, in this survey, we first cover major works in eye tracking, VR, and privacy areas between the years 2012 and 2022. While eye tracking in the VR part covers the complete pipeline of eye-tracking methodology from pupil detection and gaze estimation to offline use and analyses, as for privacy and security, we focus on eye-based authentication as well as computational methods to preserve the privacy of individuals and their eye-tracking data in VR. Later, taking all into consideration, we draw three main directions for the research community by mainly focusing on privacy challenges. In summary, this survey provides an extensive literature review of the utmost possibilities with eye tracking in VR and the privacy implications of those possibilities. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2305.04098 [pdf, other]

doi 10.1051/0004-6361/202346271

Neutral outflows in high-z QSOs

Authors: Kirsty M. Butler, Paul P. van der Werf, Alain Omont, Pierre Cox

Abstract: OH+ absorption is a powerful tracer of inflowing and outflowing gas in the predominantly atomic diffuse and turbulent halo surrounding galaxies. In this letter, we present observations of OH+(1_1-1_0), CO(9-8) and the underlying dust continuum in 5 strongly lensed z~2-4 QSOs, using ALMA to detect outflowing neutral gas. Blue-shifted OH+ absorption is detected in 3/5 QSOs and tentatively detected i… ▽ More OH+ absorption is a powerful tracer of inflowing and outflowing gas in the predominantly atomic diffuse and turbulent halo surrounding galaxies. In this letter, we present observations of OH+(1_1-1_0), CO(9-8) and the underlying dust continuum in 5 strongly lensed z~2-4 QSOs, using ALMA to detect outflowing neutral gas. Blue-shifted OH+ absorption is detected in 3/5 QSOs and tentatively detected in a 4th. Absorption at systemic velocities is also detected in one. OH+ emission is observed in 3/5 QSOs at systemic velocities and CO(9-8) is detected in all 5 QSOs at high S/N, providing information on the dense molecular gas within the host galaxy. We compare our sample to high-z far-infrared (FIR) luminous star-forming and active galaxies from the literature. We find no difference in OH+ absorption line properties between active and star-forming galaxies with both samples following the same optical depth-dust temperature relation, suggesting that these observables are driven by the same mechanism in both samples. Similarly, star-forming and active galaxies both follow the same OH+ emission-FIR relation. Obscured QSOs display broader (>800 km/s) emission than the unobscured QSOs and all but one of the high-z star-forming galaxies, likely caused by the warm molecular gas reservoir obscuring the accreting nucleus. Broader CO(9-8) emission (>500 km/s) is found in obscured versus unobscured QSOs, but overall cover a similar range in line widths as the star-forming galaxies and follow the CO(9-8)-FIR luminosity relation found in low-z galaxies. We find that outflows traced by OH+ are only detected in extreme star-forming galaxies (broad CO emission) and in both types of QSOs, which, in turn, display no red-shifted absorption. This suggests that diffuse neutral outflows in galaxy halos may be associated with the most energetic evolutionary phases leading up to and following the obscured QSO phase. △ Less

Submitted 6 May, 2023; originally announced May 2023.

Comments: 8 pages, 3 figures, 4 tables, accepted to A&A letters

Journal ref: A&A 674, L5 (2023)

arXiv:2303.04830 [pdf, other]

doi 10.1093/mnras/stad784

Bright Extragalactic ALMA Redshift Survey (BEARS) III: Detailed study of emission lines from 71 Herschel targets

Authors: M. Hagimoto, T. J. L. C. Bakx, S. Serjeant, G. J. Bendo, S. A. Urquhart, S. Eales, K. C. Harrington, Y. Tamura, H. Umehata, S. Berta, A. R. Cooray, P. Cox, G. De Zotti, M. D. Lehnert, D. A. Riechers, D. Scott, P. Temi, P. P. van der Werf, C. Yang, A. Amvrosiadis, P. M. Andreani, A. J. Baker, A. Beelen, E. Borsato, V. Buat , et al. (33 additional authors not shown)

Abstract: We analyse the molecular and atomic emission lines of 71 bright Herschel-selected galaxies between redshifts 1.4 to 4.6 detected by the Atacama Large Millimetre/submillimetre Array. These lines include a total of 156 CO, [C I], and H2O emission lines. For 46 galaxies, we detect two transitions of CO lines, and for these galaxies we find gas properties similar to those of other dusty star-forming g… ▽ More We analyse the molecular and atomic emission lines of 71 bright Herschel-selected galaxies between redshifts 1.4 to 4.6 detected by the Atacama Large Millimetre/submillimetre Array. These lines include a total of 156 CO, [C I], and H2O emission lines. For 46 galaxies, we detect two transitions of CO lines, and for these galaxies we find gas properties similar to those of other dusty star-forming galaxy (DSFG) samples. A comparison to photo-dissociation models suggests that most of Herschel-selected galaxies have similar interstellar medium conditions as local infrared-luminous galaxies and high-redshift DSFGs, although with denser gas and more intense far-ultraviolet radiation fields than normal star-forming galaxies. The line luminosities agree with the luminosity scaling relations across five orders of magnitude, although the star-formation and gas surface density distributions (i.e., Schmidt-Kennicutt relation) suggest a different star-formation phase in our galaxies (and other DSFGs) compared to local and low-redshift gas-rich, normal star-forming systems. The gas-to-dust ratios of these galaxies are similar to Milky Way values, with no apparent redshift evolution. Four of 46 sources appear to have CO line ratios in excess of the expected maximum (thermalized) profile, suggesting a rare phase in the evolution of DSFGs. Finally, we create a deep stacked spectrum over a wide rest-frame frequency (220-890 GHz) that reveals faint transitions from HCN and CH, in line with previous stacking experiments. △ Less

Submitted 8 March, 2023; originally announced March 2023.

Comments: 30 pages, 17 figures, accepted for publication in Monthly Notices of the Royal Astronomical Society Main Journal. Comments are warmly welcomed

arXiv:2303.04201 [pdf, other]

DR-VIDAL -- Doubly Robust Variational Information-theoretic Deep Adversarial Learning for Counterfactual Prediction and Treatment Effect Estimation on Real World Data

Authors: Shantanu Ghosh, Zheng Feng, Jiang Bian, Kevin Butler, Mattia Prosperi

Abstract: Determining causal effects of interventions onto outcomes from real-world, observational (non-randomized) data, e.g., treatment repurposing using electronic health records, is challenging due to underlying bias. Causal deep learning has improved over traditional techniques for estimating individualized treatment effects (ITE). We present the Doubly Robust Variational Information-theoretic Deep Adv… ▽ More Determining causal effects of interventions onto outcomes from real-world, observational (non-randomized) data, e.g., treatment repurposing using electronic health records, is challenging due to underlying bias. Causal deep learning has improved over traditional techniques for estimating individualized treatment effects (ITE). We present the Doubly Robust Variational Information-theoretic Deep Adversarial Learning (DR-VIDAL), a novel generative framework that combines two joint models of treatment and outcome, ensuring an unbiased ITE estimation even when one of the two is misspecified. DR-VIDAL integrates: (i) a variational autoencoder (VAE) to factorize confounders into latent variables according to causal assumptions; (ii) an information-theoretic generative adversarial network (Info-GAN) to generate counterfactuals; (iii) a doubly robust block incorporating treatment propensities for outcome predictions. On synthetic and real-world datasets (Infant Health and Development Program, Twin Birth Registry, and National Supported Work Program), DR-VIDAL achieves better performance than other non-generative and generative methods. In conclusion, DR-VIDAL uniquely fuses causal assumptions, VAE, Info-GAN, and doubly robustness into a comprehensive, performant framework. Code is available at: https://github.com/Shantanu48114860/DR-VIDAL-AMIA-22 under MIT license. △ Less

Submitted 7 May, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

Comments: AMIA Annual Symposium, 2022 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10148269/)

Journal ref: Journal="AMIA Annu Symp Proc", Year="2022", Volume="2022",Pages="485--494"

arXiv:2301.10056 [pdf]

doi 10.1109/SP46215.2023.00059

Side Eye: Characterizing the Limits of POV Acoustic Eavesdrop** from Smartphone Cameras with Rolling Shutters and Movable Lenses

Authors: Yan Long, Pirouz Naghavi, Blas Kojusner, Kevin Butler, Sara Rampazzi, Kevin Fu

Abstract: Our research discovers how the rolling shutter and movable lens structures widely found in smartphone cameras modulate structure-borne sounds onto camera images, creating a point-of-view (POV) optical-acoustic side channel for acoustic eavesdrop**. The movement of smartphone camera hardware leaks acoustic information because images unwittingly modulate ambient sound as imperceptible distortions.… ▽ More Our research discovers how the rolling shutter and movable lens structures widely found in smartphone cameras modulate structure-borne sounds onto camera images, creating a point-of-view (POV) optical-acoustic side channel for acoustic eavesdrop**. The movement of smartphone camera hardware leaks acoustic information because images unwittingly modulate ambient sound as imperceptible distortions. Our experiments find that the side channel is further amplified by intrinsic behaviors of Complementary metal-oxide-semiconductor (CMOS) rolling shutters and movable lenses such as in Optical Image Stabilization (OIS) and Auto Focus (AF). Our paper characterizes the limits of acoustic information leakage caused by structure-borne sound that perturbs the POV of smartphone cameras. In contrast with traditional optical-acoustic eavesdrop** on vibrating objects, this side channel requires no line of sight and no object within the camera's field of view (images of a ceiling suffice). Our experiments test the limits of this side channel with a novel signal processing pipeline that extracts and recognizes the leaked acoustic information. Our evaluation with 10 smartphones on a spoken digit dataset reports 80.66%, 91.28%, and 99.67% accuracies on recognizing 10 spoken digits, 20 speakers, and 2 genders respectively. We further systematically discuss the possible defense strategies and implementations. By modeling, measuring, and demonstrating the limits of acoustic eavesdrop** from smartphone camera image streams, our contributions explain the physics-based causality and possible ways to reduce the threat on current and future devices. △ Less

Submitted 26 January, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

Journal ref: 2023 IEEE Symposium on Security and Privacy

arXiv:2301.09462 [pdf, other]

doi 10.1051/0004-6361/202244906

Quantitative spectroscopy of late O-type main-sequence stars with a hybrid non-LTE method

Authors: Patrick Aschenbrenner, Norbert Przybilla, Keith Butler

Abstract: Context. Late O-type stars at luminosities $\log L/L_\odot \lesssim 5.2$ show weak winds with mass-loss rates lower than 10$^{-8} M_\odot$ yr$^{-1}$. This implies that their photospheric layers are not strongly affected by the stellar wind. Aims. A hybrid non-local thermodynamic equilibrium (non-LTE) approach is tested for analyses of late O-type stars. A sample of 20 mostly sharp-lined Galactic O… ▽ More Context. Late O-type stars at luminosities $\log L/L_\odot \lesssim 5.2$ show weak winds with mass-loss rates lower than 10$^{-8} M_\odot$ yr$^{-1}$. This implies that their photospheric layers are not strongly affected by the stellar wind. Aims. A hybrid non-local thermodynamic equilibrium (non-LTE) approach is tested for analyses of late O-type stars. A sample of 20 mostly sharp-lined Galactic O stars of spectral types O8 to O9.7 and luminosity classes V and IV, previously studied in the literature using full non-LTE model atmospheres, is investigated. Methods. Hydrostatic plane-parallel atmospheric structures and synthetic spectra computed with Kurucz's Atlas12 code together with non-LTE line-formation codes Detail and Surface, which account for the effects of turbulent pressure on the atmosphere, were employed. High-resolution spectra were analysed to derive atmospheric parameters and elemental abundances. Fundamental stellar parameters were derived by considering stellar evolution tracks and Gaia EDR3 parallaxes. Interstellar reddening was characterised by fitting spectral energy distributions from the UV to the mid-IR. Results. A high precision and accuracy is achieved for all derived parameters for 16 sample stars. Turbulent pressure effects turn out have significant effects. Effective temperatures are determined to 1-3% uncertainty levels, surface gravities to 0.05 to 0.10 dex, masses to better than 8%, radii to better than 10%, and luminosities to better than 20% uncertainty typically. Abundances for C, N, O, Ne, Mg, Al, Si are derived with uncertainties of 0.05 to 0.10 dex and for helium within 0.03 to 0.05 dex (1$σ$ standard deviations) in general. Distances to the Lac OB1b association and to the open clusters NGC 2244, IC 1805, NGC 457, and IC 1396 are determined as a byproduct. △ Less

Submitted 23 January, 2023; originally announced January 2023.

Comments: 31 pages, 23 figures, Accepted for publication in Astronomy & Astrophysics

Journal ref: A&A 671, A36 (2023)

arXiv:2301.02584 [pdf, ps, other]

doi 10.1093/mnras/stac3771

The Bright Extragalactic ALMA Redshift Survey (BEARS) II: Millimetre photometry of gravitational lens candidates

Authors: G. J. Bendo, S. A. Urquhart, S. Serjeant, T. Bakx, M. Hagimoto, P. Cox, R. Neri, M. D. Lehnert, H. Dannerbauer, A. Amvrosiadis, P. Andreani, A. J. Baker, A. Beelen, S. Berta, E. Borsato, V. Buat, K. M. Butler, A. Cooray, G. De Zotti, L. Dunne, S. Dye, S. Eales, A. Enia, L. Fan, R. Gavazzi , et al. (27 additional authors not shown)

Abstract: We present 101 and 151 GHz ALMA continuum images for 85 fields selected from Herschel observations that have 500 micron flux densities >80 mJy and 250-500 micron colours consistent with z > 2, most of which are expected to be gravitationally lensed or hyperluminous infrared galaxies. Approximately half of the Herschel 500 micron sources were resolved into multiple ALMA sources, but 11 of the 15 br… ▽ More We present 101 and 151 GHz ALMA continuum images for 85 fields selected from Herschel observations that have 500 micron flux densities >80 mJy and 250-500 micron colours consistent with z > 2, most of which are expected to be gravitationally lensed or hyperluminous infrared galaxies. Approximately half of the Herschel 500 micron sources were resolved into multiple ALMA sources, but 11 of the 15 brightest 500 micron Herschel sources correspond to individual ALMA sources. For the 37 fields containing either a single source with a spectroscopic redshift or two sources with the same spectroscopic redshift, we examined the colour temperatures and dust emissivity indices. The colour temperatures only vary weakly with redshift and are statistically consistent with no redshift-dependent temperature variations, which generally corresponds to results from other samples selected in far-infrared, submillimetre, or millimetre bands but not to results from samples selected in optical or near-infrared bands. The dust emissivity indices, with very few exceptions, are largely consistent with a value of 2. We also compared spectroscopic redshifts to photometric redshifts based on spectral energy distribution templates designed for infrared-bright high-redshift galaxies. While the templates systematically underestimate the redshifts by ~15%, the inclusion of ALMA data decreases the scatter in the predicted redshifts by a factor of ~2, illustrating the potential usefulness of these millimetre data for estimating photometric redshifts. △ Less

Submitted 6 January, 2023; originally announced January 2023.

Comments: Accepted for publication in Monthly Notices of the Royal Astronomical Society

arXiv:2212.09675 [pdf, other]

doi 10.3847/1538-4357/acad03

Molecular Outflows in z > 6 QSO Hosts Driven by Star Formation

Authors: Kirsty M. Butler, Paul P. van der Werf, Theodoros Topkaras, Matus Rybak, Bram P. Venemans, Fabian Walter, Roberto Decarli

Abstract: Feedback and outflows in galaxies that are associated with a quasar phase are expected to be pivotal in quenching the most massive galaxies. However, observations targeting the molecular outflow phase, which dominates both the mass and momentum and removes the immediate fuel for star formation, are limited in high-z QSO hosts. Massive quiescent galaxies found at z ~ 4 are predicted to have already… ▽ More Feedback and outflows in galaxies that are associated with a quasar phase are expected to be pivotal in quenching the most massive galaxies. However, observations targeting the molecular outflow phase, which dominates both the mass and momentum and removes the immediate fuel for star formation, are limited in high-z QSO hosts. Massive quiescent galaxies found at z ~ 4 are predicted to have already quenched star formation by z ~ 5 and undergone their most intense growth at z > 6. Here, we present two ALMA detections of molecular outflows, traced by blue-shifted absorption of the OH 119 micron doublet, from a sample of three z > 6 infrared luminous QSO hosts: J2310+1855 and P183+05. OH 119 micron is also detected in emission in P183+05, and tentatively in the third source: P036+03. Using similar assumptions as for high-z Dusty Star-Forming Galaxy outflows, we find that our QSOs drive molecular outflows with comparable mass outflow rates, and that are comparably energetic except for J2310+1855's significantly lower outflow energy flux. We do not find evidence, nor require additional input from the central AGN to drive the molecular outflow in J2310+1855 but can not rule out an AGN contribution in P183+05 if a significant AGN contribution to L_FIR is assumed and/or if the outflow covering fraction is high (> 53%), which evidence from the literature suggests is unlikely in these sources. Differences observed in the blue-shifted absorption spectral properties may instead be caused by the QSO hosts' more compact dust continuum, limiting observations to lower altitude and more central regions of the outflow. △ Less

Submitted 24 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

Comments: Erratum: 4 pages, 3 figures, 1 table Original: 19 pages, 7 figures, 3 tables

arXiv:2212.06444 [pdf, other]

Predicting Thermoelectric Transport Properties from Composition with Attention-based Deep Learning

Authors: Luis M. Antunes, Keith T. Butler, Ricardo Grau-Crespo

Abstract: Thermoelectric materials can be used to construct devices which recycle waste heat into electricity. However, the best known thermoelectrics are based on rare, expensive or even toxic elements, which limits their widespread adoption. To enable deployment on global scales, new classes of effective thermoelectrics are thus required. $\textit{Ab initio}$ models of transport properties can help in the… ▽ More Thermoelectric materials can be used to construct devices which recycle waste heat into electricity. However, the best known thermoelectrics are based on rare, expensive or even toxic elements, which limits their widespread adoption. To enable deployment on global scales, new classes of effective thermoelectrics are thus required. $\textit{Ab initio}$ models of transport properties can help in the design of new thermoelectrics, but they are still too computationally expensive to be solely relied upon for high-throughput screening in the vast chemical space of all possible candidates. Here, we use models constructed with modern machine learning techniques to scan very large areas of inorganic materials space for novel thermoelectrics, using composition as an input. We employ an attention-based deep learning model, trained on data derived from $\textit{ab initio}$ calculations, to predict a material's Seebeck coefficient, electrical conductivity, and power factor over a range of temperatures and $\textit{n}$- or $\textit{p}$-type do** levels, with surprisingly good performance given the simplicity of the input, and with significantly lower computational cost. The results of applying the model to a space of known and hypothetical binary and ternary selenides reveal several materials that may represent promising thermoelectrics. Our study establishes a protocol for composition-based prediction of thermoelectric behaviour that can be easily enhanced as more accurate theoretical or experimental databases become available. △ Less

Submitted 13 December, 2022; originally announced December 2022.

arXiv:2209.05554 [pdf, other]

doi 10.1039/D2DD00096B

Unified Graph Neural Network Force-field for the Periodic Table

Authors: Kamal Choudhary, Brian DeCost, Lily Major, Keith Butler, Jeyan Thiyagalingam, Francesca Tavazza

Abstract: Classical force fields (FF) based on machine learning (ML) methods show great potential for large scale simulations of materials. MLFFs have hitherto largely been designed and fitted for specific systems and are not usually transferable to chemistries beyond the specific training set. We develop a unified atomisitic line graph neural network-based FF (ALIGNN-FF) that can model both structurally an… ▽ More Classical force fields (FF) based on machine learning (ML) methods show great potential for large scale simulations of materials. MLFFs have hitherto largely been designed and fitted for specific systems and are not usually transferable to chemistries beyond the specific training set. We develop a unified atomisitic line graph neural network-based FF (ALIGNN-FF) that can model both structurally and chemically diverse materials with any combination of 89 elements from the periodic table. To train the ALIGNN-FF model, we use the JARVIS-DFT dataset which contains around 75000 materials and 4 million energy-force entries, out of which 307113 are used in the training. We demonstrate the applicability of this method for fast optimization of atomic structures in the crystallography open database and by predicting accurate crystal structures using genetic algorithm for alloys. △ Less

Submitted 16 September, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

arXiv:2208.02692 [pdf, other]

doi 10.1051/0004-6361/202243973

Quantitative spectroscopy of B-type supergiants

Authors: D. Weßmayer, N. Przybilla, K. Butler

Abstract: Context. B-type supergiants are versatile tools to address various astrophysical topics, ranging from stellar atmospheres over stellar and galactic evolution to the cosmic distance scale. Aims. A hybrid non-LTE approach - line-blanketed model atmospheres computed under the assumption of local thermodynamic equilibrium (LTE) in combination with line formation calculations that account for deviation… ▽ More Context. B-type supergiants are versatile tools to address various astrophysical topics, ranging from stellar atmospheres over stellar and galactic evolution to the cosmic distance scale. Aims. A hybrid non-LTE approach - line-blanketed model atmospheres computed under the assumption of local thermodynamic equilibrium (LTE) in combination with line formation calculations that account for deviations from LTE - is tested for quantitative analyses of B-type supergiants with masses $M<30 M_{\odot}$, characterising a sample of 14 Galactic objects. Methods. Hydrostatic plane-parallel atmospheric structures and synthetic spectra computed with Kurucz's Atlas12 code together with the non-LTE line-formation codes Detail/Surface are compared to results from full non-LTE calculations with Tlusty, and the effects of turbulent pressure on the models are investigated. High-resolution spectra are analysed for atmospheric parameters, using Stark-broadened hydrogen lines and multiple metal ionisation equilibria, and for elemental abundances. Fundamental stellar parameters are derived by considering stellar evolution tracks and Gaia EDR3 parallaxes. Interstellar reddening towards the target stars is determined by matching model spectral energy distributions to observed ones. Results. Our hybrid non-LTE approach turns out to be equivalent to hydrostatic full non-LTE modelling for the deeper photospheric layers of the B-type supergiants considered. Turbulent pressure can become relevant for microturbulent velocities larger than 10 km s$^{-1}$. High precision and accuracy is achieved for all derived parameters by bringing multiple indicators to agreement simultaneously. Abundances for chemical species (He, C, N, O, Ne, Mg, Al, Si, S, Ar, Fe) are derived with uncertainties of 0.05 to 0.10 dex. The derived ratios N/C vs. N/O tightly follow the predictions from Geneva stellar evolution models. △ Less

Submitted 9 January, 2023; v1 submitted 4 August, 2022; originally announced August 2022.

Comments: 32 pages, 24 figures, Accepted for publication in Astronomy & Astrophysics, Data: https://doi.org/10.5281/zenodo.6802567

Journal ref: A&A 668, A92 (2022)

arXiv:2207.13389 [pdf, other]

Versatile Domain Map** Of Scanning Electron Nanobeam Diffraction Datasets Utilising Variational AutoEncoders and Decoder-Assisted Latent-Space Clustering

Authors: Andy Bridger, William I. F. David, Thomas J. Wood, Mohsen Danaie, Keith T. Butler

Abstract: Advancements in fast electron detectors have enabled the statistically significant sampling of crystal structures on the nanometre scale by means of Scanning Electron Nanobeam Diffraction (SEND). Characterisation of structural similarity across this length scale is key to bridging the gap between local atomic structure (using atomic resolution techniques such as High Resolution Scanning Transmissi… ▽ More Advancements in fast electron detectors have enabled the statistically significant sampling of crystal structures on the nanometre scale by means of Scanning Electron Nanobeam Diffraction (SEND). Characterisation of structural similarity across this length scale is key to bridging the gap between local atomic structure (using atomic resolution techniques such as High Resolution Scanning Transmission Electron Microscopy (HR-STEM)) and the macro-scale (using bulk techniques such as powder X-ray and neutron diffraction). The use of SEND technique allows for structural investigation of a broad range of samples, due to the techniques ability to operate with low electron dosage and its tolerance for sample thickness, relative to HR-STEM. This, coupled with the capacity for data collection over a wide areas and the automation of this collection, allows for statistically representative sampling of the microstructure. Also due to these factors, SEND generates large datasets and as a result automated/ semi-automated data processing workflows are required to aid in maximal extraction of useful information. As such, this paper outlines a versatile, data-driven approach for producing domain maps, as well as a statistical approach for assessing their applicability. The production of such domain maps for a dataset can help highlight nuance in the microstructure, as well as improve the manageability of that dataset for further investigation. The workflow outlined utilises a Variational AutoEncoder to identify and learn the sources of variance in the diffraction signal and this, in combination with clustering techniques, is used to produce domain maps for a set of varied example cases. This approach: is agnostic to domain crystallinity; requires no prior knowledge of crystal structure; and does not require the, potentially prohibitive, simulation of a library of appropriate diffraction patterns. △ Less

Submitted 27 July, 2022; originally announced July 2022.

arXiv:2206.10746 [pdf]

A Practical Methodology for ML-Based EM Side Channel Disassemblers

Authors: Cesar N. Arguello, Hunter Searle, Sara Rampazzi, Kevin R. B. Butler

Abstract: Providing security guarantees for embedded devices with limited interface capabilities is an increasingly crucial task. Although these devices don't have traditional interfaces, they still generate unintentional electromagnetic signals that correlate with the instructions being executed. By collecting these traces using our methodology and leveraging a random forest algorithm to develop a machine… ▽ More Providing security guarantees for embedded devices with limited interface capabilities is an increasingly crucial task. Although these devices don't have traditional interfaces, they still generate unintentional electromagnetic signals that correlate with the instructions being executed. By collecting these traces using our methodology and leveraging a random forest algorithm to develop a machine learning model, we built an EM side channel based instruction level disassembler. The disassembler was tested on an Arduino UNO board, yielding an accuracy of 88.69% instruction recognition for traces from twelve instructions captured at a single location in the device; this is an improvement compared to the 75.6% (for twenty instructions) reported in previous similar work. △ Less

Submitted 20 July, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

Comments: Accepted to the poster section of the 7th IEEE European Symposium on Security and Privacy 2022

arXiv:2205.10084 [pdf]

Spinel nitride solid solutions: charting properties in the configurational space with explainable machine learning

Authors: Pablo Sánchez-Palencia, Said Hamad, Pablo Palacios, Ricardo Grau-Crespo, Keith T. Butler

Abstract: Ab initio prediction of the variation of properties in the configurational space of solid solutions is computationally very demanding. We present an approach to accelerate these predictions via a combination of density functional theory and machine learning, using the cubic spinel nitride GeSn$_2$N$_4$ as a case study, exploring how formation energy and electronic bandgap are affected by configura… ▽ More Ab initio prediction of the variation of properties in the configurational space of solid solutions is computationally very demanding. We present an approach to accelerate these predictions via a combination of density functional theory and machine learning, using the cubic spinel nitride GeSn$_2$N$_4$ as a case study, exploring how formation energy and electronic bandgap are affected by configurational variations. Furthermore, we demonstrate the utility of applying explainable machine learning to understand the crystal chemistry origins of the trends that we observe. Different configuration descriptors (Coulomb matrix eigenspectrum, many-body tensor representation, and cluster correlation function vectors) are combined with different models (linear regression, gradient-boosted decision tree, and multi-layer perceptron) to extrapolate the calculation of ab initio properties from a small set of configurations to the full space with thousands of configurations. We discuss the performance of different descriptors and models. SHAP (SHapley Additive exPlanations) analysis of the machine learning models highlights how values of formation energy are dominated by variations in local crystal structure (single polyhedral environments), while values of electronic bandgap are dominated by variations in more extended structural motifs. Finally, we demonstrate the usefulness of this approach by constructing structure-property maps, identifying important configurations of GeSn$_2$N$_4$ with extremal properties, as well as by calculating accurate equilibrium properties using configurational averaging. △ Less

Submitted 20 May, 2022; originally announced May 2022.

arXiv:2204.01516 [pdf, other]

SAUSAGE: Security Analysis of Unix domain Socket Usage in Android

Authors: Mounir Elgharabawy, Blas Kojusner, Mohammad Mannan, Kevin R. B. Butler, Byron Williams, Amr Youssef

Abstract: The Android operating system is currently the most popular mobile operating system in the world. Android is based on Linux and therefore inherits its features including its Inter-Process Communication (IPC) mechanisms. These mechanisms are used by processes to communicate with one another and are extensively used in Android. While Android-specific IPC mechanisms have been studied extensively, Unix… ▽ More The Android operating system is currently the most popular mobile operating system in the world. Android is based on Linux and therefore inherits its features including its Inter-Process Communication (IPC) mechanisms. These mechanisms are used by processes to communicate with one another and are extensively used in Android. While Android-specific IPC mechanisms have been studied extensively, Unix domain sockets have not been examined comprehensively, despite playing a crucial role in the IPC of highly privileged system daemons. In this paper, we propose SAUSAGE, an efficient novel static analysis framework to study the security properties of these sockets. SAUSAGE considers access control policies implemented in the Android security model, as well as authentication checks implemented by the daemon binaries. It is a fully static analysis framework, specifically designed to analyze Unix domain socket usage in Android system daemons, at scale. We use this framework to analyze 200 Android images across eight popular smartphone vendors spanning Android versions 7-9. As a result, we uncover multiple access control misconfigurations and insecure authentication checks. Our notable findings include a permission bypass in highly privileged Qualcomm system daemons and an unprotected socket that allows an untrusted app to set the scheduling priority of other processes running on the system, despite the implementation of mandatory SELinux policies. Ultimately, the results of our analysis are worrisome; all vendors except the Android Open Source Project (AOSP) have access control issues, allowing an untrusted app to communicate to highly privileged daemons through Unix domain sockets introduced by hardware manufacturer or vendor customization. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: Accepted to EuroS&P 2022

arXiv:2203.12562 [pdf, other]

doi 10.1051/0004-6361/202142990

CRIRES high-resolution near-infrared spectroscopy of diffuse interstellar band profiles. Detection of 12 new DIBs in the YJ band and the introduction of a combined ISM sight line and stellar analysis approach

Authors: A. Ebenbichler, A. Postel, N. Przybilla, A. Seifahrt, D. Weßmayer, W. Kausch, M. Firnstein, K. Butler, A. Kaufer, H. Linnartz

Abstract: A high spectral resolution investigation of diffuse interstellar bands (DIBs) in the near-infrared ($YJ$ band) is conducted to test new methods, to confirm and improve existing parameters, and to search for new DIBs. Methods: The CRyogenic high-resolution InfraRed Echelle Spectrograph (CRIRES) on the European Southern Observatory's Very Large Telescope was employed to obtain spectra of four redd… ▽ More A high spectral resolution investigation of diffuse interstellar bands (DIBs) in the near-infrared ($YJ$ band) is conducted to test new methods, to confirm and improve existing parameters, and to search for new DIBs. Methods: The CRyogenic high-resolution InfraRed Echelle Spectrograph (CRIRES) on the European Southern Observatory's Very Large Telescope was employed to obtain spectra of four reddened background supergiant stars (HD 183143, HD 165784, HD 92207, HD 111613) and an unreddened comparison star (HD 87737) at the highest resolution of $R \approx 100000$ currently achievable at near-infrared wavelengths. The correction for telluric absorption was performed by a modelling approach. Non-local thermodynamic equilibrium spectral modelling of available optical and the new near-infrared spectra facilitated a comprehensive characterisation of the atmospheric properties of the background stars. A more precise and accurate determination of the reddening law along the sight lines could be achieved than feasible before by comparison of the observed and model spectral energy distributions. For DIBs that overlap with stellar lines the DIB profile shapes could be recovered. Results: Seventeen known near-infrared DIBs were confirmed, and 12 previously unknown and generally weaker DIBs were identified in the $YJ$ band. Three DIBs that show uniform profiles along all sight lines were identified, possibly connected to transitions from a common lower state of the same carrier. The divergent extinction curve towards the frequently discussed DIB standard star HD 183143 could be reproduced for the first time, requiring extra absorption by $\sim$3.5 mag due to polycyclic aromatic hydrocarbons (PAHs) to match the ultraviolet extinction bump. This extra absorption probably stems from a circumstellar bubble lying in front of the star which is intersected tangentially by the line of sight. △ Less

Submitted 15 April, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

Comments: 17 pages, 14 figures, Accepted for publication in Astronomy & Astrophysics, Data: https://doi.org/10.5281/zenodo.6461585

Journal ref: A&A 662, A81 (2022)

arXiv:2201.11161 [pdf]

Co-substituted BiFeO3: electronic, ferroelectric, and thermodynamic properties from first principles

Authors: Shivani Grover, Keith T. Butler, Umesh V Waghmare, Ricardo Grau-Crespo

Abstract: Bismuth ferrite, BiFeO3, is a multiferroic solid that is attracting increasing attention as a potential photocatalytic material, because the ferroelectric polarisation enhances the separation of photogenerated carriers. With the motivation of finding routes to engineer the band gap and the band alignment, while conserving or enhancing the ferroelectric properties, we have investigated the thermody… ▽ More Bismuth ferrite, BiFeO3, is a multiferroic solid that is attracting increasing attention as a potential photocatalytic material, because the ferroelectric polarisation enhances the separation of photogenerated carriers. With the motivation of finding routes to engineer the band gap and the band alignment, while conserving or enhancing the ferroelectric properties, we have investigated the thermodynamic, electronic and ferroelectric properties of BiCoxFe1 xO3 solid solutions, with 0 < x < 0.13, using density functional theory. We show that the band gap can be reduced from 2.9 eV to 2.1 eV by cobalt substitution, while simultaneously increasing the spontaneous polarisation, which is associated with a notably larger Born effective charge of Co compared to Fe cations. We discuss the interaction between Co impurities, which is strongly attractive and would drive the aggregation of Co, as evidenced by Monte Carlo simulations. Phase separation into a Co-rich phase is therefore predicted to be thermodynamically preferred, and the homogeneous solid solution can only exist in metastable form, protected by slow cation diffusion kinetics. Finally, we discuss the band alignment of pure and Co-substituted BiFeO3 with relevant redox potentials, in the context of its applicability in photocatalysis. △ Less

Submitted 4 August, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

Comments: Biblography expanded; typos corrected; improved discussion of photocatalytic applications

arXiv:2201.07815 [pdf, other]

doi 10.1093/mnras/stac150

The Bright Extragalactic ALMA Redshift Survey (BEARS) I: redshifts of bright gravitationally-lensed galaxies from the Herschel ATLAS

Authors: S. A. Urquhart, G. J. Bendo, S. Serjeant, T. Bakx, M. Hagimoto, P. Cox, R. Neri, M. Lehnert, C. Sedgwick, C. Weiner, H. Dannerbauer, A. Amvrosiadis, P. Andreani, A. J. Baker, A. Beelen, S. Berta, E. Borsato, V. Buat, K. M. Butler, A. Cooray, G. De Zotti, L. Dunne, S. Dye, S. Eales, A. Enia , et al. (31 additional authors not shown)

Abstract: We present spectroscopic measurements for 71 galaxies associated with 62 of the brightest high-redshift submillimeter sources from the Southern fields of the Herschel Astrophysical Terahertz Large Area Survey (H-ATLAS), while targeting 85 sources which resolved into 142. We have obtained robust redshift measurements for all sources using the 12-m Array and an efficient tuning of ALMA to optimise i… ▽ More We present spectroscopic measurements for 71 galaxies associated with 62 of the brightest high-redshift submillimeter sources from the Southern fields of the Herschel Astrophysical Terahertz Large Area Survey (H-ATLAS), while targeting 85 sources which resolved into 142. We have obtained robust redshift measurements for all sources using the 12-m Array and an efficient tuning of ALMA to optimise its use as a redshift hunter, with 73 per cent of the sources having a robust redshift identification. Nine of these redshift identifications also rely on observations from the Atacama Compact Array. The spectroscopic redshifts span a range $1.41<z<4.53$ with a mean value of 2.75, and the CO emission line full-width at half-maxima range between $\rm 110\,km\,s^{-1} < FWHM < 1290\,km\,s^{-1}$ with a mean value of $\sim$ 500kms$^{-1}$, in line with other high-$z$ samples. The derived CO(1-0) luminosity is significantly elevated relative to line-width to CO(1-0) luminosity scaling relation, which is suggestive of lensing magnification across our sources. In fact, the distribution of magnification factors inferred from the CO equivalent widths is consistent with expectations from galaxy-galaxy lensing models, though there is a hint of an excess at large magnifications that may be attributable to the additional lensing optical depth from galaxy groups or clusters. △ Less

Submitted 19 January, 2022; originally announced January 2022.

Comments: 21 pages, 8 figures

arXiv:2112.09795 [pdf]

doi 10.1039/D1TA10860C

Mixed-anion mixed-cation perovskite (FAPbI$_3$)$_{0.875}$(MAPbBr$_3$)$_{0.125}$: an ab-initio molecular dynamics study

Authors: Eduardo Menéndez-Proupin, Shivani Grover, Ana L. Montero-Alejo, Scott D. Midgley, Keith T. Butler, Ricardo Grau-Crespo

Abstract: Mixed-anion mixed-cation perovskites with (FAPbI$_3$)$_{1-x}$(MAPbBr$_3$)$_x$ composition have allowed record efficiencies in photovoltaic solar cells, but their atomic-scale behaviour is not well understood yet, in part because their theoretical modelling requires consideration of complex and interrelated dynamic and disordering effects. We present here an ab initio molecular dynamics investigati… ▽ More Mixed-anion mixed-cation perovskites with (FAPbI$_3$)$_{1-x}$(MAPbBr$_3$)$_x$ composition have allowed record efficiencies in photovoltaic solar cells, but their atomic-scale behaviour is not well understood yet, in part because their theoretical modelling requires consideration of complex and interrelated dynamic and disordering effects. We present here an ab initio molecular dynamics investigation of the structural, thermodynamic, and electronic properties of the (FAPbI$_3$)$_{0.875}$(MAPbBr$_3$)$_{0.125}$ perovskite. A special quasi-random structure is proposed to mimic the disorder of both the molecular cations and the halide anions, in a stoichiometry that is close to that of one of today's most efficient perovskite solar cells. We show that the rotation of the organic cations is more strongly hindered in the mixed structure in comparison with the pure compounds. Our analysis suggests that this mixed perovskite is thermodynamically stable against phase separation despite the endothermic mixing enthalpy, due to the large configurational entropy. The electronic properties are investigated by hybrid density functional calculations including spin-orbit coupling in carefully selected representative configurations extracted from the molecular dynamics. Our model, that is validated here against experimental information, provides a more sophisticated understanding of the interplay between dynamic and disordering effects in this important family of photovoltaic materials. △ Less

Submitted 17 December, 2021; originally announced December 2021.

Comments: 10 pages, 7 figures

Journal ref: Journal of Materials Chemistry A, 2022

arXiv:2111.01037 [pdf, other]

doi 10.1021/accountsmr.1c00244

Interpretable and Explainable Machine Learning for Materials Science and Chemistry

Authors: Felipe Oviedo, Juan Lavista Ferres, Tonio Buonassisi, Keith Butler

Abstract: While the uptake of data-driven approaches for materials science and chemistry is at an exciting, early stage, to realise the true potential of machine learning models for successful scientific discovery, they must have qualities beyond purely predictive power. The predictions and inner workings of models should provide a certain degree of explainability by human experts, permitting the identifica… ▽ More While the uptake of data-driven approaches for materials science and chemistry is at an exciting, early stage, to realise the true potential of machine learning models for successful scientific discovery, they must have qualities beyond purely predictive power. The predictions and inner workings of models should provide a certain degree of explainability by human experts, permitting the identification of potential model issues or limitations, building trust on model predictions and unveiling unexpected correlations that may lead to scientific insights. In this work, we summarize applications of interpretability and explainability techniques for materials science and chemistry and discuss how these techniques can improve the outcome of scientific studies. We discuss various challenges for interpretable machine learning in materials science and, more broadly, in scientific settings. In particular, we emphasize the risks of inferring causation or reaching generalization by purely interpreting machine learning models and the need of uncertainty estimates for model explanations. Finally, we showcase a number of exciting developments in other fields that could benefit interpretability in material science and chemistry problems. △ Less

Submitted 3 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

Comments: Under review Accounts of Material Research

Journal ref: 2022 Account of Materials Research

arXiv:2108.12865 [pdf]

doi 10.1039/D1CP05623A

Ultralow Work Function of the Electride Sr$_3$CrN$_3$

Authors: Cuicui Wang, Miaoting Xu, Keith T. Butler, Lee A. Burton

Abstract: Electrides have valence electrons that occupy free space in the crystal structure, making them easier to extract. This feature can be used in catalysis for important reactions that usually requires a high-temperature and high-pressure environments, such as ammonia synthesis. In this paper, we use density functional theory to investigate the behaviour of interstitial electrons of the 1-dimensional… ▽ More Electrides have valence electrons that occupy free space in the crystal structure, making them easier to extract. This feature can be used in catalysis for important reactions that usually requires a high-temperature and high-pressure environments, such as ammonia synthesis. In this paper, we use density functional theory to investigate the behaviour of interstitial electrons of the 1-dimensional electride Sr$_3$CrN$_3$. We find that the bulk excess electron density persists on introduction of surface terminations, that the crystal termination perpendicular to the 1D free-electron channel is highly stable and we confirm an extremely low work function with hybrid functional methods. Our results indicate that Sr$_3$CrN$_3$ is a potentially important novel catalyst, with accessible, directional and extractable free electron density. △ Less

Submitted 29 August, 2021; originally announced August 2021.

Comments: 4 pages, 4 figures

arXiv:2108.02077 [pdf, other]

doi 10.1063/5.0065694

Entropy-based Active Learning of Graph Neural Network Surrogate Models for Materials Properties

Authors: Johannes Allotey, Keith T. Butler, Jeyan Thiyagalingam

Abstract: Graph neural networks, trained on experimental or calculated data are becoming an increasingly important tool in computational materials science. Networks, once trained, are able to make highly accurate predictions at a fraction of the cost of experiments or first-principles calculations of comparable accuracy. However these networks typically rely on large databases of labelled experiments to tra… ▽ More Graph neural networks, trained on experimental or calculated data are becoming an increasingly important tool in computational materials science. Networks, once trained, are able to make highly accurate predictions at a fraction of the cost of experiments or first-principles calculations of comparable accuracy. However these networks typically rely on large databases of labelled experiments to train the model. In scenarios where data is scarce or expensive to obtain this can be prohibitive. By building a neural network that provides a confidence on the predicted properties, we are able to develop an active learning scheme that can reduce the amount of labelled data required, by identifying the areas of chemical space where the model is most uncertain. We present a scheme for coupling a graph neural network with a Gaussian process to featurise solid-state materials and predict properties \textit{including} a measure of confidence in the prediction. We then demonstrate that this scheme can be used in an active learning context to speed up the training of the model, by selecting the optimal next experiment for obtaining a data label. Our active learning scheme can double the rate at which the performance of the model on a test data set improves with additional data compared to choosing the next sample at random. This type of uncertainty quantification and active learning has the potential to open up new areas of materials science, where data are scarce and expensive to obtain, to the transformative power of graph neural networks. △ Less

Submitted 13 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

arXiv:2107.14664 [pdf, other]

Distributed Representations of Atoms and Materials for Machine Learning

Authors: Luis M. Antunes, Ricardo Grau-Crespo, Keith T. Butler

Abstract: The use of machine learning is becoming increasingly common in computational materials science. To build effective models of the chemistry of materials, useful machine-based representations of atoms and their compounds are required. We derive distributed representations of compounds from their chemical formulas only, via pooling operations of distributed representations of atoms. These compound re… ▽ More The use of machine learning is becoming increasingly common in computational materials science. To build effective models of the chemistry of materials, useful machine-based representations of atoms and their compounds are required. We derive distributed representations of compounds from their chemical formulas only, via pooling operations of distributed representations of atoms. These compound representations are evaluated on ten different tasks, such as the prediction of formation energy and band gap, and are found to be competitive with existing benchmarks that make use of structure, and even superior in cases where only composition is available. Finally, we introduce a new approach for learning distributed representations of atoms, named SkipAtom, which makes use of the growing information in materials structure databases. △ Less

Submitted 30 July, 2021; originally announced July 2021.

arXiv:2104.10077 [pdf, other]

doi 10.3847/1538-4357/ac0c7a

Resolved Neutral Outflow from a Lensed Dusty Star Forming Galaxy at z=2.09

Authors: Kirsty M. Butler, Paul P. van der Werf, Matus Rybak, Tiago Costa, Pierre Cox, Axel Weiß, Michał J. Michałowski, Dominik A. Riechers, Dimitra Rigopoulou, Lucia Marchetti, Stephen Eales, Ivan Valtchanov

Abstract: We report the detection of a massive neutral gas outflow in the z=2.09 gravitationally lensed Dusty Star-Forming Galaxy HATLASJ085358.9+015537 (G09v1.40), seen in absorption with the OH+(1_1-1_0) transition using spatially resolved (0.5"x0.4") Atacama Large Millimeter/submillimeter Array (ALMA) observations. The blueshifted OH+ line is observed simultaneously with the CO(9-8) emission line and und… ▽ More We report the detection of a massive neutral gas outflow in the z=2.09 gravitationally lensed Dusty Star-Forming Galaxy HATLASJ085358.9+015537 (G09v1.40), seen in absorption with the OH+(1_1-1_0) transition using spatially resolved (0.5"x0.4") Atacama Large Millimeter/submillimeter Array (ALMA) observations. The blueshifted OH+ line is observed simultaneously with the CO(9-8) emission line and underlying dust continuum. These data are complemented by high angular resolution (0.17"x0.13") ALMA observations of CH+(1-0) and underlying dust continuum, and Keck 2.2 micron imaging tracing the stellar emission. The neutral outflow, dust, dense molecular gas and stars all show spatial offsets from each other. The total atomic gas mass of the observed outflow is 6.7x10^9 M_sun, >25% as massive as the gas mass of the galaxy. We find that a conical outflow geometry best describes the OH+ kinematics and morphology and derive deprojected outflow properties as functions of possible inclination (0.38 deg-64 deg). The neutral gas mass outflow rate is between 83-25400 M_sun/yr, exceeding the star formation rate (788+/-300 M_sun/yr) if the inclination is >3.6 deg (mass-loading factor = 0.3-4.7). Kinetic energy and momentum fluxes span 4.4-290x10^9 L_sun and 0.1-3.7x10^37 dyne, respectively (energy-loading factor = 0.013-16), indicating that the feedback mechanisms required to drive the outflow depend on the inclination assumed. We derive a gas depletion time between 29 and 1 Myr, but find that the neutral outflow is likely to remain bound to the galaxy, unless the inclination is small, and may be re-accreted if additional feedback processes do not occur. △ Less

Submitted 2 July, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

Comments: 33 pages, 20 figures

arXiv:2103.08973 [pdf, other]

doi 10.1107/S160057672100563X

Determining the maximum information gain and optimising experimental design in neutron reflectometry using the Fisher information

Authors: James H. Durant, Lucas Wilkins, Keith Butler, Joshaniel F. K. Cooper

Abstract: An approach based on the Fisher information (FI) is developed to quantify the maximum information gain and optimal experimental design in neutron reflectometry experiments. In these experiments, the FI can be analytically calculated and used to provide sub-second predictions of parameter uncertainties. This approach can be used to influence real-time decisions about measurement angle, measurement… ▽ More An approach based on the Fisher information (FI) is developed to quantify the maximum information gain and optimal experimental design in neutron reflectometry experiments. In these experiments, the FI can be analytically calculated and used to provide sub-second predictions of parameter uncertainties. This approach can be used to influence real-time decisions about measurement angle, measurement time, contrast choice and other experimental conditions based on parameters of interest. The FI provides a lower bound on parameter estimation uncertainties and these are shown to decrease with the square root of measurement time, providing useful information for the planning and scheduling of experimental work. As the FI is computationally inexpensive to calculate, it can be computed repeatedly during the course of an experiment, saving costly beam time by signalling that sufficient data has been obtained; or saving experimental datasets by signalling that an experiment needs to continue. The approach's predictions are validated through the introduction of an experiment simulation framework that incorporates instrument-specific incident flux profiles, and through the investigation of measuring the structural properties of a phospholipid bilayer. △ Less

Submitted 1 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

Comments: Revised submission to the Journal of Applied Crystallography

Journal ref: J. Appl. Cryst. (2021). 54, 1100-1110

arXiv:2103.03325 [pdf, other]

Hard-label Manifolds: Unexpected Advantages of Query Efficiency for Finding On-manifold Adversarial Examples

Authors: Washington Garcia, Pin-Yu Chen, Somesh Jha, Scott Clouse, Kevin R. B. Butler

Abstract: Designing deep networks robust to adversarial examples remains an open problem. Likewise, recent zeroth order hard-label attacks on image classification models have shown comparable performance to their first-order, gradient-level alternatives. It was recently shown in the gradient-level setting that regular adversarial examples leave the data manifold, while their on-manifold counterparts are in… ▽ More Designing deep networks robust to adversarial examples remains an open problem. Likewise, recent zeroth order hard-label attacks on image classification models have shown comparable performance to their first-order, gradient-level alternatives. It was recently shown in the gradient-level setting that regular adversarial examples leave the data manifold, while their on-manifold counterparts are in fact generalization errors. In this paper, we argue that query efficiency in the zeroth-order setting is connected to an adversary's traversal through the data manifold. To explain this behavior, we propose an information-theoretic argument based on a noisy manifold distance oracle, which leaks manifold information through the adversary's gradient estimate. Through numerical experiments of manifold-gradient mutual information, we show this behavior acts as a function of the effective problem dimensionality and number of training points. On real-world datasets and multiple zeroth-order attacks using dimension-reduction, we observe the same universal behavior to produce samples closer to the data manifold. This results in up to two-fold decrease in the manifold distance measure, regardless of the model robustness. Our results suggest that taking the manifold-gradient mutual information into account can thus inform better robust model design in the future, and avoid leakage of the sensitive data manifold. △ Less

Submitted 4 March, 2021; originally announced March 2021.

Comments: Preprint

arXiv:2102.01770 [pdf, other]

doi 10.1109/TVCG.2021.3067787

A privacy-preserving approach to streaming eye-tracking data

Authors: Brendan David-John, Diane Hosfelt, Kevin Butler, Eakta Jain

Abstract: Eye-tracking technology is being increasingly integrated into mixed reality devices. Although critical applications are being enabled, there are significant possibilities for violating user privacy expectations. We show that there is an appreciable risk of unique user identification even under natural viewing conditions in virtual reality. This identification would allow an app to connect a user's… ▽ More Eye-tracking technology is being increasingly integrated into mixed reality devices. Although critical applications are being enabled, there are significant possibilities for violating user privacy expectations. We show that there is an appreciable risk of unique user identification even under natural viewing conditions in virtual reality. This identification would allow an app to connect a user's personal ID with their work ID without needing their consent, for example. To mitigate such risks we propose a framework that incorporates gatekee** via the design of the application programming interface and via software-implemented privacy mechanisms. Our results indicate that these mechanisms can reduce the rate of identification from as much as 85% to as low as 30%. The impact of introducing these mechanisms is less than 1.5$^\circ$ error in gaze position for gaze prediction. Gaze data streams can thus be made private while still allowing for gaze prediction, for example, during foveated rendering. Our approach is the first to support privacy-by-design in the flow of eye-tracking data within mixed reality use cases. △ Less

Submitted 19 March, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

Comments: 12 pages, 4 figures, to appear in IEEE TVCG Special Issue on IEEE VR 2021

arXiv:2011.04584 [pdf, other]

doi 10.1088/1361-648X/abea1c

Interpretable, calibrated neural networks for analysis and understanding of inelastic neutron scattering data

Authors: Keith T. Butler, Manh Duc Le, Jeyarajan Thiyagalingam, Toby G. Perring

Abstract: Deep neural networks provide flexible frameworks for learning data representations and functions relating data to other properties and are often claimed to achieve 'super-human' performance in inferring relationships between input data and desired property. In the context of inelastic neutron scattering experiments, however, as in many other scientific scenarios, a number of issues arise: (i) scar… ▽ More Deep neural networks provide flexible frameworks for learning data representations and functions relating data to other properties and are often claimed to achieve 'super-human' performance in inferring relationships between input data and desired property. In the context of inelastic neutron scattering experiments, however, as in many other scientific scenarios, a number of issues arise: (i) scarcity of labelled experimental data, (ii) lack of uncertainty quantification on results, and (iii) lack of interpretability of the deep neural networks. In this work we examine approaches to all three issues. We use simulated data to train a deep neural network to distinguish between two possible magnetic exchange models of a half-doped manganite. We apply the recently developed deterministic uncertainty quantification method to provide error estimates for the classification, demonstrating in the process how important realistic representations of instrument resolution in the training data are for reliable estimates on experimental data. Finally we use class activation maps to determine which regions of the spectra are most important for the final classification result reached by the network. △ Less

Submitted 20 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

arXiv:2008.01623 [pdf]

Semantic based model of Conceptual Work Products for formal verification of complex interactive systems

Authors: Mohcine Madkour, Keith Butler, Eric Mercer, Ali Bahrami, Cui Tao

Abstract: Many clinical workflows depend on interactive computer systems for highly technical, conceptual work products, such as diagnoses, treatment plans, care coordination, and case management. We describe an automatic logic reasoner to verify objective specifications for these highly technical, but abstract, work products that are essential to care. The conceptual work products specifications serve as a… ▽ More Many clinical workflows depend on interactive computer systems for highly technical, conceptual work products, such as diagnoses, treatment plans, care coordination, and case management. We describe an automatic logic reasoner to verify objective specifications for these highly technical, but abstract, work products that are essential to care. The conceptual work products specifications serve as a fundamental output requirement, which must be clearly stated, correct and solvable. There is strategic importance for such specifications because, in turn, they enable system model checking to verify that machine functions taken with user procedures are actually able to achieve these abstract products. We chose case management of Multiple Sclerosis (MS) outpatients as our use case for its challenging complexity. As a first step, we illustrate how graphical class and state diagrams from UML can be developed and critiqued with subject matter experts to serve as specifications of the conceptual work product of case management. A key feature is that the specification must be declarative and thus independent of any process or technology. Our Work Domain Ontology with tools from Semantic Web is needed to translate UML class and state diagrams for verification of solvability with automatic reasoning. The solvable model will then be ready for subsequent use with model checking on the system of human procedures and machine functions. We used the expressive rule language SPARQL Inferencing Notation (SPIN) to develop formal representations of the UML class diagram, the state machine, and their interactions. Using SPIN, we proved the consistency of the interactions of static and dynamic concepts. We discussed how the new SPIN rule engine could be incorporated in the Object Management Group (OMG) Ontology Definition Metamodel (ODM) △ Less

Submitted 4 August, 2020; originally announced August 2020.

arXiv:2006.10117 [pdf, other]

doi 10.1103/PhysRevE.102.013310

Heterogeneous partition of cellular blood-borne nanoparticles through microvascular bifurcations

Authors: Zixiang L. Liu, Jonathan R. Clausen, Justin L. Wagner, Kimberly S. Butler, Dan S. Bolintineanu, Jeremy B. Lechman, Rekha R. Rao, Cyrus K. Aidun

Abstract: Blood flowing through microvascular bifurcations has been an active research topic for many decades, while the partitioning pattern of nanoscale solutes in the blood remains relatively unexplored. Here, we demonstrate a multiscale computational framework for direct numerical simulation of the nanoparticle (NP) partitioning through physiologically-relevant vascular bifurcations in the presence of r… ▽ More Blood flowing through microvascular bifurcations has been an active research topic for many decades, while the partitioning pattern of nanoscale solutes in the blood remains relatively unexplored. Here, we demonstrate a multiscale computational framework for direct numerical simulation of the nanoparticle (NP) partitioning through physiologically-relevant vascular bifurcations in the presence of red blood cells (RBCs). The computational framework is established by embedding a newly-developed particulate suspension inflow/outflow boundary condition into a multiscale blood flow solver. The computational framework is verified by recovering a tubular blood flow without a bifurcation and validated against the experimental measurement of an intravital bifurcation flow. The classic Zweifach-Fung (ZF) effect is shown to be well captured by the method. Moreover, we observe that NPs exhibit a ZF-like heterogeneous partition in response to the heterogeneous partition of the RBC phase. The NP partitioning prioritizes the high-flow-rate daughter branch except for extreme (large or small) suspension flow partition ratios under which the complete phase separation tends to occur. By analyzing the flow field and the particle trajectories, we show that the ZF-like heterogeneity in NP partition can be explained by the RBC-entrainment effect caused by the deviation of the flow separatrix preceded by the tank-treading of RBCs near the bifurcation junction. The recovery of homogeneity in the NP partition under extreme flow partition ratios is due to the plasma skimming of NPs in the cell-free layer. These findings, based on the multiscale computational framework, provide biophysical insights to the heterogeneous distribution of NPs in microvascular beds that are observed pathophysiologically. △ Less

Submitted 17 June, 2020; originally announced June 2020.

Journal ref: Phys. Rev. E 102, 013310 (2020)

arXiv:2005.05831 [pdf, other]

doi 10.1063/5.0013136

Modelling the dielectric constants of crystals using machine learning

Authors: Kazuki Morita, Daniel W. Davies, Keith T. Butler, Aron Walsh

Abstract: The relative permittivity of a crystal is a fundamental property that links microscopic chemical bonding to macroscopic electromagnetic response. Multiple models, including analytical, numerical and statistical descriptions, have been made to understand and predict dielectric behaviour. Analytical models are often limited to a particular type of compounds, whereas machine learning (ML) models ofte… ▽ More The relative permittivity of a crystal is a fundamental property that links microscopic chemical bonding to macroscopic electromagnetic response. Multiple models, including analytical, numerical and statistical descriptions, have been made to understand and predict dielectric behaviour. Analytical models are often limited to a particular type of compounds, whereas machine learning (ML) models often lack interpretability. Here, we combine supervised ML, density functional perturbation theory, and analysis based on game theory to predict and explain the physical trends in optical dielectric constants of crystals. Two ML models, support vector regression and deep neural networks, were trained on a dataset of 1,364 dielectric constants. Shapley additive explanations (SHAP) analysis of the ML models reveals that they recover correlations described by textbook Clausius-Mossotti and Penn models, which gives confidence in their ability to describe physical behavior, while providing superior predictive power. △ Less

Submitted 12 May, 2020; originally announced May 2020.

Comments: 14 pages, 4 figures, 4 tables

arXiv:1910.07631 [pdf]

doi 10.1098/rsta.2019.0054

Machine Learning and Big Scientific Data

Authors: Tony Hey, Keith Butler, Sam Jackson, Jeyarajan Thiyagalingam

Abstract: This paper reviews some of the challenges posed by the huge growth of experimental data generated by the new generation of large-scale experiments at UK national facilities at the Rutherford Appleton Laboratory site at Harwell near Oxford. Such "Big Scientific Data" comes from the Diamond Light Source and Electron Microscopy Facilities, the ISIS Neutron and Muon Facility, and the UK's Central Lase… ▽ More This paper reviews some of the challenges posed by the huge growth of experimental data generated by the new generation of large-scale experiments at UK national facilities at the Rutherford Appleton Laboratory site at Harwell near Oxford. Such "Big Scientific Data" comes from the Diamond Light Source and Electron Microscopy Facilities, the ISIS Neutron and Muon Facility, and the UK's Central Laser Facility. Increasingly, scientists are now needing to use advanced machine learning and other AI technologies both to automate parts of the data pipeline and also to help find new scientific discoveries in the analysis of their data. For commercially important applications, such as object recognition, natural language processing and automatic translation, deep learning has made dramatic breakthroughs. Google's DeepMind has now also used deep learning technology to develop their AlphaFold tool to make predictions for protein folding. Remarkably, they have been able to achieve some spectacular results for this specific scientific problem. Can deep learning be similarly transformative for other scientific problems? After a brief review of some initial applications of machine learning at the Rutherford Appleton Laboratory, we focus on challenges and opportunities for AI in advancing materials science. Finally, we discuss the importance of develo** some realistic machine learning benchmarks using Big Scientific Data coming from a number of different scientific domains. We conclude with some initial examples of our "SciML" benchmark suite and of the research challenges these benchmarks will enable. △ Less

Submitted 12 October, 2019; originally announced October 2019.

Comments: 42 Pages with full colour images

arXiv:1908.08070 [pdf, other]

doi 10.1103/PhysRevApplied.15.054030

Quantum-statistical transport phenomena in memristive computing architectures

Authors: Christopher N. Singh, Brian A. Crafton, Mathew P. West, Alex S. Weidenbach, Keith T. Butler, Allan H. MacDonald, Arjit Raychowdury, Eric M. Vogel, W. Alan Doolittle, L. F. J. Piper, Wei-Cheng Lee

Abstract: The advent of reliable, nanoscale memristive components is promising for next generation compute-in-memory paradigms, however, the intrinsic variability in these devices has prevented widespread adoption. Here we show coherent electron wave functions play a pivotal role in the nanoscale transport properties of these emerging, non-volatile memories. By characterizing both filamentary and non-filame… ▽ More The advent of reliable, nanoscale memristive components is promising for next generation compute-in-memory paradigms, however, the intrinsic variability in these devices has prevented widespread adoption. Here we show coherent electron wave functions play a pivotal role in the nanoscale transport properties of these emerging, non-volatile memories. By characterizing both filamentary and non-filamentary memristive devices as disordered Anderson systems, the switching characteristics and intrinsic variability arise directly from the universality of electron transport in disordered media. Our framework suggests localization phenomena in nanoscale, solid-state memristive systems are directly linked to circuit level performance. We discuss how quantum conductance fluctuations in the active layer set a lower bound on device variability. This finding implies there is a fundamental quantum limit on the reliability of memristive devices, and electron coherence will play a decisive role in surpassing or maintaining Moore's Law with these systems. △ Less

Submitted 31 May, 2021; v1 submitted 21 August, 2019; originally announced August 2019.

Comments: 13 pages, 6 figures

Journal ref: Phys. Rev. Applied 15, 054030 (2021)

Showing 1–50 of 115 results for author: Butler, K