Search | arXiv e-print repository

Quantum computing for corrosion-resistant materials and anti-corrosive coatings design

Authors: Nam Nguyen, Thomas W. Watts, Benjamin Link, Kristen S. Williams, Yuval R. Sanders, Samuel J. Elman, Maria Kieferova, Michael J. Bremner, Kaitlyn J. Morrell, Justin Elenewski, Eric B. Isaacs, Samuel D. Johnson, Luke Mathieson, Kevin M. Obenland, Matthew Otten, Rashmi Sundareswara, Adam Holmes

Abstract: Recent estimates indicate that the U.S. Department of Defense spends over \$20 billion USD annually on corrosion-related maintenance. This expenditure is accompanied by a substantial loss in asset readiness, ranging from 10% to 30%. Moreover, the global costs associated with corrosion damage have been estimated at an astonishing \$2.5 trillion USD per year, or approximately 3.4% of global GDP in 2… ▽ More Recent estimates indicate that the U.S. Department of Defense spends over \$20 billion USD annually on corrosion-related maintenance. This expenditure is accompanied by a substantial loss in asset readiness, ranging from 10% to 30%. Moreover, the global costs associated with corrosion damage have been estimated at an astonishing \$2.5 trillion USD per year, or approximately 3.4% of global GDP in 2016. This project aims to describe how quantum computers might be leveraged to fundamentally change the way material-environment interactions are modeled for material discovery, selection, and design. This project also seeks to understand the plausibility and utility of replacing portions of classical computing workflows with algorithms optimized for quantum computing hardware. The utility of quantum computers is explored through the lens of two industrially relevant problems: (1) characterizing magnesium alloy corrosion properties in aqueous environments and (2) identifying stable niobium-rich alloys with corrosion resistance at temperatures above 1500K. This paper presents an end-to-end analysis of the complexity of both classical and quantum algorithms used in application workflows. Resource estimates are produced using a custom software package, pyLIQTR, based on the qubitized Quantum Phase Estimation (QPE) algorithm. Estimates for the two aforementioned applications show that industrially-relevant computational models that have the potential to deliver commercial utility require quantum computers with thousands to hundreds of thousands of logical qubits and the ability to execute $10^{13}$ to $10^{19}$ T-gates. These estimates represent an upper bound and motivate continued research into improved quantum algorithms and resource reduction techniques. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2406.08549 [pdf, other]

Investigating Mutual Coupling in the Hydrogen Epoch of Reionization Array and Mitigating its Effects on the 21-cm Power Spectrum

Authors: E. Rath, R. Pascua, A. T. Josaitis, A. Ewall-Wice, N. Fagnoni, E. de Lera Acedo, Z. E. Martinot, Z. Abdurashidova, T. Adams, J. E. Aguirre, R. Baartman, A. P. Beardsley, L. M. Berkhout, G. Bernardi, T. S. Billings, J. D. Bowman, P. Bull, J. Burba, R. Byrne, S. Carey, K. -F. Chen, S. Choudhuri, T. Cox, D. R. DeBoer, M. Dexter , et al. (56 additional authors not shown)

Abstract: Interferometric experiments designed to detect the highly redshifted 21-cm signal from neutral hydrogen are producing increasingly stringent constraints on the 21-cm power spectrum, but some k-modes remain systematics-dominated. Mutual coupling is a major systematic that must be overcome in order to detect the 21-cm signal, and simulations that reproduce effects seen in the data can guide strategi… ▽ More Interferometric experiments designed to detect the highly redshifted 21-cm signal from neutral hydrogen are producing increasingly stringent constraints on the 21-cm power spectrum, but some k-modes remain systematics-dominated. Mutual coupling is a major systematic that must be overcome in order to detect the 21-cm signal, and simulations that reproduce effects seen in the data can guide strategies for mitigating mutual coupling. In this paper, we analyse 12 nights of data from the Hydrogen Epoch of Reionization Array and compare the data against simulations that include a computationally efficient and physically motivated semi-analytic treatment of mutual coupling. We find that simulated coupling features qualitatively agree with coupling features in the data; however, coupling features in the data are brighter than the simulated features, indicating the presence of additional coupling mechanisms not captured by our model. We explore the use of fringe-rate filters as mutual coupling mitigation tools and use our simulations to investigate the effects of mutual coupling on a simulated cosmological 21-cm power spectrum in a "worst case" scenario where the foregrounds are particularly bright. We find that mutual coupling contaminates a large portion of the "EoR Window", and the contamination is several orders-of-magnitude larger than our simulated cosmic signal across a wide range of cosmological Fourier modes. While our fiducial fringe-rate filtering strategy reduces mutual coupling by roughly a factor of 100 in power, a non-negligible amount of coupling cannot be excised with fringe-rate filters, so more sophisticated mitigation strategies are required. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 19 pages, 12 figures, submitted to MNRAS

arXiv:2405.14577 [pdf, other]

Representation noising effectively prevents harmful fine-tuning on LLMs

Authors: Domenic Rosati, Jan Wehner, Kai Williams, Łukasz Bartoszcze, David Atanasov, Robie Gonzales, Subhabrata Majumdar, Carsten Maple, Hassan Sajjad, Frank Rudzicz

Abstract: Releasing open-source large language models (LLMs) presents a dual-use risk since bad actors can easily fine-tune these models for harmful purposes. Even without the open release of weights, weight stealing and fine-tuning APIs make closed models vulnerable to harmful fine-tuning attacks (HFAs). While safety measures like preventing jailbreaks and improving safety guardrails are important, such me… ▽ More Releasing open-source large language models (LLMs) presents a dual-use risk since bad actors can easily fine-tune these models for harmful purposes. Even without the open release of weights, weight stealing and fine-tuning APIs make closed models vulnerable to harmful fine-tuning attacks (HFAs). While safety measures like preventing jailbreaks and improving safety guardrails are important, such measures can easily be reversed through fine-tuning. In this work, we propose Representation Noising (RepNoise), a defence mechanism that is effective even when attackers have access to the weights and the defender no longer has any control. RepNoise works by removing information about harmful representations such that it is difficult to recover them during fine-tuning. Importantly, our defence is also able to generalize across different subsets of harm that have not been seen during the defence process. Our method does not degrade the general capability of LLMs and retains the ability to train the model on harmless tasks. We provide empirical evidence that the effectiveness of our defence lies in its "depth": the degree to which information about harmful representations is removed across all layers of the LLM. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.00784 [pdf, other]

A Radio Study of Persistent Radio Sources in Nearby Dwarf Galaxies: Implications for Fast Radio Bursts

Authors: Y. Dong, T. Eftekhari, W. Fong, S. Bhandari, E. Berger, O. S. Ould-Boukattine, J. W. T. Hessels, N. Sridhar, A. Reines, B. Margalit, J. Darling, A. C. Gordon, J. E. Greene, C. D. Kilpatrick, B. Marcote, B. D. Metzger, K. Nimmo, A. E. Nugent, Z. Paragi, P. K. G. Williams

Abstract: We present 1 - 12 GHz Karl G. Jansky Very Large Array observations of 9 off-nuclear persistent radio sources (PRSs) in nearby (z < 0.055) dwarf galaxies, along with high-resolution European very-long baseline interferometry (VLBI) Network (EVN) observations for one of them at 1.7GHz. We explore the plausibility that these PRSs are associated with fast radio burst (FRB) sources by examining their p… ▽ More We present 1 - 12 GHz Karl G. Jansky Very Large Array observations of 9 off-nuclear persistent radio sources (PRSs) in nearby (z < 0.055) dwarf galaxies, along with high-resolution European very-long baseline interferometry (VLBI) Network (EVN) observations for one of them at 1.7GHz. We explore the plausibility that these PRSs are associated with fast radio burst (FRB) sources by examining their properties, physical sizes, host-normalized offsets, spectral energy distributions (SEDs), radio luminosities, and light curves, and compare them to those of the PRSs associated with FRBs 20121102A and 20190520B, two known active galactic nuclei (AGN), and one likely AGN in our sample with comparable data, as well as other radio transients exhibiting characteristics analogous to FRB-PRSs. We identify a single source in our sample, J1136+2643, as the most promising FRB- PRS, based on its compact physical size and host-normalized offset. We further identify two sources, J0019+1507 and J0909+5955, with physical sizes comparable to FRB-PRSs, but which exhibit large offsets and flat spectral indices potentially indicative of a background AGN origin. We test the viability of neutron star wind nebulae and hypernebulae models for J1136+2643, and find that the physical size, luminosity, and SED of J1136+2643 are broadly consistent with these models. Finally, we discuss the alternative interpretation that the radio sources are instead powered by accreting massive black holes and outline future prospects and follow-up observations for differentiating between these scenarios. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 24 pages, 7 figures, 3 tables

arXiv:2404.18237 [pdf, ps, other]

Torus Queen Independence

Authors: Kada Williams

Abstract: Define a queen on $\mathbb{Z}_n^d$ with admissible moves parallel to $\mathbf{x}\in\{-1,0,1\}^d$ at arbitrary length. How many queens can be placed on $\mathbb{Z}_n^d$ without any two in conflict? We improve a 1918 result by Pólya on the torus board $\mathbf{Z}_n^2$ while deducing an exact answer whenever $n\le 15$. Moreover, we give an example of $(n-c)n^{d-2}$ independent queens on… ▽ More Define a queen on $\mathbb{Z}_n^d$ with admissible moves parallel to $\mathbf{x}\in\{-1,0,1\}^d$ at arbitrary length. How many queens can be placed on $\mathbb{Z}_n^d$ without any two in conflict? We improve a 1918 result by Pólya on the torus board $\mathbf{Z}_n^2$ while deducing an exact answer whenever $n\le 15$. Moreover, we give an example of $(n-c)n^{d-2}$ independent queens on $\mathbf{Z}_n^d$, where $c(2)=4$. △ Less

Submitted 28 April, 2024; originally announced April 2024.

arXiv:2404.18190 [pdf, other]

Naive Bayes Classifiers and One-hot Encoding of Categorical Variables

Authors: Christopher K. I. Williams

Abstract: This paper investigates the consequences of encoding a $K$-valued categorical variable incorrectly as $K$ bits via one-hot encoding, when using a Naïve Bayes classifier. This gives rise to a product-of-Bernoullis (PoB) assumption, rather than the correct categorical Naïve Bayes classifier. The differences between the two classifiers are analysed mathematically and experimentally. In our experiment… ▽ More This paper investigates the consequences of encoding a $K$-valued categorical variable incorrectly as $K$ bits via one-hot encoding, when using a Naïve Bayes classifier. This gives rise to a product-of-Bernoullis (PoB) assumption, rather than the correct categorical Naïve Bayes classifier. The differences between the two classifiers are analysed mathematically and experimentally. In our experiments using probability vectors drawn from a Dirichlet distribution, the two classifiers are found to agree on the maximum a posteriori class label for most cases, although the posterior probabilities are usually greater for the PoB case. △ Less

Submitted 28 April, 2024; originally announced April 2024.

Comments: 7 pages, 3 figures

arXiv:2404.18014 [pdf, other]

Layered subgraphs of the hypercube

Authors: Natalie Behague, Imre Leader, Natasha Morrison, Kada Williams

Abstract: A subgraph of the $n$-dimensional hypercube is called 'layered' if it is a subgraph of a layer of some hypercube. In this paper we show that there exist subgraphs of the cube of arbitrarily large girth that are not layered. This answers a question of Axenovich, Martin and Winter. Perhaps surprisingly, these subgraphs may even be taken to be induced. A subgraph of the $n$-dimensional hypercube is called 'layered' if it is a subgraph of a layer of some hypercube. In this paper we show that there exist subgraphs of the cube of arbitrarily large girth that are not layered. This answers a question of Axenovich, Martin and Winter. Perhaps surprisingly, these subgraphs may even be taken to be induced. △ Less

Submitted 27 April, 2024; originally announced April 2024.

Comments: 11 pages

arXiv:2404.16940 [pdf, other]

A Volume-Limited Radio Search for Magnetic Activity in 140 Exoplanets with the Very Large Array

Authors: Kevin N. Ortiz Ceballos, Yvette Cendes, Edo Berger, Peter K. G. Williams

Abstract: We present results from a search for radio emission in 77 stellar systems hosting 140 exoplanets, predominantly within 17.5 pc using the Very Large Array (VLA) at $4-8$ GHz. This is the largest and most sensitive search to date for radio emission in exoplanetary systems in the GHz frequency range. We obtained new observations of 58 systems, and analyzed archival observations of an additional 19 sy… ▽ More We present results from a search for radio emission in 77 stellar systems hosting 140 exoplanets, predominantly within 17.5 pc using the Very Large Array (VLA) at $4-8$ GHz. This is the largest and most sensitive search to date for radio emission in exoplanetary systems in the GHz frequency range. We obtained new observations of 58 systems, and analyzed archival observations of an additional 19 systems. Our choice of frequency and volume limit are motivated by radio detections of ultracool dwarfs (UCDs), including T dwarfs with masses at the exoplanet threshold of $\sim\!13\,M_J$. Our surveyed exoplanets span a mass range of $\approx\,10^{-3}-10\,M_J$ and semi-major axes of $\approx\,10^{-2}-10\,$AU. We detect a single target - GJ 3323 (M4) hosting two exoplanets with minimum masses of 2 and 2.3$\,M_\oplus$ - with a circular polarization fraction of $\approx\,40\%$; the radio luminosity agrees with its known X-ray luminosity and the Güdel-Benz relation for stellar activity suggesting a likely stellar origin, but the high circular polarization fraction may also be indicative of star-planet interaction. For the remaining sources our $3σ$ upper limits are generally $L_ν\lesssim\,10^{12.5}\,\mathrm{erg}\,\mathrm{s}^{-1}\,\mathrm{Hz}^{-1}$, comparable to the lowest radio luminosities in UCDs. Our results are consistent with previous targeted searches of individual systems at GHz frequencies while greatly expanding the sample size. Our sensitivity is comparable to predicted fluxes for some systems considered candidates for detectable star-planet interaction. Observations with future instruments such as the Square Kilometer Array and Next Generation Very Large Array will be necessary to further constrain emission mechanisms from exoplanet systems at GHz frequencies. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: Submitted to ApJ, 18 pages, 8 figures

arXiv:2404.12771 [pdf]

Phase-space analysis of a two-section InP laser as an all-optical spiking neuron: dependency on control and design parameters

Authors: Lukas Puts, Daan Lenstra, Kevin Williams, Weiming Yao

Abstract: Using a rate-equation model we numerically evaluate the carrier concentration and photon number in an integrated two-section semiconductor laser, and analyse its dynamics in three-dimensional phase space. The simulation comprises compact model descriptions extracted from a commercially-available generic InP technology platform, allowing us to model an applied reverse-bias voltage to the saturable… ▽ More Using a rate-equation model we numerically evaluate the carrier concentration and photon number in an integrated two-section semiconductor laser, and analyse its dynamics in three-dimensional phase space. The simulation comprises compact model descriptions extracted from a commercially-available generic InP technology platform, allowing us to model an applied reverse-bias voltage to the saturable absorber. We use the model to study the influence of the injected gain current, reverse-bias voltage, and cavity mirror reflectivity on the excitable operation state, which is the operation mode desired for the laser to act as an all-optical integrated neuron. We show in phase-space that our model is capable of demonstrating four different operation modes, i.e. cw, self-pulsating and an on-set and excitable mode under optical pulse injection. In addition, we show that lowering the reflectivity of one of the cavity mirrors greatly enhances the control parameter space for excitable operation, enabling more relaxed operation parameter control and lower power consumption of an integrated two-section laser neuron. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 11 pages, 10 figures

arXiv:2404.08591 [pdf, other]

QCD bounds on leading-order hadronic vacuum polarization contributions to the muon anomalous magnetic moment

Authors: Siyuan Li, T. G. Steele, J. Ho, R. Raza, K. Williams, R. T. Kleiv

Abstract: QCD bounds on the leading-order (LO) hadronic vacuum polarization (HVP) contribution to the anomalous magnetic moment of the muon ($a_μ^{\mathrm{HVP,LO}}$, $a_μ=\left(g-2\right)_μ/2$) are determined by imposing Hölder inequalities and related inequality constraints on systems of Finite-Energy QCD sum-rules. This novel methodology is complementary to lattice QCD and data-driven approaches to determ… ▽ More QCD bounds on the leading-order (LO) hadronic vacuum polarization (HVP) contribution to the anomalous magnetic moment of the muon ($a_μ^{\mathrm{HVP,LO}}$, $a_μ=\left(g-2\right)_μ/2$) are determined by imposing Hölder inequalities and related inequality constraints on systems of Finite-Energy QCD sum-rules. This novel methodology is complementary to lattice QCD and data-driven approaches to determining $a_μ^{\mathrm{HVP,LO}}$. For the light-quark ($u,d,s$) contributions up to five-loop order in perturbation theory in the chiral limit, LO in light-quark mass corrections, next-to-leading order in dimension-four QCD condensates, and to LO in dimension-six QCD condensates, we find that $\left(673.0\pm 40.0\right)\times 10^{-10}\leq a_μ^{\mathrm{HVP,LO}} \leq \left(807.5\pm 48.0\right)\times10^{-10}\,$, bridging the range between lattice QCD and data-driven values. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 16 pages, 4 figures

arXiv:2403.19688 [pdf, other]

Non-Euclidean Cross-Ratios and Carnot's Theorem for Conics

Authors: Michael Perez Palapa, Kai Williams

Abstract: When considering geometry, one might think of working with lines and circles on a flat plane as in Euclidean geometry. However, doing geometry in other spaces is possible, as the existence of spherical and hyperbolic geometry demonstrates. Despite the differences between these three geometries, striking connections appear among the three. In this paper, we illuminate one such connection by general… ▽ More When considering geometry, one might think of working with lines and circles on a flat plane as in Euclidean geometry. However, doing geometry in other spaces is possible, as the existence of spherical and hyperbolic geometry demonstrates. Despite the differences between these three geometries, striking connections appear among the three. In this paper, we illuminate one such connection by generalizing the cross-ratio, a powerful invariant associating a number to four points on a line, into non-Euclidean geometry. Along the way, we see how projections between these geometries can allow us to directly export results from one geometry into the others. The paper culminates by generalizing Carnot's Theorem for Conics - a classical result relating when six points on a triangle lie on a conic - into spherical and hyperbolic geometry. These same techniques are then applied to Carnot's Theorem for higher degree curves. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2403.09943 [pdf, ps, other]

The Width of a Ball in a Hypercube

Authors: Kada Williams

Abstract: Various authors have calculated how many pairwise incomparable points can be selected from a partially ordered set. We tackle this question for the family of subsets of a finite set obtained by removing or adding a bounded number of elements from a given subset. Our versatile approach is proven valid under the condition of the set size exceeding the cube of the radius. Various authors have calculated how many pairwise incomparable points can be selected from a partially ordered set. We tackle this question for the family of subsets of a finite set obtained by removing or adding a bounded number of elements from a given subset. Our versatile approach is proven valid under the condition of the set size exceeding the cube of the radius. △ Less

Submitted 14 March, 2024; originally announced March 2024.

arXiv:2402.16382 [pdf, other]

Immunization against harmful fine-tuning attacks

Authors: Domenic Rosati, Jan Wehner, Kai Williams, Łukasz Bartoszcze, Jan Batzner, Hassan Sajjad, Frank Rudzicz

Abstract: Approaches to aligning large language models (LLMs) with human values has focused on correcting misalignment that emerges from pretraining. However, this focus overlooks another source of misalignment: bad actors might purposely fine-tune LLMs to achieve harmful goals. In this paper, we present an emerging threat model that has arisen from alignment circumvention and fine-tuning attacks. However,… ▽ More Approaches to aligning large language models (LLMs) with human values has focused on correcting misalignment that emerges from pretraining. However, this focus overlooks another source of misalignment: bad actors might purposely fine-tune LLMs to achieve harmful goals. In this paper, we present an emerging threat model that has arisen from alignment circumvention and fine-tuning attacks. However, lacking in previous works is a clear presentation of the conditions for effective defence. We propose a set of conditions for effective defence against harmful fine-tuning in LLMs called "Immunization conditions," which help us understand how we would construct and measure future defences. Using this formal framework for defence, we offer a synthesis of different research directions that might be persued to prevent harmful fine-tuning attacks and provide a demonstration of how to use these conditions experimentally showing early results of using an adversarial loss to immunize LLama2-7b-chat. △ Less

Submitted 26 February, 2024; originally announced February 2024.

arXiv:2402.08659 [pdf, other]

A demonstration of the effect of fringe-rate filtering in the Hydrogen Epoch of Reionization Array delay power spectrum pipeline

Authors: Hugh Garsden, Philip Bull, Mike Wilensky, Zuhra Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Lindsay M. Berkhout, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Steven Carey, Chris L. Carilli, Kai-Feng Chen, Carina Cheng, Samir Choudhuri, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter , et al. (72 additional authors not shown)

Abstract: Radio interferometers targeting the 21cm brightness temperature fluctuations at high redshift are subject to systematic effects that operate over a range of different timescales. These can be isolated by designing appropriate Fourier filters that operate in fringe-rate (FR) space, the Fourier pair of local sidereal time (LST). Applications of FR filtering include separating effects that are correl… ▽ More Radio interferometers targeting the 21cm brightness temperature fluctuations at high redshift are subject to systematic effects that operate over a range of different timescales. These can be isolated by designing appropriate Fourier filters that operate in fringe-rate (FR) space, the Fourier pair of local sidereal time (LST). Applications of FR filtering include separating effects that are correlated with the rotating sky vs. those relative to the ground, down-weighting emission in the primary beam sidelobes, and suppressing noise. FR filtering causes the noise contributions to the visibility data to become correlated in time however, making interpretation of subsequent averaging and error estimation steps more subtle. In this paper, we describe fringe rate filters that are implemented using discrete prolate spheroidal sequences, and designed for two different purposes -- beam sidelobe/horizon suppression (the `mainlobe' filter), and ground-locked systematics removal (the `notch' filter). We apply these to simulated data, and study how their properties affect visibilities and power spectra generated from the simulations. Included is an introduction to fringe-rate filtering and a demonstration of fringe-rate filters applied to simple situations to aid understanding. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: 21 pages, 18 figures, submitted to Monthly Notices of the Royal Astronomical Society

arXiv:2401.04304 [pdf, other]

doi 10.1088/1538-3873/ad3122

Hydrogen Epoch of Reionization Array (HERA) Phase II Deployment and Commissioning

Authors: Lindsay M. Berkhout, Daniel C. Jacobs, Zuhra Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Philip Bull, Jacob Burba, Steven Carey, Chris L. Carilli, Kai-Feng Chen, Carina Cheng, Samir Choudhuri, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Joshua S. Dillon , et al. (71 additional authors not shown)

Abstract: This paper presents the design and deployment of the Hydrogen Epoch of Reionization Array (HERA) phase II system. HERA is designed as a staged experiment targeting 21 cm emission measurements of the Epoch of Reionization. First results from the phase I array are published as of early 2022, and deployment of the phase II system is nearing completion. We describe the design of the phase II system an… ▽ More This paper presents the design and deployment of the Hydrogen Epoch of Reionization Array (HERA) phase II system. HERA is designed as a staged experiment targeting 21 cm emission measurements of the Epoch of Reionization. First results from the phase I array are published as of early 2022, and deployment of the phase II system is nearing completion. We describe the design of the phase II system and discuss progress on commissioning and future upgrades. As HERA is a designated Square Kilometer Array (SKA) pathfinder instrument, we also show a number of "case studies" that investigate systematics seen while commissioning the phase II system, which may be of use in the design and operation of future arrays. Common pathologies are likely to manifest in similar ways across instruments, and many of these sources of contamination can be mitigated once the source is identified. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Journal ref: PASP 2024 136 045002

arXiv:2312.14098 [pdf, other]

Comparing indirect methods for black hole masses in AGN: the good, the bad, and the ugly

Authors: M. Gliozzi, J. K. Williams, A. Akylas, I. E. Papadakis, O. I. Shuvo, A. Halavatkar, A. Alt

Abstract: The black hole mass MBH is crucial in constraining the growth of supermassive BHs within their host galaxies. Since direct measurements of MBH with dynamical methods are restricted to a limited number of nearly quiescent nearby galaxies and a small minority of active galactic nuclei (AGN), we must rely on indirect methods. In this work, we utilize an unbiased, volume-limited, hard X-ray selected s… ▽ More The black hole mass MBH is crucial in constraining the growth of supermassive BHs within their host galaxies. Since direct measurements of MBH with dynamical methods are restricted to a limited number of nearly quiescent nearby galaxies and a small minority of active galactic nuclei (AGN), we must rely on indirect methods. In this work, we utilize an unbiased, volume-limited, hard X-ray selected sample of AGN to compare the reliability of some commonly used indirect methods, emphasising those that can be applied to obscured AGN. Based on a subsample of AGN with MBH determined via dynamical methods, our study suggests that X-ray based techniques, such as the scaling method and the one based on the variability measured through the excess variance, are in good agreement with the dynamical methods. On the other hand, the M-sigma correlation based on inactive galaxies tends to systematically overestimate MBH, regardless of the level of obscuration. We provide a correcting factor that produces an acceptable agreement with dynamical values and can be used to quickly correct the MBH computed with this method. We also derive an alternative M-sigma correlation based on this unbiased sample of AGN with a slope considerably shallower than the ones obtained using inactive galaxies, suggesting that the latter correlation may not be appropriate to compute the MBH in AGN. Finally, we find that no quick fix can be applied to correct the MBH obtained from the fundamental plane of black hole activity, casting doubts on the reliability of this method. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: 15 pages, 9 figures, 3 tables, Accepted for publication in MNRAS

arXiv:2312.09763 [pdf, other]

matvis: A matrix-based visibility simulator for fast forward modelling of many-element 21 cm arrays

Authors: Piyanat Kittiwisit, Steven G. Murray, Hugh Garsden, Philip Bull, Christopher Cain, Aaron R. Parsons, Jackson Sipple, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Lindsay M. Berkhout, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Steven Carey, Chris L. Carilli, Kai-Feng Chen, Carina Cheng , et al. (73 additional authors not shown)

Abstract: Detection of the faint 21 cm line emission from the Cosmic Dawn and Epoch of Reionisation will require not only exquisite control over instrumental calibration and systematics to achieve the necessary dynamic range of observations but also validation of analysis techniques to demonstrate their statistical properties and signal loss characteristics. A key ingredient in achieving this is the ability… ▽ More Detection of the faint 21 cm line emission from the Cosmic Dawn and Epoch of Reionisation will require not only exquisite control over instrumental calibration and systematics to achieve the necessary dynamic range of observations but also validation of analysis techniques to demonstrate their statistical properties and signal loss characteristics. A key ingredient in achieving this is the ability to perform high-fidelity simulations of the kinds of data that are produced by the large, many-element, radio interferometric arrays that have been purpose-built for these studies. The large scale of these arrays presents a computational challenge, as one must simulate a detailed sky and instrumental model across many hundreds of frequency channels, thousands of time samples, and tens of thousands of baselines for arrays with hundreds of antennas. In this paper, we present a fast matrix-based method for simulating radio interferometric measurements (visibilities) at the necessary scale. We achieve this through judicious use of primary beam interpolation, fast approximations for coordinate transforms, and a vectorised outer product to expand per-antenna quantities to per-baseline visibilities, coupled with standard parallelisation techniques. We validate the results of this method, implemented in the publicly-available matvis code, against a high-precision reference simulator, and explore its computational scaling on a variety of problems. △ Less

Submitted 15 December, 2023; originally announced December 2023.

Comments: 25 pages, 20 figures, submitted to RAS Techniques and Instruments, matvis is publicly available at https://github.com/HERA-Team/matvis

arXiv:2312.03697 [pdf, other]

Bayesian estimation of cross-coupling and reflection systematics in 21cm array visibility data

Authors: Geoff G. Murphy, Philip Bull, Mario G. Santos, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Christopher Cain, Steven Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Joshua S. Dillon, Nico Eksteen , et al. (54 additional authors not shown)

Abstract: Observations with radio arrays that target the 21-cm signal originating from the early Universe suffer from a variety of systematic effects. An important class of these are reflections and spurious couplings between antennas. We apply a Hamiltonian Monte Carlo sampler to the modelling and mitigation of these systematics in simulated Hydrogen Epoch of Reionisation Array (HERA) data. This method all… ▽ More Observations with radio arrays that target the 21-cm signal originating from the early Universe suffer from a variety of systematic effects. An important class of these are reflections and spurious couplings between antennas. We apply a Hamiltonian Monte Carlo sampler to the modelling and mitigation of these systematics in simulated Hydrogen Epoch of Reionisation Array (HERA) data. This method allows us to form statistical uncertainty estimates for both our models and the recovered visibilities, which is an important ingredient in establishing robust upper limits on the Epoch of Reionisation (EoR) power spectrum. In cases where the noise is large compared to the EoR signal, this approach can constrain the systematics well enough to mitigate them down to the noise level for both systematics studied. Where the noise is smaller than the EoR, our modelling can mitigate the majority of the reflections with there being only a minor level of residual systematics, while cross-coupling sees essentially complete mitigation. Our approach performs similarly to existing filtering/fitting techniques used in the HERA pipeline, but with the added benefit of rigorously propagating uncertainties. In all cases it does not significantly attenuate the underlying signal. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: 19 pages, 14 figures, submitted to MNRAS

arXiv:2311.10711 [pdf, other]

Direct Optimal Map** Image Power Spectrum and its Window Functions

Authors: Zhilei Xu, Honggeun Kim, Jacqueline N. Hewitt, Kai-Feng Chen, Nicholas S. Kern, Elizabeth Rath, Ruby Byrne, Adélie Gorce, Zachary E. Martinot, Joshua S. Dillon, Bryna J. Hazelton, Adrian Liu, Miguel F. Morales, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley , et al. (56 additional authors not shown)

Abstract: The key to detecting neutral hydrogen during the epoch of reionization (EoR) is to separate the cosmological signal from the dominating foreground radiation. We developed direct optimal map** (Xu et al. 2022) to map interferometric visibilities; it contains only linear operations, with full knowledge of point spread functions from visibilities to images. Here we present an FFT-based image power… ▽ More The key to detecting neutral hydrogen during the epoch of reionization (EoR) is to separate the cosmological signal from the dominating foreground radiation. We developed direct optimal map** (Xu et al. 2022) to map interferometric visibilities; it contains only linear operations, with full knowledge of point spread functions from visibilities to images. Here we present an FFT-based image power spectrum and its window functions based on direct optimal map**. We use noiseless simulation, based on the Hydrogen Epoch of Reionization Array (HERA) Phase I configuration, to study the image power spectrum properties. The window functions show $<10^{-11}$ power leakage from the foreground-dominated region into the EoR window; the 2D and 1D power spectra also verify the separation between the foregrounds and the EoR. Furthermore, we simulated visibilities from a $uv$-complete array and calculated its image power spectrum. The result shows that the foreground--EoR leakage is further suppressed below $10^{-12}$, dominated by the tapering function sidelobes; the 2D power spectrum does not show signs of the horizon wedge. The $uv$-complete result provides a reference case for future 21cm cosmology array designs. △ Less

Submitted 17 November, 2023; originally announced November 2023.

Comments: Submitted to ApJ

arXiv:2309.11549 [pdf, other]

Large Synthetic Data from the arXiv for OCR Post Correction of Historic Scientific Articles

Authors: Jill P. Naiman, Morgan G. Cosillo, Peter K. G. Williams, Alyssa Goodman

Abstract: Scientific articles published prior to the "age of digitization" (~1997) require Optical Character Recognition (OCR) to transform scanned documents into machine-readable text, a process that often produces errors. We develop a pipeline for the generation of a synthetic ground truth/OCR dataset to correct the OCR results of the astrophysics literature holdings of the NASA Astrophysics Data System (… ▽ More Scientific articles published prior to the "age of digitization" (~1997) require Optical Character Recognition (OCR) to transform scanned documents into machine-readable text, a process that often produces errors. We develop a pipeline for the generation of a synthetic ground truth/OCR dataset to correct the OCR results of the astrophysics literature holdings of the NASA Astrophysics Data System (ADS). By mining the arXiv we create, to the authors' knowledge, the largest scientific synthetic ground truth/OCR post correction dataset of 203,354,393 character pairs. We provide baseline models trained with this dataset and find the mean improvement in character and word error rates of 7.71% and 18.82% for historical OCR text, respectively. When used to classify parts of sentences as inline math, we find a classification F1 score of 77.82%. Interactive dashboards to explore the dataset are available online: https://readingtimemachine.github.io/projects/1-ocr-groundtruth-may2023, and data and code, within the limitations of our agreement with the arXiv, are hosted on GitHub: https://github.com/ReadingTimeMachine/ocr_post_correction. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: 6 pages, 1 figure, 1 table; training/validation/test datasets and all model weights to be linked on Zenodo on publication

arXiv:2308.04392 [pdf, other]

Let's Get Vysical: Perceptual Accuracy In Visual and Tactile Encodings

Authors: Zhongzheng Xu, Kristin Williams, Emily Wall

Abstract: In this paper, we explore the effectiveness of tactile data encodings using swell paper in comparison to visual encodings displayed with SVGs for data perception tasks. By replicating and adapting Cleveland and McGill's graphical perception study for the tactile modality, we establish a novel tactile encoding hierarchy. In a study with 12 university students, we found that participants perceived v… ▽ More In this paper, we explore the effectiveness of tactile data encodings using swell paper in comparison to visual encodings displayed with SVGs for data perception tasks. By replicating and adapting Cleveland and McGill's graphical perception study for the tactile modality, we establish a novel tactile encoding hierarchy. In a study with 12 university students, we found that participants perceived visual encodings more accurately when comparing values, judging their ratios with lower cognitive load, and better self-evaluated performance than tactile encodings. However, tactile encodings differed from their visual counterparts in terms of how accurately values could be decoded from them. This suggests that data physicalizations will require different design guidance than that developed for visual encodings. By providing empirical evidence for the perceptual accuracy of tactile encodings, our work contributes to foundational research on forms of data representation that prioritize tactile perception such as tactile graphics. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: 4 pages, 3 figures

arXiv:2306.09877 [pdf]

Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes

Authors: Shenghuan Sun, Travis Zack, Christopher Y. K. Williams, Atul J. Butte, Madhumita Sushil

Abstract: We aimed to investigate the impact of social circumstances on cancer therapy selection using natural language processing to derive insights from social worker documentation. We developed and employed a Bidirectional Encoder Representations from Transformers (BERT) based approach, using a hierarchical multi-step BERT model (BERT-MS) to predict the prescription of targeted cancer therapy to patients… ▽ More We aimed to investigate the impact of social circumstances on cancer therapy selection using natural language processing to derive insights from social worker documentation. We developed and employed a Bidirectional Encoder Representations from Transformers (BERT) based approach, using a hierarchical multi-step BERT model (BERT-MS) to predict the prescription of targeted cancer therapy to patients based solely on documentation by clinical social workers. Our corpus included free-text clinical social work notes, combined with medication prescription information, for all patients treated for breast cancer. We conducted a feature importance analysis to pinpoint the specific social circumstances that impact cancer therapy selection. Using only social work notes, we consistently predicted the administration of targeted therapies, suggesting systematic differences in treatment selection exist due to non-clinical factors. The UCSF-BERT model, pretrained on clinical text at UCSF, outperformed other publicly available language models with an AUROC of 0.675 and a Macro F1 score of 0.599. The UCSF BERT-MS model, capable of leveraging multiple pieces of notes, surpassed the UCSF-BERT model in both AUROC and Macro-F1. Our feature importance analysis identified several clinically intuitive social determinants of health (SDOH) that potentially contribute to disparities in treatment. Our findings indicate that significant disparities exist among breast cancer patients receiving different types of therapies based on social determinants of health. Social work reports play a crucial role in understanding these disparities in clinical decision-making. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Comments: 18 pages, 4 figures, 2 Tables

arXiv:2306.09311 [pdf, other]

doi 10.3847/2041-8213/ace0c4

Millimeter Observations of the Type II SN2023ixf: Constraints on the Proximate Circumstellar Medium

Authors: Edo Berger, Garrett K. Keating, Raffaella Margutti, Keiichi Maeda, Kate D. Alexander, Yvette Cendes, Tarraneh Eftekhari, Mark Gurwell, Daichi Hiramatsu, Anna Y. Q. Ho, Tanmoy Laskar, Ramprasad Rao, Peter K. G. Williams

Abstract: We present 1.3 mm (230 GHz) observations of the recent and nearby Type II supernova, SN2023ixf, obtained with the Submillimeter Array (SMA) at 2.6-18.6 days after explosion. The observations were obtained as part the SMA Large Program POETS (Pursuit of Extragalactic Transients with the SMA). We do not detect any emission at the location of SN2023ixf, with the deepest limits of… ▽ More We present 1.3 mm (230 GHz) observations of the recent and nearby Type II supernova, SN2023ixf, obtained with the Submillimeter Array (SMA) at 2.6-18.6 days after explosion. The observations were obtained as part the SMA Large Program POETS (Pursuit of Extragalactic Transients with the SMA). We do not detect any emission at the location of SN2023ixf, with the deepest limits of $L_ν(230\,{\rm GHz})\lesssim 8.6\times 10^{25}$ erg s$^{-1}$ Hz$^{-1}$ at 2.7 and 7.7 days, and $L_ν(230\,{\rm GHz})\lesssim 3.4\times 10^{25}$ erg s$^{-1}$ Hz$^{-1}$ at 18.6 days. These limits are about a factor of 2 times dimmer than the mm emission from SN2011dh (IIb), about an order of magnitude dimmer compared to SN1993J (IIb) and SN2018ivc (IIL), and about 30 times dimmer than the most luminous non-relativistic SNe in the mm-band (Type IIb/Ib/Ic). Using these limits in the context of analytical models that include synchrotron self-absorption and free-free absorption we place constraints on the proximate circumstellar medium around the progenitor star, to a scale of $\sim 2\times 10^{15}$ cm, excluding the range $\dot{M}\sim {\rm few}\times 10^{-6}-10^{-2}$ M$_\odot$ yr$^{-1}$ (for a wind velocity, $v_w=115$ km s$^{-1}$, and ejecta velocity, $v_{\rm eje}\sim (1-2)\times 10^4$ km s$^{-1}$). These results are consistent with an inference of the mass loss rate based on optical spectroscopy ($\sim 2\times 10^{-2}$ M$_\odot$ yr$^{-1}$ for $v_w=115$ km s$^{-1}$), but are in tension with the inference from hard X-rays ($\sim 7\times 10^{-4}$ M$_\odot$ yr$^{-1}$ for $v_w=115$ km s$^{-1}$). This tension may be alleviated by a non-homogeneous and confined CSM, consistent with results from high-resolution optical spectroscopy. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Comments: Submitted

arXiv:2306.03066 [pdf, other]

doi 10.1007/s11263-024-02118-3

Of Mice and Mates: Automated Classification and Modelling of Mouse Behaviour in Groups using a Single Model across Cages

Authors: Michael P. J. Camilleri, Rasneer S. Bains, Christopher K. I. Williams

Abstract: Behavioural experiments often happen in specialised arenas, but this may confound the analysis. To address this issue, we provide tools to study mice in the home-cage environment, equip** biologists with the possibility to capture the temporal aspect of the individual's behaviour and model the interaction and interdependence between cage-mates with minimal human intervention. Our main contributi… ▽ More Behavioural experiments often happen in specialised arenas, but this may confound the analysis. To address this issue, we provide tools to study mice in the home-cage environment, equip** biologists with the possibility to capture the temporal aspect of the individual's behaviour and model the interaction and interdependence between cage-mates with minimal human intervention. Our main contribution is the novel Group Behaviour Model (GBM) which summarises the joint behaviour of groups of mice across cages, using a permutation matrix to match the mouse identities in each cage to the model. In support of the above, we also (a) developed the Activity Labelling Module (ALM) to automatically classify mouse behaviour from video, and (b) released two datasets, ABODe for training behaviour classifiers and IMADGE for modelling behaviour. △ Less

Submitted 24 June, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

Comments: International Journal of Computer Vision (2024)

arXiv:2305.05687 [pdf, other]

doi 10.3847/1538-4357/accc89

Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies

Authors: James Paul Mason, Alexandra Werth, Colin G. West, Allison A. Youngblood, Donald L. Woodraska, Courtney Peck, Kevin Lacjak, Florian G. Frick, Moutamen Gabir, Reema A. Alsinan, Thomas Jacobsen, Mohammad Alrubaie, Kayla M. Chizmar, Benjamin P. Lau, Lizbeth Montoya Dominguez, David Price, Dylan R. Butler, Connor J. Biron, Nikita Feoktistov, Kai Dewey, N. E. Loomis, Michal Bodzianowski, Connor Kuybus, Henry Dietrick, Aubrey M. Wolfe , et al. (977 additional authors not shown)

Abstract: Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th… ▽ More Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms that could explain it: nanoflares or Alfvén waves. To date, neither can be directly observed. Nanoflares are, by definition, extremely small, but their aggregate energy release could represent a substantial heating mechanism, presuming they are sufficiently abundant. One way to test this presumption is via the flare frequency distribution, which describes how often flares of various energies occur. If the slope of the power law fitting the flare frequency distribution is above a critical threshold, $α=2$ as established in prior literature, then there should be a sufficient abundance of nanoflares to explain coronal heating. We performed $>$600 case studies of solar flares, made possible by an unprecedented number of data analysts via three semesters of an undergraduate physics laboratory course. This allowed us to include two crucial, but nontrivial, analysis methods: pre-flare baseline subtraction and computation of the flare energy, which requires determining flare start and stop times. We aggregated the results of these analyses into a statistical study to determine that $α= 1.63 \pm 0.03$. This is below the critical threshold, suggesting that Alfvén waves are an important driver of coronal heating. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: 1,002 authors, 14 pages, 4 figures, 3 tables, published by The Astrophysical Journal on 2023-05-09, volume 948, page 71

arXiv:2303.11208 [pdf, ps, other]

On Intersecting Polygons

Authors: Kada Williams

Abstract: Consider two regions in the plane, bounded by an $n$-gon and an $m$-gon, respectively. At most how many connected components can there be in their intersection? This question was asked by Croft. We answer this asymptotically, proving the bounds… ▽ More Consider two regions in the plane, bounded by an $n$-gon and an $m$-gon, respectively. At most how many connected components can there be in their intersection? This question was asked by Croft. We answer this asymptotically, proving the bounds $$\left\lfloor \frac{m}{2}\right\rfloor \cdot \left\lfloor \frac{n}{2}\right\rfloor\le f(n,m)\le \left\lfloor \frac{m}{2}\right\rfloor \cdot \frac{n}{2} + \frac{m}{2} $$ where $f(n,m)$ denotes the maximal number of components and $m\le n$. Furthermore, we give an exact answer to the related question of finding the maximal number of components if the $m$-gon is required to be convex: $\left \lfloor \frac{m+n-2}{2}\right\rfloor$ if $n\ge m+2$ and $n-2$ otherwise. △ Less

Submitted 20 March, 2023; originally announced March 2023.

arXiv:2303.07485 [pdf]

Efficiency-boosted semiconductor optical amplifiers via mode-division multiplexing

Authors: Yi Wang, Yihui Wei, Victor Dolores-Calzadilla, Daoxin Dai, Kevin Williams, Meint Smit, Yuqing Jiao

Abstract: Semiconductor optical amplifiers (SOA) are a fundamental building block for many photonic systems. However, their power inefficiency has been setting back operational cost reduction, and the resulting thermal losses constrain miniaturization, and the realization of more complex photonic functions such as large-scale switches and optical phased arrays. In this work, we demonstrate significant gain… ▽ More Semiconductor optical amplifiers (SOA) are a fundamental building block for many photonic systems. However, their power inefficiency has been setting back operational cost reduction, and the resulting thermal losses constrain miniaturization, and the realization of more complex photonic functions such as large-scale switches and optical phased arrays. In this work, we demonstrate significant gain and efficiency enhancement using an extra degree of freedom of light - the mode space. This is done without changing the SOA's material design, and therefore high versatility and compatibility can be obtained. Light is multiplexed in different guided modes and is reinjected into the same gain section twice without introducing resonance, doubling the interaction length in a broadband manner. Up to 87% higher gain and 300% higher wall-plug efficiency are obtained in a double-pass SOA compared to a conventional single-pass SOA, at the same operating current, in the wavelength range of 1560 - 1580 nm. △ Less

Submitted 18 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

Comments: Added references

arXiv:2303.03968 [pdf, ps, other]

doi 10.1093/mnras/stad718

Assessing indirect methods to determine black hole masses using NGC 4151

Authors: James K. Williams, Mario Gliozzi, Kyle A. Bockwoldt, Onic I. Shuvo

Abstract: Accurately determining the black hole mass ($M_\mathrm{BH}$) in active galactic nuclei (AGN) is crucial to constraining their properties and to studying their evolution. While direct methods yield reliable measurements of $M_\mathrm{BH}$ in unobscured type 1 AGN, where the dynamics of stellar or gas components can be directly observed, only indirect methods can be applied to the vast majority of h… ▽ More Accurately determining the black hole mass ($M_\mathrm{BH}$) in active galactic nuclei (AGN) is crucial to constraining their properties and to studying their evolution. While direct methods yield reliable measurements of $M_\mathrm{BH}$ in unobscured type 1 AGN, where the dynamics of stellar or gas components can be directly observed, only indirect methods can be applied to the vast majority of heavily absorbed type 2 AGN, which represent most of the AGN population. Since it is difficult to evaluate the accuracy and precision of these indirect methods, we utilize the nearby X-ray bright Seyfert galaxy NGC 4151, whose $M_\mathrm{BH}$ has been tightly constrained with several independent direct methods, as a laboratory to assess the reliability of three indirect methods that have been applied to obscured AGN. All three, the X-ray scaling method, the fundamental plane of black hole activity, and the M-$σ$ correlation, yield $M_\mathrm{BH}$ values consistent with those inferred from direct methods and can therefore be considered accurate. However, only the X-ray scaling method and the M-$σ$ correlation are precise because the substantial scatter in the fundamental plane of BH activity allows only for crude estimates. Of the four M-$σ$ correlations we used, only the one from Kormendy and Ho yields a value consistent with the dynamical estimates. This study suggests that the best approach to estimating the black hole mass in systems where direct dynamical methods cannot be applied is to utilize a combination of indirect methods, taking into account their different ranges of applicability. △ Less

Submitted 7 March, 2023; originally announced March 2023.

Comments: Accepted for publication in the Monthly Notices of the Royal Astronomical Society

arXiv:2302.11583 [pdf, other]

The Digitization of Historical Astrophysical Literature with Highly-Localized Figures and Figure Captions

Authors: Jill P. Naiman, Peter K. G. Williams, Alyssa Goodman

Abstract: Scientific articles published prior to the "age of digitization" in the late 1990s contain figures which are "trapped" within their scanned pages. While progress to extract figures and their captions has been made, there is currently no robust method for this process. We present a YOLO-based method for use on scanned pages, after they have been processed with Optical Character Recognition (OCR), w… ▽ More Scientific articles published prior to the "age of digitization" in the late 1990s contain figures which are "trapped" within their scanned pages. While progress to extract figures and their captions has been made, there is currently no robust method for this process. We present a YOLO-based method for use on scanned pages, after they have been processed with Optical Character Recognition (OCR), which uses both grayscale and OCR-features. We focus our efforts on translating the intersection-over-union (IOU) metric from the field of object detection to document layout analysis and quantify "high localization" levels as an IOU of 0.9. When applied to the astrophysics literature holdings of the NASA Astrophysics Data System (ADS), we find F1 scores of 90.9% (92.2%) for figures (figure captions) with the IOU cut-off of 0.9 which is a significant improvement over other state-of-the-art methods. △ Less

Submitted 22 February, 2023; originally announced February 2023.

Comments: 29 pages, 10 figures, accepted for publication in the International Journal on Digital Libraries, special issue follow up to TPDL 2022 conference. arXiv admin note: substantial text overlap with arXiv:2209.04460

arXiv:2302.07969 [pdf, other]

doi 10.1093/mnras/stad371

Search for the Epoch of Reionisation with HERA: Upper Limits on the Closure Phase Delay Power Spectrum

Authors: Pascal M. Keller, Bojan Nikolic, Nithyanandan Thyagarajan, Chris L. Carilli, Gianni Bernardi, Ntsikelelo Charles, Landman Bester, Oleg M. Smirnov, Nicholas S. Kern, Joshua S. Dillon, Bryna J. Hazelton, Miguel F. Morales, Daniel C. Jacobs, Aaron R. Parsons, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley , et al. (58 additional authors not shown)

Abstract: Radio interferometers aiming to measure the power spectrum of the redshifted 21 cm line during the Epoch of Reionisation (EoR) need to achieve an unprecedented dynamic range to separate the weak signal from overwhelming foreground emissions. Calibration inaccuracies can compromise the sensitivity of these measurements to the effect that a detection of the EoR is precluded. An alternative to standa… ▽ More Radio interferometers aiming to measure the power spectrum of the redshifted 21 cm line during the Epoch of Reionisation (EoR) need to achieve an unprecedented dynamic range to separate the weak signal from overwhelming foreground emissions. Calibration inaccuracies can compromise the sensitivity of these measurements to the effect that a detection of the EoR is precluded. An alternative to standard analysis techniques makes use of the closure phase, which allows one to bypass antenna-based direction-independent calibration. Similarly to standard approaches, we use a delay spectrum technique to search for the EoR signal. Using 94 nights of data observed with Phase I of the Hydrogen Epoch of Reionization Array (HERA), we place approximate constraints on the 21 cm power spectrum at $z=7.7$. We find at 95% confidence that the 21 cm EoR brightness temperature is $\le$(372)$^2$ "pseudo" mK$^2$ at 1.14 "pseudo" $h$ Mpc$^{-1}$, where the "pseudo" emphasises that these limits are to be interpreted as approximations to the actual distance scales and brightness temperatures. Using a fiducial EoR model, we demonstrate the feasibility of detecting the EoR with the full array. Compared to standard methods, the closure phase processing is relatively simple, thereby providing an important independent check on results derived using visibility intensities, or related. △ Less

Submitted 15 February, 2023; originally announced February 2023.

Comments: 16 pages, 14 figures, accepted for publication by MNRAS

arXiv:2302.07309 [pdf, other]

doi 10.1145/3544548.3580694

Augmenting Pathologists with NaviPath: Design and Evaluation of a Human-AI Collaborative Navigation System

Authors: Hongyan Gu, Chunxu Yang, Mohammad Haeri, **g Wang, Shirley Tang, Wenzhong Yan, Shu** He, Christopher Kazu Williams, Shino Magaki, Xiang 'Anthony' Chen

Abstract: Artificial Intelligence (AI) brings advancements to support pathologists in navigating high-resolution tumor images to search for pathology patterns of interest. However, existing AI-assisted tools have not realized this promised potential due to a lack of insight into pathology and HCI considerations for pathologists' navigation workflows in practice. We first conducted a formative study with six… ▽ More Artificial Intelligence (AI) brings advancements to support pathologists in navigating high-resolution tumor images to search for pathology patterns of interest. However, existing AI-assisted tools have not realized this promised potential due to a lack of insight into pathology and HCI considerations for pathologists' navigation workflows in practice. We first conducted a formative study with six medical professionals in pathology to capture their navigation strategies. By incorporating our observations along with the pathologists' domain knowledge, we designed NaviPath -- a human-AI collaborative navigation system. An evaluation study with 15 medical professionals in pathology indicated that: (i) compared to the manual navigation, participants saw more than twice the number of pathological patterns in unit time with NaviPath, and (ii) participants achieved higher precision and recall against the AI and the manual navigation on average. Further qualitative analysis revealed that navigation was more consistent with NaviPath, which can improve the overall examination quality. △ Less

Submitted 14 February, 2023; originally announced February 2023.

Comments: Accepted ACM CHI Conference on Human Factors in Computing Systems (CHI '23)

arXiv:2302.04388 [pdf, other]

doi 10.3847/2041-8213/acbfad

The Radio to GeV Afterglow of GRB 221009A

Authors: Tanmoy Laskar, Kate D. Alexander, Raffaella Margutti, Tarraneh Eftekhari, Ryan Chornock, Edo Berger, Yvette Cendes, Anne Duerr, Daniel A. Perley, Maria Edvige Ravasio, Ryo Yamazaki, Eliot H. Ayache, Thomas Barclay, Rodolfo Barniol Duran, Shivani Bhandari, Daniel Brethauer, Collin T. Christy, Deanne L. Coppejans, Paul Duffell, Wen-fai Fong, Andreja Gomboc, Cristiano Guidorzi, Jamie A. Kennea, Shiho Kobayashi, Andrew Levan , et al. (5 additional authors not shown)

Abstract: GRB 221009A ($z=0.151$) is one of the closest known long $γ$-ray bursts (GRBs). Its extreme brightness across all electromagnetic wavelengths provides an unprecedented opportunity to study a member of this still-mysterious class of transients in exquisite detail. We present multi-wavelength observations of this extraordinary event, spanning 15 orders of magnitude in photon energy from radio to… ▽ More GRB 221009A ($z=0.151$) is one of the closest known long $γ$-ray bursts (GRBs). Its extreme brightness across all electromagnetic wavelengths provides an unprecedented opportunity to study a member of this still-mysterious class of transients in exquisite detail. We present multi-wavelength observations of this extraordinary event, spanning 15 orders of magnitude in photon energy from radio to $γ$-rays. We find that the data can be partially explained by a forward shock (FS) from a highly-collimated relativistic jet interacting with a low-density wind-like medium. Under this model, the jet's beaming-corrected kinetic energy ($E_K \sim 4\times10^{50}$ erg) is typical for the GRB population. The radio and mm data provide strong limiting constraints on the FS model, but require the presence of an additional emission component. From equipartition arguments, we find that the radio emission is likely produced by a small amount of mass ($\lesssim6\times10^{-7} M_\odot$) moving relativistically ($Γ\gtrsim9$) with a large kinetic energy ($\gtrsim10^{49}$ erg). However, the temporal evolution of this component does not follow prescriptions for synchrotron radiation from a single power-law distribution of electrons (e.g. in a reverse shock or two-component jet), or a thermal electron population, perhaps suggesting that one of the standard assumptions of afterglow theory is violated. GRB 221009A will likely remain detectable with radio telescopes for years to come, providing a valuable opportunity to track the full lifecycle of a powerful relativistic jet. △ Less

Submitted 22 February, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

Comments: Accepted for publication in the Astrophysical Journal Letters

arXiv:2302.03531 [pdf, other]

Structured Generative Models for Scene Understanding

Authors: Christopher K. I. Williams

Abstract: This position paper argues for the use of \emph{structured generative models} (SGMs) for scene understanding. This requires the reconstruction of a 3D scene from an input image, whereby the contents of the image are causally explained in terms of models of instantiated objects, each with their own type, shape, appearance and pose, along with global variables like scene lighting and camera paramete… ▽ More This position paper argues for the use of \emph{structured generative models} (SGMs) for scene understanding. This requires the reconstruction of a 3D scene from an input image, whereby the contents of the image are causally explained in terms of models of instantiated objects, each with their own type, shape, appearance and pose, along with global variables like scene lighting and camera parameters. This approach also requires scene models which account for the co-occurrences and inter-relationships of objects in a scene. The SGM approach has the merits that it is compositional and generative, which lead to interpretability. To pursue the SGM agenda, we need models for objects and scenes, and approaches to carry out inference. We first review models for objects, which include ``things'' (object categories that have a well defined shape), and ``stuff'' (categories which have amorphous spatial extent). We then move on to review \emph{scene models} which describe the inter-relationships of objects. Perhaps the most challenging problem for SGMs is \emph{inference} of the objects, lighting and camera parameters, and scene inter-relationships from input consisting of a single or multiple images. We conclude with a discussion of issues that need addressing to advance the SGM agenda. △ Less

Submitted 7 February, 2023; originally announced February 2023.

Comments: 33 pages, 10 figures

arXiv:2301.03373 [pdf]

Chatbots As Fluent Polyglots: Revisiting Breakthrough Code Snippets

Authors: David Noever, Kevin Williams

Abstract: The research applies AI-driven code assistants to analyze a selection of influential computer code that has shaped modern technology, including email, internet browsing, robotics, and malicious software. The original contribution of this study was to examine half of the most significant code advances in the last 50 years and, in some cases, to provide notable improvements in clarity or performance… ▽ More The research applies AI-driven code assistants to analyze a selection of influential computer code that has shaped modern technology, including email, internet browsing, robotics, and malicious software. The original contribution of this study was to examine half of the most significant code advances in the last 50 years and, in some cases, to provide notable improvements in clarity or performance. The AI-driven code assistant could provide insights into obfuscated code or software lacking explanatory commentary in all cases examined. We generated additional sample problems based on bug corrections and code optimizations requiring much deeper reasoning than a traditional Google search might provide. Future work focuses on adding automated documentation and code commentary and translating select large code bases into more modern versions with multiple new application programming interfaces (APIs) and chained multi-tasks. The AI-driven code assistant offers a valuable tool for software engineering, particularly in its ability to provide human-level expertise and assist in refactoring legacy code or simplifying the explanation or functionality of high-value repositories. △ Less

Submitted 5 January, 2023; originally announced January 2023.

arXiv:2212.05913 [pdf, other]

The architectural application of shells whose boundaries subtend a constant solid angle

Authors: Emil Adiels, Mats Ander, Chris J. K. Williams

Abstract: Surface geometry plays a central role in the design of bridges, vaults and shells, using various techniques for generating a geometry which aims to balance structural, spatial, aesthetic and construction requirements. In this paper we propose the use of surfaces defined such that given closed curves subtend a constant solid angle at all points on the surface and form its boundary. Constant solid… ▽ More Surface geometry plays a central role in the design of bridges, vaults and shells, using various techniques for generating a geometry which aims to balance structural, spatial, aesthetic and construction requirements. In this paper we propose the use of surfaces defined such that given closed curves subtend a constant solid angle at all points on the surface and form its boundary. Constant solid angle surfaces enable one to control the boundary slope and hence achieve an approximately constant span-to-height ratio as the span varies, making them structurally viable for shell structures. In addition, when the entire surface boundary is in the same plane, the slope of the surface around the boundary is constant and thus follows a principal curvature direction. Such surfaces are suitable for surface grids where planar quadrilaterals meet the surface boundaries. They can also be used as the Airy stress function in the form finding of shells having forces concentrated at the corners. Our technique employs the Gauss-Bonnet theorem to calculate the solid angle of a point in space and Newton's method to move the point onto the constant solid angle surface. We use the Biot-Savart law to find the gradient of the solid angle. The technique can be applied in parallel to each surface point without an initial mesh, opening up for future studies and other applications when boundary curves are known but the initial topology is unknown. We show the geometrical properties, possibilities and limitations of surfaces of constant solid angle using examples in three dimensions. △ Less

Submitted 6 December, 2022; originally announced December 2022.

arXiv:2212.04445 [pdf, ps, other]

Non-tightness in class theory and second-order arithmetic

Authors: Alfredo Roque Freire, Kameryn J. Williams

Abstract: A theory T is tight if different deductively closed extensions of T (in the same language) cannot be bi-interpretable. Many well-studied foundational theories are tight, including PA [Visser2006], ZF, Z2, and KM [enayat2017]. In this article we extend Enayat's investigations to subsystems of these latter two theories. We prove that restricting the Comprehension schema of Z2 and KM gives non-tight… ▽ More A theory T is tight if different deductively closed extensions of T (in the same language) cannot be bi-interpretable. Many well-studied foundational theories are tight, including PA [Visser2006], ZF, Z2, and KM [enayat2017]. In this article we extend Enayat's investigations to subsystems of these latter two theories. We prove that restricting the Comprehension schema of Z2 and KM gives non-tight theories. Specifically, we show that GB and ACA0 each admit different bi-interpretable extensions, and the same holds for their extensions by adding Sigma^1_k-Comprehension, for k <= 1. These results provide evidence that tightness characterizes Z2 and KM in a minimal way. △ Less

Submitted 14 May, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

MSC Class: 03E70; 03C62; 03H15

arXiv:2212.03907 [pdf, other]

A Novel JupyterLab User Experience for Interactive Data Visualization

Authors: Peter K. G. Williams, Jonathan Carifio, Henrik Norman, A. David Weigel

Abstract: In the Jupyter ecosystem, data visualization is usually done with "widgets" created as notebook cell outputs. While this mechanism works well in some circumstances, it is not well-suited to presenting interfaces that are long-lived, interactive, and visually rich. Unlike the traditional Jupyter notebook system, the newer JupyterLab application provides a sophisticated extension infrastructure that… ▽ More In the Jupyter ecosystem, data visualization is usually done with "widgets" created as notebook cell outputs. While this mechanism works well in some circumstances, it is not well-suited to presenting interfaces that are long-lived, interactive, and visually rich. Unlike the traditional Jupyter notebook system, the newer JupyterLab application provides a sophisticated extension infrastructure that raises new design possibilities. Here we present a novel user experience (UX) for interactive data visualization in JupyterLab that is based on an "app" that runs alongside the user's notebooks, rather than widgets that are bound inside them. We have implemented this UX for the AAS WorldWide Telescope (WWT) visualization tool. JupyterLab's messaging APIs allow the app to smoothly exchange data with multiple computational kernels, allowing users to accomplish tasks that are not possible using the widget framework. A new Jupyter server extension allows the frontend to request data from kernels asynchronously over HTTP, enabling interactive exploration of gigapixel-scale imagery in WWT. While we have developed this UX for WWT, the overall design and the server extension are portable to other applications and have the potential to unlock a variety of new user activities that aren't currently possible in "science platform" interfaces. △ Less

Submitted 7 December, 2022; originally announced December 2022.

Comments: Submitted to proceedings of ADASS32; 8 pages, 3 figures. Try the WWT app at https://bit.ly/pywwt-notebooks

arXiv:2212.01482 [pdf, other]

doi 10.1063/5.0139024

PyQMC: an all-Python real-space quantum Monte Carlo module in PySCF

Authors: William A. Wheeler, Shivesh Pathak, Kevin Kleiner, Shunyue Yuan, João N. B. Rodrigues, Cooper Lorsung, Kittithat Krongchon, Yueqing Chang, Yiqing Zhou, Brian Busemeyer, Kiel T. Williams, Alexander Muñoz, Chun Yu Chow, Lucas K. Wagner

Abstract: We describe a new open-source Python-based package for high accuracy correlated electron calculations using quantum Monte Carlo (QMC) in real space: PyQMC. PyQMC implements modern versions of QMC algorithms in an accessible format, enabling algorithmic development and easy implementation of complex workflows. Tight integration with the PySCF environment allows for simple comparison between QMC cal… ▽ More We describe a new open-source Python-based package for high accuracy correlated electron calculations using quantum Monte Carlo (QMC) in real space: PyQMC. PyQMC implements modern versions of QMC algorithms in an accessible format, enabling algorithmic development and easy implementation of complex workflows. Tight integration with the PySCF environment allows for simple comparison between QMC calculations and other many-body wave function techniques, as well as access to high accuracy trial wave functions. △ Less

Submitted 2 December, 2022; originally announced December 2022.

arXiv:2211.05938 [pdf, other]

doi 10.1093/mnras/stac3182

The Merger Fraction of Ultramassive White Dwarfs

Authors: Mukremin Kilic, Adam G. Moss, Alekzander Kosakowski, P. Bergeron, Annamarie A. Conly, Warren R. Brown, Silvia Toonen, Kurtis A. Williams, P. Dufour

Abstract: We search for merger products among the 25 most massive white dwarfs in the Montreal White Dwarf Database 100 pc sample through follow-up spectroscopy and high-cadence photometry. We find an unusually high fraction, 40%, of magnetic white dwarfs among this population. In addition, we identify four outliers in transverse velocity and detect rapid rotation in five objects. Our results show that… ▽ More We search for merger products among the 25 most massive white dwarfs in the Montreal White Dwarf Database 100 pc sample through follow-up spectroscopy and high-cadence photometry. We find an unusually high fraction, 40%, of magnetic white dwarfs among this population. In addition, we identify four outliers in transverse velocity and detect rapid rotation in five objects. Our results show that $56^{+9}_{-10}$\% of the $M\approx1.3~M_{\odot}$ ultramassive white dwarfs form through mergers. This fraction is significantly higher than expected from the default binary population synthesis calculations using the $α$-prescription (with $αλ= 2$), and provides further support for efficient orbital shrinkage, such as with low values of the common envelope efficiency. △ Less

Submitted 10 November, 2022; originally announced November 2022.

Comments: MNRAS, in press

arXiv:2211.01232 [pdf, other]

doi 10.3847/1538-4357/acf765

The Luminosity Phase Space of Galactic and Extragalactic X-ray Transients Out to Intermediate Redshifts

Authors: Ava Polzin, Raffaella Margutti, Deanne Coppejans, Katie Auchettl, Kim L. Page, Georgios Vasilopoulos, Joe S. Bright, Paolo Esposito, Peter K. G. Williams, Koji Mukai, Edo Berger

Abstract: We present a detailed compilation and analysis of the X-ray phase space of low- to intermediate-redshift ($ 0\le z \le 1$) transients that consolidates observed light curves (and theory where necessary) for a large variety of classes of transient/variable phenomena in the 0.3--10 keV energy band. We include gamma-ray burst afterglows, supernovae, supernova shock breakouts and shocks interacting wi… ▽ More We present a detailed compilation and analysis of the X-ray phase space of low- to intermediate-redshift ($ 0\le z \le 1$) transients that consolidates observed light curves (and theory where necessary) for a large variety of classes of transient/variable phenomena in the 0.3--10 keV energy band. We include gamma-ray burst afterglows, supernovae, supernova shock breakouts and shocks interacting with the environment, tidal disruption events and active galactic nuclei, fast blue optical transients, cataclysmic variables, magnetar flares/outbursts and fast radio bursts, cool stellar flares, X-ray binary outbursts, and ultraluminous X-ray sources. Our overarching goal is to offer a comprehensive resource for the examination of these ephemeral events, extending the X-ray duration-luminosity phase space (DLPS) to show luminosity evolution. We use existing observations (both targeted and serendipitous) to characterize the behavior of various transient/variable populations. Contextualizing transient signals in the larger DLPS serves two primary purposes: to identify areas of interest (i.e., regions in the parameter space where one would expect detections, but in which observations have historically been lacking) and to provide initial qualitative guidance in classifying newly discovered transient signals. We find that while the most luminous (largely extragalactic) and least luminous (largely Galactic) part of the phase space is well-populated at $t > 0.1$ days, intermediate luminosity phenomena (L$_x = 10^{34} - 10^{42}$ erg s$^{-1}$) represent a gap in the phase space. We thus identify L$_x = 10^{34} - 10^{42}$ erg s$^{-1}$ and $t = 10^{-4} - 0.1$ days as a key discovery phase space in transient X-ray astronomy. △ Less

Submitted 5 September, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

Comments: 12 figures, 13 tables; version accepted to ApJ

arXiv:2211.00192 [pdf, other]

AI Assistants: A Framework for Semi-Automated Data Wrangling

Authors: Tomas Petricek, Gerrit J. J. van den Burg, Alfredo Nazábal, Taha Ceritli, Ernesto Jiménez-Ruiz, Christopher K. I. Williams

Abstract: Data wrangling tasks such as obtaining and linking data from various sources, transforming data formats, and correcting erroneous records, can constitute up to 80% of typical data engineering work. Despite the rise of machine learning and artificial intelligence, data wrangling remains a tedious and manual task. We introduce AI assistants, a class of semi-automatic interactive tools to streamline… ▽ More Data wrangling tasks such as obtaining and linking data from various sources, transforming data formats, and correcting erroneous records, can constitute up to 80% of typical data engineering work. Despite the rise of machine learning and artificial intelligence, data wrangling remains a tedious and manual task. We introduce AI assistants, a class of semi-automatic interactive tools to streamline data wrangling. An AI assistant guides the analyst through a specific data wrangling task by recommending a suitable data transformation that respects the constraints obtained through interaction with the analyst. We formally define the structure of AI assistants and describe how existing tools that treat data cleaning as an optimization problem fit the definition. We implement AI assistants for four common data wrangling tasks and make AI assistants easily accessible to data analysts in an open-source notebook environment for data science, by leveraging the common structure they follow. We evaluate our AI assistants both quantitatively and qualitatively through three example scenarios. We show that the unified and interactive design makes it easy to perform tasks that would be difficult to do manually or with a fully automatic tool. △ Less

Submitted 31 October, 2022; originally announced November 2022.

Comments: Accepted for publication in IEEE Transactions on Knowledge and Data Engineering

arXiv:2210.14927 [pdf, other]

doi 10.1093/mnras/stad441

Characterization Of Inpaint Residuals In Interferometric Measurements of the Epoch Of Reionization

Authors: Michael Pagano, **g Liu, Adrian Liu, Nicholas S. Kern, Aaron Ewall-Wice, Philip Bull, Robert Pascua, Siamak Ravanbakhsh, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Steven Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer , et al. (53 additional authors not shown)

Abstract: Radio Frequency Interference (RFI) is one of the systematic challenges preventing 21cm interferometric instruments from detecting the Epoch of Reionization. To mitigate the effects of RFI on data analysis pipelines, numerous inpaint techniques have been developed to restore RFI corrupted data. We examine the qualitative and quantitative errors introduced into the visibilities and power spectrum du… ▽ More Radio Frequency Interference (RFI) is one of the systematic challenges preventing 21cm interferometric instruments from detecting the Epoch of Reionization. To mitigate the effects of RFI on data analysis pipelines, numerous inpaint techniques have been developed to restore RFI corrupted data. We examine the qualitative and quantitative errors introduced into the visibilities and power spectrum due to inpainting. We perform our analysis on simulated data as well as real data from the Hydrogen Epoch of Reionization Array (HERA) Phase 1 upper limits. We also introduce a convolutional neural network that capable of inpainting RFI corrupted data in interferometric instruments. We train our network on simulated data and show that our network is capable at inpainting real data without requiring to be retrained. We find that techniques that incorporate high wavenumbers in delay space in their modeling are best suited for inpainting over narrowband RFI. We also show that with our fiducial parameters Discrete Prolate Spheroidal Sequences (DPSS) and CLEAN provide the best performance for intermittent ``narrowband'' RFI while Gaussian Progress Regression (GPR) and Least Squares Spectral Analysis (LSSA) provide the best performance for larger RFI gaps. However we caution that these qualitative conclusions are sensitive to the chosen hyperparameters of each inpainting technique. We find these results to be consistent in both simulated and real visibilities. We show that all inpainting techniques reliably reproduce foreground dominated modes in the power spectrum. Since the inpainting techniques should not be capable of reproducing noise realizations, we find that the largest errors occur in the noise dominated delay modes. We show that in the future, as the noise level of the data comes down, CLEAN and DPSS are most capable of reproducing the fine frequency structure in the visibilities of HERA data. △ Less

Submitted 20 February, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

Comments: 21 pages, 13 figures

arXiv:2210.04912 [pdf, other]

doi 10.3847/1538-4357/acaf50

Improved Constraints on the 21 cm EoR Power Spectrum and the X-Ray Heating of the IGM with HERA Phase I Observations

Authors: The HERA Collaboration, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Rennan Barkana, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Daniela Breitman, Philip Bull, Jacob Burba, Steve Carey, Chris L. Carilli, Carina Cheng, Samir Choudhuri, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Joshua S. Dillon , et al. (70 additional authors not shown)

Abstract: We report the most sensitive upper limits to date on the 21 cm epoch of reionization power spectrum using 94 nights of observing with Phase I of the Hydrogen Epoch of Reionization Array (HERA). Using similar analysis techniques as in previously reported limits (HERA Collaboration 2022a), we find at 95% confidence that $Δ^2(k = 0.34$ $h$ Mpc$^{-1}$) $\leq 457$ mK$^2$ at $z = 7.9$ and that… ▽ More We report the most sensitive upper limits to date on the 21 cm epoch of reionization power spectrum using 94 nights of observing with Phase I of the Hydrogen Epoch of Reionization Array (HERA). Using similar analysis techniques as in previously reported limits (HERA Collaboration 2022a), we find at 95% confidence that $Δ^2(k = 0.34$ $h$ Mpc$^{-1}$) $\leq 457$ mK$^2$ at $z = 7.9$ and that $Δ^2 (k = 0.36$ $h$ Mpc$^{-1}) \leq 3,496$ mK$^2$ at $z = 10.4$, an improvement by a factor of 2.1 and 2.6 respectively. These limits are mostly consistent with thermal noise over a wide range of $k$ after our data quality cuts, despite performing a relatively conservative analysis designed to minimize signal loss. Our results are validated with both statistical tests on the data and end-to-end pipeline simulations. We also report updated constraints on the astrophysics of reionization and the cosmic dawn. Using multiple independent modeling and inference techniques previously employed by HERA Collaboration (2022b), we find that the intergalactic medium must have been heated above the adiabatic cooling limit at least as early as $z = 10.4$, ruling out a broad set of so-called "cold reionization" scenarios. If this heating is due to high-mass X-ray binaries during the cosmic dawn, as is generally believed, our result's 99% credible interval excludes the local relationship between soft X-ray luminosity and star formation and thus requires heating driven by evolved low-metallicity stars. △ Less

Submitted 19 January, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

Comments: 57 pages, 37 figures. Updated to match the accepted ApJ version. Corresponding author: Joshua S. Dillon

Journal ref: 2023 ApJ 945 124

arXiv:2210.04221 [pdf, other]

The Elliptical Quartic Exponential Distribution: An Annular Distribution Obtained via Maximum Entropy

Authors: Christopher K I Williams

Abstract: This paper describes the Elliptical Quartic Exponential distribution in $\mathbb{R}^D$, obtained via a maximum entropy construction by imposing second and fourth moment constraints. I discuss relationships to related work, analytical expressions for the normalization constant and the entropy, and the conditional and marginal distributions. This paper describes the Elliptical Quartic Exponential distribution in $\mathbb{R}^D$, obtained via a maximum entropy construction by imposing second and fourth moment constraints. I discuss relationships to related work, analytical expressions for the normalization constant and the entropy, and the conditional and marginal distributions. △ Less

Submitted 9 October, 2022; originally announced October 2022.

Comments: 6 pages, 1 figure

arXiv:2210.04023 [pdf, other]

Multi-Task Dynamical Systems

Authors: Alex Bird, Christopher K. I. Williams, Christopher Hawthorne

Abstract: Time series datasets are often composed of a variety of sequences from the same domain, but from different entities, such as individuals, products, or organizations. We are interested in how time series models can be specialized to individual sequences (capturing the specific characteristics) while still retaining statistical power by sharing commonalities across the sequences. This paper describe… ▽ More Time series datasets are often composed of a variety of sequences from the same domain, but from different entities, such as individuals, products, or organizations. We are interested in how time series models can be specialized to individual sequences (capturing the specific characteristics) while still retaining statistical power by sharing commonalities across the sequences. This paper describes the multi-task dynamical system (MTDS); a general methodology for extending multi-task learning (MTL) to time series models. Our approach endows dynamical systems with a set of hierarchical latent variables which can modulate all model parameters. To our knowledge, this is a novel development of MTL, and applies to time series both with and without control inputs. We apply the MTDS to motion-capture data of people walking in various styles using a multi-task recurrent neural network (RNN), and to patient drug-response data using a multi-task pharmacodynamic model. △ Less

Submitted 8 October, 2022; originally announced October 2022.

Comments: 52 pages, 17 figures

Journal ref: Journal of Machine Learning Research 23 (2022)

arXiv:2210.03721 [pdf, other]

doi 10.1093/mnras/stad090

Impact of instrument and data characteristics in the interferometric reconstruction of the 21 cm power spectrum

Authors: Adélie Gorce, Samskruthi Ganjam, Adrian Liu, Steven G. Murray, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Philip Bull, Jacob Burba, Steven Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Joshua S. Dillon , et al. (53 additional authors not shown)

Abstract: Combining the visibilities measured by an interferometer to form a cosmological power spectrum is a complicated process. In a delay-based analysis, the map** between instrumental and cosmological space is not a one-to-one relation. Instead, neighbouring modes contribute to the power measured at one point, with their respective contributions encoded in the window functions. To better understand t… ▽ More Combining the visibilities measured by an interferometer to form a cosmological power spectrum is a complicated process. In a delay-based analysis, the map** between instrumental and cosmological space is not a one-to-one relation. Instead, neighbouring modes contribute to the power measured at one point, with their respective contributions encoded in the window functions. To better understand the power measured by an interferometer, we assess the impact of instrument characteristics and analysis choices on these window functions. Focusing on the Hydrogen Epoch of Reionization Array (HERA) as a case study, we find that long-baseline observations correspond to enhanced low-k tails of the window functions, which facilitate foreground leakage, whilst an informed choice of bandwidth and frequency taper can reduce said tails. With simple test cases and realistic simulations, we show that, apart from tracing mode mixing, the window functions help accurately reconstruct the power spectrum estimator of simulated visibilities. The window functions depend strongly on the beam chromaticity, and less on its spatial structure - a Gaussian approximation, ignoring side lobes, is sufficient. Finally, we investigate the potential of asymmetric window functions, down-weighting the contribution of low-k power to avoid foreground leakage. The window functions presented here correspond to the latest HERA upper limits for the full Phase I data. They allow an accurate reconstruction of the power spectrum measured by the instrument and will be used in future analyses to confront theoretical models and data directly in cylindrical space. △ Less

Submitted 11 January, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

Comments: 18 pages, 19 figures, accepted for publication in MNRAS

arXiv:2209.04460 [pdf, other]

Figure and Figure Caption Extraction for Mixed Raster and Vector PDFs: Digitization of Astronomical Literature with OCR Features

Authors: J. P. Naiman, Peter K. G. Williams, Alyssa Goodman

Abstract: Scientific articles published prior to the "age of digitization" in the late 1990s contain figures which are "trapped" within their scanned pages. While progress to extract figures and their captions has been made, there is currently no robust method for this process. We present a YOLO-based method for use on scanned pages, post-Optical Character Recognition (OCR), which uses both grayscale and OC… ▽ More Scientific articles published prior to the "age of digitization" in the late 1990s contain figures which are "trapped" within their scanned pages. While progress to extract figures and their captions has been made, there is currently no robust method for this process. We present a YOLO-based method for use on scanned pages, post-Optical Character Recognition (OCR), which uses both grayscale and OCR-features. When applied to the astrophysics literature holdings of the Astrophysics Data System (ADS), we find F1 scores of 90.9% (92.2%) for figures (figure captions) with the intersection-over-union (IOU) cut-off of 0.9 which is a significant improvement over other state-of-the-art methods. △ Less

Submitted 9 September, 2022; originally announced September 2022.

Comments: 16 pages, 3 figures, accepted to TPDL 2022

arXiv:2209.03115 [pdf, other]

doi 10.1162/neco_a_01564

Inference and Learning for Generative Capsule Models

Authors: Alfredo Nazabal, Nikolaos Tsagkas, Christopher K. I. Williams

Abstract: Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. In this paper we specify a generative model for such data, and derive a variational algorithm for inferring the transformation of each model object in a scene, and the assignments of observed parts to the objects. We derive a learning algorithm for the objec… ▽ More Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. In this paper we specify a generative model for such data, and derive a variational algorithm for inferring the transformation of each model object in a scene, and the assignments of observed parts to the objects. We derive a learning algorithm for the object models, based on variational expectation maximization (Jordan et al., 1999). We also study an alternative inference algorithm based on the RANSAC method of Fischler and Bolles (1981). We apply these inference methods to (i) data generated from multiple geometric objects like squares and triangles ("constellations"), and (ii) data from a parts-based model of faces. Recent work by Kosiorek et al. (2019) has used amortized inference via stacked capsule autoencoders (SCAEs) to tackle this problem -- our results show that we significantly outperform them where we can make comparisons (on the constellations data). △ Less

Submitted 21 October, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

Comments: 31 pages, 6 figures. This paper extends our previous work (arxiv:2103.06676) by covering the learning of the models as well as inference. Paper accepted for publication in Neural Computation

Journal ref: Neural Computation 35(4) (2023) 727-761

arXiv:2208.12437 [pdf, other]

Detecting Mitoses with a Convolutional Neural Network for MIDOG 2022 Challenge

Authors: Hongyan Gu, Mohammad Haeri, Shuo Ni, Christopher Kazu Williams, Neda Zarrin-Khameh, Shino Magaki, Xiang 'Anthony' Chen

Abstract: This work presents a mitosis detection method with only one vanilla Convolutional Neural Network (CNN). Our method consists of two steps: given an image, we first apply a CNN using a sliding window technique to extract patches that have mitoses; we then calculate each extracted patch's class activation map to obtain the mitosis's precise location. To increase the model performance on high-domain-v… ▽ More This work presents a mitosis detection method with only one vanilla Convolutional Neural Network (CNN). Our method consists of two steps: given an image, we first apply a CNN using a sliding window technique to extract patches that have mitoses; we then calculate each extracted patch's class activation map to obtain the mitosis's precise location. To increase the model performance on high-domain-variance pathology images, we train the CNN with a data augmentation pipeline, a noise-tolerant loss that copes with unlabeled images, and a multi-rounded active learning strategy. In the MIDOG 2022 challenge, our approach, with an EfficientNet-b3 CNN model, achieved an overall F1 score of 0.7323 in the preliminary test phase, and 0.6847 in the final test phase (task 1). Our approach sheds light on the broader applicability of class activation maps for object detections in pathology images. △ Less

Submitted 30 October, 2022; v1 submitted 26 August, 2022; originally announced August 2022.

Comments: 3 pages, 2 figures

arXiv:2207.13763 [pdf, other]

doi 10.3847/1538-3881/ac8543

The Rapid Rotation of the Strongly Magnetic Ultramassive White Dwarf EGGR 156

Authors: K. A. Williams, J. J. Hermes, Z. P. Vanderbosch

Abstract: The distribution of white dwarf rotation periods provides a means for constraining angular momentum evolution during the late stages of stellar evolution, as well as insight into the physics and remnants of double degenerate mergers. Although the rotational distribution of low mass white dwarfs is relatively well constrained via asteroseismology, that of high mass white dwarfs, which can arise fro… ▽ More The distribution of white dwarf rotation periods provides a means for constraining angular momentum evolution during the late stages of stellar evolution, as well as insight into the physics and remnants of double degenerate mergers. Although the rotational distribution of low mass white dwarfs is relatively well constrained via asteroseismology, that of high mass white dwarfs, which can arise from either intermediate mass stellar evolution or white dwarf mergers, is not. Photometric variability in white dwarfs due to rotation of a spotted star is rapidly increasing the sample size of high mass white dwarfs with measured rotation periods. We present the discovery of 22.4 minute photometric variability in the lightcurve of EGGR 156, a strongly magnetic, ultramassive white dwarf. We interpret this variability as rapid rotation, and our data suggest that EGGR 156 is the remnant of a double degenerate merger. Finally, we calculate the rate of period change in rapidly rotating, massive, magnetic WDs due to magnetic dipole radiation. In many cases, including EGGR 156, the period change is not currently detectable over reasonable timescales, indicating that these WDs could be very precise clocks. For the most highly magnetic, rapidly rotating massive WDs, such as ZTF J1901+1450 and RE J0317$-$853, the period change should be detectable and may help constrain the structure and evolution of these exotic white dwarfs. △ Less

Submitted 23 August, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

Comments: Replaced to correct two typos in equations on page 12. No calculations or conclusions affected. 15 pages, 5 figures, accepted for publication in the Astronomical Journal

Showing 1–50 of 544 results for author: Williams, K