Search | arXiv e-print repository

doi 10.1007/JHEP02(2023)003

Poles At Infinity in On-shell Diagrams

Authors: Taro V. Brown, Umut Oktem, Jaroslav Trnka

Abstract: In this paper we study on-shell diagrams in ${\cal N}{<}4$ supersymmetric Yang-Mills (SYM) theory. These are on-shell gauge invariant objects which appear as cuts of loop integrands in the context of generalized unitarity and serve as building blocks for amplitudes in recursion relations. In the dual formulation, they are associated with cells of the positive Grassmannian $G_+(k,n)$ and the on-she… ▽ More In this paper we study on-shell diagrams in ${\cal N}{<}4$ supersymmetric Yang-Mills (SYM) theory. These are on-shell gauge invariant objects which appear as cuts of loop integrands in the context of generalized unitarity and serve as building blocks for amplitudes in recursion relations. In the dual formulation, they are associated with cells of the positive Grassmannian $G_+(k,n)$ and the on-shell functions can be reproduced as canonical differential forms. While for the case of the ${\cal N}{=}4$ maximally supersymmetric Yang-Mills theory all poles in on-shell diagrams correspond to IR poles when the momentum flows in edges are zero, for ${\cal N}{<}4$ SYM theories there are new UV poles when the loop momenta go to infinity. These poles originate from the prefactor of the canonical dlog form and do not correspond to erasing edges in on-shell diagrams. We show that they can be interpreted as a diagrammatic operation which involves pinching a loop and performing a ``non-planar twist'' on external legs, which gives rise to a non-planar on-shell diagram. Our result provides an important clue on the role of poles at infinite momenta in on-shell scattering amplitudes, and the relation to non-planar on-shell functions. △ Less

Submitted 13 December, 2022; originally announced December 2022.

Comments: 59 pages, 88 figures

arXiv:2212.04870 [pdf, other]

Meteorological conditions during Dunkelflauten in Germany: Characteristics, the role of weather regimes and impacts on demand

Authors: Fabian Mockert, Christian M. Grams, Tom Brown, Fabian Neumann

Abstract: Renewable generation from wind and solar power is strongly weather-dependent. To plan future sustainable energy systems that are robust to this variability, a better understanding of why and when periods of low wind and solar power output occur is valuable. We call such periods of low wind and solar power output `Dunkelflauten', the German word for dark wind lulls. In this article, we analyse the… ▽ More Renewable generation from wind and solar power is strongly weather-dependent. To plan future sustainable energy systems that are robust to this variability, a better understanding of why and when periods of low wind and solar power output occur is valuable. We call such periods of low wind and solar power output `Dunkelflauten', the German word for dark wind lulls. In this article, we analyse the meteorological conditions during Dunkelflauten in Germany by applying the concept of weather regimes. Weather regimes are quasi-stationary, recurrent, and persistent large-scale circulation patterns which explain multi-day atmospheric variability (5-15 days). We use a regime definition that allows us to distinguish four different types of blocked regimes, characterised by high pressure situations in the North Atlantic-European region. We find that in Germany, Dunkelflauten mainly occur in winter when the solar power output is anyway low and when the wind power output drops for several consecutive days. A high-pressure system over Germany, associated with the European Blocking regime, is responsible for most of the Dunkelflauten. Dunkelflauten during the Greenland Blocking regime are associated with colder temperatures than usual, causing higher electricity demand and presenting a particular challenge as space heating demand electrifies in future. Furthermore, we show that Dunkelflauten occur predominantly when a weather regime is well-established and persists longer than usual. Our study provides novel insight on the occurrence and meteorological characteristics of Dunkelflauten, which is essential for planning resilient energy systems and supporting grid operators to prepare for potential shortages in supply. △ Less

Submitted 9 December, 2022; originally announced December 2022.

Comments: 20pages, 11figures, submitted to "Meteorological Applications" by Royal Meteorological Society (https://rmets.onlinelibrary.wiley.com/journal/14698080)

arXiv:2212.00810 [pdf, other]

doi 10.3847/1538-4357/aca9d1

Timing the r-Process Enrichment of the Ultra-Faint Dwarf Galaxy Reticulum II

Authors: Joshua D. Simon, Thomas M. Brown, Burçin Mutlu-Pakdil, Alexander P. Ji, Alex Drlica-Wagner, Roberto J. Avila, Clara E. Martínez-Vázquez, Ting S. Li, Eduardo Balbinot, Keith Bechtol, Anna Frebel, Marla Geha, Terese T. Hansen, David J. James, Andrew B. Pace, M. Aguena, O. Alves, F. Andrade-Oliveira, J. Annis, D. Bacon, E. Bertin, D. Brooks, D. L. Burke, A. Carnero Rosell, M. Carrasco Kind , et al. (43 additional authors not shown)

Abstract: The ultra-faint dwarf galaxy Reticulum II (Ret II) exhibits a unique chemical evolution history, with 72 +10/-12% of its stars strongly enhanced in r-process elements. We present deep Hubble Space Telescope photometry of Ret II and analyze its star formation history. As in other ultra-faint dwarfs, the color-magnitude diagram is best fit by a model consisting of two bursts of star formation. If we… ▽ More The ultra-faint dwarf galaxy Reticulum II (Ret II) exhibits a unique chemical evolution history, with 72 +10/-12% of its stars strongly enhanced in r-process elements. We present deep Hubble Space Telescope photometry of Ret II and analyze its star formation history. As in other ultra-faint dwarfs, the color-magnitude diagram is best fit by a model consisting of two bursts of star formation. If we assume that the bursts were instantaneous, then the older burst occurred around the epoch of reionization and formed ~80% of the stars in the galaxy, while the remainder of the stars formed ~3 Gyr later. When the bursts are allowed to have nonzero durations we obtain slightly better fits. The best-fitting model in this case consists of two bursts beginning before reionization, with approximately half the stars formed in a short (100 Myr) burst and the other half in a more extended period lasting 2.6 Gyr. Considering the full set of viable star formation history models, we find that 28% of the stars formed within 500 +/- 200 Myr of the onset of star formation. The combination of the star formation history and the prevalence of r-process-enhanced stars demonstrates that the r-process elements in Ret II must have been synthesized early in its initial star-forming phase. We therefore constrain the delay time between the formation of the first stars in Ret II and the r-process nucleosynthesis to be less than 500 Myr. This measurement rules out an r-process source with a delay time of several Gyr or more such as GW170817. △ Less

Submitted 1 December, 2022; originally announced December 2022.

Comments: 14 pages, 5 figures, 1 table. Accepted for publication in ApJ

arXiv:2212.00521 [pdf, other]

Unexpected Scaling in Path Copying Trees

Authors: Ilya Kokorin, Alexander Fedorov, Trevor Brown, Vitaly Aksenov

Abstract: Although a wide variety of handcrafted concurrent data structures have been proposed, there is considerable interest in universal approaches (henceforth called Universal Constructions or UCs) for building concurrent data structures. These approaches (semi-)automatically convert a sequential data structure into a concurrent one. The simplest approach uses locks that protect a sequential data struct… ▽ More Although a wide variety of handcrafted concurrent data structures have been proposed, there is considerable interest in universal approaches (henceforth called Universal Constructions or UCs) for building concurrent data structures. These approaches (semi-)automatically convert a sequential data structure into a concurrent one. The simplest approach uses locks that protect a sequential data structure and allow only one process to access it at a time. The resulting data structures use locks, and hence are blocking. Most work on UCs instead focuses on obtaining non-blocking progress guarantees such as obstruction-freedom, lock-freedom, or wait-freedom. Many non-blocking UCs have appeared. Key examples include the seminal wait-free UC by Herlihy, a NUMA-aware UC by Yi et al., and an efficient UC for large objects by Fatourou et al. We borrow ideas from persistent data structures and multi-version concurrency control (MVCC), most notably path copying, and use them to implement concurrent versions of sequential persistent data structures. Despite our expectation that our data structures would not scale under write-heavy workloads, they scale in practice. We confirm this scaling analytically in our model with private per-process caches. △ Less

Submitted 2 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

arXiv:2211.16521 [pdf, other]

doi 10.1051/0004-6361/202244718

VERTICO III: The Kennicutt-Schmidt relation in Virgo cluster galaxies

Authors: M. J. Jiménez-Donaire, T. Brown, C. D. Wilson, I. D. Roberts, N. Zabel, S. L. Ellison, M. Thorp, V. Villanueva, R. Chown, D. Bisaria, A. D. Bolatto, A. Boselli, B. Catinella, A. Chung, L. Cortese, T. A. Davis, C. D. P. Lagos, B. Lee, L. C. Parker, K. Spekkens, A. R. H. Stevens, J. Sun

Abstract: In this VERTICO science paper we aim to study how the star formation process depends on galactic environment and gravitational interactions in the context of galaxy evolution. We explore the scaling relation between the star formation rate (SFR) surface density and the molecular gas surface density, also known as the Kennicutt-Schmidt (KS) relation, in a subsample of Virgo cluster spiral galaxies.… ▽ More In this VERTICO science paper we aim to study how the star formation process depends on galactic environment and gravitational interactions in the context of galaxy evolution. We explore the scaling relation between the star formation rate (SFR) surface density and the molecular gas surface density, also known as the Kennicutt-Schmidt (KS) relation, in a subsample of Virgo cluster spiral galaxies. We use new ACA and TP observations from the VERTICO-ALMA Large Program at 720pc resolution to resolve the molecular gas content, as traced by the 12CO(2-1) transition, across the disks of 37 spiral galaxies in the Virgo cluster. In combination with archival observations, we estimate the parameters of the KS relation for the entire ensemble of galaxies, and within individual galaxies. We find the KS slope for the entire population to be N=0.97+/-0.07, with a characteristic molecular gas depletion time of 1.86Gyr for our full sample, in agreement with previous work in isolated star-forming galaxies. In individual galaxies, we find KS slopes ranging between 0.69 and 1.40, and typical star formation efficiencies (SFE) that can vary from galaxy to galaxy by a factor of ~4. These galaxy-to-galaxy variations account for ~0.20dex in scatter in the ensemble KS relation, which is characterized by a 0.42dex scatter. We find that the HI-deficient galaxies in the Virgo cluster show a steeper resolved KS relation and lower molecular gas efficiencies than HI-normal cluster galaxies. While the molecular gas content in Virgo cluster galaxies appears to behave similarly to that in isolated galaxies, our VERTICO sample shows that cluster environments play a key role in regulating star formation. The environmental mechanisms affecting the HI galaxy content also have a direct impact in the SFE of molecular gas in cluster galaxies, leading to longer depletion times in HI-deficient members. △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: Accepted for publication in Astronomy & Astrophysics

Journal ref: A&A 671, A3 (2023)

arXiv:2211.03540 [pdf, other]

Measuring Progress on Scalable Oversight for Large Language Models

Authors: Samuel R. Bowman, Jeeyoon Hyun, Ethan Perez, Edwin Chen, Craig Pettit, Scott Heiner, Kamilė Lukošiūtė, Amanda Askell, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Christopher Olah, Daniela Amodei, Dario Amodei, Dawn Drain, Dustin Li, Eli Tran-Johnson, Jackson Kernion, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse , et al. (21 additional authors not shown)

Abstract: Develo** safe and useful general-purpose AI systems will require us to make progress on scalable oversight: the problem of supervising systems that potentially outperform us on most skills relevant to the task at hand. Empirical work on this problem is not straightforward, since we do not yet have systems that broadly exceed our abilities. This paper discusses one of the major ways we think abou… ▽ More Develo** safe and useful general-purpose AI systems will require us to make progress on scalable oversight: the problem of supervising systems that potentially outperform us on most skills relevant to the task at hand. Empirical work on this problem is not straightforward, since we do not yet have systems that broadly exceed our abilities. This paper discusses one of the major ways we think about this problem, with a focus on ways it can be studied empirically. We first present an experimental design centered on tasks for which human specialists succeed but unaided humans and current general AI systems fail. We then present a proof-of-concept experiment meant to demonstrate a key feature of this experimental design and show its viability with two question-answering tasks: MMLU and time-limited QuALITY. On these tasks, we find that human participants who interact with an unreliable large-language-model dialog assistant through chat -- a trivial baseline strategy for scalable oversight -- substantially outperform both the model alone and their own unaided performance. These results are an encouraging sign that scalable oversight will be tractable to study with present models and bolster recent findings that large language models can productively assist humans with difficult tasks. △ Less

Submitted 11 November, 2022; v1 submitted 4 November, 2022; originally announced November 2022.

Comments: v2 fixes a few typos from v1

arXiv:2211.00970 [pdf, other]

doi 10.1016/j.ijepes.2021.107702

Topology-based Approximations for $\mathcal{N}-1$ Contingency Constraints in Power Transmission Networks

Authors: Amin Shokri Gazafroudi, Fabian Neumann, Tom Brown

Abstract: It is crucial for maintaining the security of supply that transmission networks continue to operate even if a single line fails. Modeling $\mathcal{N} - 1$ security in power system capacity expansion problems introduces many extra constraints if all possible outages are accounted for, which leads to a high computational burden. Typical approaches to avoid this burden consider only a subset of poss… ▽ More It is crucial for maintaining the security of supply that transmission networks continue to operate even if a single line fails. Modeling $\mathcal{N} - 1$ security in power system capacity expansion problems introduces many extra constraints if all possible outages are accounted for, which leads to a high computational burden. Typical approaches to avoid this burden consider only a subset of possible outages relevant to a given dispatch situation. However, this relies on knowing the dispatch situation beforehand, and it is not suitable for investment optimization problems where the generation fleet is not known in advance. In this paper, we introduce a heuristic approach to model the fully secured $\mathcal{N}-1$ feasible space using a smaller number of constraints in a way that only depends on the topology of transmission networks. In our proposed approach, the network's security is modelled by comparing the polytope of the feasible space of nodal net power obtained from the security-constrained linearized AC optimal power flow problem. To approximate this polytope, a buffer capacity factor is defined for transmission lines in the $\mathcal{N}-0$ secure case, thereby avoiding the introduction of many additional constraints. In this way, three approaches are introduced for obtaining a buffer capacity factor consisting of approximate, robust and line-specific approaches. Finally, the performance of our proposed approaches is assessed in different scales of transmission networks for determining the proposed buffer capacity factors, contingency analysis and economic evaluation. Moreover, we find that our proposed heuristics provide excellent approximations of the fully secured $\mathcal{N}-1$ solutions with a much lower computational burden. △ Less

Submitted 2 November, 2022; originally announced November 2022.

Journal ref: International Journal of Electrical Power & Energy Systems, 2021

arXiv:2210.08179 [pdf, other]

doi 10.1093/mnras/stad894

A sub-Neptune transiting the young field star HD 18599 at 40 pc

Authors: Jerome P. de Leon, John H. Livingston, James S. Jenkins, Jose I. Vines, Robert A. Wittenmyer, Jake T. Clark, Joshua I. M. Winn, Brett Addison, Sarah Ballard, Daniel Bayliss, Charles Beichman, Björn Benneke, David Anthony Berardo, Brendan P. Bowler, Tim Brown, Edward M. Bryant, Jessie Christiansen, David Ciardi, Karen A. Collins, Kevin I. Collins, Ian Crossfield, Drake Deming, Diana Dragomir, Courtney D. Dressing, Akihiko Fukui , et al. (45 additional authors not shown)

Abstract: Transiting exoplanets orbiting young nearby stars are ideal laboratories for testing theories of planet formation and evolution. However, to date only a handful of stars with age <1 Gyr have been found to host transiting exoplanets. Here we present the discovery and validation of a sub-Neptune around HD 18599, a young (300 Myr), nearby (d=40 pc) K star. We validate the transiting planet candidate… ▽ More Transiting exoplanets orbiting young nearby stars are ideal laboratories for testing theories of planet formation and evolution. However, to date only a handful of stars with age <1 Gyr have been found to host transiting exoplanets. Here we present the discovery and validation of a sub-Neptune around HD 18599, a young (300 Myr), nearby (d=40 pc) K star. We validate the transiting planet candidate as a bona fide planet using data from the TESS, Spitzer, and Gaia missions, ground-based photometry from IRSF, LCO, PEST, and NGTS, speckle imaging from Gemini, and spectroscopy from CHIRON, NRES, FEROS, and Minerva-Australis. The planet has an orbital period of 4.13 d, and a radius of 2.7Rearth. The RV data yields a 3-sigma mass upper limit of 30.5Mearth which is explained by either a massive companion or the large observed jitter typical for a young star. The brightness of the host star (V~9 mag) makes it conducive to detailed characterization via Doppler mass measurement which will provide a rare view into the interior structure of young planets. △ Less

Submitted 14 October, 2022; originally announced October 2022.

Comments: submitted to MNRAS

arXiv:2210.05381 [pdf, other]

doi 10.3847/1538-4357/ac9d3c

VERTICO IV: Environmental Effects on the Gas Distribution and Star Formation Efficiency of Virgo Cluster Spirals

Authors: Vicente Villanueva, Alberto D. Bolatto, Stuart Vogel, Tobias Brown, Christine D. Wilson, Nikki Zabel, Sara Ellison, Adam R. H. Stevens, Maria Jesus Jimenez Donaire, Kristine Spekkens, Mallory Thorp, Timothy A. Davis, Laura C. Parker, Ian D. Roberts, Dhruv Bisaria, Alessandro Boselli, Barbara Catinella, Aeree Chung, Luca Cortese, Bumhyun Lee, Adam Watts

Abstract: We measure the molecular-to-atomic gas ratio, $R_{\rm mol}$, and the star formation rate (SFR) per unit molecular gas mass, SFE$_{\rm mol}$, in 38 nearby galaxies selected from the Virgo Environment Traced in CO (VERTICO) survey. We determine their scale-lengths for the molecular and stellar components and find a roughly 3:5 ratio between them compared to $\sim$1:1 in field galaxies, indicating th… ▽ More We measure the molecular-to-atomic gas ratio, $R_{\rm mol}$, and the star formation rate (SFR) per unit molecular gas mass, SFE$_{\rm mol}$, in 38 nearby galaxies selected from the Virgo Environment Traced in CO (VERTICO) survey. We determine their scale-lengths for the molecular and stellar components and find a roughly 3:5 ratio between them compared to $\sim$1:1 in field galaxies, indicating that the CO emission is more centrally concentrated than the stars. We compute $R_{\rm mol}$ as a function of different physical quantities. While the spatially-resolved $R_{\rm mol}$ on average decreases with increasing radius, we find that the mean molecular-to-atomic gas ratio within the stellar effective radius $R_{\rm e}$, $R_{\rm mol}(r<R_{\rm e})$, shows a systematic increase with the level of H$_{\rm I}$, truncation and/or asymmetry (H$_{\rm I}$ perturbation). Analysis of the molecular- and the atomic-to-stellar mass ratios within $R_{\rm e}$, $R^{\rm mol}_{\star}(r<R_{\rm e})$ and $R^{\rm atom}_{\star}(r<R_{\rm e})$, shows that VERTICO galaxies have increasingly lower $R^{\rm atom}_{\star}(r<R_{\rm e})$ for larger levels of H$_{\rm I}$perturbation (compared to field galaxies matched in stellar mass), but no significant change in $R^{\rm mol}_{\star}(r<R_{\rm e})$. We also measure a clear systematic decrease of the SFE$_{\rm mol}$ within $R_{\rm e}$, SFE$_{\rm mol}(r<R_{\rm e})$, with increasingly perturbed H$_{\rm I}$. Therefore, compared to galaxies from the field, VERTICO galaxies are more compact in CO emission in relation to their stellar distribution, but increasingly perturbed atomic gas increases their $R_{\rm mol}$ and decreases the efficiency with which their molecular gas forms stars. (abridged) △ Less

Submitted 1 November, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

arXiv:2209.11895 [pdf]

In-context Learning and Induction Heads

Authors: Catherine Olsson, Nelson Elhage, Neel Nanda, Nicholas Joseph, Nova DasSarma, Tom Henighan, Ben Mann, Amanda Askell, Yuntao Bai, Anna Chen, Tom Conerly, Dawn Drain, Deep Ganguli, Zac Hatfield-Dodds, Danny Hernandez, Scott Johnston, Andy Jones, Jackson Kernion, Liane Lovitt, Kamal Ndousse, Dario Amodei, Tom Brown, Jack Clark, Jared Kaplan, Sam McCandlish , et al. (1 additional authors not shown)

Abstract: "Induction heads" are attention heads that implement a simple algorithm to complete token sequences like [A][B] ... [A] -> [B]. In this work, we present preliminary and indirect evidence for a hypothesis that induction heads might constitute the mechanism for the majority of all "in-context learning" in large transformer models (i.e. decreasing loss at increasing token indices). We find that induc… ▽ More "Induction heads" are attention heads that implement a simple algorithm to complete token sequences like [A][B] ... [A] -> [B]. In this work, we present preliminary and indirect evidence for a hypothesis that induction heads might constitute the mechanism for the majority of all "in-context learning" in large transformer models (i.e. decreasing loss at increasing token indices). We find that induction heads develop at precisely the same point as a sudden sharp increase in in-context learning ability, visible as a bump in the training loss. We present six complementary lines of evidence, arguing that induction heads may be the mechanistic source of general in-context learning in transformer models of any size. For small attention-only models, we present strong, causal evidence; for larger models with MLPs, we present correlational evidence. △ Less

Submitted 23 September, 2022; originally announced September 2022.

arXiv:2209.08950 [pdf, other]

doi 10.1364/JOSAA.474837

Using fluorescent beads to emulate single flurophores

Authors: Luis A. Aleman-Castaneda, Sherry Yi-Ting Feng, Rodrigo Gutierrez-Cuevas, Isael Herrera, Thomas G. Brown, Sophie Brasselet, Miguel A. Alonso

Abstract: In this work, we study the conditions under which fluorescent beads can be used to emulate single fluorescent molecules in the calibration of optical microscopes. Although beads are widely used due to their brightness and easy manipulation, there can be notable differences between the point spread functions (PSFs) they produce and those for single-molecule fluorophores, caused by their different e… ▽ More In this work, we study the conditions under which fluorescent beads can be used to emulate single fluorescent molecules in the calibration of optical microscopes. Although beads are widely used due to their brightness and easy manipulation, there can be notable differences between the point spread functions (PSFs) they produce and those for single-molecule fluorophores, caused by their different emission pattern and their size. We study theoretically these differences for various scenarios, e.g. with or without polarization channel splitting, to determine the conditions under which the use of beads as a model for single molecules is valid. We also propose methods to model the blurring due to the size difference and compensate for it to produce PSFs that are more similar to those for single molecules. △ Less

Submitted 6 December, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

Journal ref: J. Opt. Soc. Am. A 39, C167-C178 (2022)

arXiv:2209.07858 [pdf, other]

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

Authors: Deep Ganguli, Liane Lovitt, Jackson Kernion, Amanda Askell, Yuntao Bai, Saurav Kadavath, Ben Mann, Ethan Perez, Nicholas Schiefer, Kamal Ndousse, Andy Jones, Sam Bowman, Anna Chen, Tom Conerly, Nova DasSarma, Dawn Drain, Nelson Elhage, Sheer El-Showk, Stanislav Fort, Zac Hatfield-Dodds, Tom Henighan, Danny Hernandez, Tristan Hume, Josh Jacobson, Scott Johnston , et al. (11 additional authors not shown)

Abstract: We describe our early efforts to red team language models in order to simultaneously discover, measure, and attempt to reduce their potentially harmful outputs. We make three main contributions. First, we investigate scaling behaviors for red teaming across 3 model sizes (2.7B, 13B, and 52B parameters) and 4 model types: a plain language model (LM); an LM prompted to be helpful, honest, and harmle… ▽ More We describe our early efforts to red team language models in order to simultaneously discover, measure, and attempt to reduce their potentially harmful outputs. We make three main contributions. First, we investigate scaling behaviors for red teaming across 3 model sizes (2.7B, 13B, and 52B parameters) and 4 model types: a plain language model (LM); an LM prompted to be helpful, honest, and harmless; an LM with rejection sampling; and a model trained to be helpful and harmless using reinforcement learning from human feedback (RLHF). We find that the RLHF models are increasingly difficult to red team as they scale, and we find a flat trend with scale for the other model types. Second, we release our dataset of 38,961 red team attacks for others to analyze and learn from. We provide our own analysis of the data and find a variety of harmful outputs, which range from offensive language to more subtly harmful non-violent unethical outputs. Third, we exhaustively describe our instructions, processes, statistical methodologies, and uncertainty about red teaming. We hope that this transparency accelerates our ability to work together as a community in order to develop shared norms, practices, and technical standards for how to red team language models. △ Less

Submitted 22 November, 2022; v1 submitted 23 August, 2022; originally announced September 2022.

arXiv:2209.04465 [pdf, other]

doi 10.1093/mnras/stac2590

Evaluating the efficacy of sonification for signal detection in univariate, evenly sampled light curves using astronify

Authors: J. Tucker Brown, C. M. Harrison, A. Zanella, J. Trayford

Abstract: Sonification is the technique of representing data with sound, with potential applications in astronomy research for aiding discovery and accessibility. Several astronomy-focused sonification tools have been developed; however, efficacy testing is extremely limited. We performed testing of astronify, a prototype tool for sonification functionality within the Barbara A. Mikulski Archive for Space T… ▽ More Sonification is the technique of representing data with sound, with potential applications in astronomy research for aiding discovery and accessibility. Several astronomy-focused sonification tools have been developed; however, efficacy testing is extremely limited. We performed testing of astronify, a prototype tool for sonification functionality within the Barbara A. Mikulski Archive for Space Telescopes (MAST). We created synthetic light curves containing zero, one, or two transit-like signals with a range of signal-to-noise ratios (SNRs=3-100) and applied the default map** of brightness to pitch. We performed remote testing, asking participants to count signals when presented with light curves as a sonification, visual plot, or combination of both. We obtained 192 responses, of which 118 self-classified as experts in astronomy and data analysis. For high SNRs (=30 and 100), experts and non-experts performed well with sonified data (85-100% successful signal counting). At low SNRs (=3 and 5) both groups were consistent with guessing with sonifications. At medium SNRs (=7 and 10), experts performed no better than non-experts with sonifications but significantly better (factor of ~2-3) with visuals. We infer that sonification training, like that experienced by experts for visual data inspection, will be important if this sonification method is to be useful for moderate SNR signal detection within astronomical archives and broader research. Nonetheless, we show that even a very simple, and non-optimised, sonification approach allows users to identify high SNR signals. A more optimised approach, for which we present ideas, would likely yield higher success for lower SNR signals. △ Less

Submitted 9 September, 2022; originally announced September 2022.

Comments: Accepted for publication in MNRAS (10 pages, 5 figures). Sonifications of Figure 1 (4 audio files) and Figure 5 (2 movie files) are available in the ancillary files folder. These, plus all other data products associated with this article are also available at: https://doi.org/10.25405/data.ncl.20936749

arXiv:2209.02364 [pdf, other]

doi 10.1016/j.energy.2023.128133

Inverse methods: How feasible are spatially low-resolved capacity expansion modelling results when disaggregated at high spatial resolution?

Authors: Martha Maria Frysztacki, Veit Hagenmeyer, Tom Brown

Abstract: Spatially highly-resolved capacity expansion models are often simplified to a lower spatial resolution because they are computationally intensive. The simplification mixes sites with different renewable features while ignoring transmission lines that can cause congestion. As a consequence, the results may represent an infeasible system when the capacities are fed back at higher spatial detail. Thu… ▽ More Spatially highly-resolved capacity expansion models are often simplified to a lower spatial resolution because they are computationally intensive. The simplification mixes sites with different renewable features while ignoring transmission lines that can cause congestion. As a consequence, the results may represent an infeasible system when the capacities are fed back at higher spatial detail. Thus far there has been no detailed investigation of how to disaggregate results and whether the spatially highly-resolved disaggregated model is feasible. This is challenging since there is no unique way to invert the clustering. This article is split into two parts to tackle these challenges. First, methods to disaggregate spatially low-resolved results are presented: (a) an uniform distribution of regional results across its original highly-resolved regions, (b) a re-optimisation for each region separately, (c) an approach that minimises the "excess electricity". Second, the resulting highly-resolved models' feasibility is investigated by running an operational dispatch. While re-optimising yields the best results, the third inverse method provides comparable results for less computational effort. Feasibility-wise, the study design strengthens that modelling countries by single regions is insufficient. State-of-the-art reduced models with 100-200 regions for Europe still yield 3%-7% of load-shedding, depending on model resolution and inverse method. △ Less

Submitted 3 July, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

Comments: Post-print

Journal ref: Energy, 2023

arXiv:2208.13854 [pdf, other]

doi 10.1073/pnas.2220033120

Microscopic motility of isolated E. coli flagella

Authors: Franky Djutanta, Peter T. Brown, Bonfilio Nainggolan, Alexis Coullomb, Sritharini Radhakrishnan, Jason Sentosa, Bernard Yurke, Rizal F. Hariadi, Douglas P. Shepherd

Abstract: The fluctuation-dissipation theorem describes the intimate connection between the Brownian diffusion of thermal particles and their drag coefficients. In the simple case of spherical particles, it takes the form of the Stokes-Einstein relationship that links the particle geometry, fluid viscosity, and diffusive behavior. However, studying the fundamental properties of microscopic asymmetric partic… ▽ More The fluctuation-dissipation theorem describes the intimate connection between the Brownian diffusion of thermal particles and their drag coefficients. In the simple case of spherical particles, it takes the form of the Stokes-Einstein relationship that links the particle geometry, fluid viscosity, and diffusive behavior. However, studying the fundamental properties of microscopic asymmetric particles, such as the helical-shaped propeller used by $\textit{E. coli}$, has remained out of reach for experimental approaches due to the need to quantify correlated translation and rotation simultaneously with sufficient spatial and temporal resolution. To solve this outstanding problem, we generated volumetric movies of fluorophore-labeled, freely diffusing, isolated $\textit{E. Coli}$ flagella using oblique plane microscopy. From these movies, we extracted trajectories and determined the hydrodynamic propulsion matrix directly from the diffusion of flagella via a generalized Einstein relation. Our results validate prior proposals, based on macroscopic wire helices and low Reynolds number scaling laws, that the average flagellum is a highly inefficient propeller. Specifically, we found the maximum propulsion efficiency of flagella is less than 5%. Beyond extending Brownian motion analysis to asymmetric 3D particles, our approach opens new avenues to study the propulsion matrix of particles in complex environments where direct hydrodynamic approaches are not feasible. △ Less

Submitted 31 August, 2022; v1 submitted 29 August, 2022; originally announced August 2022.

Comments: 6 pages, 4 figures, 9 supplemental sections, 7 supplemental figures, 3 supplemental movies *authors contributed equally and reserve the right to change order for first authorship

Journal ref: PNAS 120 (22) e2220033120 (2023)

arXiv:2208.08469 [pdf, other]

Performance Anomalies in Concurrent Data Structure Microbenchmarks

Authors: Rosina F. Kharal, Trevor Brown

Abstract: Recent decades have witnessed a surge in the development of concurrent data structures with an increasing interest in data structures implementing concurrent sets (CSets). Microbenchmarking tools are frequently utilized to evaluate and compare the performance differences across concurrent data structures. The underlying structure and design of the microbenchmarks themselves can play a hidden but i… ▽ More Recent decades have witnessed a surge in the development of concurrent data structures with an increasing interest in data structures implementing concurrent sets (CSets). Microbenchmarking tools are frequently utilized to evaluate and compare the performance differences across concurrent data structures. The underlying structure and design of the microbenchmarks themselves can play a hidden but influential role in performance results. However, the impact of microbenchmark design has not been well investigated. In this work, we illustrate instances where concurrent data structure performance results reported by a microbenchmark can vary 10-100x depending on the microbenchmark implementation details. We investigate factors leading to performance variance across three popular microbenchmarks and outline cases in which flawed microbenchmark design can lead to an inversion of performance results between two concurrent data structure implementations. We further derive a set of recommendations for best practices in the design and usage of concurrent data structure microbenchmarks and explore advanced features in the Setbench microbenchmark. △ Less

Submitted 8 December, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

arXiv:2208.05561 [pdf, other]

SSDBCODI: Semi-Supervised Density-Based Clustering with Outliers Detection Integrated

Authors: Jiahao Deng, Eli T. Brown

Abstract: Clustering analysis is one of the critical tasks in machine learning. Traditionally, clustering has been an independent task, separate from outlier detection. Due to the fact that the performance of clustering can be significantly eroded by outliers, a small number of algorithms try to incorporate outlier detection in the process of clustering. However, most of those algorithms are based on unsupe… ▽ More Clustering analysis is one of the critical tasks in machine learning. Traditionally, clustering has been an independent task, separate from outlier detection. Due to the fact that the performance of clustering can be significantly eroded by outliers, a small number of algorithms try to incorporate outlier detection in the process of clustering. However, most of those algorithms are based on unsupervised partition-based algorithms such as k-means. Given the nature of those algorithms, they often fail to deal with clusters of complex, non-convex shapes. To tackle this challenge, we have proposed SSDBCODI, a semi-supervised density-based algorithm. SSDBCODI combines the advantage of density-based algorithms, which are capable of dealing with clusters of complex shapes, with the semi-supervised element, which offers flexibility to adjust the clustering results based on a few user labels. We also merge an outlier detection component with the clustering process. Potential outliers are detected based on three scores generated during the process: (1) reachability-score, which measures how density-reachable a point is to a labeled normal object, (2) local-density-score, which measures the neighboring density of data objects, and (3) similarity-score, which measures the closeness of a point to its nearest labeled outliers. Then in the following step, instance weights are generated for each data instance based on those three scores before being used to train a classifier for further clustering and outlier detection. To enhance the understanding of the proposed algorithm, for our evaluation, we have run our proposed algorithm against some of the state-of-art approaches on multiple datasets and separately listed the results of outlier detection apart from clustering. Our results indicate that our algorithm can achieve superior results with a small percentage of labels. △ Less

Submitted 10 August, 2022; originally announced August 2022.

arXiv:2208.03842 [pdf, other]

doi 10.1093/mnras/stac2193

The cold gas and dust properties of red star-forming galaxies

Authors: Ryan Chown, Laura C. Parker, Christine D. Wilson, Toby Brown, Fraser A. Evans, Yang Gao, Ho Seong Hwang, Lihwai Lin, Amelie Saintonge, Mark Sargent, Matthew W. L. Smith, Ting Xiao

Abstract: We study the cold gas and dust properties for a sample of red star forming galaxies called "red misfits." We collect single-dish CO observations and HI observations from representative samples of low-redshift galaxies, as well as our own JCMT CO observations of red misfits. We also obtain SCUBA-2 850 um observations for a subset of these galaxies. With these data we compare the molecular gas, tota… ▽ More We study the cold gas and dust properties for a sample of red star forming galaxies called "red misfits." We collect single-dish CO observations and HI observations from representative samples of low-redshift galaxies, as well as our own JCMT CO observations of red misfits. We also obtain SCUBA-2 850 um observations for a subset of these galaxies. With these data we compare the molecular gas, total cold gas, and dust properties of red misfits against those of their blue counterparts ("blue actives") taking non-detections into account using a survival analysis technique. We compare these properties at fixed position in the log SFR-log M* plane, as well as versus offset from the star-forming main sequence. Compared to blue actives, red misfits have slightly longer molecular gas depletion times, similar total gas depletion times, significantly lower molecular- and total-gas mass fractions, lower dust-to-stellar mass ratios, similar dust-to-gas ratios, and a significantly flatter slope in the $\log M_\mathrm{mol}$-$\log M_\star$ plane. Our results suggest that red misfits as a population are likely quenching due to a shortage in gas supply. △ Less

Submitted 24 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

Comments: 16 pages, 7 Figures, accepted to MNRAS

arXiv:2208.02968 [pdf, other]

A Case-Study of Sample-Based Bayesian Forecasting Algorithms

Authors: Taylor R. Brown

Abstract: For a Bayesian, real-time forecasting with the posterior predictive distribution can be challenging for a variety of time series models. First, estimating the parameters of a time series model can be difficult with sample-based approaches when the model's likelihood is intractable and/or when the data set being used is large. Second, once samples from a parameter posterior are obtained on a fixed… ▽ More For a Bayesian, real-time forecasting with the posterior predictive distribution can be challenging for a variety of time series models. First, estimating the parameters of a time series model can be difficult with sample-based approaches when the model's likelihood is intractable and/or when the data set being used is large. Second, once samples from a parameter posterior are obtained on a fixed window of data, it is not clear how they will be used to generate forecasts, nor is it clear how, and in what sense, they will be ``updated" as interest shifts to newer posteriors as new data arrive. This paper provides a comparison of the sample-based forecasting algorithms that are available for Bayesians interested in real-time forecasting with nonlinear/non-Gaussian state space models. An applied analysis of financial returns is provided using a well-established stochastic volatility model. The principal aim of this paper is to provide guidance on how to select one of these algorithms, and to describe a variety of benefits and pitfalls associated with each approach. △ Less

Submitted 4 August, 2022; originally announced August 2022.

arXiv:2207.05816 [pdf, other]

doi 10.1016/j.joule.2023.06.016

The Potential Role of a Hydrogen Network in Europe

Authors: Fabian Neumann, Elisabeth Zeyen, Marta Victoria, Tom Brown

Abstract: Electricity transmission expansion has suffered many delays in Europe in recent decades, despite its significance for integrating renewable electricity into the energy system. A hydrogen network which reuses the existing fossil gas network could not only help to supply demand for low-emission fuels, but could also to balance variations in wind and solar energy across the continent and thus avoid p… ▽ More Electricity transmission expansion has suffered many delays in Europe in recent decades, despite its significance for integrating renewable electricity into the energy system. A hydrogen network which reuses the existing fossil gas network could not only help to supply demand for low-emission fuels, but could also to balance variations in wind and solar energy across the continent and thus avoid power grid expansion. We pursue this idea by varying the allowed expansion of electricity and hydrogen grids in net-zero CO2 scenarios for a sector-coupled and self-sufficient European energy system with high shares of renewables. We cover the electricity, buildings, transport, agriculture, and industry sectors across 181 regions and model every third hour of a year. With this high spatio-temporal resolution, the model can capture bottlenecks in transmission networks, the variability of demand and renewable supply, as well as regional opportunities for the retrofitting of legacy gas infrastructure and the development of geological hydrogen storage. Our results show consistent system cost reductions with a pan-continental hydrogen network that connects regions with low-cost and abundant renewable potentials to demand centres, synthetic fuel production and cavern storage sites. Develo** a hydrogen network reduces system costs by up to 26 billion Euros per year (3.4%), with the highest benefits when electricity grid reinforcements cannot be realised. Between 64% and 69% of this network could be built from repurposed natural gas pipelines. However, we find that hydrogen networks can only partially substitute for power grid expansion. While the expansion of both networks together can achieve the largest cost savings of 10%, the expansion of neither is truly essential as long as higher costs can be accepted and regulatory changes are made to manage grid bottlenecks. △ Less

Submitted 13 March, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

Comments: including supplementary material

arXiv:2207.05221 [pdf, other]

Language Models (Mostly) Know What They Know

Authors: Saurav Kadavath, Tom Conerly, Amanda Askell, Tom Henighan, Dawn Drain, Ethan Perez, Nicholas Schiefer, Zac Hatfield-Dodds, Nova DasSarma, Eli Tran-Johnson, Scott Johnston, Sheer El-Showk, Andy Jones, Nelson Elhage, Tristan Hume, Anna Chen, Yuntao Bai, Sam Bowman, Stanislav Fort, Deep Ganguli, Danny Hernandez, Josh Jacobson, Jackson Kernion, Shauna Kravec, Liane Lovitt , et al. (11 additional authors not shown)

Abstract: We study whether language models can evaluate the validity of their own claims and predict which questions they will be able to answer correctly. We first show that larger models are well-calibrated on diverse multiple choice and true/false questions when they are provided in the right format. Thus we can approach self-evaluation on open-ended sampling tasks by asking models to first propose answe… ▽ More We study whether language models can evaluate the validity of their own claims and predict which questions they will be able to answer correctly. We first show that larger models are well-calibrated on diverse multiple choice and true/false questions when they are provided in the right format. Thus we can approach self-evaluation on open-ended sampling tasks by asking models to first propose answers, and then to evaluate the probability "P(True)" that their answers are correct. We find encouraging performance, calibration, and scaling for P(True) on a diverse array of tasks. Performance at self-evaluation further improves when we allow models to consider many of their own samples before predicting the validity of one specific possibility. Next, we investigate whether models can be trained to predict "P(IK)", the probability that "I know" the answer to a question, without reference to any particular proposed answer. Models perform well at predicting P(IK) and partially generalize across tasks, though they struggle with calibration of P(IK) on new tasks. The predicted P(IK) probabilities also increase appropriately in the presence of relevant source materials in the context, and in the presence of hints towards the solution of mathematical word problems. We hope these observations lay the groundwork for training more honest models, and for investigating how honesty generalizes to cases where models are trained on objectives other than the imitation of human writing. △ Less

Submitted 21 November, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

Comments: 23+17 pages; refs added, typos fixed

arXiv:2207.03000 [pdf, other]

doi 10.1016/j.apenergy.2022.120016

Are biofuel mandates cost-effective? -- an analysis of transport fuels and biomass usage to achieve emissions targets in the European energy system

Authors: Markus Millinger, Lina Reichenberg, Fredrik Hedenus, Göran Berndes, Elisabeth Zeyen, Tom Brown

Abstract: Abatement options for the hard-to-electrify parts of the transport sector are needed to achieve ambitious emissions targets. Biofuels based on biomass, electrofuels based on renewable hydrogen and a carbon source, as well as fossil fuels compensated by carbon dioxide removal (CDR) are the main options. Currently, biofuels are the only renewable fuels available at scale and are stimulated by blendi… ▽ More Abatement options for the hard-to-electrify parts of the transport sector are needed to achieve ambitious emissions targets. Biofuels based on biomass, electrofuels based on renewable hydrogen and a carbon source, as well as fossil fuels compensated by carbon dioxide removal (CDR) are the main options. Currently, biofuels are the only renewable fuels available at scale and are stimulated by blending mandates. Here, we estimate the system cost of enforcing such mandates in addition to an overall emissions cap for all energy sectors. We model overnight scenarios for 2040 and 2060 with the sector-coupled European energy system model PyPSA-Eur-Sec, with a high temporal resolution. The following cost drivers are identified: (i) high biomass costs due to scarcity, (ii) opportunity costs for competing usages of biomass for industry heat and combined heat and power (CHP) with carbon capture, and (iii) lower scalability and generally higher cost for biofuels compared to electrofuels and fossil fuels combined with CDR. With a -80% emissions reduction target in 2040, variable renewables, partial electrification of heat, industry and transport and biomass use for CHP and industrial heat are important for achieving the target at minimal cost. Abatement of remaining liquid fossil fuel use increases system cost, with a 50% biofuel mandate increasing costs by 128-229 billion EUR, or 39-82% of the liquid fuel cost. With a negative -105% emissions target in 2060, fuel abatement options are necessary, and electrofuels or the use of CDR to offset fossil fuel emissions are more competitive than biofuels. Biomass is preferred in CHP and industry heat, combined with carbon capture to serve negative emissions or electrofuel production, thereby utilising biogenic carbon several times. Sensitivity analyses reveal significant uncertainties but consistently support that higher biofuel mandates lead to higher costs. △ Less

Submitted 6 July, 2022; originally announced July 2022.

Comments: 25 pages, 9 figures

arXiv:2206.14699 [pdf, other]

Validation and results of an approximate model for the stress of a Tokamak toroidal field coil at the inboard midplane

Authors: C. P. S. Swanson, S. Kahn, C. Rana, P. H. Titus, A. W. Brooks, W. Guttenfelder, Y. Zhai, T. G. Brown, J. E. Menard

Abstract: We present the verification, validation, and results of an approximate, analytic model for the radial profile of the stress, strain, and displacement within the toroidal field (TF) coil of a Tokamak at the inner midplane, where stress management is of the most concern. The model is designed to have high execution speed yet capture the essential physics, suitable for sco** studies, rapid evaluati… ▽ More We present the verification, validation, and results of an approximate, analytic model for the radial profile of the stress, strain, and displacement within the toroidal field (TF) coil of a Tokamak at the inner midplane, where stress management is of the most concern. The model is designed to have high execution speed yet capture the essential physics, suitable for sco** studies, rapid evaluation of designs, and in the inner loop of an optimizer. It is implemented in the PROCESS fusion reactor systems code. The model solves a many-layer axisymmetric extended plane strain problem. It includes linear elastic deformation, Poisson effects, transverse-isotropic materials properties, radial Lorentz force profiles, and axial tension applied to layer subsets. The model does not include out-of-plane forces from poloidal field coils. We benchmark the model against 2D and 3D Finite Element Analyses (FEA) using Ansys and COMSOL. We find the Tresca stress accuracy of the model to be within 10\% of the FEA result. We show that this model allows PROCESS to optimize a fusion pilot plant, subject to the TF coil winding pack and coil case yield constraints. This model sets an upper limit on the magnetic field strength at the coil surface of $29$ Tesla for steel TF coil cases, with the practical limit being significantly below this. △ Less

Submitted 21 September, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

arXiv:2206.03719 [pdf, other]

doi 10.1145/3535044.3535059

Low-power option Greeks: Efficiency-driven market risk analysis using FPGAs

Authors: Mark Klaisoongnoen, Nick Brown, Oliver Thomson Brown

Abstract: Quantitative finance is the use of mathematical models to analyse financial markets and securities. Typically requiring significant amounts of computation, an important question is the role that novel architectures can play in accelerating these models. In this paper we explore the acceleration of the industry standard Securities Technology Analysis Center's (STAC) derivatives risk analysis benchm… ▽ More Quantitative finance is the use of mathematical models to analyse financial markets and securities. Typically requiring significant amounts of computation, an important question is the role that novel architectures can play in accelerating these models. In this paper we explore the acceleration of the industry standard Securities Technology Analysis Center's (STAC) derivatives risk analysis benchmark STAC-A2\texttrademark{} by porting the Heston stochastic volatility model and Longstaff and Schwartz path reduction onto a Xilinx Alveo U280 FPGA with a focus on efficiency-driven computing. Describing in detail the steps undertaken to optimise the algorithm for the FPGA, we then leverage the flexibility provided by the reconfigurable architecture to explore choices around numerical precision and representation. Insights gained are then exploited in our final performance and energy measurements, where for the efficiency improvement metric we achieve between an 8 times and 185 times improvement on the FPGA compared to two 24-core Intel Xeon Platinum CPUs. The result of this work is not only a show-case for the market risk analysis workload on FPGAs, but furthermore a set of efficiency driven techniques and lessons learnt that can be applied to quantitative finance and computational workloads on reconfigurable architectures more generally. △ Less

Submitted 8 June, 2022; originally announced June 2022.

Comments: Extended preprint of paper accepted to The International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART 2022)

Journal ref: In International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies (HEART2022). Association for Computing Machinery, New York, NY, USA, 95 to 101

arXiv:2205.11901 [pdf, other]

doi 10.1038/s41467-023-39397-2

Endogenous learning for green hydrogen in a sector-coupled energy model for Europe

Authors: Elisabeth Zeyen, Marta Victoria, Tom Brown

Abstract: Many studies have shown that hydrogen could play a large role in the energy transition for hard-to-electrify sectors, but previous modelling has not included the necessary features to assess its role. They have either left out important sectors of hydrogen demand, ignored the temporal variability in the system or neglected the dynamics of learning effects. We address these limitations and consider… ▽ More Many studies have shown that hydrogen could play a large role in the energy transition for hard-to-electrify sectors, but previous modelling has not included the necessary features to assess its role. They have either left out important sectors of hydrogen demand, ignored the temporal variability in the system or neglected the dynamics of learning effects. We address these limitations and consider learning-by-doing for the full green hydrogen production chain with different climate targets in a detailed European sector-coupled model. Here, we show that in the next 10 years a faster scale-up of electrolysis and renewable capacities than envisaged by the EU in the REPowerEU Plan is cost-optimal in order to reach the +1.5°C target. This reduces the costs for hydrogen production to 1.26 Eur/kg by 2050. Hydrogen production switches from grey to green hydrogen, omitting the option of blue hydrogen. If electrolysis costs are modelled without dynamic learning-by-doing, then the electrolysis scale-up is significantly delayed, while total system costs are overestimated by up to 13% and the levelised cost of hydrogen is overestimated by 67%. △ Less

Submitted 3 February, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

Comments: 10 pages, 5 figures

arXiv:2205.10487 [pdf, other]

Scaling Laws and Interpretability of Learning from Repeated Data

Authors: Danny Hernandez, Tom Brown, Tom Conerly, Nova DasSarma, Dawn Drain, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Tom Henighan, Tristan Hume, Scott Johnston, Ben Mann, Chris Olah, Catherine Olsson, Dario Amodei, Nicholas Joseph, Jared Kaplan, Sam McCandlish

Abstract: Recent large language models have been trained on vast datasets, but also often on repeated data, either intentionally for the purpose of upweighting higher quality data, or unintentionally because data deduplication is not perfect and the model is exposed to repeated data at the sentence, paragraph, or document level. Some works have reported substantial negative performance effects of this repea… ▽ More Recent large language models have been trained on vast datasets, but also often on repeated data, either intentionally for the purpose of upweighting higher quality data, or unintentionally because data deduplication is not perfect and the model is exposed to repeated data at the sentence, paragraph, or document level. Some works have reported substantial negative performance effects of this repeated data. In this paper we attempt to study repeated data systematically and to understand its effects mechanistically. To do this, we train a family of models where most of the data is unique but a small fraction of it is repeated many times. We find a strong double descent phenomenon, in which repeated data can lead test loss to increase midway through training. A predictable range of repetition frequency leads to surprisingly severe degradation in performance. For instance, performance of an 800M parameter model can be degraded to that of a 2x smaller model (400M params) by repeating 0.1% of the data 100 times, despite the other 90% of the training tokens remaining unique. We suspect there is a range in the middle where the data can be memorized and doing so consumes a large fraction of the model's capacity, and this may be where the peak of degradation occurs. Finally, we connect these observations to recent mechanistic interpretability work - attempting to reverse engineer the detailed computations performed by the model - by showing that data repetition disproportionately damages copying and internal structures associated with generalization, such as induction heads, providing a possible mechanism for the shift from generalization to memorization. Taken together, these results provide a hypothesis for why repeating a relatively small fraction of data in large language models could lead to disproportionately large harms to performance. △ Less

Submitted 20 May, 2022; originally announced May 2022.

Comments: 23 pages, 22 figures

arXiv:2205.05698 [pdf, other]

doi 10.3847/1538-4357/ac6e68

VERTICO II: effects of HI-identified environmental mechanisms on molecular gas

Authors: Nikki Zabel, Toby Brown, Christine D. Wilson, Timothy A. Davis, Luca Cortese, Laura C. Parker, Alessandro Boselli, Barbara Catinella, Ryan Chown, Aeree Chung, Tirna Deb, Sara L. Ellison, María J. Jiménez-Donaire, Bumhyun Lee, Ian D. Roberts, Kristine Spekkens, Adam R. H. Stevens, Mallory Thorp, Stephanie Tonnesen, Vicente Villanueva

Abstract: In this VERTICO early science paper we explore in detail how environmental mechanisms, identified in HI, affect the resolved properties of molecular gas reservoirs in cluster galaxies. The molecular gas is probed using ALMA ACA (+TP) observations of 12CO(2-1) in 51 spiral galaxies in the Virgo cluster (of which 49 are detected), all of which are included in the VIVA HI survey. The sample spans a s… ▽ More In this VERTICO early science paper we explore in detail how environmental mechanisms, identified in HI, affect the resolved properties of molecular gas reservoirs in cluster galaxies. The molecular gas is probed using ALMA ACA (+TP) observations of 12CO(2-1) in 51 spiral galaxies in the Virgo cluster (of which 49 are detected), all of which are included in the VIVA HI survey. The sample spans a stellar mass range of 9 < log M*/Msol < 11. We study molecular gas radial profiles, isodensity radii, and surface densities as a function of galaxy HI deficiency and morphology. There is a weak correlation between global HI and H2 deficiencies, and resolved properties of molecular gas correlate with HI deficiency: galaxies that have large HI deficiencies have relatively steep and truncated molecular gas radial profiles, which is due to the removal of low-surface density molecular gas on the outskirts. Therefore, while the environmental mechanisms observed in HI also affect molecular gas reservoirs, there is only a moderate reduction of the total amount of molecular gas. △ Less

Submitted 12 January, 2023; v1 submitted 11 May, 2022; originally announced May 2022.

Comments: Published in ApJ. 22 pages, 6 figures, 1 table, 1 appendix. Erratum accepted for publication in ApJ

arXiv:2204.05862 [pdf, other]

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Authors: Yuntao Bai, Andy Jones, Kamal Ndousse, Amanda Askell, Anna Chen, Nova DasSarma, Dawn Drain, Stanislav Fort, Deep Ganguli, Tom Henighan, Nicholas Joseph, Saurav Kadavath, Jackson Kernion, Tom Conerly, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Tristan Hume, Scott Johnston, Shauna Kravec, Liane Lovitt, Neel Nanda, Catherine Olsson, Dario Amodei , et al. (6 additional authors not shown)

Abstract: We apply preference modeling and reinforcement learning from human feedback (RLHF) to finetune language models to act as helpful and harmless assistants. We find this alignment training improves performance on almost all NLP evaluations, and is fully compatible with training for specialized skills such as python coding and summarization. We explore an iterated online mode of training, where prefer… ▽ More We apply preference modeling and reinforcement learning from human feedback (RLHF) to finetune language models to act as helpful and harmless assistants. We find this alignment training improves performance on almost all NLP evaluations, and is fully compatible with training for specialized skills such as python coding and summarization. We explore an iterated online mode of training, where preference models and RL policies are updated on a weekly cadence with fresh human feedback data, efficiently improving our datasets and models. Finally, we investigate the robustness of RLHF training, and identify a roughly linear relation between the RL reward and the square root of the KL divergence between the policy and its initialization. Alongside our main results, we perform peripheral analyses on calibration, competing objectives, and the use of OOD detection, compare our models with human writers, and provide samples from our models using prompts appearing in recent related work. △ Less

Submitted 12 April, 2022; originally announced April 2022.

Comments: Data available at https://github.com/anthropics/hh-rlhf

arXiv:2204.03721 [pdf, other]

doi 10.1073/pnas.2206096119

Encapsulated bacteria deform lipid vesicles into flagellated swimmers

Authors: Lucas Le Nagard, Aidan T. Brown, Angela Dawson, Vincent A. Martinez, Wilson C. K. Poon, Margarita Staykova

Abstract: We study a synthetic system of motile Escherichia coli bacteria encapsulated inside giant lipid vesicles. Forces exerted by the bacteria on the inner side of the membrane are sufficient to extrude membrane tubes filled with one or several bacteria. We show that a physical coupling between the membrane tube and the flagella of the enclosed cells transforms the tube into an effective helical flagell… ▽ More We study a synthetic system of motile Escherichia coli bacteria encapsulated inside giant lipid vesicles. Forces exerted by the bacteria on the inner side of the membrane are sufficient to extrude membrane tubes filled with one or several bacteria. We show that a physical coupling between the membrane tube and the flagella of the enclosed cells transforms the tube into an effective helical flagellum propelling the vesicle. We develop a simple theoretical model to estimate the propulsive force from the speed of the vesicles, and demonstrate the good efficiency of this coupling mechanism. Together, these results point to design principles for conferring motility to synthetic cells. △ Less

Submitted 29 August, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

Comments: 22 pages, 12 figures

Journal ref: Proceedings of the National Academy of Sciences, 119(34), e2206096119 (2022)

arXiv:2202.07785 [pdf, other]

doi 10.1145/3531146.3533229

Predictability and Surprise in Large Generative Models

Authors: Deep Ganguli, Danny Hernandez, Liane Lovitt, Nova DasSarma, Tom Henighan, Andy Jones, Nicholas Joseph, Jackson Kernion, Ben Mann, Amanda Askell, Yuntao Bai, Anna Chen, Tom Conerly, Dawn Drain, Nelson Elhage, Sheer El Showk, Stanislav Fort, Zac Hatfield-Dodds, Scott Johnston, Shauna Kravec, Neel Nanda, Kamal Ndousse, Catherine Olsson, Daniela Amodei, Dario Amodei , et al. (5 additional authors not shown)

Abstract: Large-scale pre-training has recently emerged as a technique for creating capable, general purpose, generative models such as GPT-3, Megatron-Turing NLG, Gopher, and many others. In this paper, we highlight a counterintuitive property of such models and discuss the policy implications of this property. Namely, these generative models have an unusual combination of predictable loss on a broad train… ▽ More Large-scale pre-training has recently emerged as a technique for creating capable, general purpose, generative models such as GPT-3, Megatron-Turing NLG, Gopher, and many others. In this paper, we highlight a counterintuitive property of such models and discuss the policy implications of this property. Namely, these generative models have an unusual combination of predictable loss on a broad training distribution (as embodied in their "scaling laws"), and unpredictable specific capabilities, inputs, and outputs. We believe that the high-level predictability and appearance of useful capabilities drives rapid development of such models, while the unpredictable qualities make it difficult to anticipate the consequences of model deployment. We go through examples of how this combination can lead to socially harmful behavior with examples from the literature and real world observations, and we also perform two novel experiments to illustrate our point about harms from unpredictability. Furthermore, we analyze how these conflicting properties combine to give model developers various motivations for deploying these models, and challenges that can hinder deployment. We conclude with a list of possible interventions the AI community may take to increase the chance of these models having a beneficial impact. We intend this paper to be useful to policymakers who want to understand and regulate AI systems, technologists who care about the potential policy impact of their work, and academics who want to analyze, critique, and potentially develop large generative models. △ Less

Submitted 3 October, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

Comments: Updated to reflect the version submitted (and accepted) to ACM FAccT '22. This update incorporates feedback from peer-review and fixes minor typos. See open access FAccT conference version at: https://dl.acm.org/doi/abs/10.1145/3531146.3533229

arXiv:2201.13296 [pdf, other]

doi 10.3847/1538-4357/ac739e

An Isolated Stellar-Mass Black Hole Detected Through Astrometric Microlensing

Authors: Kailash C. Sahu, Jay Anderson, Stefano Casertano, Howard E. Bond, Andrzej Udalski, Martin Dominik, Annalisa Calamida, Andrea Bellini, Thomas M. Brown, Marina Rejkuba, Varun Bajaj, Noe Kains, Henry C. Ferguson, Chris L. Fryer, Philip Yock, Przemek Mroz, Szymon Kozlowski, Pawel Pietrukowicz, Radek Poleski, Jan Skowron, Igor Soszynski, Michael K. Szymanski, Krzysztof Ulaczyk, Lukasz Wyrzykowski, Richard Barry , et al. (68 additional authors not shown)

Abstract: We report the first unambiguous detection and mass measurement of an isolated stellar-mass black hole (BH). We used the Hubble Space Telescope (HST) to carry out precise astrometry of the source star of the long-duration (t_E~270 days), high-magnification microlensing event MOA-2011-BLG-191/OGLE-2011-BLG-0462 (hereafter designated as MOA-11-191/OGLE-11-462), in the direction of the Galactic bulge.… ▽ More We report the first unambiguous detection and mass measurement of an isolated stellar-mass black hole (BH). We used the Hubble Space Telescope (HST) to carry out precise astrometry of the source star of the long-duration (t_E~270 days), high-magnification microlensing event MOA-2011-BLG-191/OGLE-2011-BLG-0462 (hereafter designated as MOA-11-191/OGLE-11-462), in the direction of the Galactic bulge. HST imaging, conducted at eight epochs over an interval of six years, reveals a clear relativistic astrometric deflection of the background star's apparent position. Ground-based photometry of MOA-11-191/OGLE-11-462 shows a parallactic signature of the effect of the Earth's motion on the microlensing light curve. Combining the HST astrometry with the ground-based light curve and the derived parallax, we obtain a lens mass of 7.1 +/- 1.3 Msun and a distance of 1.58 +/- 0.18 kpc. We show that the lens emits no detectable light, which, along with having a mass higher than is possible for a white dwarf or neutron star, confirms its BH nature. Our analysis also provides an absolute proper motion for the BH. The proper motion is offset from the mean motion of Galactic-disk stars at similar distances by an amount corresponding to a transverse space velocity of ~45 km/s, suggesting that the BH received a 'natal kick' from its supernova explosion. Previous mass determinations for stellar-mass BHs have come from radial-velocity measurements of Galactic X-ray binaries, and from gravitational radiation emitted by merging BHs in binary systems in external galaxies. Our mass measurement is the first for an isolated stellar-mass BH using any technique. △ Less

Submitted 22 July, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

Comments: 37 pages, Published in ApJ

Journal ref: ApJ, 933, 83 (2022)

arXiv:2201.04644 [pdf, other]

doi 10.1093/mnras/stac304

Intrinsic alignments of the extended radio continuum emission of galaxies in the EAGLE simulations

Authors: Alexander D. Hill, Robert A. Crain, Ian G. McCarthy, Shaun T. Brown

Abstract: We present measurements of the intrinsic alignments (IAs) of the star-forming gas of galaxies in the EAGLE simulations. Radio continuum imaging of this gas enables cosmic shear measurements complementary to optical surveys. We measure the orientation of star-forming gas with respect to the direction to, and orientation of, neighbouring galaxies. Star-forming gas exhibits a preferentially radial or… ▽ More We present measurements of the intrinsic alignments (IAs) of the star-forming gas of galaxies in the EAGLE simulations. Radio continuum imaging of this gas enables cosmic shear measurements complementary to optical surveys. We measure the orientation of star-forming gas with respect to the direction to, and orientation of, neighbouring galaxies. Star-forming gas exhibits a preferentially radial orientation-direction alignment that is a decreasing function of galaxy pair separation, but remains significant to $\gtrsim 1$ Mpc at $z=0$. The alignment is qualitatively similar to that exhibited by the stars, but is weaker at fixed separation. Pairs of galaxies hosted by more massive subhaloes exhibit stronger alignment at fixed separation, but the strong alignment of close pairs is dominated by ${\sim}L^\star$ galaxies and their satellites. At fixed comoving separation, the radial alignment is stronger at higher redshift. The orientation-orientation alignment is consistent with random at all separations, despite subhaloes exhibiting preferential parallel minor axis alignment. The weaker IA of star-forming gas than for stars stems from the former's tendency to be less well aligned with the dark matter structure of galaxies than the latter, and implies that the systematic uncertainty due to IA may be less severe in radio continuum weak lensing surveys than in optical counterparts. Alignment models equating the orientation of star-forming gas discs to that of stellar discs or the DM structure of host subhaloes will therefore overestimate the impact of IAs on radio continuum cosmic shear measurements. △ Less

Submitted 12 January, 2022; originally announced January 2022.

Comments: 18 pages, 13 figures. Paper submitted to MNRAS

arXiv:2201.01239 [pdf]

The Most Difference in Means: A Statistic for the Strength of Null and Near-Zero Results

Authors: Bruce A. Corliss, Taylor R. Brown, Tingting Zhang, Kevin A. Janes, Heman Shakeri, Philip E. Bourne

Abstract: Statistical insignificance does not suggest the absence of effect, yet scientists must often use null results as evidence of negligible (near-zero) effect size to falsify scientific hypotheses. Doing so must assess a result's null strength, defined as the evidence for a negligible effect size. Such an assessment would differentiate strong null results that suggest a negligible effect size from wea… ▽ More Statistical insignificance does not suggest the absence of effect, yet scientists must often use null results as evidence of negligible (near-zero) effect size to falsify scientific hypotheses. Doing so must assess a result's null strength, defined as the evidence for a negligible effect size. Such an assessment would differentiate strong null results that suggest a negligible effect size from weak null results that suggest a broad range of potential effect sizes. We propose the most difference in means ($δ_M$) as a two-sample statistic that can both quantify null strength and perform a hypothesis test for negligible effect size. To facilitate consensus when interpreting results, our statistic allows scientists to conclude that a result has negligible effect size using different thresholds with no recalculation required. To assist with selecting a threshold, $δ_M$ can also compare null strength between related results. Both $δ_M$ and the relative form of $δ_M$ outperform other candidate statistics in comparing null strength. We compile broadly related results and use the relative $δ_M$ to compare null strength across different treatments, measurement methods, and experiment models. Reporting the relative $δ_M$ may provide a technical solution to the file drawer problem by encouraging the publication of null and near-zero results. △ Less

Submitted 24 May, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

arXiv:2112.15259 [pdf, other]

Elimination (a,b)-trees with fast, durable updates

Authors: Anubhav Srivastava, Trevor Brown

Abstract: Many concurrent dictionary implementations are designed and optimized for read-mostly workloads with uniformly distributed keys, and often perform poorly on update-heavy workloads. In this work, we first present a concurrent (a,b)-tree, the OCC-ABtree, which outperforms its fastest competitor by up to 2x on uniform update-heavy workloads, and is competitive on other workloads. We then turn our att… ▽ More Many concurrent dictionary implementations are designed and optimized for read-mostly workloads with uniformly distributed keys, and often perform poorly on update-heavy workloads. In this work, we first present a concurrent (a,b)-tree, the OCC-ABtree, which outperforms its fastest competitor by up to 2x on uniform update-heavy workloads, and is competitive on other workloads. We then turn our attention to skewed update-heavy workloads (which feature many inserts/deletes on the same key) and introduce the Elim-ABtree, which uses a new optimization called publishing elimination. In publishing elimination, concurrent inserts and deletes to a key are reordered to eliminate them. This reduces the number of writes in the data structure. The Elim-ABtree achieves up to 2.5x the performance of its fastest competitor (including the OCC-ABtree). The OCC-ABtree and Elim-ABtree are linearizable. We also introduce durable linearizable versions (for systems with Intel Optane DCPMM non-volatile main memory) that are nearly as fast. △ Less

Submitted 30 December, 2021; originally announced December 2021.

Comments: 22 pages, 17 figures, 1 table. Full version of the paper to published in Principles and Practice of Parallel Programming (PPoPP) 2022

ACM Class: E.1

arXiv:2112.06667 [pdf, other]

Long-Term Benefits of Network Boosters for Renewables Integration and Corrective Grid Security

Authors: Amin Shokri Gazafroudi, Elisabeth Zeyen, Martha Frysztacki, Fabian Neumann, Tom Brown

Abstract: The preventative strategies for $N-1$ network security dominant in European networks mean that network capacity is kept free in case a line fails. If instead fast corrective actions are used to overcome network overloading when single lines fail, this has the potential to free up network capacity that is otherwise underused in preventive $N-1$ security strategies. In this paper, we investigate the… ▽ More The preventative strategies for $N-1$ network security dominant in European networks mean that network capacity is kept free in case a line fails. If instead fast corrective actions are used to overcome network overloading when single lines fail, this has the potential to free up network capacity that is otherwise underused in preventive $N-1$ security strategies. In this paper, we investigate the impact on renewable integration of a corrective network security strategy, whereby storage or other flexibility assets are used to correct overloading shortly after line outages. In this way, we find significant cost savings for the integration of renewable energy of up to 2.4 billion euros per year in an aggregated 50-bus model of the German power system utilizing these flexibility assets, so-called network boosters (NB). This offers a role for storage beyond energy arbitrage or ancillary services like frequency control. While previous literature has focused on the potential savings of NB in the short-term operation, we focus on the long-term benefits in systems with high shares of renewable energy sources, where the capacities and dispatch of generation and NB are optimised. We demonstrate the benefits of NB for various shares of renewable energy, NB and flexibility costs, as well as different allowed levels of temporary overloading the lines in both (i) a sequential model, where long-run generation investments are optimised separately from the NB capacities, and (ii) a simultaneous model, where generation is co-optimised with NB investment so that mixed preventive-corrective approaches are possible. △ Less

Submitted 16 May, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

Comments: Preprint submitted to International Journal of Electrical Power & Energy Systems

arXiv:2112.00861 [pdf, other]

A General Language Assistant as a Laboratory for Alignment

Authors: Amanda Askell, Yuntao Bai, Anna Chen, Dawn Drain, Deep Ganguli, Tom Henighan, Andy Jones, Nicholas Joseph, Ben Mann, Nova DasSarma, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam McCandlish, Chris Olah, Jared Kaplan

Abstract: Given the broad capabilities of large language models, it should be possible to work towards a general-purpose, text-based assistant that is aligned with human values, meaning that it is helpful, honest, and harmless. As an initial foray in this direction we study simple baseline techniques and evaluations, such as prompting. We find that the benefits from modest interventions increase with model… ▽ More Given the broad capabilities of large language models, it should be possible to work towards a general-purpose, text-based assistant that is aligned with human values, meaning that it is helpful, honest, and harmless. As an initial foray in this direction we study simple baseline techniques and evaluations, such as prompting. We find that the benefits from modest interventions increase with model size, generalize to a variety of alignment evaluations, and do not compromise the performance of large models. Next we investigate scaling trends for several training objectives relevant to alignment, comparing imitation learning, binary discrimination, and ranked preference modeling. We find that ranked preference modeling performs much better than imitation learning, and often scales more favorably with model size. In contrast, binary discrimination typically performs and scales very similarly to imitation learning. Finally we study a `preference model pre-training' stage of training, with the goal of improving sample efficiency when finetuning on human preferences. △ Less

Submitted 9 December, 2021; v1 submitted 1 December, 2021; originally announced December 2021.

Comments: 26+19 pages; v2 typos fixed, refs added, figure scale / colors fixed; v3 correct very non-standard TruthfulQA formatting and metric, alignment implications slightly improved

arXiv:2111.14443 [pdf, other]

Broad Ranges of Investment Configurations for Renewable Power Systems, Robust to Cost Uncertainty and Near-Optimality

Authors: Fabian Neumann, Tom Brown

Abstract: To achieve ambitious greenhouse gas emission reduction targets in time, the planning of future energy systems needs to accommodate societal preferences, e.g. low levels of acceptance for transmission expansion or onshore wind turbines, and must also acknowledge the inherent uncertainties of technology cost projections. To date, however, many capacity expansion models lean heavily towards only mini… ▽ More To achieve ambitious greenhouse gas emission reduction targets in time, the planning of future energy systems needs to accommodate societal preferences, e.g. low levels of acceptance for transmission expansion or onshore wind turbines, and must also acknowledge the inherent uncertainties of technology cost projections. To date, however, many capacity expansion models lean heavily towards only minimising system cost and only studying a few cost projections. Here, we address both criticisms in unison. While taking account of technology cost uncertainties, we apply methods from multi-objective optimisation to explore trade-offs in a fully renewable European electricity system between increasing system cost and extremising the use of individual technologies for generating, storing and transmitting electricity to build robust insights about what actions are viable within given cost ranges. We identify boundary conditions that must be met for cost-efficiency regardless of how cost developments will unfold; for instance, that some grid reinforcement and long-term storage alongside a significant amount of wind capacity appear essential. But, foremost, we reveal that near the cost-optimum a broad spectrum of regionally and technologically diverse options exists in any case, which allows policymakers to navigate around public acceptance issues. The analysis requires managing many computationally demanding scenario runs efficiently, for which we leverage multi-fidelity surrogate modelling techniques using sparse polynomial chaos expansions and low-discrepancy sampling. △ Less

Submitted 29 November, 2021; originally announced November 2021.

arXiv:2111.10331 [pdf, other]

Maximum arrangements of nonattacking kings on the $2n\times 2n$ chessboard

Authors: Tricia Muldoon Brown

Abstract: To count the number of maximum independent arrangements of $n^2$ kings on a $2n\times 2n$ chessboard, we build a $2^n \times (n+1)$ matrix whose entries are independent arrangements of $n$ kings on $2\times 2n$ rectangles. Utilizing upper and lower bound functions dependent of the entries of the matrix, we recursively construct independent solutions, and provide a straight-forward formula and algo… ▽ More To count the number of maximum independent arrangements of $n^2$ kings on a $2n\times 2n$ chessboard, we build a $2^n \times (n+1)$ matrix whose entries are independent arrangements of $n$ kings on $2\times 2n$ rectangles. Utilizing upper and lower bound functions dependent of the entries of the matrix, we recursively construct independent solutions, and provide a straight-forward formula and algorithm. △ Less

Submitted 14 January, 2022; v1 submitted 19 November, 2021; originally announced November 2021.

arXiv:2111.00937 [pdf, other]

doi 10.3847/1538-4365/ac28f5

VERTICO: The Virgo Environment Traced In CO Survey

Authors: Toby Brown, Christine D. Wilson, Nikki Zabel, Timothy A. Davis, Alessandro Boselli, Aeree Chung, Sara L. Ellison, Claudia D. P. Lagos, Adam R. H. Stevens, Luca Cortese, Yannick M. Bahé, Dhruv Bisaria, Alberto D. Bolatto, Claire R. Cashmore, Barbara Catinella, Ryan Chown, Benedikt Diemer, Pascal J. Elahi, Maan H. Hani, María J. Jiménez-Donaire, Bumhyun Lee, Katya Leidig, Angus Mok, Karen Pardos Olsen, Laura C. Parker , et al. (11 additional authors not shown)

Abstract: We present the Virgo Environment Traced in CO (VERTICO) survey, a new effort to map $^{12}$CO($2-1$), $^{13}$CO($2-1$), and C$^{18}$O($2-1$) in 51 Virgo Cluster galaxies with the Atacama Compact Array, part of the Atacama Large Millimeter/submillimeter Array (ALMA). The primary motivation of VERTICO is to understand the physical mechanisms that perturb molecular gas disks, and therefore star forma… ▽ More We present the Virgo Environment Traced in CO (VERTICO) survey, a new effort to map $^{12}$CO($2-1$), $^{13}$CO($2-1$), and C$^{18}$O($2-1$) in 51 Virgo Cluster galaxies with the Atacama Compact Array, part of the Atacama Large Millimeter/submillimeter Array (ALMA). The primary motivation of VERTICO is to understand the physical mechanisms that perturb molecular gas disks, and therefore star formation and galaxy evolution, in dense environments. This first paper contains an overview of VERTICO's design and sample selection, $^{12}$CO($2-1$) observations, and data reduction procedures. We characterize global $^{12}$CO($2-1$) fluxes and molecular gas masses for the 49 detected VERTICO galaxies, provide upper limits for the two non-detections, and produce resolved $^{12}$CO($2-1$) data products (median resolution $= 8^{\prime\prime} \approx 640~{\rm pc}$). Azimuthally averaged $^{12}$CO($2-1$) radial intensity profiles are presented along with derived molecular gas radii. We demonstrate the scientific power of VERTICO by comparing the molecular gas size--mass scaling relation for our galaxies with a control sample of field galaxies, highlighting the strong effect that radius definition has on this correlation. We discuss the drivers of the form and scatter in the size--mass relation and highlight areas for future work. VERTICO is an ideal resource for studying the fate of molecular gas in cluster galaxies and the physics of environment-driven processes that perturb the star formation cycle. Upon public release, the survey will provide a homogeneous legacy dataset for studying galaxy evolution in our closest cluster. △ Less

Submitted 1 November, 2021; originally announced November 2021.

Comments: 68 pages, 13 Figures, 2 Figure Sets, Accepted for publication in ApJS, Online FITS versions of Tables 1, 2, and 3 are available with the journal publication

arXiv:2110.01632 [pdf, other]

doi 10.1093/mnras/stab3394

Towards a universal model for the density profiles of dark matter haloes

Authors: Shaun T. Brown, Ian G. McCarthy, Sam G. Stafford, Andreea S. Font

Abstract: It is well established from cosmological simulations that dark matter haloes are not precisely self-similar and an additional parameter, beyond their concentration, is required to accurately describe their spherically-averaged mass density profiles. We present, for the first time, a model to consistently predict both halo concentration, $c$, and this additional `shape' parameter, $α$, for a halo o… ▽ More It is well established from cosmological simulations that dark matter haloes are not precisely self-similar and an additional parameter, beyond their concentration, is required to accurately describe their spherically-averaged mass density profiles. We present, for the first time, a model to consistently predict both halo concentration, $c$, and this additional `shape' parameter, $α$, for a halo of given mass and redshift for a specified cosmology. Following recent studies, we recast the dependency on mass, redshift, and cosmology to a dependence on `peak height'. We show that, when adopting the standard definition of peak height, which employs the so-called spherical top hat (STH) window function, the concentration--peak height relation has a strong residual dependence on cosmology (i.e., it is not uniquely determined by peak height), whereas the $α$--peak height relation is approximately universal when employing the STH window function. Given the freedom in the choice of window function, we explore a simple modification of the STH function, constraining its form so that it produces universal relations for concentration and $α$ as a function of peak height using a large suite of cosmological simulations. It is found that universal relations for the two density profile parameters can indeed be derived and that these parameters are set by the linear power spectrum, $P(k)$, filtered on different scales. We show that the results of this work generalise to any (reasonable) combination of $P(k)$ and background expansion history, $H(z)$, resulting in accurate predictions of the density profiles of dark matter haloes for a wide range of cosmologies. △ Less

Submitted 22 November, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

Comments: 16 pages, 8 figures, accepted to MNRAS

arXiv:2109.11956 [pdf, other]

doi 10.1093/mnras/stab2787

Testing extensions to LCDM on small scales with forthcoming cosmic shear surveys

Authors: Sam G. Stafford, Ian G McCarthy, Juliana Kwan, Shaun T. Brown, Andreea S. Font, Andrew Robertson

Abstract: We investigate the constraining power of forthcoming Stage-IV weak lensing surveys (Euclid, LSST, and NGRST) for extensions to the LCDM model on small scales, via their impact on the cosmic shear power spectrum. We use high-resolution cosmological simulations to calculate how warm dark matter (WDM), self-interacting dark matter (SIDM) and a running of the spectral index affect the non-linear matte… ▽ More We investigate the constraining power of forthcoming Stage-IV weak lensing surveys (Euclid, LSST, and NGRST) for extensions to the LCDM model on small scales, via their impact on the cosmic shear power spectrum. We use high-resolution cosmological simulations to calculate how warm dark matter (WDM), self-interacting dark matter (SIDM) and a running of the spectral index affect the non-linear matter power spectrum, P(k), as a function of scale and redshift. We evaluate the cosmological constraining power using synthetic weak lensing observations derived from these power spectra and that take into account the anticipated source densities, shape noise and cosmic variance errors of upcoming surveys. We show that upcoming Stage-IV surveys will be able to place useful, independent constraints on both WDM models (ruling out models with a particle mass of < 0.5 keV) and SIDM models (ruling out models with a velocity-independent cross-section of > 10 cm^2 g^-1) through their effects on the small-scale cosmic shear power spectrum. Similarly, they will be able to strongly constrain cosmologies with a running spectral index. Finally, we explore the error associated with the cosmic shear cross-spectrum between tomographic bins, finding that it can be significantly affected by Poisson noise (the standard assumption is that the Poisson noise cancels between tomographic bins). We provide a new analytic form for the error on the cross-spectrum which accurately captures this effect. △ Less

Submitted 24 September, 2021; originally announced September 2021.

Comments: 19 pages, 7 figures, accepted for publication in MNRAS

arXiv:2109.09563 [pdf, other]

doi 10.1016/j.joule.2022.04.016

Speed of technological transformations required in Europe to achieve different climate goals

Authors: Marta Victoria, Elisabeth Zeyen, Tom Brown

Abstract: Europe's contribution to global warming will be determined by the cumulative emissions until climate neutrality is achieved. In this paper, we investigate alternative transition paths under carbon budgets corresponding to temperature increases between 1.5 and 2C. We use PyPSA-Eur-Sec, an open model of the sector-coupled European energy system with high spatial and temporal resolution. All the path… ▽ More Europe's contribution to global warming will be determined by the cumulative emissions until climate neutrality is achieved. In this paper, we investigate alternative transition paths under carbon budgets corresponding to temperature increases between 1.5 and 2C. We use PyPSA-Eur-Sec, an open model of the sector-coupled European energy system with high spatial and temporal resolution. All the paths entail similar technological transformations, but the timing of the scale-up of important technologies like water electrolysis, carbon capture and hydrogen networks differs in the model. In our results, solar PV, onshore and offshore wind become the cornerstone of a net-zero energy system enabling the decarbonisation of other sectors via direct electrification (e.g. heat pumps and electric vehicles) or indirect electrification (e.g. using synthetic fuels). Under the cost and performance assumptions applied, for a social cost of carbon (SCC) of 120EUR/tCO2, transition paths under 1.5 and 1.6C budgets are, respectively, 8%, and 1% more expensive than the 2C-budget because building assets earlier costs more. These pathways also see a faster ramp-up of new technologies before 2035. Under these assumptions, the 1.5C-budget is cost-optimal in our model, if SCC of at least 300 EUR/tCO2 is considered. Moreover, we discuss the strong implications of the SCC and discount rate assumed when comparing alternative paths. We also analyse the consequences of different assumptions on the cost and potential of CO2 sequestration. △ Less

Submitted 28 January, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

Journal ref: Joule, 2022

arXiv:2109.08708 [pdf, other]

doi 10.3847/1538-3881/ac281f

Relative Ages of Nine Inner Milky Way Globular Clusters from Proper Motion Cleaned Color-Magnitude Diagrams

Authors: Roger E. Cohen, Andrea Bellini, Luca Casagrande, Thomas M. Brown, Matteo Correnti, Jason S. Kalirai

Abstract: Our picture of the age-metallicity relation for Milky Way globular clusters (MWGCs) is still highly incomplete, and the majority of MWGCs lack self-consistent age measurements. Here, we exploit deep, homogenous multi-epoch Hubble Space Telescope (HST) imaging of nine MWGCs located towards the inner Milky Way to measure their relative ages, in most cases for the first time. Our relative age measure… ▽ More Our picture of the age-metallicity relation for Milky Way globular clusters (MWGCs) is still highly incomplete, and the majority of MWGCs lack self-consistent age measurements. Here, we exploit deep, homogenous multi-epoch Hubble Space Telescope (HST) imaging of nine MWGCs located towards the inner Milky Way to measure their relative ages, in most cases for the first time. Our relative age measurements are designed to be directly comparable to the large set of MWGC ages presented by VandenBerg et al. (2013, V13), using identical filters, evolutionary models, and bolometric corrections, extended to the higher extinction values relevant to our target clusters. Adopting the V13 MWGC age scale, our relative age measurements imply that our target clusters are consistently very old, with a mean age of 12.9$\pm$0.4 Gyr, with the exception of the young metal-rich MWGC NGC 6342. We perform two tests to validate the precision of our methodology, and discuss the implications of our target cluster loci in the MWGC age-metallicity plane. In addition, we use our fully self-consistent bolometric corrections to assess the systematic impact of variations in the total-to-selective extinction ratio $R_{V}$ on relative age measurements. △ Less

Submitted 17 September, 2021; originally announced September 2021.

Comments: AJ Accepted. 15 pages, 5 figures, 3 tables

arXiv:2109.06215 [pdf, other]

doi 10.1093/mnras/stac183

Quenching of satellite galaxies of Milky Way analogues: reconciling theory and observations

Authors: Andreea S. Font, Ian G. McCarthy, Vasily Belokurov, Shaun T. Brown, Sam G. Stafford

Abstract: The vast majority of low-mass satellite galaxies around the Milky Way and M31 appear virtually devoid of cool gas and show no signs of recent or ongoing star formation. Cosmological simulations demonstrate that such quenching is expected and is due to the harsh environmental conditions that satellites face when joining the Local Group (LG). However, recent observations of Milky Way analogues in th… ▽ More The vast majority of low-mass satellite galaxies around the Milky Way and M31 appear virtually devoid of cool gas and show no signs of recent or ongoing star formation. Cosmological simulations demonstrate that such quenching is expected and is due to the harsh environmental conditions that satellites face when joining the Local Group (LG). However, recent observations of Milky Way analogues in the SAGA survey present a very different picture, showing the majority of observed satellites to be actively forming stars, calling into question the realism of current simulations and the typicality of the LG. Here we use the ARTEMIS suite of high-resolution cosmological hydrodynamical simulations to carry out a careful comparison with observations of dwarf satellites in the LG, SAGA, and the Local Volume (LV) survey. We show that differences between SAGA and the LG and LV surveys, as well as between SAGA and the ARTEMIS simulations, can be strongly reduced by considering differences in the host mass distributions and (more importantly) observational selection effects, specifically that low-mass satellites which have only recently been accreted are more likely to be star-forming, have a higher optical surface brightness, and are therefore more likely to be included in the SAGA survey. This picture is confirmed using data from the deeper LV survey, which shows pronounced quenching at low masses, in accordance with the predictions of LCDM-based simulations. △ Less

Submitted 19 January, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

Comments: 13 pages, 8 figures, MNRAS, accepted for publication. Host mass rescaling corrected and additional discussion of observational selection effects included

arXiv:2108.04271 [pdf, other]

doi 10.3847/2041-8213/ac2aa3

Star Formation Histories of Ultra-Faint Dwarf Galaxies: environmental differences between Magellanic and non-Magellanic satellites?

Authors: Elena Sacchi, Hannah Richstein, Nitya Kallivayalil, Roeland van der Marel, Mattia Libralato, Paul Zivick, Gurtina Besla, Thomas M. Brown, Yumi Choi, Alis Deason, Tobias Fritz, Marla Geha, Puragra Guhathakurta, Myoungwon Jeon, Evan Kirby, Steven R. Majewski, Ekta Patel, Joshua D. Simon, Sangmo Tony Sohn, Erik Tollerud, Andrew Wetzel

Abstract: We present the color-magnitude diagrams and star formation histories (SFHs) of seven ultra-faint dwarf galaxies: Horologium 1, Hydra 2, Phoenix 2, Reticulum 2, Sagittarius 2, Triangulum 2, and Tucana 2, derived from high-precision Hubble Space Telescope photometry. We find that the SFH of each galaxy is consistent with them having created at least 80% of the stellar mass by $z\sim6$. For all galax… ▽ More We present the color-magnitude diagrams and star formation histories (SFHs) of seven ultra-faint dwarf galaxies: Horologium 1, Hydra 2, Phoenix 2, Reticulum 2, Sagittarius 2, Triangulum 2, and Tucana 2, derived from high-precision Hubble Space Telescope photometry. We find that the SFH of each galaxy is consistent with them having created at least 80% of the stellar mass by $z\sim6$. For all galaxies, we find quenching times older than 11.5 Gyr ago, compatible with the scenario in which reionization suppresses the star formation of small dark matter halos. However, our analysis also reveals some differences in the SFHs of candidate Magellanic Cloud satellites, i.e., galaxies that are likely satellites of the Large Magellanic Cloud and that entered the Milky Way potential only recently. Indeed, Magellanic satellites show quenching times about 600 Myr more recent with respect to those of other Milky Way satellites, on average, even though the respective timings are still compatible within the errors. This finding is consistent with theoretical models that suggest that satellites' SFHs may depend on their host environment at early times, although we caution that within the error bars all galaxies in our sample are consistent with being quenched at a single epoch. △ Less

Submitted 28 September, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

Comments: 7 pages, 3 figures, 2 tables. Accepted for publication in ApJL

arXiv:2108.03982 [pdf, other]

Optimisation of an FPGA Credit Default Swap engine by embracing dataflow techniques

Authors: Nick Brown, Mark Klaisoongnoen, Oliver Thomson Brown

Abstract: Quantitative finance is the use of mathematical models to analyse financial markets and securities. Typically requiring significant amounts of computation, an important question is the role that novel architectures can play in accelerating these models in the future on HPC machines. In this paper we explore the optimisation of an existing, open source, FPGA based Credit Default Swap (CDS) engine u… ▽ More Quantitative finance is the use of mathematical models to analyse financial markets and securities. Typically requiring significant amounts of computation, an important question is the role that novel architectures can play in accelerating these models in the future on HPC machines. In this paper we explore the optimisation of an existing, open source, FPGA based Credit Default Swap (CDS) engine using High Level Synthesis (HLS). Developed by Xilinx, and part of their open source Vitis libraries, the implementation of this engine currently favours flexibility and ease of integration over performance. We explore redesigning the engine to fully embrace the dataflow approach, ultimately resulting in an engine which is around eight times faster on an Alveo U280 FPGA than the original Xilinx library version. We then compare five of our engines on the U280 against a 24-core Xeon Platinum Cascade Lake CPU, outperforming the CPU by around 1.55 times, with the FPGA consuming 4.7 times less power and delivering around seven times the power efficiency of the CPU. △ Less

Submitted 28 July, 2021; originally announced August 2021.

Comments: Preprint of article in the IEEE Cluster FPGA for HPC Workshop 2021 (HPC FPGA 2021)

arXiv:2107.13579 [pdf, other]

doi 10.1038/s41467-022-31638-0

Trade off-Free Entanglement Stabilization in a Superconducting Qutrit-Qubit System

Authors: Tristan Brown, Emery Doucet, Diego Ristè, Guilhem Ribeill, Katarina Cicak, Joe Aumentado, Ray Simmonds, Luke Govia, Archana Kamal, Leonardo Ranzani

Abstract: Quantum reservoir engineering is a powerful framework for autonomous quantum state preparation and error correction. However, traditional approaches to reservoir engineering are hindered by unavoidable coherent leakage out of the target state, which imposes an inherent trade off between achievable steady-state state fidelity and stabilization rate. In this work we demonstrate a protocol that achie… ▽ More Quantum reservoir engineering is a powerful framework for autonomous quantum state preparation and error correction. However, traditional approaches to reservoir engineering are hindered by unavoidable coherent leakage out of the target state, which imposes an inherent trade off between achievable steady-state state fidelity and stabilization rate. In this work we demonstrate a protocol that achieves trade off-free Bell state stabilization in a qutrit-qubit system realized on a circuit-QED platform. We accomplish this by creating a purely dissipative channel for population transfer into the target state, mediated by strong parametric interactions coupling the second-excited state of a superconducting transmon and the engineered bath resonator. Our scheme achieves a state preparation fidelity of 84% with a stabilization time constant of 339 ns, leading to the lowest error-time product reported in solid-state quantum information platforms to date. △ Less

Submitted 13 August, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

Comments: 19 pages, 14 figures

arXiv:2107.12159 [pdf, other]

Enhanced Meta-Displays Using Advanced Phase-Change Materials

Authors: Omid Hemmatyar, Sajjad Abdollahramezani, Ioannis Zeimpekis, Sergey Lepeshov, Alex Krasnok, Asir Intisar Khan, Kathryn M. Neilson, Christian Teichrib, Tyler Brown, Eric Pop, Daniel W. Hewak, Matthias Wuttig, Andrea Alu, Otto L. Muskens, Ali Adibi

Abstract: Structural colors generated due to light scattering from static all-dielectric metasurfaces have successfully enabled high-resolution, high-saturation, and wide-gamut color printing applications. Despite recent advances, most demonstrations of these structure-dependent colors lack post-fabrication tunability. This hinders their applicability for front-end dynamic display technologies. Phase-change… ▽ More Structural colors generated due to light scattering from static all-dielectric metasurfaces have successfully enabled high-resolution, high-saturation, and wide-gamut color printing applications. Despite recent advances, most demonstrations of these structure-dependent colors lack post-fabrication tunability. This hinders their applicability for front-end dynamic display technologies. Phase-change materials (PCMs), with significant contrast of their optical properties between their amorphous and crystalline states, have demonstrated promising potentials in reconfigurable nanophotonics. Herein, we leverage tunable all-dielectric reflective metasurfaces made of newly emerged classes of low-loss optical PCMs, i.e., antimony trisulphide (Sb$_2$S$_3$) and antimony triselenide (Sb$_2$Se$_3$), with superb characteristics to realize switchable, high-saturation, high-efficiency and high-resolution dynamic meta-pixels. Exploiting polarization-sensitive building blocks, the presented meta-pixel can generate two different colors when illuminated by either one of two orthogonally polarized incident beams. Such degrees of freedom (i.e., material phase and polarization state) enable a single reconfigurable metasurface with fixed geometrical parameters to generate four distinct wide-gamut colors. We experimentally demonstrate, for the first time, an electrically-driven micro-scale display through the integration of phase-change metasurfaces with an on-chip heater formed by transparent conductive oxide. Our experimental findings enable a versatile platform suitable for a wide range of applications, including tunable full-color printing, enhanced dynamic displays, information encryption, and anti-counterfeiting. △ Less

Submitted 19 July, 2021; originally announced July 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:2105.01313

arXiv:2107.01308 [pdf, other]

Deep Neural Nets with Fixed Bias Configuration

Authors: Harbir Antil, Thomas S. Brown, Rainald Löhner, Fumiya Togashi, Deepanshu Verma

Abstract: For any given neural network architecture a permutation of weights and biases results in the same functional network. This implies that optimization algorithms used to `train' or `learn' the network are faced with a very large number (in the millions even for small networks) of equivalent optimal solutions in the parameter space. To the best of our knowledge, this observation is absent in the lite… ▽ More For any given neural network architecture a permutation of weights and biases results in the same functional network. This implies that optimization algorithms used to `train' or `learn' the network are faced with a very large number (in the millions even for small networks) of equivalent optimal solutions in the parameter space. To the best of our knowledge, this observation is absent in the literature. In order to narrow down the parameter search space, a novel technique is introduced in order to fix the bias vector configurations to be monotonically increasing. This is achieved by augmenting a typical learning problem with inequality constraints on the bias vectors in each layer. A Moreau-Yosida regularization based algorithm is proposed to handle these inequality constraints and a theoretical convergence of the this algorithm is established. Applications of the proposed approach to standard trigonometric functions and more challenging stiff ordinary differential equations arising in chemically reacting ows clearly illustrate the benefits of the proposed approach. △ Less

Submitted 18 February, 2022; v1 submitted 2 July, 2021; originally announced July 2021.

arXiv:2107.01092 [pdf, other]

doi 10.1371/journal.pone.0281380

Import options for chemical energy carriers from renewable sources to Germany

Authors: Johannes Hampp, Michael Düren, Tom Brown

Abstract: Import and export of fossil energy carriers are cornerstones of energy systems world-wide. If energy systems are to become climate neutral and sustainable, fossil carriers need to be substituted with carbon neutral alternatives or electrified if possible. We investigate synthetic chemical energy carriers, H2, CH4, MeOH, NH3 and Fischer-Tropsch fuels (FTF), produced using electricity from RES as fo… ▽ More Import and export of fossil energy carriers are cornerstones of energy systems world-wide. If energy systems are to become climate neutral and sustainable, fossil carriers need to be substituted with carbon neutral alternatives or electrified if possible. We investigate synthetic chemical energy carriers, H2, CH4, MeOH, NH3 and Fischer-Tropsch fuels (FTF), produced using electricity from RES as fossil substitutes. [...] We model the sourcing of feedstock chemicals, synthesis and transport along nine different Energy Supply Chains to Germany (DE) and compare import options for seven locations around the world against each other and with domestically sourced alternatives on the basis of their respective cost per unit of H2 and energy delivered. We find that for each type of chemical energy carrier, there is an import option with lower costs compared to domestic production in DE. No single exporting country or energy carrier has a unique cost advantage, since for each energy carrier and country there are cost-competitive alternatives. This allows exporter and infrastructure decisions to be made based on other criteria than energy and cost. The lowest cost means for importing of energy and H2 are by H2 pipeline from Denmark, Spain and Western Asia and Northern Africa starting at 36 EUR/MWhLHV to 42 EUR/MWh-LHV or 1.0 EUR/kg-H2 to 1.3 EUR/kg-H2 (in 2050, assuming 5 % p.a. capital cost). For complex energy carriers derived from H2 like CH4, NH3, MeOH or FTF, imports from Argentina by ship to DE are lower cost than closer exporters in the European Union or Western Asia and Northern Africa. For meeting H2 demand, direct H2 imports are more attractive than indirect routes using CH4, MeOH or NH3 imports and subsequent decomposition to H2 because of high capital investment costs and energetic losses. We make our model and data available under open licenses for adaptation and reuse. △ Less

Submitted 2 February, 2023; v1 submitted 2 July, 2021; originally announced July 2021.

Journal ref: PLOS ONE, 2023

Showing 51–100 of 550 results for author: Brown, T