-
A global evidence map of human well-being and biodiversity co-benefits and trade-offs of natural climate solutions
Authors:
Charlotte H. Chang,
James T. Erbaugh,
Paola Fajardo,
Luci Lu,
István Molnár,
Dávid Papp,
Brian E. Robinson,
Kemen Austin,
Susan Cook-Patton,
Timm Kroeger,
Lindsey Smart,
Miguel Castro,
Samantha H. Cheng,
Peter W. Ellis,
Rob I. McDonald,
Teevrat Garg,
Erin E. Poor,
Preston Welker,
Andrew R. Tilman,
Stephen A. Wood,
Yuta J. Masuda
Abstract:
Natural climate solutions (NCS) are critical for mitigating climate change through ecosystem-based carbon removal and emissions reductions. NCS implementation can also generate biodiversity and human well-being co-benefits and trade-offs ("NCS co-impacts"), but the volume of evidence on NCS co-impacts has grown rapidly across disciplines, is poorly understood, and remains to be systematically coll…
▽ More
Natural climate solutions (NCS) are critical for mitigating climate change through ecosystem-based carbon removal and emissions reductions. NCS implementation can also generate biodiversity and human well-being co-benefits and trade-offs ("NCS co-impacts"), but the volume of evidence on NCS co-impacts has grown rapidly across disciplines, is poorly understood, and remains to be systematically collated and synthesized. A global evidence map of NCS co-impacts would overcome key barriers to NCS implementation by providing relevant information on co-benefits and trade-offs where carbon mitigation potential alone does not justify NCS projects. We employ large language models to assess over two million articles, finding 257,266 relevant articles on NCS co-impacts. We analyze this large and dispersed body of literature using innovative machine learning methods to extract relevant data (e.g., study location, species, and other key variables), and create a global evidence map on NCS co-impacts. Evidence on NCS co-impacts has grown approximately ten-fold in three decades, although some of the most abundant evidence is associated with pathways that have less mitigation potential. We find that studies often examine multiple NCS pathways, indicating natural NCS pathway complements, and each NCS is often associated with two or more coimpacts. Finally, NCS co-impacts evidence and priority areas for NCS are often mismatched--some countries with high mitigation potential from NCS have few published studies on the broader co-impacts of NCS implementation. Our work advances and makes available novel methods and systematic and representative data of NCS co-impacts studies, thus providing timely insights to inform NCS research and action globally.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Rare events in a polling system: Rays and Spirals
Authors:
Robert D. Foley,
David R. McDonald
Abstract:
It's a situation everyone dreads. A road is down to one lane for repairs. Traffic is let through one way until the backlog clears and then traffic is let through the other way to clear that backlog and so on. When stuck in a very long queue it is inevitable to wonder how did I get into this mess?
We study a polling model with a server having exponential service time with mean $1/μ$ alternating b…
▽ More
It's a situation everyone dreads. A road is down to one lane for repairs. Traffic is let through one way until the backlog clears and then traffic is let through the other way to clear that backlog and so on. When stuck in a very long queue it is inevitable to wonder how did I get into this mess?
We study a polling model with a server having exponential service time with mean $1/μ$ alternating between two queues, emptying one queue before switching to the other. Customers arrive at queue one according to a Poisson process with rate $λ_1$ and at queue two with rate $λ_2$. We discuss how we get at a rare event with a large number of customers in the system. In fact this can happen in two different ways depending on the parameters. In one case one queue simply explodes and runs away without emptying. We call this the ray case. In the other spiral case the queues are successively emptied but in a losing battle as the system zigzags to the rare event. This dichotomy extends to the steady state distribution and leads to quite different asymptotic behaviour in the two cases.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Multi-flavor quantum criticality
Authors:
A. Khansili,
A. Bangura,
R. D. McDonald,
B. J. Ramshaw,
A. Rydh,
A. Shekhter
Abstract:
In a quantum critical metal, the electronic density of states, or quasiparticle mass on the Fermi surface, is strongly enhanced through electronic correlations. The density of states in the quantum critical unconventional superconductor CeCoIn$_5$, can be readily accessed in the normal state because all energy scales are small. However, the experimental challenges associated with large nuclear spe…
▽ More
In a quantum critical metal, the electronic density of states, or quasiparticle mass on the Fermi surface, is strongly enhanced through electronic correlations. The density of states in the quantum critical unconventional superconductor CeCoIn$_5$, can be readily accessed in the normal state because all energy scales are small. However, the experimental challenges associated with large nuclear specific heat and long nuclear spin-lattice relaxation times have impeded unveiling a more detailed physical picture. Here we report an extensive thermal impedance spectroscopy study of CeCoIn$_5$ that assesses the density of states in two independent ways, via the nuclear spin-lattice relaxation rate and via the specific heat. We establish that the temperature- and magnetic field dependence of the nuclear spin-lattice relaxation rate is determined entirely by the energy-scale competition near the quantum critical point. In particular, mass enhancement is cut off at finite magnetic fields. However, the specific heat measurements reveal excess entropy in addition to that associated with the density of states on the Fermi surface. This excess entropy is direct thermodynamic evidence for a "second flavor" of fluctuating boson in CeCoIn$_5$. The electronic nature of this excess entropy is evidenced by its suppression in the superconducting state. We suggest such a multi-flavour character for a broader class of quantum critical metals.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Multi-Step Dialogue Workflow Action Prediction
Authors:
Ramya Ramakrishnan,
Ethan R. Elenberg,
Hashan Narangodage,
Ryan McDonald
Abstract:
In task-oriented dialogue, a system often needs to follow a sequence of actions, called a workflow, that complies with a set of guidelines in order to complete a task. In this paper, we propose the novel problem of multi-step workflow action prediction, in which the system predicts multiple future workflow actions. Accurate prediction of multiple steps allows for multi-turn automation, which can f…
▽ More
In task-oriented dialogue, a system often needs to follow a sequence of actions, called a workflow, that complies with a set of guidelines in order to complete a task. In this paper, we propose the novel problem of multi-step workflow action prediction, in which the system predicts multiple future workflow actions. Accurate prediction of multiple steps allows for multi-turn automation, which can free up time to focus on more complex tasks. We propose three modeling approaches that are simple to implement yet lead to more action automation: 1) fine-tuning on a training dataset, 2) few-shot in-context learning leveraging retrieval and large language model prompting, and 3) zero-shot graph traversal, which aggregates historical action sequences into a graph for prediction. We show that multi-step action prediction produces features that improve accuracy on downstream dialogue tasks like predicting task success, and can increase automation of steps by 20% without requiring as much feedback from a human overseeing the system.
△ Less
Submitted 12 February, 2024; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Temperature dependence and limiting mechanisms of the upper critical field of FeSe thin films
Authors:
M. Stanley,
Y. Li,
J. C. Palmstrom,
J. L. Thompson,
K. D. Halanayake,
D. Reifsnyder-Hickey,
R. D. McDonald,
S. A. Crooker,
N. Trivedi,
N. Samarth
Abstract:
We use magnetoresistance measurements at high magnetic field (B \leq 65 T) and low temperature (T \geq 500 mK) to gain fresh insights into the behavior of the upper critical field, Hc2, in superconducting ultrathin FeSe films of varying degrees of disorder, grown by molecular beam epitaxy on SrTiO3. Measurements of Hc2 across samples with a widely varying superconducting critical temperature (1.2…
▽ More
We use magnetoresistance measurements at high magnetic field (B \leq 65 T) and low temperature (T \geq 500 mK) to gain fresh insights into the behavior of the upper critical field, Hc2, in superconducting ultrathin FeSe films of varying degrees of disorder, grown by molecular beam epitaxy on SrTiO3. Measurements of Hc2 across samples with a widely varying superconducting critical temperature (1.2 K \leq Tc \leq 21 K) generically show similar qualitative temperature dependence. We analyze the temperature dependence of Hc2 in the context of Werthamer-Helfand-Hohenberg (WHH) theory. The analysis yields parameters that indicate a strong Pauli paramagnetic pair-breaking mechanism which is also reflected by pseudo-isotropic superconductivity in the limit of zero temperature. In the lower Tc samples, we observe a spin-orbit scattering driven enhancement of Hc2 above the strongly-coupled Pauli paramagnetic limit. We also observe clear deviations from WHH theory at low temperature, regardless of Tc. We attribute this to the multi-band superconductivity of FeSe and possibly to the emergence of a low temperature, high field superconducting phase.
△ Less
Submitted 29 October, 2023;
originally announced October 2023.
-
High-field immiscibility of electrons belonging to adjacent twinned bismuth crystals
Authors:
Yuhao Ye,
Akiyoshi Yamada,
Yuto Kinoshita,
**hua Wang,
Pan Nie,
Liangcai Xu,
Huakun Zuo,
Masashi Tokunaga,
Neil Harrison,
Ross D. McDonald,
Alexey V. Suslov,
Arzhang Ardavan,
Moon-Sun Nam,
David LeBoeuf,
Cyril Proust,
Benoît Fauqué,
Yuki Fuseya,
Zengwei Zhu,
Kamran Behnia
Abstract:
Bulk bismuth has a complex Landau spectrum. The small effective masses and the large g-factors are anisotropic. The chemical potential drifts at high magnetic fields. Moreover, twin boundaries further complexify the interpretation of the data by producing extra anomalies in the extreme quantum limit. Here, we present a study of angle dependence of magnetoresistance up to 65 T in bismuth complement…
▽ More
Bulk bismuth has a complex Landau spectrum. The small effective masses and the large g-factors are anisotropic. The chemical potential drifts at high magnetic fields. Moreover, twin boundaries further complexify the interpretation of the data by producing extra anomalies in the extreme quantum limit. Here, we present a study of angle dependence of magnetoresistance up to 65 T in bismuth complemented with Nernst, ultrasound, and magneto-optic data. All observed anomalies can be explained in a single-particle picture of a sample consisting of two twinned crystals tilted by 108$^{\circ}$ and with two adjacent crystals kee** their own chemical potentials despite a shift between chemical potentials as large as 68 meV at 65 T. This implies an energy barrier between adjacent twinned crystals reminiscent of a metal-semiconductor Schottky barrier or a p-n junction. We argue that this barrier is built by accumulating charge carriers of opposite signs across a twin boundary.
△ Less
Submitted 15 February, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
SteP: Stacked LLM Policies for Web Actions
Authors:
Paloma Sodhi,
S. R. K. Branavan,
Yoav Artzi,
Ryan McDonald
Abstract:
Performing tasks on the web presents fundamental challenges to large language models (LLMs), including combinatorially large open-world tasks and variations across web interfaces. Simply specifying a large prompt to handle all possible behaviors and states is extremely complex, and results in behavior leaks between unrelated behaviors. Decomposition to distinct policies can address this challenge,…
▽ More
Performing tasks on the web presents fundamental challenges to large language models (LLMs), including combinatorially large open-world tasks and variations across web interfaces. Simply specifying a large prompt to handle all possible behaviors and states is extremely complex, and results in behavior leaks between unrelated behaviors. Decomposition to distinct policies can address this challenge, but requires carefully handing off control between policies. We propose Stacked LLM Policies for Web Actions (SteP), an approach to dynamically compose policies to solve a diverse set of web tasks. SteP defines a Markov Decision Process where the state is a stack of policies representing the control state, i.e., the chain of policy calls. Unlike traditional methods that are restricted to static hierarchies, SteP enables dynamic control that adapts to the complexity of the task. We evaluate SteP against multiple baselines and web environments including WebArena, MiniWoB++, and a CRM simulator. On WebArena, SteP improves (14.9% to 35.8%) over SOTA that use GPT-4 policies, while on MiniWob++, SteP is competitive with prior works while using significantly less data. Our code and data is available at https://asappresearch.github.io/webagents-step.
△ Less
Submitted 22 April, 2024; v1 submitted 5 October, 2023;
originally announced October 2023.
-
On the Effectiveness of Offline RL for Dialogue Response Generation
Authors:
Paloma Sodhi,
Felix Wu,
Ethan R. Elenberg,
Kilian Q. Weinberger,
Ryan McDonald
Abstract:
A common training technique for language models is teacher forcing (TF). TF attempts to match human language exactly, even though identical meanings can be expressed in different ways. This motivates use of sequence-level objectives for dialogue response generation. In this paper, we study the efficacy of various offline reinforcement learning (RL) methods to maximize such objectives. We present a…
▽ More
A common training technique for language models is teacher forcing (TF). TF attempts to match human language exactly, even though identical meanings can be expressed in different ways. This motivates use of sequence-level objectives for dialogue response generation. In this paper, we study the efficacy of various offline reinforcement learning (RL) methods to maximize such objectives. We present a comprehensive evaluation across multiple datasets, models, and metrics. Offline RL shows a clear performance improvement over teacher forcing while not inducing training instability or sacrificing practical training budgets.
△ Less
Submitted 23 July, 2023;
originally announced July 2023.
-
Penguin huddling: a continuum model
Authors:
Samuel J. Harris,
N. R. McDonald
Abstract:
Penguins huddling in a cold wind are represented by a two-dimensional, continuum model. The huddle boundary evolves due to heat loss to the huddle exterior and through the reorganisation of penguins as they seek to regulate their heat production within the huddle. These two heat transfer mechanisms, along with area, or penguin number, conservation, gives a free boundary problem whose dynamics depe…
▽ More
Penguins huddling in a cold wind are represented by a two-dimensional, continuum model. The huddle boundary evolves due to heat loss to the huddle exterior and through the reorganisation of penguins as they seek to regulate their heat production within the huddle. These two heat transfer mechanisms, along with area, or penguin number, conservation, gives a free boundary problem whose dynamics depend on both the dynamics interior and exterior to the huddle. Assuming the huddle shape evolves slowly compared to the advective timescale of the exterior wind, the interior temperature is governed by a Poisson equation and the exterior temperature by the steady advection-diffusion equation. The exterior, advective wind velocity is the gradient of a harmonic, scalar field. The conformal invariance of the exterior governing equations is used to convert the system to a Polubarinova-Galin type equation, with forcing depending on both the interior and exterior temperature gradients at the huddle boundary. The interior Poisson equation is not conformally invariant, so the interior temperature gradient is found numerically using a combined adaptive Antoulas-Anderson and least squares algorithm. The results show that, irrespective of the starting shape, penguin huddles evolve into an egg-like steady shape. This shape is dependent on the wind strength, parameterised by the Péclet number Pe, and a parameter \b{eta} which effectively measures the strength of the interior self-generation of heat by the penguins. The numerical method developed is applicable to a further five free boundary problems.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Calorimetric measurement of nuclear spin-lattice relaxation rate in metals
Authors:
A. Khansili,
A. Bangura,
R. D. McDonald,
B. J. Ramshaw,
A. Rydh,
A. Shekhter
Abstract:
The quasiparticle density of states in correlated and quantum-critical metals directly probes the effect of electronic correlations on the Fermi surface. Measurements of the nuclear spin-lattice relaxation rate provide one such experimental probe of quasiparticle mass through the electronic density of states. By far the most common way of accessing the spin-lattice relaxation rate is via nuclear m…
▽ More
The quasiparticle density of states in correlated and quantum-critical metals directly probes the effect of electronic correlations on the Fermi surface. Measurements of the nuclear spin-lattice relaxation rate provide one such experimental probe of quasiparticle mass through the electronic density of states. By far the most common way of accessing the spin-lattice relaxation rate is via nuclear magnetic resonance and nuclear quadrupole resonance experiments, which require resonant excitation of nuclear spin transitions. Here we report non-resonant access to spin-lattice relaxation dynamics in AC-calorimetric measurements. The nuclear spin-lattice relaxation rate is inferred in our measurements from its effect on the frequency dispersion of the thermal response of the calorimeter-sample assembly. We use fast, lithographically-defined nanocalorimeters to access the nuclear spin-lattice relaxation times in metallic indium from 0.3~K to 7~K and in magnetic fields up to 35~T.
△ Less
Submitted 11 May, 2023; v1 submitted 9 February, 2023;
originally announced February 2023.
-
Zigzag persistence for coral reef resilience using a stochastic spatial model
Authors:
Robert A. McDonald,
Rosanna Neuhausler,
Martin Robinson,
Laurel G. Larsen,
Heather A. Harrington,
Maria Bruna
Abstract:
A complex interplay between species governs the evolution of spatial patterns in ecology. An open problem in the biological sciences is characterising spatio-temporal data and understanding how changes at the local scale affect global dynamics/behaviour. Here, we extend a well-studied temporal mathematical model of coral reef dynamics to include stochastic and spatial interactions and generate dat…
▽ More
A complex interplay between species governs the evolution of spatial patterns in ecology. An open problem in the biological sciences is characterising spatio-temporal data and understanding how changes at the local scale affect global dynamics/behaviour. Here, we extend a well-studied temporal mathematical model of coral reef dynamics to include stochastic and spatial interactions and generate data to study different ecological scenarios. We present descriptors to characterise patterns in heterogeneous spatio-temporal data surpassing spatially averaged measures. We apply these descriptors to simulated coral data and demonstrate the utility of two topological data analysis techniques--persistent homology and zigzag persistence--for characterising mechanisms of reef resilience. We show that the introduction of local competition between species leads to the appearance of coral clusters in the reef. We use our analyses to distinguish temporal dynamics stemming from different initial configurations of coral, showing that the neighbourhood composition of coral sites determines their long-term survival. Using zigzag persistence, we determine which spatial configurations protect coral from extinction in different environments. Finally, we apply this toolkit of multi-scale methods to empirical coral reef data, which distinguish spatio-temporal reef dynamics in different locations, and demonstrate the applicability to a range of datasets.
△ Less
Submitted 12 August, 2023; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Magnetotropic susceptibility
Authors:
A. Shekhter,
R. D. McDonald,
B. J. Ramshaw,
K. A. Modic
Abstract:
The magnetotropic susceptibility is the thermodynamic coefficient associated with the rotational anisotropy of the free energy in an external magnetic field, and is closely related to the magnetic susceptibility. It emerges naturally in frequency-shift measurements of oscillating mechanical cantilevers, which are becoming an increasingly important tool in the quantitative study of the thermodynami…
▽ More
The magnetotropic susceptibility is the thermodynamic coefficient associated with the rotational anisotropy of the free energy in an external magnetic field, and is closely related to the magnetic susceptibility. It emerges naturally in frequency-shift measurements of oscillating mechanical cantilevers, which are becoming an increasingly important tool in the quantitative study of the thermodynamics of modern condensed matter systems. Here we discuss the basic properties of the magnetotropic susceptibility as they relate to the experimental aspects of frequency-shift measurements, as well as to the interpretation of those experiments in terms of the intrinsic properties of the system under study.
△ Less
Submitted 25 June, 2023; v1 submitted 21 August, 2022;
originally announced August 2022.
-
Energy-scale competition in the Hall resistivity of a strange metal
Authors:
A. Shekhter,
K. A. Modic,
L. E. Winter,
Y. Lai,
M. K. Chan,
F. F. Balakirev,
J. B. Betts,
S. Komiya,
S. Ono,
G. S. Boebinger,
B. J. Ramshaw,
R. D. McDonald
Abstract:
Anomalous transport behavior -- both longitudinal and Hall -- is the defining characteristic of the strange-metal state of High-Tc cuprates. The temperature, frequency, and magnetic field dependence of the resistivity is understood within strange metal phenomenology as resulting from energy-scale competition to set the inelastic relaxation rate. The anomalously strong temperature dependence of the…
▽ More
Anomalous transport behavior -- both longitudinal and Hall -- is the defining characteristic of the strange-metal state of High-Tc cuprates. The temperature, frequency, and magnetic field dependence of the resistivity is understood within strange metal phenomenology as resulting from energy-scale competition to set the inelastic relaxation rate. The anomalously strong temperature dependence of the Hall coefficient, however, is at odds with this phenomenology. Here we report measurements of the Hall resistivity in the strange metal state of cuprates over a broad range of magnetic fields and temperatures. The observed field and temperature dependent Hall resistivity at very high magnetic fields reveals a distinct high-field regime which is controlled by energy-scale competition. This extends the strange metal phenomenology in the cuprates to include the Hall resistivity and suggests, in particular, that the direct effect of magnetic field on the relaxation dynamics of quantum fluctuations may be at least partially responsible for the anomalous Hall resistivity of the strange metal state.
△ Less
Submitted 20 July, 2022;
originally announced July 2022.
-
Sudden adiabaticity entering field-induced state in UTe2
Authors:
Rico Schönemann,
Priscila F. S. Rosa,
Sean M. Thomas,
You Lai,
Doan N. Nguyen,
John Singleton,
Eric L. Brosha,
Ross D. McDonald,
Vivien Zapf,
Boris Maiorov,
Marcelo Jaime
Abstract:
There has been a recent surge of interest in UTe$_2$ due to its unconventional magnetic field (H) reinforced spin-triplet superconducting phases persisting at fields far above the simple Pauli limit for H $\parallel$ [010]. Magnetic fields in excess of 35 T then induce a field-polarized magnetic state via a first-order-like phase transition. More controversially, for field orientations close to H…
▽ More
There has been a recent surge of interest in UTe$_2$ due to its unconventional magnetic field (H) reinforced spin-triplet superconducting phases persisting at fields far above the simple Pauli limit for H $\parallel$ [010]. Magnetic fields in excess of 35 T then induce a field-polarized magnetic state via a first-order-like phase transition. More controversially, for field orientations close to H $\parallel$ [011] and above 40 T, electrical resistivity measurements suggest that a further superconducting state may exist. However, no Meissner effect or thermodynamic evidence exists to date for this phase making it difficult to exclude a simple low-resistance metallic state. In this paper, we describe a study using thermal, electrical, and magnetic probes in magnetic fields of up to 55 T applied between the [010] ($b$) and [001] ($c$) directions. Our MHz conductivity data reveal the field-induced state of low or vanishing electrical resistance; simultaneous magnetocaloric effect measurements (i.e. changes in sample temperature due to changing magnetic field), show the first definitive evidence for adiabaticity and thermal behavior characteristic of bulk field-induced superconductivity.
△ Less
Submitted 2 July, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Long-term Control for Dialogue Generation: Methods and Evaluation
Authors:
Ramya Ramakrishnan,
Hashan Buddhika Narangodage,
Mauro Schilman,
Kilian Q. Weinberger,
Ryan McDonald
Abstract:
Current approaches for controlling dialogue response generation are primarily focused on high-level attributes like style, sentiment, or topic. In this work, we focus on constrained long-term dialogue generation, which involves more fine-grained control and requires a given set of control words to appear in generated responses. This setting requires a model to not only consider the generation of t…
▽ More
Current approaches for controlling dialogue response generation are primarily focused on high-level attributes like style, sentiment, or topic. In this work, we focus on constrained long-term dialogue generation, which involves more fine-grained control and requires a given set of control words to appear in generated responses. This setting requires a model to not only consider the generation of these control words in the immediate context, but also produce utterances that will encourage the generation of the words at some time in the (possibly distant) future. We define the problem of constrained long-term control for dialogue generation, identify gaps in current methods for evaluation, and propose new metrics that better measure long-term control. We also propose a retrieval-augmented method that improves performance of long-term controlled generation via logit modification techniques. We show through experiments on three task-oriented dialogue datasets that our metrics better assess dialogue control relative to current alternatives and that our method outperforms state-of-the-art constrained generation baselines.
△ Less
Submitted 15 May, 2022;
originally announced May 2022.
-
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages
Authors:
Felix Wu,
Kwangyoun Kim,
Shinji Watanabe,
Kyu Han,
Ryan McDonald,
Kilian Q. Weinberger,
Yoav Artzi
Abstract:
We introduce Wav2Seq, the first self-supervised approach to pre-train both parts of encoder-decoder models for speech data. We induce a pseudo language as a compact discrete representation, and formulate a self-supervised pseudo speech recognition task -- transcribing audio inputs into pseudo subword sequences. This process stands on its own, or can be applied as low-cost second-stage pre-training…
▽ More
We introduce Wav2Seq, the first self-supervised approach to pre-train both parts of encoder-decoder models for speech data. We induce a pseudo language as a compact discrete representation, and formulate a self-supervised pseudo speech recognition task -- transcribing audio inputs into pseudo subword sequences. This process stands on its own, or can be applied as low-cost second-stage pre-training. We experiment with automatic speech recognition (ASR), spoken named entity recognition, and speech-to-text translation. We set new state-of-the-art results for end-to-end spoken named entity recognition, and show consistent improvements on 20 language pairs for speech-to-text translation, even when competing methods use additional text data for training. Finally, on ASR, our approach enables encoder-decoder methods to benefit from pre-training for all parts of the network, and shows comparable performance to highly optimized recent methods.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
COVID-19 Multidimensional Kaggle Literature Organization
Authors:
Maksim E. Eren,
Nick Solovyev,
Chris Hamer,
Renee McDonald,
Boian S. Alexandrov,
Charles Nicholas
Abstract:
The unprecedented outbreak of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2), or COVID-19, continues to be a significant worldwide problem. As a result, a surge of new COVID-19 related research has followed suit. The growing number of publications requires document organization methods to identify relevant information. In this paper, we expand upon our previous work with clustering t…
▽ More
The unprecedented outbreak of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2), or COVID-19, continues to be a significant worldwide problem. As a result, a surge of new COVID-19 related research has followed suit. The growing number of publications requires document organization methods to identify relevant information. In this paper, we expand upon our previous work with clustering the CORD-19 dataset by applying multi-dimensional analysis methods. Tensor factorization is a powerful unsupervised learning method capable of discovering hidden patterns in a document corpus. We show that a higher-order representation of the corpus allows for the simultaneous grou** of similar articles, relevant journals, authors with similar research interests, and topic keywords. These grou**s are identified within and among the latent components extracted via tensor decomposition. We further demonstrate the application of this method with a publicly available interactive visualization of the dataset.
△ Less
Submitted 19 July, 2021; v1 submitted 17 July, 2021;
originally announced July 2021.
-
Non-linear Visual Knowledge Discovery with Elliptic Paired Coordinates
Authors:
Rose McDonald,
Boris Kovalerchuk
Abstract:
It is challenging for humans to enable visual knowledge discovery in data with more than 2-3 dimensions with a naked eye. This chapter explores the efficiency of discovering predictive machine learning models interactively using new Elliptic Paired coordinates (EPC) visualizations. It is shown that EPC are capable to visualize multidimensional data and support visual machine learning with preserva…
▽ More
It is challenging for humans to enable visual knowledge discovery in data with more than 2-3 dimensions with a naked eye. This chapter explores the efficiency of discovering predictive machine learning models interactively using new Elliptic Paired coordinates (EPC) visualizations. It is shown that EPC are capable to visualize multidimensional data and support visual machine learning with preservation of multidimensional information in 2-D. Relative to parallel and radial coordinates, EPC visualization requires only a half of the visual elements for each n-D point. An interactive software system EllipseVis, which is developed in this work, processes high-dimensional datasets, creates EPC visualizations, and produces predictive classification models by discovering dominance rules in EPC. By using interactive and automatic processes it discovers zones in EPC with a high dominance of a single class. The EPC methodology has been successful in discovering non-linear predictive models with high coverage and precision in the computational experiments. This can benefit multiple domains by producing visually appealing dominance rules. This chapter presents results of successful testing the EPC non-linear methodology in experiments using real and simulated data, EPC generalized to the Dynamic Elliptic Paired Coordinates (DEPC), incorporation of the weights of coordinates to optimize the visual discovery, introduction of an alternative EPC design and introduction of the concept of incompact machine learning methodology based on EPC/DEPC.
△ Less
Submitted 11 July, 2021;
originally announced July 2021.
-
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Authors:
Rahul Aralikatte,
Shashi Narayan,
Joshua Maynez,
Sascha Rothe,
Ryan McDonald
Abstract:
Professional summaries are written with document-level information, such as the theme of the document, in mind. This is in contrast with most seq2seq decoders which simultaneously learn to focus on salient content, while deciding what to generate, at each decoding step. With the motivation to narrow this gap, we introduce Focus Attention Mechanism, a simple yet effective method to encourage decode…
▽ More
Professional summaries are written with document-level information, such as the theme of the document, in mind. This is in contrast with most seq2seq decoders which simultaneously learn to focus on salient content, while deciding what to generate, at each decoding step. With the motivation to narrow this gap, we introduce Focus Attention Mechanism, a simple yet effective method to encourage decoders to proactively generate tokens that are similar or topical to the input document. Further, we propose a Focus Sampling method to enable generation of diverse summaries, an area currently understudied in summarization. When evaluated on the BBC extreme summarization task, two state-of-the-art models augmented with Focus Attention generate summaries that are closer to the target and more faithful to their input documents, outperforming their vanilla counterparts on \rouge and multiple faithfulness measures. We also empirically demonstrate that Focus Sampling is more effective in generating diverse and faithful summaries than top-$k$ or nucleus sampling-based decoding methods.
△ Less
Submitted 25 May, 2021;
originally announced May 2021.
-
Planning with Learned Entity Prompts for Abstractive Summarization
Authors:
Shashi Narayan,
Yao Zhao,
Joshua Maynez,
Gonçalo Simoes,
Vitaly Nikolaev,
Ryan McDonald
Abstract:
We introduce a simple but flexible mechanism to learn an intermediate plan to ground the generation of abstractive summaries. Specifically, we prepend (or prompt) target summaries with entity chains -- ordered sequences of entities mentioned in the summary. Transformer-based sequence-to-sequence models are then trained to generate the entity chain and then continue generating the summary condition…
▽ More
We introduce a simple but flexible mechanism to learn an intermediate plan to ground the generation of abstractive summaries. Specifically, we prepend (or prompt) target summaries with entity chains -- ordered sequences of entities mentioned in the summary. Transformer-based sequence-to-sequence models are then trained to generate the entity chain and then continue generating the summary conditioned on the entity chain and the input. We experimented with both pretraining and finetuning with this content planning objective. When evaluated on CNN/DailyMail, XSum, SAMSum and BillSum, we demonstrate empirically that the grounded generation with the planning objective improves entity specificity and planning in summaries for all datasets, and achieves state-of-the-art performance on XSum and SAMSum in terms of Rouge. Moreover, we demonstrate empirically that planning with entity chains provides a mechanism to control hallucinations in abstractive summaries. By prompting the decoder with a modified content plan that drops hallucinated entities, we outperform state-of-the-art approaches for faithfulness when evaluated automatically and by humans.
△ Less
Submitted 5 September, 2021; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Mid-Range Wireless Power Transfer at 100 MHz using Magnetically-Coupled Loop-Gap Resonators
Authors:
David M. Roberts,
Aaron P. Clements,
Rowan McDonald,
Jake S. Bobowski,
Thomas Johnson
Abstract:
We describe efficient four-coil inductive power transfer (IPT) systems that operate at 100 MHz. The magnetically-coupled transmitter and receiver were made from electrically-small and high-Q loop-gap resonators (LGRs). In contrast to the commonly-used helical and spiral resonators, the LGR design has the distinct advantage that electric fields are strongly confined to the capacitive gap of the res…
▽ More
We describe efficient four-coil inductive power transfer (IPT) systems that operate at 100 MHz. The magnetically-coupled transmitter and receiver were made from electrically-small and high-Q loop-gap resonators (LGRs). In contrast to the commonly-used helical and spiral resonators, the LGR design has the distinct advantage that electric fields are strongly confined to the capacitive gap of the resonator. With negligible fringing electric fields in the surrounding space, the IPT system is immune to interference from nearby dielectric objects, even when they are in close proximity to the transmitter and/or receiver. We experimented with both cylindrical and split-toroidal LGR geometries. Although both systems performed well under laboratory conditions, the toroidal geometry has the additional advantage that the magnetic flux is weak everywhere except within the bore of the LGR and in the space directly between the transmitter and receiver. Furthermore, we show that the toroidal LGR system can be operated efficiently at a fixed frequency for a wide range of transmitter-receiver distances. The experimental results are complimented by 3-D finite-element simulations which were used to investigate the electromagnetic field profiles and surface current density distributions. Finally, we demonstrate the use of our IPT system at powers up to 32 W and discuss possible applications.
△ Less
Submitted 30 April, 2021; v1 submitted 26 March, 2021;
originally announced March 2021.
-
Passive frustrated nanomagnet reservoir computing
Authors:
Alexander J. Edwards,
Dhritiman Bhattacharya,
Peng Zhou,
Nathan R. McDonald,
Walid Al Misba,
Lisa Loomis,
Felipe Garcia-Sanchez,
Naimul Hassan,
Xuan Hu,
Md. Fahim Chowdhury,
Clare D. Thiem,
Jayasimha Atulasimha,
Joseph S. Friedman
Abstract:
Reservoir computing (RC) has received recent interest because reservoir weights do not need to be trained, enabling extremely low-resource consumption implementations, which could have a transformative impact on edge computing and in-situ learning where resources are severely constrained. Ideally, a natural hardware reservoir should be passive, minimal, expressive, and feasible; to date, proposed…
▽ More
Reservoir computing (RC) has received recent interest because reservoir weights do not need to be trained, enabling extremely low-resource consumption implementations, which could have a transformative impact on edge computing and in-situ learning where resources are severely constrained. Ideally, a natural hardware reservoir should be passive, minimal, expressive, and feasible; to date, proposed hardware reservoirs have had difficulty meeting all of these criteria. We therefore propose a reservoir that meets all of these criteria by leveraging the passive interactions of dipole-coupled, frustrated nanomagnets. The frustration significantly increases the number of stable reservoir states, enriching reservoir dynamics, and as such these frustrated nanomagnets fulfill all of the criteria for a natural hardware reservoir. We likewise propose a complete frustrated nanomagnet reservoir computing (NMRC) system with low-power complementary metal-oxide semiconductor (CMOS) circuitry to interface with the reservoir, and initial experimental results demonstrate the reservoir's feasibility. The reservoir is verified with micromagnetic simulations on three separate tasks demonstrating expressivity. The proposed system is compared with a CMOS echo-state-network (ESN), demonstrating an overall resource decrease by a factor of over 10,000,000, demonstrating that because NMRC is naturally passive and minimal it has the potential to be extremely resource efficient.
△ Less
Submitted 16 September, 2022; v1 submitted 16 March, 2021;
originally announced March 2021.
-
Magnetoelastic standing waves induced in UO$_{2}$ by microsecond magnetic field pulses
Authors:
Rico Schönemann,
George Rodriguez,
Dwight Rickel,
Fedor Balakirev,
Ross D. McDonald,
Jordan Evans,
Boris Maiorov,
Charles Paillard,
Laurent Bellaiche,
Myron B. Salamon,
Krzysztof Gofryk,
Marcelo Jaime
Abstract:
Magnetoelastic measurements in the piezomagnetic antiferromagnet UO$_{2}$ were performed via the fiber Bragg grating method in magnetic fields up to $150\,\mathrm{T}$ generated by a single-turn coil setup. We show that in short timescales, order of a few micro seconds, pulsed-magnetic fields excite mechanical resonances at temperatures ranging from $10\,\mathrm{K}$ to $300\,\mathrm{K}$, in the par…
▽ More
Magnetoelastic measurements in the piezomagnetic antiferromagnet UO$_{2}$ were performed via the fiber Bragg grating method in magnetic fields up to $150\,\mathrm{T}$ generated by a single-turn coil setup. We show that in short timescales, order of a few micro seconds, pulsed-magnetic fields excite mechanical resonances at temperatures ranging from $10\,\mathrm{K}$ to $300\,\mathrm{K}$, in the paramagnetic as well as within the robust antiferromagnetic state of the material. These resonances, which are barely attenuated within the 100 ms observations, are attributed to the strong magnetoelastic coupling in UO$_{2}$ combined with the high crystallographic quality of the single crystal samples. They compare well with mechanical resonances obtained by a resonant ultrasound technique and superimpose on the known non-monotonic magnetostriction background. A clear phase-shift of $π$ in the lattice oscillations is, unexpectedly, observed in the antiferromagnetic state when the magnetic field overcomes the piezomagnetic switch-field $H_c \simeq -18\,\mathrm{T}$. We further present simulations and a theoretical argument to explain the observed phenomena.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
The fundamental solutions of the curve shortening problem via the Schwarz function
Authors:
Robb McDonald
Abstract:
Curve shortening in the $z$-plane in which, at a given point on the curve, the normal velocity of the curve is equal to the curvature, is shown to satisfy $S_tS_z=S_{zz}$, where $S(z,t)$ is the Schwarz function of the curve. This equation is shown to have a parametric solution from which the known explicit solutions for curve shortening flow; the circle, grim reaper, paperclip and hairclip, can be…
▽ More
Curve shortening in the $z$-plane in which, at a given point on the curve, the normal velocity of the curve is equal to the curvature, is shown to satisfy $S_tS_z=S_{zz}$, where $S(z,t)$ is the Schwarz function of the curve. This equation is shown to have a parametric solution from which the known explicit solutions for curve shortening flow; the circle, grim reaper, paperclip and hairclip, can be recovered.
△ Less
Submitted 14 January, 2022; v1 submitted 10 March, 2021;
originally announced March 2021.
-
Defect-driven ferrimagnetism and hidden magnetization in MnBi$_2$Te$_4$
Authors:
You Lai,
Liqin Ke,
Jiaqiang Yan,
Ross D. McDonald,
Robert J. McQueeney
Abstract:
MnBi$_2$Te$_4$ (MBT) materials are promising antiferromagnetic topological insulators where field driven ferromagnetism is predicted to cause a transition between axion insulator and Weyl semimetallic states. However, the presence of antiferromagnetic coupling between Mn/Bi antisite defects and the main Mn layer can reduce the low-field magnetization, and it has been shown that such defects are mo…
▽ More
MnBi$_2$Te$_4$ (MBT) materials are promising antiferromagnetic topological insulators where field driven ferromagnetism is predicted to cause a transition between axion insulator and Weyl semimetallic states. However, the presence of antiferromagnetic coupling between Mn/Bi antisite defects and the main Mn layer can reduce the low-field magnetization, and it has been shown that such defects are more prevalent in the structurally identical trivial magnetic insulator MnSb$_2$Te$_4$ (MST). We use high-field magnetization measurements to show that the magnetization of MBT and MST occur in stages and full saturation requires fields of~$\sim$~60 Tesla. As a consequence, the low-field magnetization plateau state in MBT, where many determinations of quantum anomalous Hall state are studied, actually consists of ferrimagnetic septuple blocks containing both a uniform and staggered magnetization component.
△ Less
Submitted 1 June, 2021; v1 submitted 10 February, 2021;
originally announced February 2021.
-
Framing energetic top-quark pair production at the LHC
Authors:
Fabrizio Caola,
Frédéric A. Dreyer,
Ross W. McDonald,
Gavin P. Salam
Abstract:
Top-quark pair production is central to many facets of LHC physics. At leading order, the top and anti-top are produced in a back-to-back topology, however this topology accounts only for a minority of $t \bar t$ events with TeV-scale momentum transfer. The remaining events instead involve the splitting of an initial or final-state gluon to $t \bar t$. We provide simple quantitative arguments that…
▽ More
Top-quark pair production is central to many facets of LHC physics. At leading order, the top and anti-top are produced in a back-to-back topology, however this topology accounts only for a minority of $t \bar t$ events with TeV-scale momentum transfer. The remaining events instead involve the splitting of an initial or final-state gluon to $t \bar t$. We provide simple quantitative arguments that explain why this is the case and examine the interplay between different topologies and a range of variables that characterise the event hardness. We then develop a method to classify the topologies of individual events and use it to illustrate our findings in the context of simulated events, using both top partons and suitably defined fiducial tops. For events with large $t \bar t$ invariant mass, we comment on additional features that have important experimental and theoretical implications.
△ Less
Submitted 15 January, 2021;
originally announced January 2021.
-
GaN/AlGaN 2DEGs in the quantum regime: Magneto-transport and photoluminescence to 60 tesla
Authors:
S. A. Crooker,
M. Lee,
R. D. McDonald,
J. L. Doorn,
I. Zimmermann,
Y. Lai,
L. E. Winter,
Y. Ren,
Y. -J. Cho,
B. J. Ramshaw,
H. G. Xing,
D. Jena
Abstract:
Using high magnetic fields up to 60 T, we report magneto-transport and photoluminescence (PL) studies of a two-dimensional electron gas (2DEG) in a GaN/AlGaN heterojunction grown by molecular-beam epitaxy. Transport measurements demonstrate that the quantum limit can be exceeded (Landau level filling factor $ν< 1$), and show evidence for the $ν=2/3$ fractional quantum Hall state. Simultaneous opti…
▽ More
Using high magnetic fields up to 60 T, we report magneto-transport and photoluminescence (PL) studies of a two-dimensional electron gas (2DEG) in a GaN/AlGaN heterojunction grown by molecular-beam epitaxy. Transport measurements demonstrate that the quantum limit can be exceeded (Landau level filling factor $ν< 1$), and show evidence for the $ν=2/3$ fractional quantum Hall state. Simultaneous optical and transport measurements reveal synchronous quantum oscillations of both the PL intensity and longitudinal resistivity in the integer quantum Hall regime. PL spectra directly reveal the dispersion of occupied Landau levels in the 2DEG and therefore the electron mass. These results demonstrate the utility of high (pulsed) magnetic fields for detailed measurements of quantum phenomena in high-density 2DEGs.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
Stepwise Extractive Summarization and Planning with Structured Transformers
Authors:
Shashi Narayan,
Joshua Maynez,
Jakub Adamek,
Daniele Pighin,
Blaž Bratanič,
Ryan McDonald
Abstract:
We propose encoder-centric stepwise models for extractive summarization using structured transformers -- HiBERT and Extended Transformers. We enable stepwise summarization by injecting the previously generated summary into the structured transformer as an auxiliary sub-structure. Our models are not only efficient in modeling the structure of long inputs, but they also do not rely on task-specific…
▽ More
We propose encoder-centric stepwise models for extractive summarization using structured transformers -- HiBERT and Extended Transformers. We enable stepwise summarization by injecting the previously generated summary into the structured transformer as an auxiliary sub-structure. Our models are not only efficient in modeling the structure of long inputs, but they also do not rely on task-specific redundancy-aware modeling, making them a general purpose extractive content planner for different tasks. When evaluated on CNN/DailyMail extractive summarization, stepwise models achieve state-of-the-art performance in terms of Rouge without any redundancy aware modeling or sentence filtering. This also holds true for Rotowire table-to-text generation, where our models surpass previously reported metrics for content selection, planning and ordering, highlighting the strength of stepwise modeling. Amongst the two structured transformers we test, stepwise Extended Transformers provides the best performance across both datasets and sets a new standard for these challenges.
△ Less
Submitted 6 October, 2020;
originally announced October 2020.
-
Strong antiferromagnetic proximity coupling in a heterostructured superconductor Sr$_2$VO$_3$FeAs
Authors:
Jong Mok Ok,
Chang Il Kwon,
O. E. Ayala Valenzuela,
Sunghun Kim,
Ross D. McDonald,
Jeehoon Kim,
E. S. Choi,
Woun Kang,
Y. J. Jo,
C. Kim,
E. G. Moon,
Y. K. Kim,
Jun Sung Kim
Abstract:
We report observation of strong magnetic proximity coupling in a heterostructured superconductor Sr$_2$VO$_3$FeAs, determined by the upper critical fields $H_{c2}(T)$ measurements up to 65 T. Using the resistivity and the radio-frequency measurements for both $H \parallel ab$ and $H \parallel c$, we found a strong upward curvature of $H_{c2}^c(T)$, together with a steep increase of…
▽ More
We report observation of strong magnetic proximity coupling in a heterostructured superconductor Sr$_2$VO$_3$FeAs, determined by the upper critical fields $H_{c2}(T)$ measurements up to 65 T. Using the resistivity and the radio-frequency measurements for both $H \parallel ab$ and $H \parallel c$, we found a strong upward curvature of $H_{c2}^c(T)$, together with a steep increase of $H_{c2}^{ab}(T)$ near $T_c$, yielding the anisotropic factor $γ_H=H_{c2}^{ab}/H_{c2}^c$ up to $\sim$ 20, the largest value among iron-based superconductors. These are attributed to the Jaccarino-Peter effect, rather than to the multiband effect, due to strong exchange interaction between itinerant Fe spins of the FeAs layers and localized V spins of Mott-insulating SrVO$_3$ layers. These findings provide evidence for strong antiferromagnetic proximity coupling, comparable with the intralayer superexchange interaction of SrVO$_3$ layer and sufficient to induce magnetic frustration in Sr$_2$VO$_3$FeAs.
△ Less
Submitted 4 October, 2020;
originally announced October 2020.
-
RRF102: Meeting the TREC-COVID Challenge with a 100+ Runs Ensemble
Authors:
Michael Bendersky,
Honglei Zhuang,
Ji Ma,
Shuguang Han,
Keith Hall,
Ryan McDonald
Abstract:
In this paper, we report the results of our participation in the TREC-COVID challenge. To meet the challenge of building a search engine for rapidly evolving biomedical collection, we propose a simple yet effective weighted hierarchical rank fusion approach, that ensembles together 102 runs from (a) lexical and semantic retrieval systems, (b) pre-trained and fine-tuned BERT rankers, and (c) releva…
▽ More
In this paper, we report the results of our participation in the TREC-COVID challenge. To meet the challenge of building a search engine for rapidly evolving biomedical collection, we propose a simple yet effective weighted hierarchical rank fusion approach, that ensembles together 102 runs from (a) lexical and semantic retrieval systems, (b) pre-trained and fine-tuned BERT rankers, and (c) relevance feedback runs. Our ablation studies demonstrate the contributions of each of these systems to the overall ensemble. The submitted ensemble runs achieved state-of-the-art performance in rounds 4 and 5 of the TREC-COVID challenge.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Geodesic Loewner paths with varying boundary conditions
Authors:
Robb McDonald
Abstract:
Equations of the Loewner class subject to non-constant boundary conditions along the real axis, are formulated and solved giving the geodesic paths of slits growing in the upper half complex plane. The problem is motivated by Laplacian growth in which the slits represent thin fingers growing in a diffusion field. A single finger follows a curved path determined by the forcing function appearing in…
▽ More
Equations of the Loewner class subject to non-constant boundary conditions along the real axis, are formulated and solved giving the geodesic paths of slits growing in the upper half complex plane. The problem is motivated by Laplacian growth in which the slits represent thin fingers growing in a diffusion field. A single finger follows a curved path determined by the forcing function appearing in Loewner's equation. This function is found by solving an ordinary differential equation whose terms depend on curvature properties of the streamlines of the diffusive field in the conformally mapped `mathematical' plane. The effect of boundary conditions specifying either piecewise constant values of the field variable along the real axis, or a dipole placed on the real axis, reveal a range of behaviours for the growing slit. These include regions along the real axis from which no slit growth is possible, regions where paths grow to infinity, or regions where paths curve back toward the real axis terminating in finite time. Symmetric pairs of paths subject to the piecewise constant boundary condition along the real axis are also computed, demonstrating that paths which grow to infinity evolve asymptotically toward an angle of bifurcation of $π/5$.
△ Less
Submitted 8 October, 2020; v1 submitted 18 June, 2020;
originally announced June 2020.
-
Wannier quasi-classical approach to high harmonic generation in semiconductors
Authors:
Andrew M. Parks,
Guilmot Ernotte,
Adam Thorpe,
Chris R. McDonald,
Paul B. Corkum,
Marco Taucer,
Thomas Brabec
Abstract:
We develop a quasi-classical theory of high harmonic generation in semiconductors based on an interband current that has been transformed from Bloch to Wannier basis. The Wannier quasi-classical approach reveals a complete picture of the mechanisms sha** high harmonic generation, such that quantitative agreement with full quantum calculations is obtained. The intuitive picture revealed by quasi-…
▽ More
We develop a quasi-classical theory of high harmonic generation in semiconductors based on an interband current that has been transformed from Bloch to Wannier basis. The Wannier quasi-classical approach reveals a complete picture of the mechanisms sha** high harmonic generation, such that quantitative agreement with full quantum calculations is obtained. The intuitive picture revealed by quasi-classical wavepacket propagation will be helpful in the interpretation and design of high harmonic and attosecond experiments. Beyond that, the capacity to quantitatively model quantum dynamics with classical trajectories should prove useful for a wider spectrum of condensed matter research, including coherent control, transport theory, and strong field physics.
△ Less
Submitted 17 June, 2020;
originally announced June 2020.
-
Observation of cyclotron resonance and measurement of the hole mass in optimally-doped La$_{2-x}$Sr$_{x}$CuO$_4$
Authors:
K. W. Post,
A. Legros,
D. G. Rickel,
J. Singleton,
R. D. McDonald,
Xi He,
I. Bozovic,
X. Xu,
X. Shi,
N. P. Armitage,
S. A. Crooker
Abstract:
Using time-domain terahertz spectroscopy in pulsed magnetic fields up to 31 T, we measure the terahertz optical conductivity in an optimally-doped thin film of the high temperature superconducting cuprate La$_{1.84}$Sr$_{0.16}$CuO$_4$. We observe systematic changes in the circularly-polarized complex optical conductivity that are consistent with cyclotron absorption of p-type charge carriers chara…
▽ More
Using time-domain terahertz spectroscopy in pulsed magnetic fields up to 31 T, we measure the terahertz optical conductivity in an optimally-doped thin film of the high temperature superconducting cuprate La$_{1.84}$Sr$_{0.16}$CuO$_4$. We observe systematic changes in the circularly-polarized complex optical conductivity that are consistent with cyclotron absorption of p-type charge carriers characterized by a cyclotron mass of $4.9\pm 0.8$ $m_{\rm e}$, and a scattering rate that increases with magnetic field. These results open the door to studies aimed at characterizing the degree to which electron-electron interactions influence carrier masses in cuprate superconductors.
△ Less
Submitted 16 June, 2020;
originally announced June 2020.
-
Hard antinodal gap revealed by quantum oscillations in the pseudogap regime of underdoped high-$T_{\rm c}$ superconductors
Authors:
Mate Hartstein,
Yu-Te Hsu,
Kimberly A. Modic,
Juan Porras,
Toshinao Loew,
Matthieu Le Tacon,
Huakun Zuo,
**hua Wang,
Zengwei Zhu,
Mun K. Chan,
Ross D. McDonald,
Gilbert G. Lonzarich,
Bernhard Keimer,
Suchitra E. Sebastian,
Neil Harrison
Abstract:
An understanding of the missing antinodal electronic excitations in the pseudogap state is essential for uncovering the physics of the underdoped cuprate high temperature superconductors. The majority of high temperature experiments performed thus far, however, have been unable to discern whether the antinodal states are rendered unobservable due to their dam**, or whether they vanish due to the…
▽ More
An understanding of the missing antinodal electronic excitations in the pseudogap state is essential for uncovering the physics of the underdoped cuprate high temperature superconductors. The majority of high temperature experiments performed thus far, however, have been unable to discern whether the antinodal states are rendered unobservable due to their dam**, or whether they vanish due to their gap**. Here we distinguish between these two scenarios by using quantum oscillations to examine whether the small Fermi surface pocket, found to occupy only 2% of the Brillouin zone in the underdoped cuprates, exists in isolation against a majority of completely gapped density of states spanning the antinodes, or whether it is thermodynamically coupled to a background of ungapped antinodal states. We find that quantum oscillations associated with the small Fermi surface pocket exhibit a signature sawtooth waveform characteristic of an isolated two-dimensional Fermi surface pocket. This finding reveals that the antinodal states are destroyed by a hard gap that extends over the majority of the Brillouin zone, placing strong constraints on a drastic underlying origin of quasiparticle disappearance over almost the entire Brillouin zone in the pseudogap regime.
△ Less
Submitted 28 May, 2020;
originally announced May 2020.
-
BIOMRC: A Dataset for Biomedical Machine Reading Comprehension
Authors:
Petros Stavropoulos,
Dimitris Pappas,
Ion Androutsopoulos,
Ryan McDonald
Abstract:
We introduce BIOMRC, a large-scale cloze-style biomedical MRC dataset. Care was taken to reduce noise, compared to the previous BIOREAD dataset of Pappas et al. (2018). Experiments show that simple heuristics do not perform well on the new dataset, and that two neural MRC models that had been tested on BIOREAD perform much better on BIOMRC, indicating that the new dataset is indeed less noisy or a…
▽ More
We introduce BIOMRC, a large-scale cloze-style biomedical MRC dataset. Care was taken to reduce noise, compared to the previous BIOREAD dataset of Pappas et al. (2018). Experiments show that simple heuristics do not perform well on the new dataset, and that two neural MRC models that had been tested on BIOREAD perform much better on BIOMRC, indicating that the new dataset is indeed less noisy or at least that its task is more feasible. Non-expert human performance is also higher on the new dataset compared to BIOREAD, and biomedical experts perform even better. We also introduce a new BERT-based MRC model, the best version of which substantially outperforms all other methods tested, reaching or surpassing the accuracy of biomedical experts in some experiments. We make the new dataset available in three different sizes, also releasing our code, and providing a leaderboard.
△ Less
Submitted 13 May, 2020;
originally announced May 2020.
-
Scale-invariant magnetic anisotropy in RuCl$_3$ at high magnetic fields
Authors:
K. A. Modic,
Ross D. McDonald,
J. P. C. Ruff,
Maja D. Bachmann,
You Lai,
Johanna C. Palmstrom,
David Graf,
Mun Chan,
F. F. Balakirev,
J. B. Betts,
G. S. Boebinger,
Marcus Schmidt,
D. A. Sokolov,
Philip J. W. Moll,
B. J. Ramshaw,
Arkady Shekhter
Abstract:
In RuCl$_3$, inelastic neutron scattering and Raman spectroscopy reveal a continuum of non-spin-wave excitations that persists to high temperature, suggesting the presence of a spin liquid state on a honeycomb lattice. In the context of the Kitaev model, magnetic fields introduce finite interactions between the elementary excitations, and thus the effects of high magnetic fields - comparable to th…
▽ More
In RuCl$_3$, inelastic neutron scattering and Raman spectroscopy reveal a continuum of non-spin-wave excitations that persists to high temperature, suggesting the presence of a spin liquid state on a honeycomb lattice. In the context of the Kitaev model, magnetic fields introduce finite interactions between the elementary excitations, and thus the effects of high magnetic fields - comparable to the spin exchange energy scale - must be explored. Here we report measurements of the magnetotropic coefficient - the second derivative of the free energy with respect to magnetic field orientation - over a wide range of magnetic fields and temperatures. We find that magnetic field and temperature compete to determine the magnetic response in a way that is independent of the large intrinsic exchange interaction energy. This emergent scale-invariant magnetic anisotropy provides evidence for a high degree of exchange frustration that favors the formation of a spin liquid state in RuCl$_3$.
△ Less
Submitted 8 May, 2020;
originally announced May 2020.
-
On Faithfulness and Factuality in Abstractive Summarization
Authors:
Joshua Maynez,
Shashi Narayan,
Bernd Bohnet,
Ryan McDonald
Abstract:
It is well known that the standard likelihood training and approximate decoding objectives in neural text generation models lead to less human-like responses for open-ended tasks such as language modeling and story generation. In this paper we have analyzed limitations of these models for abstractive document summarization and found that these models are highly prone to hallucinate content that is…
▽ More
It is well known that the standard likelihood training and approximate decoding objectives in neural text generation models lead to less human-like responses for open-ended tasks such as language modeling and story generation. In this paper we have analyzed limitations of these models for abstractive document summarization and found that these models are highly prone to hallucinate content that is unfaithful to the input document. We conducted a large scale human evaluation of several neural abstractive summarization systems to better understand the types of hallucinations they produce. Our human annotators found substantial amounts of hallucinated content in all model generated summaries. However, our analysis does show that pretrained models are better summarizers not only in terms of raw metrics, i.e., ROUGE, but also in generating faithful and factual summaries as evaluated by humans. Furthermore, we show that textual entailment measures better correlate with faithfulness than standard metrics, potentially leading the way to automatic evaluation metrics as well as training and decoding criteria.
△ Less
Submitted 1 May, 2020;
originally announced May 2020.
-
Zero-shot Neural Passage Retrieval via Domain-targeted Synthetic Question Generation
Authors:
Ji Ma,
Ivan Korotkov,
Yinfei Yang,
Keith Hall,
Ryan McDonald
Abstract:
A major obstacle to the wide-spread adoption of neural retrieval models is that they require large supervised training sets to surpass traditional term-based techniques, which are constructed from raw corpora. In this paper, we propose an approach to zero-shot learning for passage retrieval that uses synthetic question generation to close this gap. The question generation system is trained on gene…
▽ More
A major obstacle to the wide-spread adoption of neural retrieval models is that they require large supervised training sets to surpass traditional term-based techniques, which are constructed from raw corpora. In this paper, we propose an approach to zero-shot learning for passage retrieval that uses synthetic question generation to close this gap. The question generation system is trained on general domain data, but is applied to documents in the targeted domain. This allows us to create arbitrarily large, yet noisy, question-passage relevance pairs that are domain specific. Furthermore, when this is coupled with a simple hybrid term-neural model, first-stage retrieval performance can be improved further. Empirically, we show that this is an effective strategy for building neural passage retrieval models in the absence of large training corpora. Depending on the domain, this technique can even approach the accuracy of supervised models.
△ Less
Submitted 27 January, 2021; v1 submitted 29 April, 2020;
originally announced April 2020.
-
QURIOUS: Question Generation Pretraining for Text Generation
Authors:
Shashi Narayan,
Gonçalo Simoes,
Ji Ma,
Hannah Craighead,
Ryan Mcdonald
Abstract:
Recent trends in natural language processing using pretraining have shifted focus towards pretraining and fine-tuning approaches for text generation. Often the focus has been on task-agnostic approaches that generalize the language modeling objective. We propose question generation as a pretraining method, which better aligns with the text generation objectives. Our text generation models pretrain…
▽ More
Recent trends in natural language processing using pretraining have shifted focus towards pretraining and fine-tuning approaches for text generation. Often the focus has been on task-agnostic approaches that generalize the language modeling objective. We propose question generation as a pretraining method, which better aligns with the text generation objectives. Our text generation models pretrained with this method are better at understanding the essence of the input and are better language models for the target task. When evaluated on two text generation tasks, abstractive summarization and answer-focused question generation, our models result in state-of-the-art performances in terms of automatic metrics. Human evaluators also found our summaries and generated questions to be more natural, concise and informative.
△ Less
Submitted 23 April, 2020;
originally announced April 2020.
-
Reservoir Computing with Planar Nanomagnet Arrays
Authors:
Peng Zhou,
Nathan R. McDonald,
Alexander J. Edwards,
Lisa Loomis,
Clare D. Thiem,
Joseph S. Friedman
Abstract:
Reservoir computing is an emerging methodology for neuromorphic computing that is especially well-suited for hardware implementations in size, weight, and power (SWaP) constrained environments. This work proposes a novel hardware implementation of a reservoir computer using a planar nanomagnet array. A small nanomagnet reservoir is demonstrated via micromagnetic simulations to be able to identify…
▽ More
Reservoir computing is an emerging methodology for neuromorphic computing that is especially well-suited for hardware implementations in size, weight, and power (SWaP) constrained environments. This work proposes a novel hardware implementation of a reservoir computer using a planar nanomagnet array. A small nanomagnet reservoir is demonstrated via micromagnetic simulations to be able to identify simple waveforms with 100% accuracy. Planar nanomagnet reservoirs are a promising new solution to the growing need for dedicated neuromorphic hardware.
△ Less
Submitted 24 March, 2020;
originally announced March 2020.
-
Magnetic breakdown and charge density wave formation: a quantum oscillation study of the rare-earth tritellurides
Authors:
P. Walmsley,
S. Aeschlimann,
J. A. W. Straquadine,
P. Giraldo-Gallo,
S. C. Riggs,
M. K. Chan,
R. D. McDonald,
I. R. Fisher
Abstract:
The rare-earth tritellurides ($R$Te$_3$, where $R$ = La, Ce, Pr, Nd, Sm, Gd, Tb, Dy, Ho, Er, Tm, Y) form a charge density wave state consisting of a single unidirectional charge density wave for lighter $R$, with a second unidirectional charge density wave, perpendicular and in addition to the first, also present at low temperatures for heavier $R$. We present a quantum oscillation study in magnet…
▽ More
The rare-earth tritellurides ($R$Te$_3$, where $R$ = La, Ce, Pr, Nd, Sm, Gd, Tb, Dy, Ho, Er, Tm, Y) form a charge density wave state consisting of a single unidirectional charge density wave for lighter $R$, with a second unidirectional charge density wave, perpendicular and in addition to the first, also present at low temperatures for heavier $R$. We present a quantum oscillation study in magnetic fields up to 65T that compares the single charge density wave state with the double charge density wave state both above and below the magnetic breakdown field of the second charge density wave. In the double charge density wave state it is observed that there remain several small, light pockets with the largest occupying around 0.5% of the Brillouin zone. By applying magnetic fields above the independently determined magnetic breakown field, the quantum oscillation frequencies of the single charge density wave state are recovered, as expected in a magnetic breakdown scenario. Measurements of the electronic effective mass do not show any divergence or significant increase on the pockets of Fermi surface observed here as the putative quantum phase transition between the single and double charge density wave states is approached.
△ Less
Submitted 10 May, 2020; v1 submitted 21 March, 2020;
originally announced March 2020.
-
Extent of Fermi-surface reconstruction in the high-temperature superconductor HgBa$_2$CuO$_{4+δ}$
Authors:
Mun K. Chan,
Ross D. McDonald,
Brad J. Ramshaw,
Jon B. Betts,
Arkady Shekhter,
Eric D. Bauer,
Neil Harrison
Abstract:
High magnetic fields have revealed a surprisingly small Fermi-surface in underdoped cuprates, possibly resulting from Fermi-surface reconstruction due to an order parameter that breaks translational symmetry of the crystal lattice. A crucial issue concerns the do** extent of this state and its relationship to the principal pseudogap and superconducting phases. We employ pulsed magnetic field mea…
▽ More
High magnetic fields have revealed a surprisingly small Fermi-surface in underdoped cuprates, possibly resulting from Fermi-surface reconstruction due to an order parameter that breaks translational symmetry of the crystal lattice. A crucial issue concerns the do** extent of this state and its relationship to the principal pseudogap and superconducting phases. We employ pulsed magnetic field measurements on the cuprate HgBa$_2$CuO$_{4+δ}$ to identify signatures of Fermi surface reconstruction from a sign change of the Hall effect and a peak in the temperature-dependent planar resistivity. We trace the termination of Fermi-surface reconstruction to two hole concentrations where the superconducting upper critical fields are found to be enhanced. One of these points is associated with the pseudogap end-point near optimal do**. These results connect the Fermi-surface reconstruction to both superconductivity and the pseudogap phenomena.
△ Less
Submitted 12 March, 2020;
originally announced March 2020.
-
The strange metal Hall effect connects quantum criticality and superconductivity in an iron-based superconductor
Authors:
Ian M. Hayes,
Nikola Maksimovic,
Mun K. Chan,
Gilbert N. Lopez,
B. J. Ramshaw,
Ross D. McDonald,
James G. Analytis
Abstract:
Many unconventional superconductors exhibit a common set of anomalous charge transport properties that characterize them as `strange metals', which provides hope that there is single theory that describes them. However, model-independent connections between the strange metal and superconductivity have remained elusive. In this letter, we show that the Hall effect of the unconventional superconduct…
▽ More
Many unconventional superconductors exhibit a common set of anomalous charge transport properties that characterize them as `strange metals', which provides hope that there is single theory that describes them. However, model-independent connections between the strange metal and superconductivity have remained elusive. In this letter, we show that the Hall effect of the unconventional superconductor BaFe$_2$(As$_{1-x}$P$_x$)$_2$ contains an anomalous contribution arising from the correlations within the strange metal. This term has a distinctive dependence on magnetic field, which allows us to track its behavior across the do**-temperature phase diagram, even under the superconducting dome. These measurements demonstrate that the strange metal Hall component emanates from a quantum critical point and, in the zero temperature limit, decays in proportion to the superconducting critical temperature. This creates a clear and novel connection between quantum criticality and superconductivity, and suggests that similar connections exist in other strange metal superconductors.
△ Less
Submitted 12 December, 2019;
originally announced December 2019.
-
Measuring Domain Portability and ErrorPropagation in Biomedical QA
Authors:
Stefan Hosein,
Daniel Andor,
Ryan McDonald
Abstract:
In this work we present Google's submission to the BioASQ 7 biomedical question answering (QA) task (specifically Task 7b, Phase B). The core of our systems are based on BERT QA models, specifically the model of \cite{alberti2019bert}. In this report, and via our submissions, we aimed to investigate two research questions. We start by studying how domain portable are QA systems that have been pre-…
▽ More
In this work we present Google's submission to the BioASQ 7 biomedical question answering (QA) task (specifically Task 7b, Phase B). The core of our systems are based on BERT QA models, specifically the model of \cite{alberti2019bert}. In this report, and via our submissions, we aimed to investigate two research questions. We start by studying how domain portable are QA systems that have been pre-trained and fine-tuned on general texts, e.g., Wikipedia. We measure this via two submissions. The first is a non-adapted model that uses a public pre-trained BERT model and is fine-tuned on the Natural Questions data set \cite{kwiatkowski2019natural}. The second system takes this non-adapted model and fine-tunes it with the BioASQ training data. Next, we study the impact of error propagation in end-to-end retrieval and QA systems. Again we test this via two submissions. The first uses human annotated relevant documents and snippets as input to the model and the second predicted documents and snippets. Our main findings are that domain specific fine-tuning can benefit Biomedical QA. However, the biggest quality bottleneck is at the retrieval stage, where we see large drops in metrics -- over 10pts absolute -- when using non gold inputs to the QA model.
△ Less
Submitted 24 September, 2019; v1 submitted 12 September, 2019;
originally announced September 2019.
-
Dynamics of Hot Bose-Einstein Condensates: stochastic Ehrenfest relations for number and energy dam**
Authors:
Rob G. McDonald,
Peter S. Barnett,
Fradom Atayee,
Ashton S. Bradley
Abstract:
Describing partially-condensed Bose gases poses a long-standing theoretical challenge. We present exact stochastic Ehrenfest relations for the stochastic projected Gross-Pitaevskii equation, including both number and energy dam** mechanisms, and all projector terms that arise from the energy cutoff separating system from reservoir. We test the theory by applying it to the centre of mass fluctuat…
▽ More
Describing partially-condensed Bose gases poses a long-standing theoretical challenge. We present exact stochastic Ehrenfest relations for the stochastic projected Gross-Pitaevskii equation, including both number and energy dam** mechanisms, and all projector terms that arise from the energy cutoff separating system from reservoir. We test the theory by applying it to the centre of mass fluctuations of a harmonically trapped prolate system, finding close agreement between c-field simulations and analytical results. The formalism lays the foundation to analytically explore experimentally accessible hot Bose-Einstein condensates.
△ Less
Submitted 12 December, 2019; v1 submitted 15 August, 2019;
originally announced August 2019.
-
Growth of nematic susceptibility in the field-induced normal state of an iron-based superconductor revealed by elastoresistivity measurements in a 65 T pulsed magnet
Authors:
J. A. W. Straquadine,
J. C. Palmstrom,
P. Walmsley,
A. T. Hristov,
F. Weickert,
F. F. Balakirev,
M. Jaime,
R. McDonald,
I. R. Fisher
Abstract:
In the iron-based superconductors, both nematic and magnetic fluctuations are expected to enhance superconductivity and may originate from a quantum critical point hidden beneath the superconducting dome. The behavior of the non-superconducting state can be an important piece of the puzzle, motivating in this paper the use of high magnetic fields to suppress superconductivity and measure the nemat…
▽ More
In the iron-based superconductors, both nematic and magnetic fluctuations are expected to enhance superconductivity and may originate from a quantum critical point hidden beneath the superconducting dome. The behavior of the non-superconducting state can be an important piece of the puzzle, motivating in this paper the use of high magnetic fields to suppress superconductivity and measure the nematic susceptibility of the normal state at low temperatures. We describe experimental advances which make it possible to measure a resistive gauge factor (which is a proxy for the nematic susceptibility) in the field-induced normal state in a 65 T pulsed magnet, and report measurements of the gauge factor of a micromachined single crystal of Ba(Fe$_{0.926}$Co$_{0.074}$)$_2$As$_2$ at temperatures down to 1.2 K. The nematic susceptibility increases monotonically in the field-induced normal state as the temperature decreases, consistent with the presence of a quantum critical point nearby in composition.
△ Less
Submitted 28 July, 2019;
originally announced July 2019.
-
Exchange biased Anomalous Hall Effect driven by frustration in a magnetic Kagome lattice
Authors:
E. Lachman,
N. Maksimovic,
R. Kealhofer,
S. Haley,
R. McDonald,
James G. Analytis
Abstract:
Co3Sn2S2 is a ferromagnetic Weyl semimetal that has been the subject of intense scientific interest due to its large anomalous Hall effect. We show that the coupling of this material's topological properties to its magnetic texture leads to a strongly exchange biased anomalous Hall effect. We argue that this is likely caused by the coexistence of ferromagnetism and spin glass phases, the latter be…
▽ More
Co3Sn2S2 is a ferromagnetic Weyl semimetal that has been the subject of intense scientific interest due to its large anomalous Hall effect. We show that the coupling of this material's topological properties to its magnetic texture leads to a strongly exchange biased anomalous Hall effect. We argue that this is likely caused by the coexistence of ferromagnetism and spin glass phases, the latter being driven by the geometric frustration intrinsic to the Kagome network of magnetic ions.
△ Less
Submitted 15 July, 2019;
originally announced July 2019.
-
Spin-valley locking, bulk quantum Hall effect and chiral surface state in a noncentrosymmetric Dirac semimetal BaMnSb$_2$
Authors:
J. Y. Liu,
J. Yu,
J. L. Ning,
H. M. Yi,
L. Miao,
L. J. Min,
Y. F. Zhao,
W. Ning,
K. A. Lopez,
Y. L. Zhu,
T. Pillsbury,
Y. B. Zhang,
Y. Wang,
J. Hu,
H. B. Cao,
F. Balakirev,
F. Weickert,
M. Jaime,
Y. Lai,
Kun Yang,
J. W. Sun,
N. Alem,
V. Gopalan,
C. Z. Chang,
N. Samarth
, et al. (3 additional authors not shown)
Abstract:
Spin-valley locking in the band structure of monolayers of MoS$_2$ and other group-VI dichalcogenides has attracted enormous interest, since it offers potential for valleytronic and optoelectronic applications. Such an exotic electronic state has sparsely been seen in bulk materials. Here, we report spin-valley locking in a bulk Dirac semimetal BaMnSb$_2$. We find valley and spin are inherently co…
▽ More
Spin-valley locking in the band structure of monolayers of MoS$_2$ and other group-VI dichalcogenides has attracted enormous interest, since it offers potential for valleytronic and optoelectronic applications. Such an exotic electronic state has sparsely been seen in bulk materials. Here, we report spin-valley locking in a bulk Dirac semimetal BaMnSb$_2$. We find valley and spin are inherently coupled for both valence and conduction bands in this material. This is revealed by comprehensive studies using first principle calculations, tight-binding and effective model analyses, angle-resolved photoemission spectroscopy and quantum transport measurements. Moreover, this material also exhibits a stacked quantum Hall effect. The spin-valley degeneracy extracted from the plateau height of quantized Hall resistivity is close to 2. This result, together with the observed Landau level spin splitting, further confirms the spin-valley locking picture. In the extreme quantum limit, we have also observed a two-dimensional chiral metal at the side surface, which represents a novel topological quantum liquid. These findings establish BaMnSb$_2$ as a rare platform for exploring coupled spin and valley physics in bulk single crystals and accessing 3D interacting topological states.
△ Less
Submitted 4 November, 2020; v1 submitted 14 July, 2019;
originally announced July 2019.
-
Embedding Biomedical Ontologies by Jointly Encoding Network Structure and Textual Node Descriptors
Authors:
Sotiris Kotitsas,
Dimitris Pappas,
Ion Androutsopoulos,
Ryan McDonald,
Marianna Apidianaki
Abstract:
Network Embedding (NE) methods, which map network nodes to low-dimensional feature vectors, have wide applications in network analysis and bioinformatics. Many existing NE methods rely only on network structure, overlooking other information associated with the nodes, e.g., text describing the nodes. Recent attempts to combine the two sources of information only consider local network structure. W…
▽ More
Network Embedding (NE) methods, which map network nodes to low-dimensional feature vectors, have wide applications in network analysis and bioinformatics. Many existing NE methods rely only on network structure, overlooking other information associated with the nodes, e.g., text describing the nodes. Recent attempts to combine the two sources of information only consider local network structure. We extend NODE2VEC, a well-known NE method that considers broader network structure, to also consider textual node descriptors using recurrent neural encoders. Our method is evaluated on link prediction in two networks derived from UMLS. Experimental results demonstrate the effectiveness of the proposed approach compared to previous work.
△ Less
Submitted 20 June, 2019; v1 submitted 13 June, 2019;
originally announced June 2019.
-
Dirac fermions and flat bands in the ideal kagome metal FeSn
Authors:
Mingu Kang,
Linda Ye,
Shiang Fang,
Jhih-Shih You,
Abe Levitan,
Minyong Han,
Jorge I. Facio,
Chris Jozwiak,
Aaron Bostwick,
Eli Rotenberg,
Mun K. Chan,
Ross D. McDonald,
David Graf,
Konstantine Kaznatcheev,
Elio Vescovo,
David C. Bell,
Efthimios Kaxiras,
Jeroen van den Brink,
Manuel Richter,
Madhav Prasad Ghimire,
Joseph G. Checkelsky,
Riccardo Comin
Abstract:
The kagome lattice based on 3d transition metals is a versatile platform for novel topological phases hosting symmetry-protected electronic excitations and exotic magnetic ground states. However, the paradigmatic states of the idealized two-dimensional (2D) kagome lattice - Dirac fermions and topological flat bands - have not been simultaneously observed, partly owing to the complex stacking struc…
▽ More
The kagome lattice based on 3d transition metals is a versatile platform for novel topological phases hosting symmetry-protected electronic excitations and exotic magnetic ground states. However, the paradigmatic states of the idealized two-dimensional (2D) kagome lattice - Dirac fermions and topological flat bands - have not been simultaneously observed, partly owing to the complex stacking structure of the kagome compounds studied to date. Here, we take the approach of examining FeSn, an antiferromagnetic single-layer kagome metal with spatially-decoupled kagome planes. Using polarization- and termination-dependent angle-resolved photoemission spectroscopy (ARPES), we detect the momentum-space signatures of coexisting flat bands and Dirac fermions in the vicinity of the Fermi energy. Intriguingly, when complemented with bulk-sensitive de Haas-van Alphen (dHvA) measurements, our data reveal an even richer electronic structure that exhibits robust surface Dirac fermions on specific crystalline terminations. Through band structure calculations and matrix element simulations, we demonstrate that the bulk Dirac bands arise from in-plane localized Fe-3d orbitals under kagome symmetry, while the surface state realizes a rare example of fully spin-polarized 2D Dirac fermions when combined with spin-layer locking in FeSn. These results highlight FeSn as a prototypical host for the emergent excitations of the kagome lattice. The prospect to harness these excitations for novel topological phases and spintronic devices is a frontier of great promise at the confluence of topology, magnetism, and strongly-correlated electron physics.
△ Less
Submitted 5 June, 2019;
originally announced June 2019.