Search | arXiv e-print repository

What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

Authors: Matthew Finlayson, Kyle Richardson, Ashish Sabharwal, Peter Clark

Abstract: The instruction learning paradigm -- where a model learns to perform new tasks from task descriptions alone -- has become popular in general-purpose model research. The capabilities of large transformer models as instruction learners, however, remain poorly understood. We use a controlled synthetic environment to characterize such capabilities. Specifically, we use the task of deciding whether a g… ▽ More The instruction learning paradigm -- where a model learns to perform new tasks from task descriptions alone -- has become popular in general-purpose model research. The capabilities of large transformer models as instruction learners, however, remain poorly understood. We use a controlled synthetic environment to characterize such capabilities. Specifically, we use the task of deciding whether a given string matches a regular expression (viewed as an instruction) to identify properties of tasks, instructions, and instances that make instruction learning challenging. For instance, we find that our model, a fine-tuned T5-based text2text transformer, struggles with large regular languages, suggesting that less precise instructions are challenging for models. Additionally, instruction executions that require tracking longer contexts of prior steps are also more difficult. We use our findings to systematically construct a challenging instruction learning dataset, which we call Hard RegSet. Fine-tuning on Hard RegSet, our large transformer learns to correctly interpret only 65.6% of test instructions (with at least 90% accuracy), and 11%-24% of the instructions in out-of-distribution generalization settings. We propose Hard RegSet as a challenging instruction learning task, and a controlled environment for studying instruction learning. △ Less

Submitted 24 May, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

Comments: Typos corrected, rewordings

MSC Class: 68T50 ACM Class: I.2.7

arXiv:2203.13337 [pdf]

Electrical Programmable Multi-Level Non-volatile Photonic Random-Access Memory

Authors: Jiawei Meng, Yaliang Gui, Behrouz Movahhed Nouri, Gelu Comanescu, Xiaoxuan Ma, Yifei Zhang, Cosmin-Constantin Popescu, Myungkoo Kang, Mario Miscuglio, Nicola Peserico, Kathleen A. Richardson, Juejun Hu, Hamed Dalir, Volker J. Sorger

Abstract: Photonic Random-Access Memories (P-RAM) are an essential component for the on-chip non-von Neumann photonic computing by eliminating optoelectronic conversion losses in data links. Emerging Phase Change Materials (PCMs) have been showed multilevel memory capability, but demonstrations still yield relatively high optical loss and require cumbersome WRITE-ERASE approaches increasing power consumptio… ▽ More Photonic Random-Access Memories (P-RAM) are an essential component for the on-chip non-von Neumann photonic computing by eliminating optoelectronic conversion losses in data links. Emerging Phase Change Materials (PCMs) have been showed multilevel memory capability, but demonstrations still yield relatively high optical loss and require cumbersome WRITE-ERASE approaches increasing power consumption and system package challenges. Here we demonstrate a multi-state electrically-programmed low-loss non-volatile photonic memory based on a broadband transparent phase change material (Ge2Sb2Se5, GSSe) with ultra-low absorption in the amorphous state. A zero-static-power and electrically-programmed multi-bit P-RAM is demonstrated on a silicon-on-insulator platform, featuring efficient amplitude modulation up to 0.2 dB/μm and an ultra-low insertion loss of total 0.12 dB for a 4-bit memory showing a 100x improved signal to loss ratio compared to other phase-change-materials based photonic memories. We further optimize the positioning of dual micro-heaters validating performance tradeoffs. Experimentally we demonstrate a half-a million cyclability test showcasing the robust approach of this material and device. Low-loss photonic retention-of-state adds a key feature for photonic functional and programmable circuits impacting many applications including neural networks, LiDAR, and sensors for example. △ Less

Submitted 21 June, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

arXiv:2202.13127 [pdf]

doi 10.1002/qute.202200022

Plug-and-play quantum devices with efficient fiber-quantum dot interface

Authors: Woong Bae Jeon, Jong Sung Moon, Kyu-Young Kim, Young-Ho Ko, Christopher J. K. Richardson, Edo Waks, Je-Hyung Kim

Abstract: Incorporating solid-state quantum emitters into optical fiber networks enables the long-distance transmission of quantum information and the remote connection of distributed quantum nodes. However, interfacing quantum emitters with fiber optics encounters several challenges, including low coupling efficiency and stability. Here, we demonstrate a highly efficient fiber-interfacing photonic device t… ▽ More Incorporating solid-state quantum emitters into optical fiber networks enables the long-distance transmission of quantum information and the remote connection of distributed quantum nodes. However, interfacing quantum emitters with fiber optics encounters several challenges, including low coupling efficiency and stability. Here, we demonstrate a highly efficient fiber-interfacing photonic device that directly launches single photons from quantum dots into a standard FC/PC-connectorized single-mode fiber (SMF28). Optimally designed photonic structures based on hole gratings produce an ultra-narrow directional beam that matches the small numerical aperture of a single-mode fiber. A pick-and-place technique selectively integrates a single miniaturized device into the core of the fiber. Our approach realizes a plug-and-play single-photon device that does not require any optical alignment and thus guarantees long-term stability. The results thus represent a major step toward practical and reliable quantum lights across a fiber network. △ Less

Submitted 26 February, 2022; originally announced February 2022.

Comments: 19 pages, 10 figures including supporting information

Journal ref: Advanced Quantum Technologies 2200022 (2022)

arXiv:2112.14857 [pdf, other]

doi 10.1038/s41467-022-31607-7

Waveguide-Integrated Mid-Infrared Photodetection using Graphene on a Scalable Chalcogenide Glass Platform

Authors: Jordan Goldstein, Hongtao Lin, Skylar Deckoff-Jones, Marek Hempel, Ang-Yu Lu, Kathleen A. Richardson, Tomas Palacios, **g Kong, Juejun Hu, Dirk Englund

Abstract: The development of compact and fieldable mid-infrared (mid-IR) spectroscopy devices represents a critical challenge for distributed sensing with applications from gas leak detection to environmental monitoring. Recent work has focused on mid-IR photonic integrated circuit (PIC) sensing platforms and waveguide-integrated mid-IR light sources and detectors based on semiconductors such as PbTe, black… ▽ More The development of compact and fieldable mid-infrared (mid-IR) spectroscopy devices represents a critical challenge for distributed sensing with applications from gas leak detection to environmental monitoring. Recent work has focused on mid-IR photonic integrated circuit (PIC) sensing platforms and waveguide-integrated mid-IR light sources and detectors based on semiconductors such as PbTe, black phosphorus and tellurene. However, material bandgaps and reliance on SiO$_2$ substrates limit operation to wavelengths $λ\lesssim4\,μ\textrm{m}$. Here we overcome these challenges with a chalcogenide glass-on-CaF$_2$ PIC architecture incorporating split-gate photothermoelectric graphene photodetectors. Our design extends operation to $λ=5.2\,μ\textrm{m}$ with a Johnson noise-limited noise-equivalent power of $1.1\,\mathrm{nW}/\mathrm{Hz}^{1/2}$, no fall-off in photoresponse up to $f = 1\,\mathrm{MHz}$, and a predicted 3-dB bandwidth of $f_{3\textrm{dB}}>1\,\mathrm{GHz}$. This mid-IR PIC platform readily extends to longer wavelengths and opens the door to applications from distributed gas sensing and portable dual comb spectroscopy to weather-resilient free space optical communications. △ Less

Submitted 29 December, 2021; originally announced December 2021.

Comments: 15 pages, 11 figures

arXiv:2112.09054 [pdf, other]

Pushing the Limits of Rule Reasoning in Transformers through Natural Language Satisfiability

Authors: Kyle Richardson, Ashish Sabharwal

Abstract: Investigating the reasoning abilities of transformer models, and discovering new challenging tasks for them, has been a topic of much interest. Recent studies have found these models to be surprisingly strong at performing deductive reasoning over formal logical theories expressed in natural language. A shortcoming of these studies, however, is that they do not take into account that logical theor… ▽ More Investigating the reasoning abilities of transformer models, and discovering new challenging tasks for them, has been a topic of much interest. Recent studies have found these models to be surprisingly strong at performing deductive reasoning over formal logical theories expressed in natural language. A shortcoming of these studies, however, is that they do not take into account that logical theories, when sampled uniformly at random, do not necessarily lead to hard instances. We propose a new methodology for creating challenging algorithmic reasoning datasets that focus on natural language satisfiability (NLSat) problems. The key idea is to draw insights from empirical sampling of hard propositional SAT problems and from complexity-theoretic studies of language. This methodology allows us to distinguish easy from hard instances, and to systematically increase the complexity of existing reasoning benchmarks such as RuleTaker. We find that current transformers, given sufficient training data, are surprisingly robust at solving the resulting NLSat problems of substantially increased difficulty. They also exhibit some degree of scale-invariance - the ability to generalize to problems of larger size and scope. Our results, however, reveal important limitations too: a careful sampling of training data is crucial for building models that generalize to larger problems, and transformer models' limited scale-invariance suggests they are far from learning robust deductive reasoning algorithms. △ Less

Submitted 16 December, 2021; originally announced December 2021.

Comments: Accepted to AAAI-2022, AAAI preprint

arXiv:2112.08348 [pdf, other]

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Authors: Daniel Khashabi, Shane Lyu, Sewon Min, Lianhui Qin, Kyle Richardson, Sean Welleck, Hannaneh Hajishirzi, Tushar Khot, Ashish Sabharwal, Sameer Singh, Ye** Choi

Abstract: Fine-tuning continuous prompts for target tasks has recently emerged as a compact alternative to full model fine-tuning. Motivated by these promising results, we investigate the feasibility of extracting a discrete (textual) interpretation of continuous prompts that is faithful to the problem they solve. In practice, we observe a "wayward" behavior between the task solved by continuous prompts and… ▽ More Fine-tuning continuous prompts for target tasks has recently emerged as a compact alternative to full model fine-tuning. Motivated by these promising results, we investigate the feasibility of extracting a discrete (textual) interpretation of continuous prompts that is faithful to the problem they solve. In practice, we observe a "wayward" behavior between the task solved by continuous prompts and their nearest neighbor discrete projections: We can find continuous prompts that solve a task while being projected to an arbitrary text (e.g., definition of a different or even a contradictory task), while being within a very small (2%) margin of the best continuous prompt of the same size for the task. We provide intuitions behind this odd and surprising behavior, as well as extensive empirical analyses quantifying the effect of various parameters. For instance, for larger model sizes we observe higher waywardness, i.e, we can find prompts that more closely map to any arbitrary text with a smaller drop in accuracy. These findings have important implications relating to the difficulty of faithfully interpreting continuous prompts and their generalization across models and tasks, providing guidance for future progress in prompting language models. △ Less

Submitted 4 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

Comments: NAACL 2022

arXiv:2112.00086 [pdf, other]

Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking

Authors: Ronen Tamari, Kyle Richardson, Aviad Sar-Shalom, Noam Kahlon, Nelson Liu, Reut Tsarfaty, Dafna Shahaf

Abstract: While neural language models often perform surprisingly well on natural language understanding (NLU) tasks, their strengths and limitations remain poorly understood. Controlled synthetic tasks are thus an increasingly important resource for diagnosing model behavior. In this work we focus on story understanding, a core competency for NLU systems. However, the main synthetic resource for story unde… ▽ More While neural language models often perform surprisingly well on natural language understanding (NLU) tasks, their strengths and limitations remain poorly understood. Controlled synthetic tasks are thus an increasingly important resource for diagnosing model behavior. In this work we focus on story understanding, a core competency for NLU systems. However, the main synthetic resource for story understanding, the bAbI benchmark, lacks such a systematic mechanism for controllable task generation. We develop Dyna-bAbI, a dynamic framework providing fine-grained control over task generation in bAbI. We demonstrate our ideas by constructing three new tasks requiring compositional generalization, an important evaluation setting absent from the original benchmark. We tested both special-purpose models developed for bAbI as well as state-of-the-art pre-trained methods, and found that while both approaches solve the original tasks (>99% accuracy), neither approach succeeded in the compositional generalization setting, indicating the limitations of the original training data. We explored ways to augment the original data, and found that though diversifying training data was far more useful than simply increasing dataset size, it was still insufficient for driving robust compositional generalization (with <70% accuracy for complex compositions). Our results underscore the importance of highly controllable task generators for creating robust NLU systems through a virtuous cycle of model and data development. △ Less

Submitted 30 November, 2021; originally announced December 2021.

Comments: Code and data will be made available at project page: https://tiny.one/8wjxwd7z

arXiv:2110.08542 [pdf, other]

Hey AI, Can You Solve Complex Tasks by Talking to Agents?

Authors: Tushar Khot, Kyle Richardson, Daniel Khashabi, Ashish Sabharwal

Abstract: Training giant models from scratch for each complex task is resource- and data-inefficient. To help develop models that can leverage existing systems, we propose a new challenge: Learning to solve complex tasks by communicating with existing agents (or models) in natural language. We design a synthetic benchmark, CommaQA, with three complex reasoning tasks (explicit, implicit, numeric) designed to… ▽ More Training giant models from scratch for each complex task is resource- and data-inefficient. To help develop models that can leverage existing systems, we propose a new challenge: Learning to solve complex tasks by communicating with existing agents (or models) in natural language. We design a synthetic benchmark, CommaQA, with three complex reasoning tasks (explicit, implicit, numeric) designed to be solved by communicating with existing QA agents. For instance, using text and table QA agents to answer questions such as "Who had the longest javelin throw from USA?". We show that black-box models struggle to learn this task from scratch (accuracy under 50\%) even with access to each agent's knowledge and gold facts supervision. In contrast, models that learn to communicate with agents outperform black-box models, reaching scores of 100\% when given gold decomposition supervision. However, we show that the challenge of learning to solve complex tasks by communicating with existing agents \emph{without relying on any auxiliary supervision or data} still remains highly elusive. We release CommaQA, along with a compositional generalization test split, to advance research in this direction. Dataset and Code available at https://github.com/allenai/commaqa. △ Less

Submitted 9 May, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

Comments: Accepted to Findings of ACL 2022

arXiv:2110.01509 [pdf, other]

DeepA2: A Modular Framework for Deep Argument Analysis with Pretrained Neural Text2Text Language Models

Authors: Gregor Betz, Kyle Richardson

Abstract: In this paper, we present and implement a multi-dimensional, modular framework for performing deep argument analysis (DeepA2) using current pre-trained language models (PTLMs). ArgumentAnalyst -- a T5 model (Raffel et al. 2020) set up and trained within DeepA2 -- reconstructs argumentative texts, which advance an informal argumentation, as valid arguments: It inserts, e.g., missing premises and co… ▽ More In this paper, we present and implement a multi-dimensional, modular framework for performing deep argument analysis (DeepA2) using current pre-trained language models (PTLMs). ArgumentAnalyst -- a T5 model (Raffel et al. 2020) set up and trained within DeepA2 -- reconstructs argumentative texts, which advance an informal argumentation, as valid arguments: It inserts, e.g., missing premises and conclusions, formalizes inferences, and coherently links the logical reconstruction to the source text. We create a synthetic corpus for deep argument analysis, and evaluate ArgumentAnalyst on this new dataset as well as on existing data, specifically EntailmentBank (Dalvi et al. 2021). Our empirical findings vindicate the overall framework and highlight the advantages of a modular design, in particular its ability to emulate established heuristics (such as hermeneutic cycles), to explore the model's uncertainty, to cope with the plurality of correct solutions (underdetermination), and to exploit higher-order evidence. △ Less

Submitted 1 July, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

Comments: A Demo is available at https://huggingface.co/spaces/debatelab/deepa2-demo , the model can be downloaded from https://huggingface.co/debatelab/argument-analyst , and the datasets can be accessed at https://huggingface.co/datasets/debatelab/aaac

Journal ref: *SEM 2022

arXiv:2106.03983 [pdf, other]

Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference

Authors: Hai Hu, He Zhou, Zuoyu Tian, Yiwen Zhang, Yina Ma, Yanting Li, Yixin Nie, Kyle Richardson

Abstract: Multilingual transformers (XLM, mT5) have been shown to have remarkable transfer skills in zero-shot settings. Most transfer studies, however, rely on automatically translated resources (XNLI, XQuAD), making it hard to discern the particular linguistic knowledge that is being transferred, and the role of expert annotated monolingual datasets when develo** task-specific models. We investigate the… ▽ More Multilingual transformers (XLM, mT5) have been shown to have remarkable transfer skills in zero-shot settings. Most transfer studies, however, rely on automatically translated resources (XNLI, XQuAD), making it hard to discern the particular linguistic knowledge that is being transferred, and the role of expert annotated monolingual datasets when develo** task-specific models. We investigate the cross-lingual transfer abilities of XLM-R for Chinese and English natural language inference (NLI), with a focus on the recent large-scale Chinese dataset OCNLI. To better understand linguistic transfer, we created 4 categories of challenge and adversarial tasks (totaling 17 new datasets) for Chinese that build on several well-known resources for English (e.g., HANS, NLI stress-tests). We find that cross-lingual models trained on English NLI do transfer well across our Chinese tasks (e.g., in 3/4 of our challenge categories, they perform as well/better than the best monolingual models, even on 3/5 uniquely Chinese linguistic phenomena such as idioms, pro drop). These results, however, come with important caveats: cross-lingual models often perform best when trained on a mixture of English and high-quality monolingual NLI data (OCNLI), and are often hindered by automatically translated resources (XNLI-zh). For many phenomena, all models continue to struggle, highlighting the need for our new diagnostics to help benchmark Chinese and cross-lingual models. All new datasets/code are released at https://github.com/huhailinguist/ChineseNLIProbing. △ Less

Submitted 7 June, 2021; originally announced June 2021.

Comments: accepted to ACL Findings 2021

arXiv:2105.06010 [pdf]

Ultra-compact nonvolatile phase shifter based on electrically reprogrammable transparent phase change materials

Authors: Carlos Ríos, Qingyang Du, Yifei Zhang, Cosmin-Constantin Popescu, Mikhail Y. Shalaginov, Paul Miller, Christopher Roberts, Myungkoo Kang, Kathleen A. Richardson, Tian Gu, Steven A. Vitale, Juejun Hu

Abstract: Energy-efficient programmable photonic integrated circuits (PICs) are the cornerstone of on-chip classical and quantum optical technologies. Optical phase shifters constitute the fundamental building blocks which enable these programmable PICs. Thus far, carrier modulation and thermo-optical effect are the chosen phenomena for ultrafast and low-loss phase shifters, respectively; however, the state… ▽ More Energy-efficient programmable photonic integrated circuits (PICs) are the cornerstone of on-chip classical and quantum optical technologies. Optical phase shifters constitute the fundamental building blocks which enable these programmable PICs. Thus far, carrier modulation and thermo-optical effect are the chosen phenomena for ultrafast and low-loss phase shifters, respectively; however, the state and information they carry are lost once the power is turned off-they are volatile. The volatility not only compromises energy efficiency due to their demand for constant power supply, but also precludes them from emerging applications such as in-memory computing. To circumvent this limitation, we introduce a novel phase shifting mechanism that exploits the nonvolatile refractive index modulation upon structural phase transition of Sb$_{2}$Se$_{3}$, a bi-stable transparent phase change material. A zero-static power and electrically-driven phase shifter was realized on a foundry-processed silicon-on-insulator platform, featuring record phase modulation up to 0.09 $π$/$μ$m and a low insertion loss of 0.3 dB/$π$, which can be further improved upon streamlined design. We also pioneered a one-step partial amorphization scheme to enhance the speed and energy efficiency of PCM devices. A diverse cohort of programmable photonic devices were demonstrated based on the ultra-compact PCM phase shifter. △ Less

Submitted 21 March, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

Comments: 15 pages with 6 figures and 1 table

arXiv:2103.13033 [pdf, other]

Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning Performance of GPT-2

Authors: Gregor Betz, Kyle Richardson, Christian Voigt

Abstract: Thinking aloud is an effective meta-cognitive strategy human reasoners apply to solve difficult problems. We suggest to improve the reasoning ability of pre-trained neural language models in a similar way, namely by expanding a task's context with problem elaborations that are dynamically generated by the language model itself. Our main result is that dynamic problem elaboration significantly impr… ▽ More Thinking aloud is an effective meta-cognitive strategy human reasoners apply to solve difficult problems. We suggest to improve the reasoning ability of pre-trained neural language models in a similar way, namely by expanding a task's context with problem elaborations that are dynamically generated by the language model itself. Our main result is that dynamic problem elaboration significantly improves the zero-shot performance of GPT-2 in a deductive reasoning and natural language inference task: While the model uses a syntactic heuristic for predicting an answer, it is capable (to some degree) of generating reasoned additional context which facilitates the successful application of its heuristic. We explore different ways of generating elaborations, including fewshot learning, and find that their relative performance varies with the specific problem characteristics (such as problem difficulty). Moreover, the effectiveness of an elaboration can be explained in terms of the degree to which the elaboration semantically coheres with the corresponding problem. In particular, elaborations that are most faithful to the original problem description may boost accuracy by up to 24%. △ Less

Submitted 24 March, 2021; originally announced March 2021.

arXiv:2102.06722 [pdf, other]

doi 10.1103/PhysRevLett.127.081801

The search for low-mass axion dark matter with ABRACADABRA-10cm

Authors: Chiara P. Salemi, Joshua W. Foster, Jonathan L. Ouellet, Andrew Gavin, Kaliroe M. W. Pappas, Sabrina Cheng, Kate A. Richardson, Reyco Henning, Yonatan Kahn, Rachel Nguyen, Nicholas L. Rodd, Benjamin R. Safdi, Lindley Winslow

Abstract: Two of the most pressing questions in physics are the microscopic nature of the dark matter that comprises 84% of the mass in the universe and the absence of a neutron electric dipole moment. These questions would be resolved by the existence of a hypothetical particle known as the quantum chromodynamics (QCD) axion. In this work, we probe the hypothesis that axions constitute dark matter, using t… ▽ More Two of the most pressing questions in physics are the microscopic nature of the dark matter that comprises 84% of the mass in the universe and the absence of a neutron electric dipole moment. These questions would be resolved by the existence of a hypothetical particle known as the quantum chromodynamics (QCD) axion. In this work, we probe the hypothesis that axions constitute dark matter, using the ABRACADABRA-10cm experiment in a broadband configuration, with world-leading sensitivity. We find no significant evidence for axions, and we present 95% upper limits on the axion-photon coupling down to the world-leading level $g_{aγγ}<3.2 \times10^{-11}$ GeV$^{-1}$, representing one of the most sensitive searches for axions in the 0.41 - 8.27 neV mass range. Our work paves a direct path for future experiments capable of confirming or excluding the hypothesis that dark matter is a QCD axion in the mass range motivated by String Theory and Grand Unified Theories. △ Less

Submitted 12 February, 2021; originally announced February 2021.

Comments: 17 pages, 12 figures

Journal ref: Phys. Rev. Lett. 127, 081801 (2021)

arXiv:2102.03315 [pdf, other]

Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge

Authors: Sumithra Bhakthavatsalam, Daniel Khashabi, Tushar Khot, Bhavana Dalvi Mishra, Kyle Richardson, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord, Peter Clark

Abstract: We present the ARC-DA dataset, a direct-answer ("open response", "freeform") version of the ARC (AI2 Reasoning Challenge) multiple-choice dataset. While ARC has been influential in the community, its multiple-choice format is unrepresentative of real-world questions, and multiple choice formats can be particularly susceptible to artifacts. The ARC-DA dataset addresses these concerns by converting… ▽ More We present the ARC-DA dataset, a direct-answer ("open response", "freeform") version of the ARC (AI2 Reasoning Challenge) multiple-choice dataset. While ARC has been influential in the community, its multiple-choice format is unrepresentative of real-world questions, and multiple choice formats can be particularly susceptible to artifacts. The ARC-DA dataset addresses these concerns by converting questions to direct-answer format using a combination of crowdsourcing and expert review. The resulting dataset contains 2985 questions with a total of 8436 valid answers (questions typically have more than one valid answer). ARC-DA is one of the first DA datasets of natural questions that often require reasoning, and where appropriate question decompositions are not evident from the questions themselves. We describe the conversion approach taken, appropriate evaluation metrics, and several strong models. Although high, the best scores (81% GENIE, 61.4% F1, 63.2% ROUGE-L) still leave considerable room for improvement. In addition, the dataset provides a natural setting for new research on explanation, as many questions require reasoning to construct answers. We hope the dataset spurs further advances in complex question-answering by the community. ARC-DA is available at https://allenai.org/data/arc-da △ Less

Submitted 5 February, 2021; originally announced February 2021.

arXiv:2102.01761 [pdf]

Deep Convolutional Neural Networks to Predict Mutual Coupling Effects in Metasurfaces

Authors: Sensong An, Bowen Zheng, Mikhail Y. Shalaginov, Hong Tang, Hang Li, Li Zhou, Yunxi Dong, Mohammad Haerinia, Anuradha Murthy Agarwal, Clara Rivero-Baleine, Myungkoo Kang, Kathleen A. Richardson, Tian Gu, Juejun Hu, Clayton Fowler, Hualiang Zhang

Abstract: Metasurfaces have provided a novel and promising platform for the realization of compact and large-scale optical devices. The conventional metasurface design approach assumes periodic boundary conditions for each element, which is inaccurate in most cases since the near-field coupling effects between elements will change when surrounded by non-identical structures. In this paper, we propose a deep… ▽ More Metasurfaces have provided a novel and promising platform for the realization of compact and large-scale optical devices. The conventional metasurface design approach assumes periodic boundary conditions for each element, which is inaccurate in most cases since the near-field coupling effects between elements will change when surrounded by non-identical structures. In this paper, we propose a deep learning approach to predict the actual electromagnetic (EM) responses of each target meta-atom placed in a large array with near-field coupling effects taken into account. The predicting neural network takes the physical specifications of the target meta-atom and its neighbors as input, and calculates its phase and amplitude in milliseconds. This approach can be applied to explain metasurfaces' performance deterioration caused by mutual coupling and further used to optimize their efficiencies once combined with optimization algorithms. To demonstrate the efficacy of this methodology, we obtain large improvements in efficiency for a beam deflector and a metalens over the conventional design approach. Moreover, we show the correlations between a metasurface's performance and its design errors caused by mutual coupling are not bound to certain specifications (materials, shapes, etc.). As such, we envision that this approach can be readily applied to explore the mutual coupling effects and improve the performance of various metasurface designs. △ Less

Submitted 2 February, 2021; originally announced February 2021.

Comments: 16 pages, 10 figures

arXiv:2011.08092 [pdf, other]

A Dataset for Tracking Entities in Open Domain Procedural Text

Authors: Niket Tandon, Keisuke Sakaguchi, Bhavana Dalvi Mishra, Dheeraj Rajagopal, Peter Clark, Michal Guerquin, Kyle Richardson, Eduard Hovy

Abstract: We present the first dataset for tracking state changes in procedural text from arbitrary domains by using an unrestricted (open) vocabulary. For example, in a text describing fog removal using potatoes, a car window may transition between being foggy, sticky,opaque, and clear. Previous formulations of this task provide the text and entities involved,and ask how those entities change for just a sm… ▽ More We present the first dataset for tracking state changes in procedural text from arbitrary domains by using an unrestricted (open) vocabulary. For example, in a text describing fog removal using potatoes, a car window may transition between being foggy, sticky,opaque, and clear. Previous formulations of this task provide the text and entities involved,and ask how those entities change for just a small, pre-defined set of attributes (e.g., location), limiting their fidelity. Our solution is a new task formulation where given just a procedural text as input, the task is to generate a set of state change tuples(entity, at-tribute, before-state, after-state)for each step,where the entity, attribute, and state values must be predicted from an open vocabulary. Using crowdsourcing, we create OPENPI1, a high-quality (91.5% coverage as judged by humans and completely vetted), and large-scale dataset comprising 29,928 state changes over 4,050 sentences from 810 procedural real-world paragraphs from WikiHow.com. A current state-of-the-art generation model on this task achieves 16.1% F1 based on BLEU metric, leaving enough room for novel model architectures. △ Less

Submitted 30 October, 2020; originally announced November 2020.

Comments: To appear in EMNLP 2020

arXiv:2010.13778 [pdf]

doi 10.1088/2058-9565/abfa64

Achieving a quantum smart workforce

Authors: Clarice D. Aiello, D. D. Awschalom, Hannes Bernien, Tina Brower-Thomas, Kenneth R. Brown, Todd A. Brun, Justin R. Caram, Eric Chitambar, Rosa Di Felice, Michael F. J. Fox, Stephan Haas, Alexander W. Holleitner, Eric R. Hudson, Jeffrey H. Hunt, Robert Joynt, Scott Koziol, H. J. Lewandowski, Douglas T. McClure, Jens Palsberg, Gina Passante, Kristen L. Pudenz, Christopher J. K. Richardson, Jessica L. Rosenberg, R. S. Ross, Mark Saffman , et al. (7 additional authors not shown)

Abstract: Interest in building dedicated Quantum Information Science and Engineering (QISE) education programs has greatly expanded in recent years. These programs are inherently convergent, complex, often resource intensive and likely require collaboration with a broad variety of stakeholders. In order to address this combination of challenges, we have captured ideas from many members in the community. Thi… ▽ More Interest in building dedicated Quantum Information Science and Engineering (QISE) education programs has greatly expanded in recent years. These programs are inherently convergent, complex, often resource intensive and likely require collaboration with a broad variety of stakeholders. In order to address this combination of challenges, we have captured ideas from many members in the community. This manuscript not only addresses policy makers and funding agencies (both public and private and from the regional to the international level) but also contains needs identified by industry leaders and discusses the difficulties inherent in creating an inclusive QISE curriculum. We report on the status of eighteen post-secondary education programs in QISE and provide guidance for building new programs. Lastly, we encourage the development of a comprehensive strategic plan for quantum education and workforce development as a means to make the most of the ongoing substantial investments being made in QISE. △ Less

Submitted 23 October, 2020; originally announced October 2020.

Comments: 18 pages, 2 figures, 1 table

Journal ref: Quantum Sci. Technol. 6 030501 (2021)

arXiv:2010.12753 [pdf, other]

Temporal Reasoning on Implicit Events from Distant Supervision

Authors: Ben Zhou, Kyle Richardson, Qiang Ning, Tushar Khot, Ashish Sabharwal, Dan Roth

Abstract: We propose TRACIE, a novel temporal reasoning dataset that evaluates the degree to which systems understand implicit events -- events that are not mentioned explicitly in natural language text but can be inferred from it. This introduces a new challenge in temporal reasoning research, where prior work has focused on explicitly mentioned events. Human readers can infer implicit events via commonsen… ▽ More We propose TRACIE, a novel temporal reasoning dataset that evaluates the degree to which systems understand implicit events -- events that are not mentioned explicitly in natural language text but can be inferred from it. This introduces a new challenge in temporal reasoning research, where prior work has focused on explicitly mentioned events. Human readers can infer implicit events via commonsense reasoning, resulting in a more comprehensive understanding of the situation and, consequently, better reasoning about time. We find, however, that state-of-the-art models struggle when predicting temporal relationships between implicit and explicit events. To address this, we propose a neuro-symbolic temporal reasoning model, SYMTIME, which exploits distant supervision signals from large-scale text and uses temporal rules to combine start times and durations to infer end times. SYMTIME outperforms strong baseline systems on TRACIE by 5%, and by 11% in a zero prior knowledge training setting. Our approach also generalizes to other temporal reasoning tasks, as evidenced by a gain of 1%-9% on MATRES, an explicit event benchmark. △ Less

Submitted 7 May, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

Comments: Accepted at NAACL 2021

arXiv:2010.05444 [pdf, other]

OCNLI: Original Chinese Natural Language Inference

Authors: Hai Hu, Kyle Richardson, Liang Xu, Lu Li, Sandra Kuebler, Lawrence S. Moss

Abstract: Despite the tremendous recent progress on natural language inference (NLI), driven largely by large-scale investment in new datasets (e.g., SNLI, MNLI) and advances in modeling, most progress has been limited to English due to a lack of reliable datasets for most of the world's languages. In this paper, we present the first large-scale NLI dataset (consisting of ~56,000 annotated sentence pairs) f… ▽ More Despite the tremendous recent progress on natural language inference (NLI), driven largely by large-scale investment in new datasets (e.g., SNLI, MNLI) and advances in modeling, most progress has been limited to English due to a lack of reliable datasets for most of the world's languages. In this paper, we present the first large-scale NLI dataset (consisting of ~56,000 annotated sentence pairs) for Chinese called the Original Chinese Natural Language Inference dataset (OCNLI). Unlike recent attempts at extending NLI to other languages, our dataset does not rely on any automatic translation or non-expert annotation. Instead, we elicit annotations from native speakers specializing in linguistics. We follow closely the annotation protocol used for MNLI, but create new strategies for eliciting diverse hypotheses. We establish several baseline results on our dataset using state-of-the-art pre-trained models for Chinese, and find even the best performing models to be far outpaced by human performance (~12% absolute performance gap), making it a challenging new resource that we hope will help to accelerate progress in Chinese NLU. To the best of our knowledge, this is the first human-elicited MNLI-style corpus for a non-English language. △ Less

Submitted 12 October, 2020; originally announced October 2020.

Comments: Findings of EMNLP 2020

arXiv:2009.07185 [pdf, other]

Critical Thinking for Language Models

Authors: Gregor Betz, Christian Voigt, Kyle Richardson

Abstract: This paper takes a first step towards a critical thinking curriculum for neural auto-regressive language models. We introduce a synthetic corpus of deductively valid arguments, and generate artificial argumentative texts to train and evaluate GPT-2. Significant transfer learning effects can be observed: Training a model on three simple core schemes allows it to accurately complete conclusions of d… ▽ More This paper takes a first step towards a critical thinking curriculum for neural auto-regressive language models. We introduce a synthetic corpus of deductively valid arguments, and generate artificial argumentative texts to train and evaluate GPT-2. Significant transfer learning effects can be observed: Training a model on three simple core schemes allows it to accurately complete conclusions of different, and more complex types of arguments, too. The language models generalize the core argument schemes in a correct way. Moreover, we obtain consistent and promising results for NLU benchmarks. In particular, pre-training on the argument schemes raises zero-shot accuracy on the GLUE diagnostics by up to 15 percentage points. The findings suggest that intermediary pre-training on texts that exemplify basic reasoning abilities (such as typically covered in critical thinking textbooks) might help language models to acquire a broad range of reasoning skills. The synthetic argumentative texts presented in this paper are a promising starting point for building such a "critical thinking curriculum for language models." △ Less

Submitted 17 December, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

arXiv:2009.00751 [pdf, other]

Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models

Authors: Tushar Khot, Daniel Khashabi, Kyle Richardson, Peter Clark, Ashish Sabharwal

Abstract: We propose a general framework called Text Modular Networks(TMNs) for building interpretable systems that learn to solve complex tasks by decomposing them into simpler ones solvable by existing models. To ensure solvability of simpler tasks, TMNs learn the textual input-output behavior (i.e., language) of existing models through their datasets. This differs from prior decomposition-based approache… ▽ More We propose a general framework called Text Modular Networks(TMNs) for building interpretable systems that learn to solve complex tasks by decomposing them into simpler ones solvable by existing models. To ensure solvability of simpler tasks, TMNs learn the textual input-output behavior (i.e., language) of existing models through their datasets. This differs from prior decomposition-based approaches which, besides being designed specifically for each complex task, produce decompositions independent of existing sub-models. Specifically, we focus on Question Answering (QA) and show how to train a next-question generator to sequentially produce sub-questions targeting appropriate sub-models, without additional human annotation. These sub-questions and answers provide a faithful natural language explanation of the model's reasoning. We use this framework to build ModularQA, a system that can answer multi-hop reasoning questions by decomposing them into sub-questions answerable by a neural factoid single-span QA model and a symbolic calculator. Our experiments show that ModularQA is more versatile than existing explainable systems for DROP and HotpotQA datasets, is more robust than state-of-the-art blackbox (uninterpretable) systems, and generates more understandable and trustworthy explanations compared to prior work. △ Less

Submitted 12 April, 2021; v1 submitted 1 September, 2020; originally announced September 2020.

Comments: Accepted to NAACL 2021

arXiv:2008.13332 [pdf]

Nonlinear Mid-infrared Metasurface based on a Phase-Change Material

Authors: Fuyong Yue, Riccardo Piccoli, Mikhail Y. Shalaginov, Tian Gu, Kathleen Richardson, Roberto Morandotti, Juejun Hu, Luca Razzari

Abstract: The mid-wave infrared (MWIR) spectral region (3-5 μm) is important to a vast variety of applications in imaging, sensing, spectroscopy, surgery, and optical communications. Efficient third-harmonic generation (THG), converting light from the MWIR range into the near-infrared, a region with mature optical detection and manipulation technologies, offers the opportunity to mitigate a commonly recogni… ▽ More The mid-wave infrared (MWIR) spectral region (3-5 μm) is important to a vast variety of applications in imaging, sensing, spectroscopy, surgery, and optical communications. Efficient third-harmonic generation (THG), converting light from the MWIR range into the near-infrared, a region with mature optical detection and manipulation technologies, offers the opportunity to mitigate a commonly recognized limitation of current MWIR systems. In this work, we present the possibility of boosting THG in the MWIR through a metasurface design. Specifically, we demonstrate a 30-fold enhancement in a highly nonlinear phase change material Ge2Sb2Se4Te1 (GSST), by patterning arrays of subwavelength cylinders supporting a magnetic dipolar resonance. The unprecedented broadband transparency, large refractive index, and remarkably high nonlinear response, together with unique phase-change properties, make GSST-based metasurfaces an appealing solution for reconfigurable and ultra-compact nonlinear devices operating in the MWIR. △ Less

Submitted 30 August, 2020; originally announced August 2020.

Comments: 15 pages, 3 figures

arXiv:2008.06659 [pdf]

doi 10.1038/s41565-021-00881-9

Electrically Reconfigurable Nonvolatile Metasurface Using Low-Loss Optical Phase Change Material

Authors: Yifei Zhang, Clayton Fowler, Junhao Liang, Bilal Azhar, Mikhail Y. Shalaginov, Skylar Deckoff-Jones, Sensong An, Jeffrey B. Chou, Christopher M. Roberts, Vladimir Liberman, Myungkoo Kang, Carlos Ríos, Kathleen A. Richardson, Clara Rivero-Baleine, Tian Gu, Hualiang Zhang, Juejun Hu

Abstract: Active metasurfaces promise reconfigurable optics with drastically improved compactness, ruggedness, manufacturability, and functionality compared to their traditional bulk counterparts. Optical phase change materials (O-PCMs) offer an appealing material solution for active metasurface devices with their large index contrast and nonvolatile switching characteristics. Here we report what we believe… ▽ More Active metasurfaces promise reconfigurable optics with drastically improved compactness, ruggedness, manufacturability, and functionality compared to their traditional bulk counterparts. Optical phase change materials (O-PCMs) offer an appealing material solution for active metasurface devices with their large index contrast and nonvolatile switching characteristics. Here we report what we believe to be the first electrically reconfigurable nonvolatile metasurfaces based on O-PCMs. The O-PCM alloy used in the devices, Ge2Sb2Se4Te1 (GSST), uniquely combines giant non-volatile index modulation capability, broadband low optical loss, and a large reversible switching volume, enabling significantly enhanced light-matter interactions within the active O-PCM medium. Capitalizing on these favorable attributes, we demonstrated continuously tunable active metasurfaces with record half-octave spectral tuning range and large optical contrast of over 400%. We further prototyped a polarization-insensitive phase-gradient metasurface to realize dynamic optical beam steering. △ Less

Submitted 2 September, 2020; v1 submitted 15 August, 2020; originally announced August 2020.

Comments: 12 pages, 5 figures

arXiv:2007.07944 [pdf]

Multi-level Electro-thermal Switching of Optical Phase-Change Materials Using Graphene

Authors: Carlos Ríos, Yifei Zhang, Mikhail Shalaginov, Skylar Deckoff-Jones, Haozhe Wang, Sensong An, Hualiang Zhang, Myungkoo Kang, Kathleen A. Richardson, Christopher Roberts, Jeffrey B. Chou, Vladimir Liberman, Steven A. Vitale, **g Kong, Tian Gu, Juejun Hu

Abstract: Reconfigurable photonic systems featuring minimal power consumption are crucial for integrated optical devices in real-world technology. Current active devices available in foundries, however, use volatile methods to modulate light, requiring a constant supply of power and significant form factors. Essential aspects to overcoming these issues are the development of nonvolatile optical reconfigurat… ▽ More Reconfigurable photonic systems featuring minimal power consumption are crucial for integrated optical devices in real-world technology. Current active devices available in foundries, however, use volatile methods to modulate light, requiring a constant supply of power and significant form factors. Essential aspects to overcoming these issues are the development of nonvolatile optical reconfiguration techniques which are compatible with on-chip integration with different photonic platforms and do not disrupt their optical performances. In this paper, a solution is demonstrated using an optoelectronic framework for nonvolatile tunable photonics that employs undoped-graphene microheaters to thermally and reversibly switch the optical phase-change material Ge$_2$Sb$_2$Se$_4$Te$_1$ (GSST). An in-situ Raman spectroscopy method is utilized to demonstrate, in real-time, reversible switching between four different levels of crystallinity. Moreover, a 3D computational model is developed to precisely interpret the switching characteristics, and to quantify the impact of current saturation on power dissipation, thermal diffusion, and switching speed. This model is used to inform the design of nonvolatile active photonic devices; namely, broadband Si$_3$N$_4$ integrated photonic circuits with small form-factor modulators and reconfigurable metasurfaces displaying 2$π$ phase coverage through neural-network-designed GSST meta-atoms. This framework will enable scalable, low-loss nonvolatile applications across a diverse range of photonics platforms. △ Less

Submitted 15 July, 2020; originally announced July 2020.

Comments: 22 pages, 5 Figures, 2 tables

arXiv:2006.07510 [pdf, ps, other]

Do Dogs have Whiskers? A New Knowledge Base of hasPart Relations

Authors: Sumithra Bhakthavatsalam, Kyle Richardson, Niket Tandon, Peter Clark

Abstract: We present a new knowledge-base of hasPart relationships, extracted from a large corpus of generic statements. Complementary to other resources available, it is the first which is all three of: accurate (90% precision), salient (covers relationships a person may mention), and has high coverage of common terms (approximated as within a 10 year old's vocabulary), as well as having several times more… ▽ More We present a new knowledge-base of hasPart relationships, extracted from a large corpus of generic statements. Complementary to other resources available, it is the first which is all three of: accurate (90% precision), salient (covers relationships a person may mention), and has high coverage of common terms (approximated as within a 10 year old's vocabulary), as well as having several times more hasPart entries than in the popular ontologies ConceptNet and WordNet. In addition, it contains information about quantifiers, argument modifiers, and links the entities to appropriate concepts in Wikipedia and WordNet. The knowledge base is available at https://allenai.org/data/haspartkb △ Less

Submitted 12 June, 2020; originally announced June 2020.

arXiv:2005.13359 [pdf, other]

NDD20: A large-scale few-shot dolphin dataset for coarse and fine-grained categorisation

Authors: Cameron Trotter, Georgia Atkinson, Matt Sharpe, Kirsten Richardson, A. Stephen McGough, Nick Wright, Ben Burville, Per Berggren

Abstract: We introduce the Northumberland Dolphin Dataset 2020 (NDD20), a challenging image dataset annotated for both coarse and fine-grained instance segmentation and categorisation. This dataset, the first release of the NDD, was created in response to the rapid expansion of computer vision into conservation research and the production of field-deployable systems suited to extreme environmental condition… ▽ More We introduce the Northumberland Dolphin Dataset 2020 (NDD20), a challenging image dataset annotated for both coarse and fine-grained instance segmentation and categorisation. This dataset, the first release of the NDD, was created in response to the rapid expansion of computer vision into conservation research and the production of field-deployable systems suited to extreme environmental conditions -- an area with few open source datasets. NDD20 contains a large collection of above and below water images of two different dolphin species for traditional coarse and fine-grained segmentation. All data contained in NDD20 was obtained via manual collection in the North Sea around the Northumberland coastline, UK. We present experimentation using standard deep learning network architecture trained using NDD20 and report baselines results. △ Less

Submitted 27 May, 2020; originally announced May 2020.

Comments: 5 pages, 6 figures, download link, submitted to FGVC7 Workshop @ CVPR20

arXiv:2005.12058 [pdf]

doi 10.1103/PhysRevB.102.064431

Extremely well isolated 2D spin-$1/2$ antiferromagnetic Heisenberg layers with small exchange coupling in the molecular-based magnet CuPOF

Authors: D. Opherden, N. Nizar, K. Richardson, J. C. Monroe, M. M. Turnbull, M. Polson, S. Vela, W. J. A. Blackmore, P. A. Goddard, J. Singleton, E. S. Choi, F. Xiao, R. C. Williams, T. Lancaster, F. L. Pratt, S. J. Blundell, Y. Skourski, M. Uhlarz, A. N. Ponomaryov, S. A. Zvyagin, J. Wosnitza, M. Baenitz, I. Heinmaa, R. Stern, H. Kühne , et al. (1 additional authors not shown)

Abstract: We report on a comprehensive characterization of the newly synthesized Cu$^{2+}$-based molecular magnet [Cu(pz)$_2$(2-HOpy)$_2$](PF$_6$)$_2$ (CuPOF), where pz = C$_4$H$_4$N$_2$ and 2-HOpy = C$_5$H$_4$NHO. From a comparison of theoretical modeling to results of bulk magnetometry, specific heat, $μ^+$SR, ESR, and NMR spectroscopy, this material is determined as an excellent realization of the 2D squ… ▽ More We report on a comprehensive characterization of the newly synthesized Cu$^{2+}$-based molecular magnet [Cu(pz)$_2$(2-HOpy)$_2$](PF$_6$)$_2$ (CuPOF), where pz = C$_4$H$_4$N$_2$ and 2-HOpy = C$_5$H$_4$NHO. From a comparison of theoretical modeling to results of bulk magnetometry, specific heat, $μ^+$SR, ESR, and NMR spectroscopy, this material is determined as an excellent realization of the 2D square-lattice $S=1/2$ antiferromagnetic Heisenberg model with a moderate intraplane nearest-neighbor exchange coupling of $J/k_\mathrm{B} = 6.80(5)$ K, and an extremely small interlayer interaction of about 1 mK. At zero field, the bulk magnetometry reveals a temperature-driven crossover of spin correlations from isotropic to $XY$ type, caused by the presence of a weak intrinsic easy-plane anisotropy. A transition to long-range order, driven by the low-temperature $XY$ anisotropy under the influence of the interlayer coupling, occurs at $T_\mathrm{N} = 1.38(2)$ K, as revealed by $μ^+$SR. In applied magnetic fields, our $^1$H-NMR data reveal a strong increase of the magnetic anisotropy, manifested by a pronounced enhancement of the transition temperature to commensurate long-range order at $T_\mathrm{N} =2.8$ K and 7 T. △ Less

Submitted 1 September, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

Comments: 14 pages, 8 figures, as well as 10 pages and 18 figures of supplemental material

Journal ref: Phys. Rev. B 102, 064431 (2020)

arXiv:2004.14623 [pdf, ps, other]

Neural Natural Language Inference Models Partially Embed Theories of Lexical Entailment and Negation

Authors: Atticus Geiger, Kyle Richardson, Christopher Potts

Abstract: We address whether neural models for Natural Language Inference (NLI) can learn the compositional interactions between lexical entailment and negation, using four methods: the behavioral evaluation methods of (1) challenge test sets and (2) systematic generalization tasks, and the structural evaluation methods of (3) probes and (4) interventions. To facilitate this holistic evaluation, we present… ▽ More We address whether neural models for Natural Language Inference (NLI) can learn the compositional interactions between lexical entailment and negation, using four methods: the behavioral evaluation methods of (1) challenge test sets and (2) systematic generalization tasks, and the structural evaluation methods of (3) probes and (4) interventions. To facilitate this holistic evaluation, we present Monotonicity NLI (MoNLI), a new naturalistic dataset focused on lexical entailment and negation. In our behavioral evaluations, we find that models trained on general-purpose NLI datasets fail systematically on MoNLI examples containing negation, but that MoNLI fine-tuning addresses this failure. In our structural evaluations, we look for evidence that our top-performing BERT-based model has learned to implement the monotonicity algorithm behind MoNLI. Probes yield evidence consistent with this conclusion, and our intervention experiments bolster this, showing that the causal dynamics of the model mirror the causal dynamics of this algorithm on subsets of MoNLI. This suggests that the BERT model at least partially embeds a theory of lexical entailment and negation at an algorithmic level. △ Less

Submitted 20 November, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

Comments: In Proceedings of BlackBoxNLP 2020 at EMNLP 2020

arXiv:2004.05986 [pdf, other]

CLUE: A Chinese Language Understanding Evaluation Benchmark

Authors: Liang Xu, Hai Hu, Xuanwei Zhang, Lu Li, Chenjie Cao, Yudong Li, Yechen Xu, Kai Sun, Dian Yu, Cong Yu, Yin Tian, Qianqian Dong, Weitang Liu, Bo Shi, Yiming Cui, Junyi Li, Jun Zeng, Rongzhao Wang, Weijian Xie, Yanting Li, Yina Patterson, Zuoyu Tian, Yiwen Zhang, He Zhou, Shaoweihua Liu , et al. (7 additional authors not shown)

Abstract: The advent of natural language understanding (NLU) benchmarks for English, such as GLUE and SuperGLUE allows new NLU models to be evaluated across a diverse set of tasks. These comprehensive benchmarks have facilitated a broad range of research and applications in natural language processing (NLP). The problem, however, is that most such benchmarks are limited to English, which has made it difficu… ▽ More The advent of natural language understanding (NLU) benchmarks for English, such as GLUE and SuperGLUE allows new NLU models to be evaluated across a diverse set of tasks. These comprehensive benchmarks have facilitated a broad range of research and applications in natural language processing (NLP). The problem, however, is that most such benchmarks are limited to English, which has made it difficult to replicate many of the successes in English NLU for other languages. To help remedy this issue, we introduce the first large-scale Chinese Language Understanding Evaluation (CLUE) benchmark. CLUE is an open-ended, community-driven project that brings together 9 tasks spanning several well-established single-sentence/sentence-pair classification tasks, as well as machine reading comprehension, all on original Chinese text. To establish results on these tasks, we report scores using an exhaustive set of current state-of-the-art pre-trained Chinese models (9 in total). We also introduce a number of supplementary datasets and additional tools to help facilitate further progress on Chinese NLU. Our benchmark is released at https://www.CLUEbenchmarks.com △ Less

Submitted 5 November, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

Comments: Accepted by COLING2020; 10 pages, 4 figures

arXiv:2003.02704 [pdf]

doi 10.1103/PhysRevPhysEducRes.17.010101

Positive attitudinal shifts and a narrowing gender gap: Do expertlike attitudes correlate to higher learning gains for women in the physics classroom?

Authors: Alma Robinson, John H. Simonetti, Kasey Richardson, Megan Wawro

Abstract: A large body of research shows that using interactive engagement pedagogy in the introductory physics classroom consistently results in significant student learning gains; however, with a few exceptions, those learning gains tend not to be accompanied by more expertlike attitudes and beliefs about physics and learning physics. In fact, in both traditionally taught and active learning classroom env… ▽ More A large body of research shows that using interactive engagement pedagogy in the introductory physics classroom consistently results in significant student learning gains; however, with a few exceptions, those learning gains tend not to be accompanied by more expertlike attitudes and beliefs about physics and learning physics. In fact, in both traditionally taught and active learning classroom environments, students often become more novicelike in their attitudes and beliefs following a semester of instruction. Further, prior to instruction, men typically score higher than women on conceptual inventories, such as the Force Concept Inventory (FCI), and more expertlike on attitudinal surveys, such as the Colorado Learning Attitudes about Science Survey (CLASS), and those gender gaps generally persist following instruction. In this paper, we analyze three years of pre-post matched data for physics majors at Virginia Tech on the FCI and the CLASS. The courses were taught using a blended pedagogical model of peer instruction, group problem solving, and direct instruction, along with an explicit focus on the importance of conceptual understanding and a growth mindset. We found that the FCI gender gap decreased, and both men and women showed positive, expertlike shifts on the CLASS. Perhaps most surprisingly, we found a meaningful correlation between a student's post- CLASS score and normalized FCI gain for women, but not for men. △ Less

Submitted 13 January, 2021; v1 submitted 5 March, 2020; originally announced March 2020.

Comments: 10 pages, 2 figures, 8 tables, will submit to Phys. Rev. PER

Journal ref: Phys. Rev. Phys. Educ. Res. 17, 010101 (2021)

arXiv:2002.05867 [pdf, other]

Transformers as Soft Reasoners over Language

Authors: Peter Clark, Oyvind Tafjord, Kyle Richardson

Abstract: Beginning with McCarthy's Advice Taker (1959), AI has pursued the goal of providing a system with explicit, general knowledge and having the system reason over that knowledge. However, expressing the knowledge in a formal (logical or probabilistic) representation has been a major obstacle to this research. This paper investigates a modern approach to this problem where the facts and rules are prov… ▽ More Beginning with McCarthy's Advice Taker (1959), AI has pursued the goal of providing a system with explicit, general knowledge and having the system reason over that knowledge. However, expressing the knowledge in a formal (logical or probabilistic) representation has been a major obstacle to this research. This paper investigates a modern approach to this problem where the facts and rules are provided as natural language sentences, thus bypassing a formal representation. We train transformers to reason (or emulate reasoning) over these sentences using synthetically generated data. Our models, that we call RuleTakers, provide the first empirical demonstration that this kind of soft reasoning over language is learnable, can achieve high (99%) accuracy, and generalizes to test data requiring substantially deeper chaining than seen during training (95%+ scores). We also demonstrate that the models transfer well to two hand-authored rulebases, and to rulebases paraphrased into more natural language. These findings are significant as it suggests a new role for transformers, namely as limited "soft theorem provers" operating over explicit theories in language. This in turn suggests new possibilities for explainability, correctability, and counterfactual reasoning in question-answering. △ Less

Submitted 5 May, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

Comments: IJCAI 2020

arXiv:2001.00121 [pdf]

A Freeform Dielectric Metasurface Modeling Approach Based on Deep Neural Networks

Authors: Sensong An, Bowen Zheng, Mikhail Y. Shalaginov, Hong Tang, Hang Li, Li Zhou, Jun Ding, Anuradha Murthy Agarwal, Clara Rivero-Baleine, Myungkoo Kang, Kathleen A. Richardson, Tian Gu, Juejun Hu, Clayton Fowler, Hualiang Zhang

Abstract: Metasurfaces have shown promising potentials in sha** optical wavefronts while remaining compact compared to bulky geometric optics devices. Design of meta-atoms, the fundamental building blocks of metasurfaces, relies on trial-and-error method to achieve target electromagnetic responses. This process includes the characterization of an enormous amount of different meta-atom designs with differe… ▽ More Metasurfaces have shown promising potentials in sha** optical wavefronts while remaining compact compared to bulky geometric optics devices. Design of meta-atoms, the fundamental building blocks of metasurfaces, relies on trial-and-error method to achieve target electromagnetic responses. This process includes the characterization of an enormous amount of different meta-atom designs with different physical and geometric parameters, which normally demands huge computational resources. In this paper, a deep learning-based metasurface/meta-atom modeling approach is introduced to significantly reduce the characterization time while maintaining accuracy. Based on a convolutional neural network (CNN) structure, the proposed deep learning network is able to model meta-atoms with free-form 2D patterns and different lattice sizes, material refractive indexes and thicknesses. Moreover, the presented approach features the capability to predict meta-atoms' wide spectrum responses in the timescale of milliseconds, which makes it attractive for applications such as fast meta-atom/metasurface on-demand designs and optimizations. △ Less

Submitted 31 December, 2019; originally announced January 2020.

arXiv:1912.13337 [pdf, other]

What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge

Authors: Kyle Richardson, Ashish Sabharwal

Abstract: Open-domain question answering (QA) is known to involve several underlying knowledge and reasoning challenges, but are models actually learning such knowledge when trained on benchmark tasks? To investigate this, we introduce several new challenge tasks that probe whether state-of-the-art QA models have general knowledge about word definitions and general taxonomic reasoning, both of which are fun… ▽ More Open-domain question answering (QA) is known to involve several underlying knowledge and reasoning challenges, but are models actually learning such knowledge when trained on benchmark tasks? To investigate this, we introduce several new challenge tasks that probe whether state-of-the-art QA models have general knowledge about word definitions and general taxonomic reasoning, both of which are fundamental to more complex forms of reasoning and are widespread in benchmark datasets. As an alternative to expensive crowd-sourcing, we introduce a methodology for automatically building datasets from various types of expert knowledge (e.g., knowledge graphs and lexical taxonomies), allowing for systematic control over the resulting probes and for a more comprehensive evaluation. We find automatically constructing probes to be vulnerable to annotation artifacts, which we carefully control for. Our evaluation confirms that transformer-based QA models are already predisposed to recognize certain types of structural lexical knowledge. However, it also reveals a more nuanced picture: their performance degrades substantially with even a slight increase in the number of hops in the underlying taxonomic hierarchy, or as more challenging distractor candidate answers are introduced. Further, even when these models succeed at the standard instance-level evaluation, they leave much room for improvement when assessed at the level of clusters of semantically connected probes (e.g., all Isa questions about a concept). △ Less

Submitted 1 September, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

Comments: TACL 2020

arXiv:1911.12970 [pdf]

doi 10.1038/s41467-021-21440-9

Reconfigurable all-dielectric metalens with diffraction limited performance

Authors: Mikhail Y. Shalaginov, Sensong An, Yifei Zhang, Fan Yang, Peter Su, Vladimir Liberman, Jeffrey B. Chou, Christopher M. Roberts, Myungkoo Kang, Carlos Rios, Qingyang Du, Clayton Fowler, Anuradha Agarwal, Kathleen Richardson, Clara Rivero-Baleine, Hualiang Zhang, Juejun Hu, Tian Gu

Abstract: Active metasurfaces, whose optical properties can be modulated post-fabrication, have emerged as an intensively explored field in recent years. The efforts to date, however, still face major performance limitations in tuning range, optical quality, and efficiency especially for non mechanical actuation mechanisms. In this paper, we introduce an active metasurface platform combining phase tuning co… ▽ More Active metasurfaces, whose optical properties can be modulated post-fabrication, have emerged as an intensively explored field in recent years. The efforts to date, however, still face major performance limitations in tuning range, optical quality, and efficiency especially for non mechanical actuation mechanisms. In this paper, we introduce an active metasurface platform combining phase tuning covering the full 2$π$ range and diffraction-limited performance using an all-dielectric, low-loss architecture based on optical phase change materials (O-PCMs). We present a generic design principle enabling switching of metasurfaces between two arbitrary phase profiles and propose a new figure-of-merit (FOM) tailored for active meta-optics. We implement the approach to realize a high-performance varifocal metalens operating at 5.2 $μ$m wavelength. The metalens is constructed using Ge2Sb2Se4Te1 (GSST), an O-PCM with a large refractive index contrast ($Δ$ n > 1) and unique broadband low-loss characteristics in both amorphous and crystalline states. The reconfigurable metalens features focusing efficiencies above 20% at both states for linearly polarized light and a record large switching contrast ratio of 29.5 dB. We further validated aberration-free imaging using the metalens at both optical states, which represents the first experimental demonstration of a non-mechanical active metalens with diffraction-limited performance. △ Less

Submitted 10 December, 2019; v1 submitted 29 November, 2019; originally announced November 2019.

arXiv:1910.08772 [pdf, ps, other]

MonaLog: a Lightweight System for Natural Language Inference Based on Monotonicity

Authors: Hai Hu, Qi Chen, Kyle Richardson, Atreyee Mukherjee, Lawrence S. Moss, Sandra Kuebler

Abstract: We present a new logic-based inference engine for natural language inference (NLI) called MonaLog, which is based on natural logic and the monotonicity calculus. In contrast to existing logic-based approaches, our system is intentionally designed to be as lightweight as possible, and operates using a small set of well-known (surface-level) monotonicity facts about quantifiers, lexical items and to… ▽ More We present a new logic-based inference engine for natural language inference (NLI) called MonaLog, which is based on natural logic and the monotonicity calculus. In contrast to existing logic-based approaches, our system is intentionally designed to be as lightweight as possible, and operates using a small set of well-known (surface-level) monotonicity facts about quantifiers, lexical items and tokenlevel polarity information. Despite its simplicity, we find our approach to be competitive with other logic-based NLI models on the SICK benchmark. We also use MonaLog in combination with the current state-of-the-art model BERT in a variety of settings, including for compositional data augmentation. We show that MonaLog is capable of generating large amounts of high-quality training data for BERT, improving its accuracy on SICK. △ Less

Submitted 19 October, 2019; originally announced October 2019.

Comments: accepted to SCIL 2020

arXiv:1909.07521 [pdf, other]

Probing Natural Language Inference Models through Semantic Fragments

Authors: Kyle Richardson, Hai Hu, Lawrence S. Moss, Ashish Sabharwal

Abstract: Do state-of-the-art models for language understanding already have, or can they easily learn, abilities such as boolean coordination, quantification, conditionals, comparatives, and monotonicity reasoning (i.e., reasoning about word substitutions in sentential contexts)? While such phenomena are involved in natural language inference (NLI) and go beyond basic linguistic understanding, it is unclea… ▽ More Do state-of-the-art models for language understanding already have, or can they easily learn, abilities such as boolean coordination, quantification, conditionals, comparatives, and monotonicity reasoning (i.e., reasoning about word substitutions in sentential contexts)? While such phenomena are involved in natural language inference (NLI) and go beyond basic linguistic understanding, it is unclear the extent to which they are captured in existing NLI benchmarks and effectively learned by models. To investigate this, we propose the use of semantic fragments---systematically generated datasets that each target a different semantic phenomenon---for probing, and efficiently improving, such capabilities of linguistic models. This approach to creating challenge datasets allows direct control over the semantic diversity and complexity of the targeted linguistic phenomena, and results in a more precise characterization of a model's linguistic behavior. Our experiments, using a library of 8 such semantic fragments, reveal two remarkable findings: (a) State-of-the-art models, including BERT, that are pre-trained on existing NLI benchmark datasets perform poorly on these new fragments, even though the phenomena probed here are central to the NLI task. (b) On the other hand, with only a few minutes of additional fine-tuning---with a carefully selected learning rate and a novel variation of "inoculation"---a BERT-based model can master all of these logic and monotonicity fragments while retaining its performance on established NLI benchmarks. △ Less

Submitted 1 December, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

Comments: AAAI camera-ready version

arXiv:1909.01958 [pdf, other]

From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project

Authors: Peter Clark, Oren Etzioni, Daniel Khashabi, Tushar Khot, Bhavana Dalvi Mishra, Kyle Richardson, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord, Niket Tandon, Sumithra Bhakthavatsalam, Dirk Groeneveld, Michal Guerquin, Michael Schmitz

Abstract: AI has achieved remarkable mastery over games such as Chess, Go, and Poker, and even Jeopardy, but the rich variety of standardized exams has remained a landmark challenge. Even in 2016, the best AI system achieved merely 59.3% on an 8th Grade science exam challenge. This paper reports unprecedented success on the Grade 8 New York Regents Science Exam, where for the first time a system scores more… ▽ More AI has achieved remarkable mastery over games such as Chess, Go, and Poker, and even Jeopardy, but the rich variety of standardized exams has remained a landmark challenge. Even in 2016, the best AI system achieved merely 59.3% on an 8th Grade science exam challenge. This paper reports unprecedented success on the Grade 8 New York Regents Science Exam, where for the first time a system scores more than 90% on the exam's non-diagram, multiple choice (NDMC) questions. In addition, our Aristo system, building upon the success of recent language models, exceeded 83% on the corresponding Grade 12 Science Exam NDMC questions. The results, on unseen test questions, are robust across different test years and different variations of this kind of test. They demonstrate that modern NLP methods can result in mastery on this task. While not a full solution to general question-answering (the questions are multiple choice, and the domain is restricted to 8th Grade science), it represents a significant milestone for the field. △ Less

Submitted 1 February, 2021; v1 submitted 4 September, 2019; originally announced September 2019.

Comments: AI Magazine 41 (4) Winter 2020. New analysis sections added

arXiv:1908.05165 [pdf, ps, other]

Some remarks on equivariant elliptic operators and their invariants

Authors: Jochen Brüning, Ken Richardson

Abstract: In this expository article, we consider first order elliptic differential operators acting on smooth vector bundles over compact manifolds, and certain invariants derived from the analysis of these operators, namely the eta invariant} and the equivariant index. Many researchers have previously considered these invariants before. What makes this work different is that we are evaluating integer-valu… ▽ More In this expository article, we consider first order elliptic differential operators acting on smooth vector bundles over compact manifolds, and certain invariants derived from the analysis of these operators, namely the eta invariant} and the equivariant index. Many researchers have previously considered these invariants before. What makes this work different is that we are evaluating integer-valued indices corresponding to multiplicities of group representations, and our eta invariant is a number dependent on the entire group at once. Moreover, the techniques of proof and formulas obtained are new and depend on equivariant heat asymptotics that may involve logarithmic terms. For simplicity, we consider only elliptic differential operators, even though the proofs outlined apply to transversally elliptic operators. In every case, we outline the well-known proofs and theorems without Lie group actions first and then show how these same ideas can be applied in the equivariant cases with appropriate modifications. A more detailed and expanded article that applies to transversally elliptic operators will appear in due time. △ Less

Submitted 14 August, 2019; originally announced August 2019.

Comments: 15 pages, to appear in "Differential Equations on Manifolds and Mathematical Physics," published by Birkäuser, Editors: V. Manuilov, A.S. Mishenko, V.E. Nazaikinskii, B.-W. Schulze, W. Zhang

arXiv:1906.03746 [pdf, ps, other]

New cohomological invariants of foliations

Authors: Georges Habib, Ken Richardson

Abstract: Given a smooth foliation on a closed manifold, basic forms are differential forms that can be expressed locally in terms of the transverse variables. The space of basic forms yields a differential complex, because the exterior derivative fixes this set. The basic cohomology is the cohomology of this complex, and this has been studied extensively. Given a Riemannian metric, the adjoint of the exter… ▽ More Given a smooth foliation on a closed manifold, basic forms are differential forms that can be expressed locally in terms of the transverse variables. The space of basic forms yields a differential complex, because the exterior derivative fixes this set. The basic cohomology is the cohomology of this complex, and this has been studied extensively. Given a Riemannian metric, the adjoint of the exterior derivative maps the orthogonal complement of the basic forms to itself, and we call the resulting cohomology the "antibasic cohomology". Although these groups are defined using the metric, the dimensions of the antibasic cohomology groups are invariant under diffeomorphism and metric changes. If the underlying foliation is Riemannian, the groups are foliated homotopy invariants that are independent of basic cohomology and ordinary cohomology of the manifold. For this class of foliations we use the codifferential on antibasic forms to obtain the corresponding Laplace operator, develop its analytic properties, and prove a Hodge theorem. We then find some topological and geometric properties that impose restrictions on the antibasic Betti numbers. △ Less

Submitted 21 May, 2024; v1 submitted 9 June, 2019; originally announced June 2019.

Comments: 29 pages, a few minor corrections were made

MSC Class: 57R30; 53C12; 58A14

arXiv:1906.03387 [pdf]

A Novel Modeling Approach for All-Dielectric Metasurfaces Using Deep Neural Networks

Authors: Sensong An, Clayton Fowler, Bowen Zheng, Mikhail Y. Shalaginov, Hong Tang, Hang Li, Li Zhou, Jun Ding, Anuradha Murthy Agarwal, Clara Rivero-Baleine, Kathleen A. Richardson, Tian Gu, Juejun Hu, Hualiang Zhang

Abstract: Metasurfaces have become a promising means for manipulating optical wavefronts in flat and high-performance optical devices. Conventional metasurface device design relies on trial-and-error methods to obtain target electromagnetic (EM) response, an approach that demands significant efforts to investigate the enormous number of possible meta-atom structures. In this paper, a deep neural network app… ▽ More Metasurfaces have become a promising means for manipulating optical wavefronts in flat and high-performance optical devices. Conventional metasurface device design relies on trial-and-error methods to obtain target electromagnetic (EM) response, an approach that demands significant efforts to investigate the enormous number of possible meta-atom structures. In this paper, a deep neural network approach is introduced that significantly improves on both speed and accuracy compared to techniques currently used to assemble metasurface-based devices. Our neural network approach overcomes three key challenges that have limited previous neural-network-based design schemes: input/output vector dimensional mismatch, accurate EM-wave phase prediction, as well as adaptation to 3-D dielectric structures, and can be generically applied to a wide variety of metasurface device designs across the entire electromagnetic spectrum. Using this new methodology, examples of neural networks capable of producing on-demand designs for meta-atoms, metasurface filters, and phase-change reconfigurable metasurfaces are demonstrated. △ Less

Submitted 8 June, 2019; originally announced June 2019.

Comments: 18 pages, 8 figures

arXiv:1902.05150 [pdf]

doi 10.1063/1.5089907

A fiber-integrated single photon source emitting at telecom wavelengths

Authors: Chang-Min Lee, Mustafa Atabey Buyukkaya, Shahriar Aghaeimeibodi, Christopher J. K. Richardson, Edo Waks

Abstract: Fiber-coupled single photon sources are essential components of photonics-based quantum information processors. Most fiber-coupled single photon sources require careful alignment between fibers and quantum emitters. In this work, we present an alignment-free fiber-integrated single photon source based on an InAs/InP quantum dot emitting at telecom wavelengths. We designed a nanobeam containing the… ▽ More Fiber-coupled single photon sources are essential components of photonics-based quantum information processors. Most fiber-coupled single photon sources require careful alignment between fibers and quantum emitters. In this work, we present an alignment-free fiber-integrated single photon source based on an InAs/InP quantum dot emitting at telecom wavelengths. We designed a nanobeam containing the quantum dots attached to a fiber taper. The adiabatic tapered coupler of the nanobeam enables efficient light coupling to the fiber taper. Using a tungsten probe in a focused ion beam system, we transferred the nanobeam to the fiber taper. The observed fiber-coupled single photon emission occurs with a brightness of 1.5% and purity of 86%. This device provides a building block for fiber-optic quantum circuits that have various applications, such as quantum communication and distributed quantum computing. △ Less

Submitted 13 February, 2019; originally announced February 2019.

arXiv:1901.11036 [pdf, ps, other]

doi 10.1093/mnras/stz376

Black hole scaling relations of active and quiescent galaxies: Addressing selection effects and constraining virial factors

Authors: Francesco Shankar, Mariangela Bernardi, Kayleigh Richardson, Christopher Marsden, Ravi K. Sheth, Viola Allevato, Luca Graziani, Mar Mezcua, Federica Ricci, Samantha J. Penny, Fabio La Franca, Fabio Pacucci

Abstract: Local samples of quiescent galaxies with dynamically measured black hole masses (Mbh) may suffer from an angular resolution-related selection effect, which could bias the observed scaling relations between Mbh and host galaxy properties away from the intrinsic relations. In particular, previous work has shown that the observed Mbh-Mstar (stellar mass) relation is more strongly biased than the Mbh-… ▽ More Local samples of quiescent galaxies with dynamically measured black hole masses (Mbh) may suffer from an angular resolution-related selection effect, which could bias the observed scaling relations between Mbh and host galaxy properties away from the intrinsic relations. In particular, previous work has shown that the observed Mbh-Mstar (stellar mass) relation is more strongly biased than the Mbh-sigma (velocity dispersion) relation. Local samples of active galactic nuclei (AGN) do not suffer from this selection effect, as in these samples Mbh is estimated from megamasers and/or reverberation map**-based techniques. With the exception of megamasers, Mbh-estimates in these AGN samples are proportional to a virial coefficient fvir. Direct modelling of the broad line region suggests that fvir~3.5. However, this results in a Mbh-Mstar relation for AGN which lies below and is steeper than the one observed for quiescent black hole samples. A similar though milder trend is seen for the Mbh-sigma relation. Matching the high-mass end of the Mbh-Mstar and Mbh-sigma relations observed in quiescent samples requires fvir~15 and fvir~7, respectively. On the other hand, fvir~3.5 yields Mbh-sigma and Mbh-Mstar relations for AGN which are remarkably consistent with the expected `intrinsic' correlations for quiescent samples (i.e., once account has been made of the angular resolution-related selection effect), providing additional evidence that the sample of local quiescent black holes is biased. We also show that, as is the case for quiescent black holes, the Mbh-Mstar scaling relation of AGN is driven by velocity dispersion, thus providing additional key constraints to black hole-galaxy co-evolution models. △ Less

Submitted 30 January, 2019; originally announced January 2019.

Comments: 15 pages, 5 Figures. MNRAS, accepted

arXiv:1812.01616 [pdf]

doi 10.1063/1.5082560

Large Stark Tuning of InAs/InP Quantum Dots

Authors: Shahriar Aghaeimeibodi, Chang-Min Lee, Mustafa Atabey Buyukkaya, Christopher J. K. Richardson, Edo Waks

Abstract: InAs/InP quantum dots are excellent sources of telecom single-photon emission and are among the most promising candidates for scalable quantum photonic circuits. However, geometric differences in each quantum dot leads to slightly different emission wavelengths and hinders the possibility of generating multiple identical quantum emitters on the same chip. Stark tuning is an efficient technique to… ▽ More InAs/InP quantum dots are excellent sources of telecom single-photon emission and are among the most promising candidates for scalable quantum photonic circuits. However, geometric differences in each quantum dot leads to slightly different emission wavelengths and hinders the possibility of generating multiple identical quantum emitters on the same chip. Stark tuning is an efficient technique to overcome this issue as it can control the emission energy of individual quantum dots through the quantum-confined Stark effect. Realizing this technique in InAs/InP quantum dots has previously been limited to shifts of less than 0.8 meV due to jumps in the emission energy because of additional charges at high electric field intensities. We demonstrate up to 5.1 meV of Stark tuning in the emission wavelength of InAs/InP quantum dots. To eliminate undesirable jumps to charged state, we use a thin oxide insulator to prevent carrier injection from the contacts, thereby significantly improves the tuning range of the Stark effect. Moreover, the single-photon nature and narrow linewidth of the quantum dot emission is preserved under a wide range of applied electric fields. Using photoluminescence intensity measurements and time-resolved lifetime spectroscopy we confirmed that this Stark tuning range is limited by carrier tunneling at high electric fields. This result is an important step toward integrating multiple identical quantum emitters at telecom wavelengths on-a-chip, which is crucial for realizing complex quantum photonic circuits for quantum information processing. △ Less

Submitted 4 December, 2018; originally announced December 2018.

Journal ref: Appl. Phys. Lett. 114, 071105 (2019)

arXiv:1811.00526 [pdf]

doi 10.1038/s41467-019-12196-4

Extreme Broadband Transparent Optical Phase Change Materials for High-Performance Nonvolatile Photonics

Authors: Yifei Zhang, Jeffrey B. Chou, Junying Li, Huashan Li, Qingyang Du, Anupama Yadav, Si Zhou, Mikhail Y. Shalaginov, Zhuoran Fang, Huikai Zhong, Christopher Roberts, Paul Robinson, Bridget Bohlin, Carlos Ríos, Hongtao Lin, Myungkoo Kang, Tian Gu, Jamie Warner, Vladimir Liberman, Kathleen Richardson, Juejun Hu

Abstract: Optical phase change materials (O-PCMs), a unique group of materials featuring drastic optical property contrast upon solid-state phase transition, have found widespread adoption in photonic switches and routers, reconfigurable meta-optics, reflective display, and optical neuromorphic computers. Current phase change materials, such as Ge-Sb-Te (GST), exhibit large contrast of both refractive index… ▽ More Optical phase change materials (O-PCMs), a unique group of materials featuring drastic optical property contrast upon solid-state phase transition, have found widespread adoption in photonic switches and routers, reconfigurable meta-optics, reflective display, and optical neuromorphic computers. Current phase change materials, such as Ge-Sb-Te (GST), exhibit large contrast of both refractive index (delta n) and optical loss (delta k), simultaneously. The coupling of both optical properties fundamentally limits the function and performance of many potential applications. In this article, we introduce a new class of O-PCMs, Ge-Sb-Se-Te (GSST) which breaks this traditional coupling, as demonstrated with an optical figure of merit improvement of more than two orders of magnitude. The first-principle computationally optimized alloy, Ge2Sb2Se4Te1, combines broadband low optical loss (1-18.5 micron), large optical contrast (delta n = 2.0), and significantly improved glass forming ability, enabling an entirely new field of infrared and thermal photonic devices. We further leverage the material to demonstrate nonvolatile integrated optical switches with record low loss and large contrast ratio, as well as an electrically addressed, microsecond switched pixel level spatial light modulator, thereby validating its promise as a platform material for scalable nonvolatile photonics. △ Less

Submitted 5 November, 2018; v1 submitted 1 November, 2018; originally announced November 2018.

Comments: 16 pages, 6 figures

arXiv:1810.05701 [pdf]

doi 10.1063/1.5054865

Integration of Quantum Emitters with Lithium Niobate Photonics

Authors: Shahriar Aghaeimeibodi, Boris Desiatov, Je-Hyung Kim, Chang-Min Lee, Mustafa Atabey Buyukkaya, Aziz Karasahin, Christopher J. K. Richardson, Richard P. Leavitt, Marko Lončar, Edo Waks

Abstract: The integration of quantum emitters with integrated photonics enables complex quantum photonic circuits that are necessary for photonic implementation of quantum simulators, computers, and networks. Thin-film lithium niobate is an ideal material substrate for quantum photonics because it can tightly confine light in small waveguides and has a strong electro-optic effect that can switch and modulat… ▽ More The integration of quantum emitters with integrated photonics enables complex quantum photonic circuits that are necessary for photonic implementation of quantum simulators, computers, and networks. Thin-film lithium niobate is an ideal material substrate for quantum photonics because it can tightly confine light in small waveguides and has a strong electro-optic effect that can switch and modulate single photons at low power and high speed. However, lithium niobite lacks efficient single-photon emitters, which are essential for scalable quantum photonic circuits. We demonstrate deterministic coupling of single-photon emitters with a lithium niobate photonic chip. The emitters are composed of InAs quantum dots embedded in an InP nanobeam, which we transfer to a lithium niobate waveguide with nanoscale accuracy using a pick-and place approach. An adiabatic taper transfers single photons emitted into the nanobeam to the lithium niobate waveguide with high efficiency. We verify the single photon nature of the emission using photon correlation measurements performed with an on-chip beamsplitter. Our results demonstrate an important step toward fast, reconfigurable quantum photonic circuits for quantum information processing. △ Less

Submitted 12 October, 2018; originally announced October 2018.

Journal ref: Appl. Phys. Lett. 113, 221102 (2018)

arXiv:1808.10289 [pdf, ps, other]

doi 10.25537/dm.2019v24.995-1031

The mean curvature of transverse Kähler foliations

Authors: Seoung Dal Jung, Ken Richardson

Abstract: We study properties of the mean curvature one-form and its holomorphic and antiholomorphic cousins on a transverse Kähler foliation. If the mean curvature of the foliation is automorphic, then there are some restrictions on basic cohomology similar to that on Kähler manifolds, such as the requirement that the odd basic Betti numbers must be even. However, the full Hodge diamond structure does not… ▽ More We study properties of the mean curvature one-form and its holomorphic and antiholomorphic cousins on a transverse Kähler foliation. If the mean curvature of the foliation is automorphic, then there are some restrictions on basic cohomology similar to that on Kähler manifolds, such as the requirement that the odd basic Betti numbers must be even. However, the full Hodge diamond structure does not apply to basic Dolbeault cohomology unless the foliation is taut. △ Less

Submitted 22 October, 2018; v1 submitted 29 August, 2018; originally announced August 2018.

Comments: 30 pages, revised notation section and historical information so that there is not much overlap with previous paper

MSC Class: 53C12 (Primary) 53C21; 53C55; 57R30; 58J50 (Secondary)

Journal ref: Doc. Math. 24 (2019), 995-1031

arXiv:1804.03631 [pdf]

doi 10.1021/acs.nanolett.8b01133

Super-radiant emission from quantum dots in a nanophotonic waveguide

Authors: Je-Hyung Kim, Shahriar Aghaeimeibodi, Christopher J. K. Richardson, Richard P. Leavitt, Edo Waks

Abstract: Future scalable photonic quantum information processing relies on the ability of integrating multiple interacting quantum emitters into a single chip. Quantum dots provide ideal on-chip quantum light sources. However, achieving quantum interaction between multiple quantum dots on-a-chip is a challenging task due to the randomness in their frequency and position, requiring local tuning technique an… ▽ More Future scalable photonic quantum information processing relies on the ability of integrating multiple interacting quantum emitters into a single chip. Quantum dots provide ideal on-chip quantum light sources. However, achieving quantum interaction between multiple quantum dots on-a-chip is a challenging task due to the randomness in their frequency and position, requiring local tuning technique and long-range quantum interaction. Here, we demonstrate quantum interactions between distant two quantum dots on a nanophotonic waveguide. We achieve a photon-mediated long-range interaction by integrating the quantum dots to the same optical mode of a nanophotonic waveguide and overcome spectral mismatch by incorporating on-chip thermal tuners. We observe their quantum interactions of the form of super-radiant emission, where the two dots collectively emit faster than each dot individually. Creating super-radiant emission from integrated quantum emitters could enable compact chip-integrated photonic structures that exhibit long-range quantum interactions. Therefore, these results represent a major step towards establishing photonic quantum information processors composed of multiple interacting quantum emitters on a semiconductor chip. △ Less

Submitted 10 April, 2018; originally announced April 2018.

Comments: 23 pages, 7 figures,

Journal ref: Nanoletters 2018

arXiv:1804.00987 [pdf, ps, other]

A Language for Function Signature Representations

Authors: Kyle Richardson

Abstract: Recent work by (Richardson and Kuhn, 2017a,b; Richardson et al., 2018) looks at semantic parser induction and question answering in the domain of source code libraries and APIs. In this brief note, we formalize the representations being learned in these studies and introduce a simple domain specific language and a systematic translation from this language to first-order logic. By recasting the tar… ▽ More Recent work by (Richardson and Kuhn, 2017a,b; Richardson et al., 2018) looks at semantic parser induction and question answering in the domain of source code libraries and APIs. In this brief note, we formalize the representations being learned in these studies and introduce a simple domain specific language and a systematic translation from this language to first-order logic. By recasting the target representations in terms of classical logic, we aim to broaden the applicability of existing code datasets for investigating more complex natural language understanding and reasoning problems in the software domain. △ Less

Submitted 18 April, 2018; v1 submitted 31 March, 2018; originally announced April 2018.

Comments: short note

arXiv:1803.06966 [pdf, other]

doi 10.18653/v1/N18-1066

Polyglot Semantic Parsing in APIs

Authors: Kyle Richardson, Jonathan Berant, Jonas Kuhn

Abstract: Traditional approaches to semantic parsing (SP) work by training individual models for each available parallel dataset of text-meaning pairs. In this paper, we explore the idea of polyglot semantic translation, or learning semantic parsing models that are trained on multiple datasets and natural languages. In particular, we focus on translating text to code signature representations using the soft… ▽ More Traditional approaches to semantic parsing (SP) work by training individual models for each available parallel dataset of text-meaning pairs. In this paper, we explore the idea of polyglot semantic translation, or learning semantic parsing models that are trained on multiple datasets and natural languages. In particular, we focus on translating text to code signature representations using the software component datasets of Richardson and Kuhn (2017a,b). The advantage of such models is that they can be used for parsing a wide variety of input natural languages and output programming languages, or mixed input languages, using a single unified model. To facilitate modeling of this type, we develop a novel graph-based decoding framework that achieves state-of-the-art performance on the above datasets, and apply this method to two other benchmark SP tasks. △ Less

Submitted 18 April, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

Comments: accepted for NAACL-2018 (camera ready version)

arXiv:1709.05469 [pdf, ps, other]

doi 10.1142/S1793525320500260

Basic Dolbeault cohomology and Weitzenböck frmulas on transversely Kähler foliations

Authors: Seoung Dal Jung, Ken Richardson

Abstract: We study basic Dolbeault cohomology and find new Weitzenböck formulas on a transversely Kähler foliation. We investigate conditions on mean curvature and Ricci curvature that impose restrictions on basic Dolbeault cohomology. For example, we prove that on a transversely Kähler foliation with positive transversal Ricci curvature, there are no nonzero basic-harmonic forms of type $(r,0)$, among othe… ▽ More We study basic Dolbeault cohomology and find new Weitzenböck formulas on a transversely Kähler foliation. We investigate conditions on mean curvature and Ricci curvature that impose restrictions on basic Dolbeault cohomology. For example, we prove that on a transversely Kähler foliation with positive transversal Ricci curvature, there are no nonzero basic-harmonic forms of type $(r,0)$, among other results. △ Less

Submitted 16 September, 2017; originally announced September 2017.

Comments: 23 pages

MSC Class: 53C12; 53C21; 53C55; 57R30; 58J50

Journal ref: J. Topol. Anal. 13 (2021), no. 3, 673-698

Showing 101–150 of 194 results for author: Richardson, K