Search | arXiv e-print repository

Efficient, indistinguishable telecom C-band photons using a tapered nanobeam

Authors: Mohammad Habibur Rahaman, Samuel Harper, Chang-Min Lee, Kyu-Young Kim, Mustafa Atabey Buyukkaya, Victor J. Patel, Samuel D. Hawkins, Je-Hyung Kim, Sadhvikas Addamane, Edo Waks

Abstract: Telecom C-band single photons exhibit the lowest attenuation in optical fibers, enabling long-haul quantum-secured communication. However, efficient coupling with optical fibers is crucial for these single photons to be effective carriers in long-distance transmission. In this work, we demonstrate an efficient fiber-coupled single photon source at the telecom C-band using InAs/InP quantum dots cou… ▽ More Telecom C-band single photons exhibit the lowest attenuation in optical fibers, enabling long-haul quantum-secured communication. However, efficient coupling with optical fibers is crucial for these single photons to be effective carriers in long-distance transmission. In this work, we demonstrate an efficient fiber-coupled single photon source at the telecom C-band using InAs/InP quantum dots coupled to a tapered nanobeam. The tapered nanobeam structure facilitates directional emission that is mode-matched to a lensed fiber, resulting in a collection efficiency of up to 65% from the nanobeam to a single-mode fiber. Using this approach, we demonstrate single photon count rates of 575 $\pm$ 5 Kcps and a single photon purity of $g^2$ (0) = 0.015 $\pm$ 0.003. Additionally, we demonstrate Hong-Ou Mandel interference from the emitted photons with a visibility of 0.84 $\pm$ 0.06. From these measurements, we determine a photon coherence time of 450 $\pm$ 20 ps, a factor of just 8.3 away from the lifetime limit. This work represents an important step towards the development of telecom C-band single-photon sources emitting bright, pure, and indistinguishable photons, which are necessary to realize fiber-based long-distance quantum networks △ Less

Submitted 5 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

arXiv:2310.11614 [pdf, other]

Learning a Hierarchical Planner from Humans in Multiple Generations

Authors: Leonardo Hernandez Cano, Yewen Pu, Robert D. Hawkins, Josh Tenenbaum, Armando Solar-Lezama

Abstract: A typical way in which a machine acquires knowledge from humans is by programming. Compared to learning from demonstrations or experiences, programmatic learning allows the machine to acquire a novel skill as soon as the program is written, and, by building a library of programs, a machine can quickly learn how to perform complex tasks. However, as programs often take their execution contexts for… ▽ More A typical way in which a machine acquires knowledge from humans is by programming. Compared to learning from demonstrations or experiences, programmatic learning allows the machine to acquire a novel skill as soon as the program is written, and, by building a library of programs, a machine can quickly learn how to perform complex tasks. However, as programs often take their execution contexts for granted, they are brittle when the contexts change, making it difficult to adapt complex programs to new contexts. We present natural programming, a library learning system that combines programmatic learning with a hierarchical planner. Natural programming maintains a library of decompositions, consisting of a goal, a linguistic description of how this goal decompose into sub-goals, and a concrete instance of its decomposition into sub-goals. A user teaches the system via curriculum building, by identifying a challenging yet not impossible goal along with linguistic hints on how this goal may be decomposed into sub-goals. The system solves for the goal via hierarchical planning, using the linguistic hints to guide its probability distribution in proposing the right plans. The system learns from this interaction by adding newly found decompositions in the successful search into its library. Simulated studies and a human experiment (n=360) on a controlled environment demonstrate that natural programming can robustly compose programs learned from different users and contexts, adapting faster and solving more complex tasks when compared to programmatic baselines. △ Less

Submitted 17 October, 2023; originally announced October 2023.

Comments: First two authors contributed equally

arXiv:2306.03882 [pdf, other]

Causal interventions expose implicit situation models for commonsense language understanding

Authors: Takateru Yamakoshi, James L. McClelland, Adele E. Goldberg, Robert D. Hawkins

Abstract: Accounts of human language processing have long appealed to implicit ``situation models'' that enrich comprehension with relevant but unstated world knowledge. Here, we apply causal intervention techniques to recent transformer models to analyze performance on the Winograd Schema Challenge (WSC), where a single context cue shifts interpretation of an ambiguous pronoun. We identify a relatively sma… ▽ More Accounts of human language processing have long appealed to implicit ``situation models'' that enrich comprehension with relevant but unstated world knowledge. Here, we apply causal intervention techniques to recent transformer models to analyze performance on the Winograd Schema Challenge (WSC), where a single context cue shifts interpretation of an ambiguous pronoun. We identify a relatively small circuit of attention heads that are responsible for propagating information from the context word that guides which of the candidate noun phrases the pronoun ultimately attends to. We then compare how this circuit behaves in a closely matched ``syntactic'' control where the situation model is not strictly necessary. These analyses suggest distinct pathways through which implicit situation models are constructed to guide pronoun resolution. △ Less

Submitted 7 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

Comments: Findings of ACL

arXiv:2305.07151 [pdf, other]

Overinformative Question Answering by Humans and Machines

Authors: Polina Tsvilodub, Michael Franke, Robert D. Hawkins, Noah D. Goodman

Abstract: When faced with a polar question, speakers often provide overinformative answers going beyond a simple "yes" or "no". But what principles guide the selection of additional information? In this paper, we provide experimental evidence from two studies suggesting that overinformativeness in human answering is driven by considerations of relevance to the questioner's goals which they flexibly adjust g… ▽ More When faced with a polar question, speakers often provide overinformative answers going beyond a simple "yes" or "no". But what principles guide the selection of additional information? In this paper, we provide experimental evidence from two studies suggesting that overinformativeness in human answering is driven by considerations of relevance to the questioner's goals which they flexibly adjust given the functional context in which the question is uttered. We take these human results as a strong benchmark for investigating question-answering performance in state-of-the-art neural language models, conducting an extensive evaluation on items from human experiments. We find that most models fail to adjust their answering behavior in a human-like way and tend to include irrelevant information. We show that GPT-3 is highly sensitive to the form of the prompt and only achieves human-like answer patterns when guided by an example and cognitively-motivated explanation. △ Less

Submitted 11 May, 2023; originally announced May 2023.

Comments: 7 pages, 2 figures, to appear in the Proceedings of the 45th Annual Conference of the Cognitive Science Society (2023)

arXiv:2305.06539 [pdf, other]

Semantic uncertainty guides the extension of conventions to new referents

Authors: Ron Eliav, Anya Ji, Yoav Artzi, Robert D. Hawkins

Abstract: A long tradition of studies in psycholinguistics has examined the formation and generalization of ad hoc conventions in reference games, showing how newly acquired conventions for a given target transfer to new referential contexts. However, another axis of generalization remains understudied: how do conventions formed for one target transfer to completely distinct targets, when specific lexical c… ▽ More A long tradition of studies in psycholinguistics has examined the formation and generalization of ad hoc conventions in reference games, showing how newly acquired conventions for a given target transfer to new referential contexts. However, another axis of generalization remains understudied: how do conventions formed for one target transfer to completely distinct targets, when specific lexical choices are unlikely to repeat? This paper presents two dyadic studies (N = 240) that address this axis of generalization, focusing on the role of nameability -- the a priori likelihood that two individuals will share the same label. We leverage the recently-released KiloGram dataset, a collection of abstract tangram images that is orders of magnitude larger than previously available, exhibiting high diversity of properties like nameability. Our first study asks how nameability shapes convention formation, while the second asks how new conventions generalize to entirely new targets of reference. Our results raise new questions about how ad hoc conventions extend beyond target-specific re-use of specific lexical choices. △ Less

Submitted 10 May, 2023; originally announced May 2023.

Comments: Proceedings of the 45th Annual Conference of the Cognitive Science Society

arXiv:2303.03215 [pdf]

Quantile-Quantile Methodology -- Detailed Results

Authors: Douglas M Hawkins

Abstract: The linear quantile-quantile relationship provides an easy-to-implement yet effective tool for transformation to and testing for normality. Its good performance is verified in this report. The linear quantile-quantile relationship provides an easy-to-implement yet effective tool for transformation to and testing for normality. Its good performance is verified in this report. △ Less

Submitted 15 October, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

arXiv:2212.00869 [pdf, other]

Flexible social inference facilitates targeted social learning when rewards are not observable

Authors: Robert D. Hawkins, Andrew M. Berdahl, Alex "Sandy" Pentland, Joshua B. Tenenbaum, Noah D. Goodman, P. M. Krafft

Abstract: Groups coordinate more effectively when individuals are able to learn from others' successes. But acquiring such knowledge is not always easy, especially in real-world environments where success is hidden from public view. We suggest that social inference capacities may help bridge this gap, allowing individuals to update their beliefs about others' underlying knowledge and success from observable… ▽ More Groups coordinate more effectively when individuals are able to learn from others' successes. But acquiring such knowledge is not always easy, especially in real-world environments where success is hidden from public view. We suggest that social inference capacities may help bridge this gap, allowing individuals to update their beliefs about others' underlying knowledge and success from observable trajectories of behavior. We compared our social inference model against simpler heuristics in three studies of human behavior in a collective sensing task. In Experiment 1, we found that average performance improves as a function of group size at a rate greater than predicted by non-inferential models. Experiment 2 introduced artificial agents to evaluate how individuals selectively rely on social information. Experiment 3 generalized these findings to a more complex reward landscape. Taken together, our findings provide insight into the relationship between individual social cognition and the flexibility of collective behavior. △ Less

Submitted 5 August, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

Comments: Nature Human Behaviour

arXiv:2211.16492 [pdf, other]

Abstract Visual Reasoning with Tangram Shapes

Authors: Anya Ji, Noriyuki Kojima, Noah Rush, Alane Suhr, Wai Keen Vong, Robert D. Hawkins, Yoav Artzi

Abstract: We introduce KiloGram, a resource for studying abstract visual reasoning in humans and machines. Drawing on the history of tangram puzzles as stimuli in cognitive science, we build a richly annotated dataset that, with >1k distinct stimuli, is orders of magnitude larger and more diverse than prior resources. It is both visually and linguistically richer, moving beyond whole shape descriptions to i… ▽ More We introduce KiloGram, a resource for studying abstract visual reasoning in humans and machines. Drawing on the history of tangram puzzles as stimuli in cognitive science, we build a richly annotated dataset that, with >1k distinct stimuli, is orders of magnitude larger and more diverse than prior resources. It is both visually and linguistically richer, moving beyond whole shape descriptions to include segmentation maps and part labels. We use this resource to evaluate the abstract visual reasoning capacities of recent multi-modal models. We observe that pre-trained weights demonstrate limited abstract reasoning, which dramatically improves with fine-tuning. We also observe that explicitly describing parts aids abstract reasoning for both humans and models, especially when jointly encoding the linguistic and visual inputs. KiloGram is available at https://lil.nlp.cornell.edu/kilogram . △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: EMNLP 2022 long paper

arXiv:2206.07870 [pdf, other]

How to talk so AI will learn: Instructions, descriptions, and autonomy

Authors: Theodore R Sumers, Robert D Hawkins, Mark K Ho, Thomas L Griffiths, Dylan Hadfield-Menell

Abstract: From the earliest years of our lives, humans use language to express our beliefs and desires. Being able to talk to artificial agents about our preferences would thus fulfill a central goal of value alignment. Yet today, we lack computational models explaining such language use. To address this challenge, we formalize learning from language in a contextual bandit setting and ask how a human might… ▽ More From the earliest years of our lives, humans use language to express our beliefs and desires. Being able to talk to artificial agents about our preferences would thus fulfill a central goal of value alignment. Yet today, we lack computational models explaining such language use. To address this challenge, we formalize learning from language in a contextual bandit setting and ask how a human might communicate preferences over behaviors. We study two distinct types of language: $\textit{instructions}$, which provide information about the desired policy, and $\textit{descriptions}$, which provide information about the reward function. We show that the agent's degree of autonomy determines which form of language is optimal: instructions are better in low-autonomy settings, but descriptions are better when the agent will need to act independently. We then define a pragmatic listener agent that robustly infers the speaker's reward function by reasoning about $\textit{how}$ the speaker expresses themselves. We validate our models with a behavioral experiment, demonstrating that (1) our speaker model predicts human behavior, and (2) our pragmatic listener successfully recovers humans' reward functions. Finally, we show that this form of social learning can integrate with and reduce regret in traditional reinforcement learning. We hope these insights facilitate a shift from develo** agents that $\textit{obey}$ language to agents that $\textit{learn}$ from it. △ Less

Submitted 10 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

Comments: 10 pages, 5 figures. Published as a conference paper at NeurIPS 2022

arXiv:2205.11558 [pdf, other]

Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines

Authors: Sreejan Kumar, Carlos G. Correa, Ishita Dasgupta, Raja Marjieh, Michael Y. Hu, Robert D. Hawkins, Nathaniel D. Daw, Jonathan D. Cohen, Karthik Narasimhan, Thomas L. Griffiths

Abstract: Strong inductive biases give humans the ability to quickly learn to perform a variety of tasks. Although meta-learning is a method to endow neural networks with useful inductive biases, agents trained by meta-learning may sometimes acquire very different strategies from humans. We show that co-training these agents on predicting representations from natural language task descriptions and programs… ▽ More Strong inductive biases give humans the ability to quickly learn to perform a variety of tasks. Although meta-learning is a method to endow neural networks with useful inductive biases, agents trained by meta-learning may sometimes acquire very different strategies from humans. We show that co-training these agents on predicting representations from natural language task descriptions and programs induced to generate such tasks guides them toward more human-like inductive biases. Human-generated language descriptions and program induction models that add new learned primitives both contain abstract concepts that can compress description length. Co-training on these representations result in more human-like behavior in downstream meta-reinforcement learning agents than less abstract controls (synthetic language descriptions, program induction without learned primitives), suggesting that the abstraction supported by these representations is key. △ Less

Submitted 5 February, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

Comments: In Proceedings of the 36th Conference on Neural Information Processing Systems (NeurIPS 2022), winner of Outstanding Paper Award

arXiv:2205.05666 [pdf, other]

Identifying concept libraries from language about object structure

Authors: Catherine Wong, William P. McCarthy, Gabriel Grand, Yoni Friedman, Joshua B. Tenenbaum, Jacob Andreas, Robert D. Hawkins, Judith E. Fan

Abstract: Our understanding of the visual world goes beyond naming objects, encompassing our ability to parse objects into meaningful parts, attributes, and relations. In this work, we leverage natural language descriptions for a diverse set of 2K procedurally generated objects to identify the parts people use and the principles leading these parts to be favored over others. We formalize our problem as sear… ▽ More Our understanding of the visual world goes beyond naming objects, encompassing our ability to parse objects into meaningful parts, attributes, and relations. In this work, we leverage natural language descriptions for a diverse set of 2K procedurally generated objects to identify the parts people use and the principles leading these parts to be favored over others. We formalize our problem as search over a space of program libraries that contain different part concepts, using tools from machine translation to evaluate how well programs expressed in each library align to human language. By combining naturalistic language at scale with structured program representations, we discover a fundamental information-theoretic tradeoff governing the part concepts people name: people favor a lexicon that allows concise descriptions of each object, while also minimizing the size of the lexicon itself. △ Less

Submitted 11 May, 2022; originally announced May 2022.

Comments: Appears in the conference proceedings of CogSci 2022

arXiv:2204.05091 [pdf, other]

Linguistic communication as (inverse) reward design

Authors: Theodore R. Sumers, Robert D. Hawkins, Mark K. Ho, Thomas L. Griffiths, Dylan Hadfield-Menell

Abstract: Natural language is an intuitive and expressive way to communicate reward information to autonomous agents. It encompasses everything from concrete instructions to abstract descriptions of the world. Despite this, natural language is often challenging to learn from: it is difficult for machine learning methods to make appropriate inferences from such a wide range of input. This paper proposes a ge… ▽ More Natural language is an intuitive and expressive way to communicate reward information to autonomous agents. It encompasses everything from concrete instructions to abstract descriptions of the world. Despite this, natural language is often challenging to learn from: it is difficult for machine learning methods to make appropriate inferences from such a wide range of input. This paper proposes a generalization of reward design as a unifying principle to ground linguistic communication: speakers choose utterances to maximize expected rewards from the listener's future behaviors. We first extend reward design to incorporate reasoning about unknown future states in a linear bandit setting. We then define a speaker model which chooses utterances according to this objective. Simulations show that short-horizon speakers (reasoning primarily about a single, known state) tend to use instructions, while long-horizon speakers (reasoning primarily about unknown, future states) tend to describe the reward function. We then define a pragmatic listener which performs inverse reward design by jointly inferring the speaker's latent horizon and rewards. Our findings suggest that this extension of reward design to linguistic communication, including the notion of a latent speaker horizon, is a promising direction for achieving more robust alignment outcomes from natural language supervision. △ Less

Submitted 11 April, 2022; originally announced April 2022.

Comments: 6 pages, 3 figures. Accepted at Learning from Natural Language Supervision workshop (ACL 2022)

arXiv:2202.12226 [pdf, other]

Probing BERT's priors with serial reproduction chains

Authors: Takateru Yamakoshi, Thomas L. Griffiths, Robert D. Hawkins

Abstract: Sampling is a promising bottom-up method for exposing what generative models have learned about language, but it remains unclear how to generate representative samples from popular masked language models (MLMs) like BERT. The MLM objective yields a dependency network with no guarantee of consistent conditional distributions, posing a problem for naive approaches. Drawing from theories of iterated… ▽ More Sampling is a promising bottom-up method for exposing what generative models have learned about language, but it remains unclear how to generate representative samples from popular masked language models (MLMs) like BERT. The MLM objective yields a dependency network with no guarantee of consistent conditional distributions, posing a problem for naive approaches. Drawing from theories of iterated learning in cognitive science, we explore the use of serial reproduction chains to sample from BERT's priors. In particular, we observe that a unique and consistent estimator of the ground-truth joint distribution is given by a Generative Stochastic Network (GSN) sampler, which randomly selects which token to mask and reconstruct on each step. We show that the lexical and syntactic statistics of sentences from GSN chains closely match the ground-truth corpus distribution and perform better than other methods in a large corpus of naturalness judgments. Our findings establish a firmer theoretical foundation for bottom-up probing and highlight richer deviations from human priors. △ Less

Submitted 18 March, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

Comments: Findings of ACL 2022

arXiv:2112.03945 [pdf]

Case Study: Evaluation of a meta-analysis of the association between soy protein and cardiovascular disease

Authors: S. Stanley Young, Warren B. Kindzierski, Douglas Hawkins, Paul Fogel, Terry Meyer

Abstract: It is well-known that claims coming from observational studies most often fail to replicate. Experimental (randomized) trials, where conditions are under researcher control, have a high reputation and meta-analysis of experimental trials are considered the best possible evidence. Given the irreproducibility crisis, experiments lately are starting to be questioned. There is a need to know the relia… ▽ More It is well-known that claims coming from observational studies most often fail to replicate. Experimental (randomized) trials, where conditions are under researcher control, have a high reputation and meta-analysis of experimental trials are considered the best possible evidence. Given the irreproducibility crisis, experiments lately are starting to be questioned. There is a need to know the reliability of claims coming from randomized trials. A case study is presented here independently examining a published meta-analysis of randomized trials claiming that soy protein intake improves cardiovascular health. Counting and p-value plotting techniques (standard p-value plot, p-value expectation plot, and volcano plot) are used. Counting (search space) analysis indicates that reported p-values from the meta-analysis could be biased low due to multiple testing and multiple modeling. Plotting techniques used to visualize the behavior of the data set used for meta-analysis suggest that statistics drawn from the base papers do not satisfy key assumptions of a random-effects meta-analysis. These assumptions include using unbiased statistics all drawn from the same population. Also, publication bias is unaddressed in the meta-analysis. The claim that soy protein intake should improve cardiovascular health is not supported by our analysis. △ Less

Submitted 28 November, 2021; originally announced December 2021.

Comments: 23 pages, 5 figures, 3 Tables

arXiv:2112.03799 [pdf, other]

A pragmatic account of the weak evidence effect

Authors: Samuel A. Barnett, Thomas L. Griffiths, Robert D. Hawkins

Abstract: Language is not only used to transmit neutral information; we often seek to persuade by arguing in favor of a particular view. Persuasion raises a number of challenges for classical accounts of belief updating, as information cannot be taken at face value. How should listeners account for a speaker's "hidden agenda" when incorporating new information? Here, we extend recent probabilistic models of… ▽ More Language is not only used to transmit neutral information; we often seek to persuade by arguing in favor of a particular view. Persuasion raises a number of challenges for classical accounts of belief updating, as information cannot be taken at face value. How should listeners account for a speaker's "hidden agenda" when incorporating new information? Here, we extend recent probabilistic models of recursive social reasoning to allow for persuasive goals and show that our model provides a pragmatic account for why weakly favorable arguments may backfire, a phenomenon known as the weak evidence effect. Critically, this model predicts a systematic relationship between belief updates and expectations about the information source: weak evidence should only backfire when speakers are expected to act under persuasive goals and prefer the strongest evidence. We introduce a simple experimental paradigm called the Stick Contest to measure the extent to which the weak evidence effect depends on speaker expectations, and show that a pragmatic listener model accounts for the empirical data better than alternative models. Our findings suggest further avenues for rational models of social reasoning to illuminate classical decision-making phenomena. △ Less

Submitted 13 September, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

Comments: in press at Open Mind

arXiv:2109.13861 [pdf, other]

Visual resemblance and communicative context constrain the emergence of graphical conventions

Authors: Robert D. Hawkins, Megumi Sano, Noah D. Goodman, Judith E. Fan

Abstract: From photorealistic sketches to schematic diagrams, drawing provides a versatile medium for communicating about the visual world. How do images spanning such a broad range of appearances reliably convey meaning? Do viewers understand drawings based solely on their ability to resemble the entities they refer to (i.e., as images), or do they understand drawings based on shared but arbitrary associat… ▽ More From photorealistic sketches to schematic diagrams, drawing provides a versatile medium for communicating about the visual world. How do images spanning such a broad range of appearances reliably convey meaning? Do viewers understand drawings based solely on their ability to resemble the entities they refer to (i.e., as images), or do they understand drawings based on shared but arbitrary associations with these entities (i.e., as symbols)? In this paper, we provide evidence for a cognitive account of pictorial meaning in which both visual and social information is integrated to support effective visual communication. To evaluate this account, we used a communication task where pairs of participants used drawings to repeatedly communicate the identity of a target object among multiple distractor objects. We manipulated social cues across three experiments and a full internal replication, finding pairs of participants develop referent-specific and interaction-specific strategies for communicating more efficiently over time, going beyond what could be explained by either task practice or a pure resemblance-based account alone. Using a combination of model-based image analyses and crowdsourced sketch annotations, we further determined that drawings did not drift toward arbitrariness, as predicted by a pure convention-based account, but systematically preserved those visual features that were most distinctive of the target object. Taken together, these findings advance theories of pictorial meaning and have implications for how successful graphical conventions emerge via complex interactions between visual perception, communicative experience, and social context. △ Less

Submitted 17 September, 2021; originally announced September 2021.

Comments: 26 pages; 8 figures; submitted version of manuscript

arXiv:2107.00077 [pdf, other]

Learning to communicate about shared procedural abstractions

Authors: William P. McCarthy, Robert D. Hawkins, Haoliang Wang, Cameron Holdaway, Judith E. Fan

Abstract: Many real-world tasks require agents to coordinate their behavior to achieve shared goals. Successful collaboration requires not only adopting the same communicative conventions, but also grounding these conventions in the same task-appropriate conceptual abstractions. We investigate how humans use natural language to collaboratively solve physical assembly problems more effectively over time. Hum… ▽ More Many real-world tasks require agents to coordinate their behavior to achieve shared goals. Successful collaboration requires not only adopting the same communicative conventions, but also grounding these conventions in the same task-appropriate conceptual abstractions. We investigate how humans use natural language to collaboratively solve physical assembly problems more effectively over time. Human participants were paired up in an online environment to reconstruct scenes containing two block towers. One participant could see the target towers, and sent assembly instructions for the other participant to reconstruct. Participants provided increasingly concise instructions across repeated attempts on each pair of towers, using higher-level referring expressions that captured each scene's hierarchical structure. To explain these findings, we extend recent probabilistic models of ad-hoc convention formation with an explicit perceptual learning mechanism. These results shed light on the inductive biases that enable intelligent agents to coordinate upon shared procedural abstractions. △ Less

Submitted 30 June, 2021; originally announced July 2021.

arXiv:2105.11950 [pdf, other]

Extending rational models of communication from beliefs to actions

Authors: Theodore R. Sumers, Robert D. Hawkins, Mark K. Ho, Thomas L. Griffiths

Abstract: Speakers communicate to influence their partner's beliefs and shape their actions. Belief- and action-based objectives have been explored independently in recent computational models, but it has been challenging to explicitly compare or integrate them. Indeed, we find that they are conflated in standard referential communication tasks. To distinguish these accounts, we introduce a new paradigm cal… ▽ More Speakers communicate to influence their partner's beliefs and shape their actions. Belief- and action-based objectives have been explored independently in recent computational models, but it has been challenging to explicitly compare or integrate them. Indeed, we find that they are conflated in standard referential communication tasks. To distinguish these accounts, we introduce a new paradigm called signaling bandits, generalizing classic Lewis signaling games to a multi-armed bandit setting where all targets in the context have some relative value. We develop three speaker models: a belief-oriented speaker with a purely informative objective; an action-oriented speaker with an instrumental objective; and a combined speaker which integrates the two by inducing listener beliefs that generally lead to desirable actions. We then present a series of simulations demonstrating that grounding production choices in future listener actions results in relevance effects and flexible uses of nonliteral language. More broadly, our findings suggest that language games based on richer decision problems are a promising avenue for insight into rational communication. △ Less

Submitted 25 May, 2021; originally announced May 2021.

Comments: 7 pages, 4 figures. Proceedings for the 43rd Annual Meeting of the Cognitive Science Society

arXiv:2105.06546 [pdf, other]

Shades of confusion: Lexical uncertainty modulates ad hoc coordination in an interactive communication task

Authors: Sonia K. Murthy, Thomas L. Griffiths, Robert D. Hawkins

Abstract: There is substantial variability in the expectations that communication partners bring into interactions, creating the potential for misunderstandings. To directly probe these gaps and our ability to overcome them, we propose a communication task based on color-concept associations. In Experiment 1, we establish several key properties of the mental representations of these expectations, or lexical… ▽ More There is substantial variability in the expectations that communication partners bring into interactions, creating the potential for misunderstandings. To directly probe these gaps and our ability to overcome them, we propose a communication task based on color-concept associations. In Experiment 1, we establish several key properties of the mental representations of these expectations, or lexical priors, based on recent probabilistic theories. Associations are more variable for abstract concepts, variability is represented as uncertainty within each individual, and uncertainty enables accurate predictions about whether others are likely to share the same association. In Experiment 2, we then examine the downstream consequences of these representations for communication. Accuracy is initially low when communicating about concepts with more variable associations, but rapidly increases as participants form ad hoc conventions. Together, our findings suggest that people cope with variability by maintaining well-calibrated uncertainty about their partner and appropriately adaptable representations of their own. △ Less

Submitted 26 April, 2022; v1 submitted 13 May, 2021; originally announced May 2021.

Comments: in press at Cognition

arXiv:2104.05857 [pdf, other]

From partners to populations: A hierarchical Bayesian account of coordination and convention

Authors: Robert D. Hawkins, Michael Franke, Michael C. Frank, Adele E. Goldberg, Kenny Smith, Thomas L. Griffiths, Noah D. Goodman

Abstract: Languages are powerful solutions to coordination problems: they provide stable, shared expectations about how the words we say correspond to the beliefs and intentions in our heads. Yet language use in a variable and non-stationary social environment requires linguistic representations to be flexible: old words acquire new ad hoc or partner-specific meanings on the fly. In this paper, we introduce… ▽ More Languages are powerful solutions to coordination problems: they provide stable, shared expectations about how the words we say correspond to the beliefs and intentions in our heads. Yet language use in a variable and non-stationary social environment requires linguistic representations to be flexible: old words acquire new ad hoc or partner-specific meanings on the fly. In this paper, we introduce CHAI (Continual Hierarchical Adaptation through Inference), a hierarchical Bayesian theory of coordination and convention formation that aims to reconcile the long-standing tension between these two basic observations. We argue that the central computational problem of communication is not simply transmission, as in classical formulations, but continual learning and adaptation over multiple timescales. Partner-specific common ground quickly emerges from social inferences within dyadic interactions, while community-wide social conventions are stable priors that have been abstracted away from interactions with multiple partners. We present new empirical data alongside simulations showing how our model provides a computational foundation for several phenomena that have posed a challenge for previous accounts: (1) the convergence to more efficient referring expressions across repeated interaction with the same partner, (2) the gradual transfer of partner-specific common ground to strangers, and (3) the influence of communicative context on which conventions eventually form. △ Less

Submitted 2 December, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

Comments: In press at Psychological Review

arXiv:2010.02375 [pdf, other]

Investigating representations of verb bias in neural language models

Authors: Robert D. Hawkins, Takateru Yamakoshi, Thomas L. Griffiths, Adele E. Goldberg

Abstract: Languages typically provide more than one grammatical construction to express certain types of messages. A speaker's choice of construction is known to depend on multiple factors, including the choice of main verb -- a phenomenon known as \emph{verb bias}. Here we introduce DAIS, a large benchmark dataset containing 50K human judgments for 5K distinct sentence pairs in the English dative alternati… ▽ More Languages typically provide more than one grammatical construction to express certain types of messages. A speaker's choice of construction is known to depend on multiple factors, including the choice of main verb -- a phenomenon known as \emph{verb bias}. Here we introduce DAIS, a large benchmark dataset containing 50K human judgments for 5K distinct sentence pairs in the English dative alternation. This dataset includes 200 unique verbs and systematically varies the definiteness and length of arguments. We use this dataset, as well as an existing corpus of naturally occurring data, to evaluate how well recent neural language models capture human preferences. Results show that larger models perform better than smaller models, and transformer architectures (e.g. GPT-2) tend to out-perform recurrent architectures (e.g. LSTMs) even under comparable parameter and training settings. Additional analyses of internal feature representations suggest that transformers may better integrate specific lexical information with grammatical constructions. △ Less

Submitted 15 October, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

Comments: Accepted to EMNLP

arXiv:2009.14715 [pdf, other]

Learning Rewards from Linguistic Feedback

Authors: Theodore R. Sumers, Mark K. Ho, Robert D. Hawkins, Karthik Narasimhan, Thomas L. Griffiths

Abstract: We explore unconstrained natural language feedback as a learning signal for artificial agents. Humans use rich and varied language to teach, yet most prior work on interactive learning from language assumes a particular form of input (e.g., commands). We propose a general framework which does not make this assumption, using aspect-based sentiment analysis to decompose feedback into sentiment about… ▽ More We explore unconstrained natural language feedback as a learning signal for artificial agents. Humans use rich and varied language to teach, yet most prior work on interactive learning from language assumes a particular form of input (e.g., commands). We propose a general framework which does not make this assumption, using aspect-based sentiment analysis to decompose feedback into sentiment about the features of a Markov decision process. We then perform an analogue of inverse reinforcement learning, regressing the sentiment on the features to infer the teacher's latent reward function. To evaluate our approach, we first collect a corpus of teaching behavior in a cooperative task where both teacher and learner are human. We implement three artificial learners: sentiment-based "literal" and "pragmatic" models, and an inference network trained end-to-end to predict latent rewards. We then repeat our initial experiment and pair them with human teachers. All three successfully learn from interactive human feedback. The sentiment models outperform the inference network, with the "pragmatic" model approaching human performance. Our work thus provides insight into the information structure of naturalistic linguistic feedback as well as methods to leverage it for reinforcement learning. △ Less

Submitted 3 July, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

Comments: 9 pages, 4 figures. AAAI '21

arXiv:2004.09431 [pdf, other]

doi 10.1093/mnras/staa1109

AT 2016dah and AT 2017fyp: the first classical novae discovered within a tidal stream

Authors: M. J. Darnley, A. M. Newsam, K. Chinetti, I. D. W. Hawkins, A. L. Jannetta, M. M. Kasliwal, J. C. McGarry, M. M. Shara, M. Sitaram, S. C. Williams

Abstract: AT2016dah and AT2017fyp are fairly typical Andromeda Galaxy (M31) classical novae. AT2016dah is an almost text book example of a 'very fast' declining, yet uncommon, Fe II'b' (broad-lined) nova, discovered during the rise to peak optical luminosity, and decaying with a smooth broken power-law light curve. AT2017fyp is classed as a 'fast' nova, unusually for M31, its early decline spectrum simultan… ▽ More AT2016dah and AT2017fyp are fairly typical Andromeda Galaxy (M31) classical novae. AT2016dah is an almost text book example of a 'very fast' declining, yet uncommon, Fe II'b' (broad-lined) nova, discovered during the rise to peak optical luminosity, and decaying with a smooth broken power-law light curve. AT2017fyp is classed as a 'fast' nova, unusually for M31, its early decline spectrum simultaneously shows properties of both Fe II and He/N spectral types - a 'hybrid'. Similarly, the light curve of AT2017fyp has a broken power-law decline but exhibits an extended flat-topped maximum. Both novae were followed in the UV and X-ray by the Neil Gehrels Swift Observatory, but no X-ray source was detected for either nova. The pair were followed photometrically and spectroscopically into their nebular phases. The progenitor systems were not visible in archival optical data, implying that the mass donors are main sequence stars. What makes AT2016dah and AT2017fyp particularly interesting is their position with respect to M31. The pair are close on the sky but are located far from the centre of M31, lying almost along the semi-minor axis of their host. Radial velocity measurements and simulations of the M31 nova population leads to the conclusion that both novae are members of the Andromeda Giant Stellar Stream (GSS). We find the probability of at least two M31 novae appearing coincident with the GSS by chance is ~1%. Therefore, we claim that these novae arose from the GSS progenitor, not M31 - the first confirmed novae discovered in a tidal steam. △ Less

Submitted 20 April, 2020; originally announced April 2020.

Comments: 22 pages, 14 figures, 4 tables. Accepted for publication in MNRAS

arXiv:2002.01510 [pdf, other]

Generalizing meanings from partners to populations: Hierarchical inference supports convention formation on networks

Authors: Robert D. Hawkins, Noah D. Goodman, Adele E. Goldberg, Thomas L. Griffiths

Abstract: A key property of linguistic conventions is that they hold over an entire community of speakers, allowing us to communicate efficiently even with people we have never met before. At the same time, much of our language use is partner-specific: we know that words may be understood differently by different people based on our shared history. This poses a challenge for accounts of convention formation… ▽ More A key property of linguistic conventions is that they hold over an entire community of speakers, allowing us to communicate efficiently even with people we have never met before. At the same time, much of our language use is partner-specific: we know that words may be understood differently by different people based on our shared history. This poses a challenge for accounts of convention formation. Exactly how do agents make the inferential leap to community-wide expectations while maintaining partner-specific knowledge? We propose a hierarchical Bayesian model to explain how speakers and listeners solve this inductive problem. To evaluate our model's predictions, we conducted an experiment where participants played an extended natural-language communication game with different partners in a small community. We examine several measures of generalization and find key signatures of both partner-specificity and community convergence that distinguish our model from alternatives. These results suggest that partner-specificity is not only compatible with the formation of community-wide conventions, but may facilitate it when coupled with a powerful inductive mechanism. △ Less

Submitted 30 May, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

Comments: CogSci 2020

arXiv:1912.07199 [pdf, other]

Characterizing the dynamics of learning in repeated reference games

Authors: Robert D. Hawkins, Michael C. Frank, Noah D. Goodman

Abstract: The language we use over the course of conversation changes as we establish common ground and learn what our partner finds meaningful. Here we draw upon recent advances in natural language processing to provide a finer-grained characterization of the dynamics of this learning process. We release an open corpus (>15,000 utterances) of extended dyadic interactions in a classic repeated reference gam… ▽ More The language we use over the course of conversation changes as we establish common ground and learn what our partner finds meaningful. Here we draw upon recent advances in natural language processing to provide a finer-grained characterization of the dynamics of this learning process. We release an open corpus (>15,000 utterances) of extended dyadic interactions in a classic repeated reference game task where pairs of participants had to coordinate on how to refer to initially difficult-to-describe tangram stimuli. We find that different pairs discover a wide variety of idiosyncratic but efficient and stable solutions to the problem of reference. Furthermore, these conventions are shaped by the communicative context: words that are more discriminative in the initial context (i.e. that are used for one target more than others) are more likely to persist through the final repetition. Finally, we find systematic structure in how a speaker's referring expressions become more efficient over time: syntactic units drop out in clusters following positive feedback from the listener, eventually leaving short labels containing open-class parts of speech. These findings provide a higher resolution look at the quantitative dynamics of ad hoc convention formation and support further development of computational models of learning in communication. △ Less

Submitted 13 April, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

Comments: Accepted at Cognitive Science

arXiv:1911.09896 [pdf, other]

Continual adaptation for efficient machine communication

Authors: Robert D. Hawkins, Minae Kwon, Dorsa Sadigh, Noah D. Goodman

Abstract: To communicate with new partners in new contexts, humans rapidly form new linguistic conventions. Recent neural language models are able to comprehend and produce the existing conventions present in their training data, but are not able to flexibly and interactively adapt those conventions on the fly as humans do. We introduce an interactive repeated reference task as a benchmark for models of ada… ▽ More To communicate with new partners in new contexts, humans rapidly form new linguistic conventions. Recent neural language models are able to comprehend and produce the existing conventions present in their training data, but are not able to flexibly and interactively adapt those conventions on the fly as humans do. We introduce an interactive repeated reference task as a benchmark for models of adaptation in communication and propose a regularized continual learning framework that allows an artificial agent initialized with a generic language model to more accurately and efficiently communicate with a partner over time. We evaluate this framework through simulations on COCO and in real-time reference game experiments with human partners. △ Less

Submitted 13 October, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

Comments: Accepted at CoNLL

arXiv:1906.02950 [pdf, other]

doi 10.1016/j.jcp.2020.109234

Provably Optimal Parallel Transport Sweeps on Semi-Structured Grids

Authors: Michael P. Adams, Marvin L. Adams, W. Daryl Hawkins, Timmie Smith, Lawrence Rauchwerger, Nancy M. Amato, Teresa S. Bailey, Robert D. Falgout, Adam Kunen, Peter Brown

Abstract: We have found provably optimal algorithms for full-domain discrete-ordinate transport sweeps on a class of grids in 2D and 3D Cartesian geometry that are regular at a coarse level but arbitrary within the coarse blocks. We describe these algorithms and show that they always execute the full eight-octant (or four-quadrant if 2D) sweep in the minimum possible number of stages for a given Px x Py x P… ▽ More We have found provably optimal algorithms for full-domain discrete-ordinate transport sweeps on a class of grids in 2D and 3D Cartesian geometry that are regular at a coarse level but arbitrary within the coarse blocks. We describe these algorithms and show that they always execute the full eight-octant (or four-quadrant if 2D) sweep in the minimum possible number of stages for a given Px x Py x Pz partitioning. Computational results confirm that our optimal scheduling algorithms execute sweeps in the minimum possible stage count. Observed parallel efficiencies agree well with our performance model. Our PDT transport code has achieved approximately 68% parallel efficiency with > 1.5M parallel threads, relative to 8 threads, on a simple weak-scaling problem with only three energy groups, 10 direction per octant, and 4096 cells/core. We demonstrate similar efficiencies on a much more realistic set of nuclear-reactor test problems, with unstructured meshes that resolve fine geometric details. These results demonstrate that discrete-ordinates transport sweeps can be executed with high efficiency using more than 106 parallel processes. △ Less

Submitted 7 June, 2019; originally announced June 2019.

Comments: intended for journal submission soon

arXiv:1905.02925 [pdf, other]

ShapeGlot: Learning Language for Shape Differentiation

Authors: Panos Achlioptas, Judy Fan, Robert X. D. Hawkins, Noah D. Goodman, Leonidas J. Guibas

Abstract: In this work we explore how fine-grained differences between the shapes of common objects are expressed in language, grounded on images and 3D models of the objects. We first build a large scale, carefully controlled dataset of human utterances that each refers to a 2D rendering of a 3D CAD model so as to distinguish it from a set of shape-wise similar alternatives. Using this dataset, we develop… ▽ More In this work we explore how fine-grained differences between the shapes of common objects are expressed in language, grounded on images and 3D models of the objects. We first build a large scale, carefully controlled dataset of human utterances that each refers to a 2D rendering of a 3D CAD model so as to distinguish it from a set of shape-wise similar alternatives. Using this dataset, we develop neural language understanding (listening) and production (speaking) models that vary in their grounding (pure 3D forms via point-clouds vs. rendered 2D images), the degree of pragmatic reasoning captured (e.g. speakers that reason about a listener or not), and the neural architecture (e.g. with or without attention). We find models that perform well with both synthetic and human partners, and with held out utterances and objects. We also find that these models are amenable to zero-shot transfer learning to novel object classes (e.g. transfer from training on chairs to testing on lamps), as well as to real-world images drawn from furniture catalogs. Lesion studies indicate that the neural listeners depend heavily on part-related words and associate these words correctly with visual parts of objects (without any explicit network training on object parts), and that transfer to novel classes is most successful when known part-words are available. This work illustrates a practical approach to language grounding, and provides a case study in the relationship between object shape and linguistic structure when it comes to object differentiation. △ Less

Submitted 8 May, 2019; originally announced May 2019.

arXiv:1904.00340 [pdf, ps, other]

CUSUM ARL - Conditional or Unconditional?

Authors: F. Lombard, D. M. Hawkins

Abstract: The behavior of CUSUM charts depends strongly on how they are initialized. Recent work has suggested that self-starting CUSUM methods retain some dependence on their very first readings, and introduced the concept of "conditional average run length" (CARL) -- the average run length conditioned on the first few process readings -- as a result of which is it claimed that different practitioners usin… ▽ More The behavior of CUSUM charts depends strongly on how they are initialized. Recent work has suggested that self-starting CUSUM methods retain some dependence on their very first readings, and introduced the concept of "conditional average run length" (CARL) -- the average run length conditioned on the first few process readings -- as a result of which is it claimed that different practitioners using the same methodology could experience different ARLs because of the random differences in their earliest readings. We cast doubt on whether CARL is relevant to practitioners who use self-starting methods and argue that the unconditional ARL is the relevant measure there. △ Less

Submitted 31 March, 2019; originally announced April 2019.

Comments: 7 pages

arXiv:1903.08237 [pdf, other]

When redundancy is useful: A Bayesian approach to 'overinformative' referring expressions

Authors: Judith Degen, Robert D. Hawkins, Caroline Graf, Elisa Kreiss, Noah D. Goodman

Abstract: Referring is one of the most basic and prevalent uses of language. How do speakers choose from the wealth of referring expressions at their disposal? Rational theories of language use have come under attack for decades for not being able to account for the seemingly irrational overinformativeness ubiquitous in referring expressions. Here we present a novel production model of referring expressions… ▽ More Referring is one of the most basic and prevalent uses of language. How do speakers choose from the wealth of referring expressions at their disposal? Rational theories of language use have come under attack for decades for not being able to account for the seemingly irrational overinformativeness ubiquitous in referring expressions. Here we present a novel production model of referring expressions within the Rational Speech Act framework that treats speakers as agents that rationally trade off cost and informativeness of utterances. Crucially, we relax the assumption that informativeness is computed with respect to a deterministic Boolean semantics, in favor of a non-deterministic continuous semantics. This innovation allows us to capture a large number of seemingly disparate phenomena within one unified framework: the basic asymmetry in speakers' propensity to overmodify with color rather than size; the increase in overmodification in complex scenes; the increase in overmodification with atypical features; and the increase in specificity in nominal reference as a function of typicality. These findings cast a new light on the production of referring expressions: rather than being wastefully overinformative, reference is usefully redundant. △ Less

Submitted 10 December, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

arXiv:1807.09000 [pdf, other]

The division of labor in communication: Speakers help listeners account for asymmetries in visual perspective

Authors: Robert D. Hawkins, Hyowon Gweon, Noah D. Goodman

Abstract: Recent debates over adults' theory of mind use have been fueled by surprising failures of perspective-taking in communication, suggesting that perspective-taking can be relatively effortful. How, then, should speakers and listeners allocate their resources to achieve successful communication? We begin with the observation that this shared goal induces a natural division of labor: the resources one… ▽ More Recent debates over adults' theory of mind use have been fueled by surprising failures of perspective-taking in communication, suggesting that perspective-taking can be relatively effortful. How, then, should speakers and listeners allocate their resources to achieve successful communication? We begin with the observation that this shared goal induces a natural division of labor: the resources one agent chooses to allocate toward perspective-taking should depend on their expectations about the other's allocation. We formalize this idea in a resource-rational model augmenting recent probabilistic weighting accounts with a mechanism for (costly) control over the degree of perspective-taking. In a series of simulations, we first derive an intermediate degree of perspective weighting as an optimal tradeoff between expected costs and benefits of perspective-taking. We then present two behavioral experiments testing novel predictions of our model. In Experiment 1, we manipulated the presence or absence of occlusions in a director-matcher task and found that speakers spontaneously produced more informative descriptions to account for "known unknowns" in their partner's private view. In Experiment 2, we compared the scripted utterances used by confederates in prior work with those produced in interactions with unscripted directors. We found that confederates were systematically less informative than listeners would initially expect given the presence of occlusions, but listeners used violations to adaptively make fewer errors over time. Taken together, our work suggests that people are not simply "mindblind"; they use contextually appropriate expectations to navigate the division of labor with their partner. We discuss how a resource rational framework may provide a more deeply explanatory foundation for understanding flexible perspective-taking under processing constraints. △ Less

Submitted 11 May, 2020; v1 submitted 24 July, 2018; originally announced July 2018.

arXiv:1705.00351 [pdf, ps, other]

Nonparametric Cusum Charts for Angular Data with Applications in Health Science and Astrophysics

Authors: F. Lombard, Douglas M. Hawkins, Cornelis Potgieter

Abstract: This paper develops non-parametric rotation invariant CUSUMs suited to the detection of changes in the mean direction as well as changes in the concentration parameter of angular data. The properties of the CUSUMs are illustrated by theoretical calculations, Monte Carlo simulation and application to sequentially observed angular data from health science and astrophysics. This paper develops non-parametric rotation invariant CUSUMs suited to the detection of changes in the mean direction as well as changes in the concentration parameter of angular data. The properties of the CUSUMs are illustrated by theoretical calculations, Monte Carlo simulation and application to sequentially observed angular data from health science and astrophysics. △ Less

Submitted 7 June, 2018; v1 submitted 30 April, 2017; originally announced May 2017.

arXiv:1704.04770 [pdf]

doi 10.1103/PhysRevApplied.8.034028

Raman study of lattice vibrations in type II superlattice InAs/InAs1-xSbx

Authors: Henan Liu, Yong Zhang, Elizabeth H. Steenbergen, Shi Liu, Zhiyuan Lin, Yong-Hang Zhang, Jeomoh Kim, Mi-Hee Ji, Theeradetch Detchprohm, Russell D. Dupuis, ** K. Kim, Samuel D. Hawkins, John F. Klem

Abstract: In this work, we report a polarized Raman study on the vibrational properties of the InAs/InAs1-xSbx SLs as well as selected InAs1-xSbx alloys, all grown on GaSb substrates by either MBE or MOCVD, from both growth surface and cleaved edge. In this work, we report a polarized Raman study on the vibrational properties of the InAs/InAs1-xSbx SLs as well as selected InAs1-xSbx alloys, all grown on GaSb substrates by either MBE or MOCVD, from both growth surface and cleaved edge. △ Less

Submitted 16 April, 2017; originally announced April 2017.

Journal ref: Phys. Rev. Applied 8, 034028 (2017)

arXiv:1703.10186 [pdf, other]

Colors in Context: A Pragmatic Neural Model for Grounded Language Understanding

Authors: Will Monroe, Robert X. D. Hawkins, Noah D. Goodman, Christopher Potts

Abstract: We present a model of pragmatic referring expression interpretation in a grounded communication task (identifying colors from descriptions) that draws upon predictions from two recurrent neural network classifiers, a speaker and a listener, unified by a recursive pragmatic reasoning framework. Experiments show that this combined pragmatic model interprets color descriptions more accurately than th… ▽ More We present a model of pragmatic referring expression interpretation in a grounded communication task (identifying colors from descriptions) that draws upon predictions from two recurrent neural network classifiers, a speaker and a listener, unified by a recursive pragmatic reasoning framework. Experiments show that this combined pragmatic model interprets color descriptions more accurately than the classifiers from which it is built, and that much of this improvement results from combining the speaker and listener perspectives. We observe that pragmatic reasoning helps primarily in the hardest cases: when the model must distinguish very similar colors, or when few utterances adequately express the target color. Our findings make use of a newly-collected corpus of human utterances in color reference games, which exhibit a variety of pragmatic behaviors. We also show that the embedded speaker model reproduces many of these pragmatic behaviors. △ Less

Submitted 16 May, 2017; v1 submitted 29 March, 2017; originally announced March 2017.

Comments: 14 pages, 3 tables, 6 figures. TACL

arXiv:1701.07417 [pdf]

Evidence for an excitonic insulator phase in a zero-gap InAs/GaSb bilayer

Authors: W. Yu, V. Clericò, C. Hernández Fuentevilla, X. Shi, Y. Jiang, D. Saha, W. K. Lou, K. Chang, D. H. Huang, G. Gumbs, D. Smirnov, C. J. Stanton, Z. Jiang, V. Bellani, Y. Meziani, E. Diez, W. Pan, S. D. Hawkins, J. F. Klem

Abstract: Many-body interactions can produce novel ground states in a condensed-matter system. For example, interacting electrons and holes can spontaneously form excitons, a neutral bound state, provided that the exciton binding energy exceeds the energy separation between the single particle states. Here we report on electrical transport measurements on spatially separated two-dimensional electron and hol… ▽ More Many-body interactions can produce novel ground states in a condensed-matter system. For example, interacting electrons and holes can spontaneously form excitons, a neutral bound state, provided that the exciton binding energy exceeds the energy separation between the single particle states. Here we report on electrical transport measurements on spatially separated two-dimensional electron and hole gases with nominally degenerate energy subbands, realized in an InAs(10 nm)/GaSb(5 nm) coupled quantum well. We observe a narrow and intense maximum (~500 kΩ) in the four-terminal resistivity in the charge neutrality region, separating the electron-like and hole-like regimes, with a strong activated temperature-dependence above T = 7 K and perfect stability against quantizing magnetic fields. By quantitatively comparing our data with early theoretical predictions, we show that such unexpectedly large resistance in our nominally zero-gap semi-metal system is probably due to the formation of an excitonic insulator state. △ Less

Submitted 25 January, 2017; originally announced January 2017.

arXiv:1612.08634 [pdf, other]

doi 10.1063/1.4973562

A facility for the analysis of the electronic structures of solids and their surfaces by synchrotron radiation photoelectron spectroscopy

Authors: M. Hoesch, T. K. Kim, P. Dudin, H. Wang, S. Scott, P. Harris, S. Patel, M. Matthews, D. Hawkins, S. G. Alcock, T. Richter, J. J. Mudd, M. Basham, L. Pratt, P. Leicester, E. C. Longhi, A. Tamai, F. Baumberger

Abstract: A synchrotron radiation beamline in the photon energy range of 18 - 240 eV and an electron spectroscopy end station have been constructed at the 3 GeV Diamond Light Source storage ring. The instrument features a variable polarisation undulator, a high resolution monochromator, a re-focussing system to form a beam spot of 50x50 micrometer^2 and an end station for angle-resolved photoelectron spectr… ▽ More A synchrotron radiation beamline in the photon energy range of 18 - 240 eV and an electron spectroscopy end station have been constructed at the 3 GeV Diamond Light Source storage ring. The instrument features a variable polarisation undulator, a high resolution monochromator, a re-focussing system to form a beam spot of 50x50 micrometer^2 and an end station for angle-resolved photoelectron spectroscopy (ARPES) including a 6-degrees-of-freedom cryogenic sample manipulator. The beamline design and its performance allow for a highly productive and precise use of the ARPES technique at an energy resolution of 10 - 15 meV for fast k-space map** studies with a photon flux up to 2 10^13 ph/sec and well below 3 meV for high resolution spectra. △ Less

Submitted 21 December, 2016; originally announced December 2016.

Comments: 10 pages, 11 figures

Journal ref: Rev. Sci. Instrum. 88, 013106 (2017)

arXiv:1610.05784 [pdf, other]

doi 10.1103/PhysRevB.95.045116

Probing the semiconductor to semimetal transition in InAs/GaSb double quantum wells by magneto-infrared spectroscopy

Authors: Y. Jiang, S. Thapa, G. D. Sanders, C. J. Stanton, Q. Zhang, J. Kono, W. K. Lou, K. Chang, S. D. Hawkins, J. F. Klem, W. Pan, D. Smirnov, Z. Jiang

Abstract: We perform a magneto-infrared spectroscopy study of the semiconductor to semimetal transition of InAs/GaSb double quantum wells from the normal to the inverted state. We show that owing to the low carrier density of our samples (approaching the intrinsic limit), the magneto-absorption spectra evolve from a single cyclotron resonance peak in the normal state to multiple absorption peaks in the inve… ▽ More We perform a magneto-infrared spectroscopy study of the semiconductor to semimetal transition of InAs/GaSb double quantum wells from the normal to the inverted state. We show that owing to the low carrier density of our samples (approaching the intrinsic limit), the magneto-absorption spectra evolve from a single cyclotron resonance peak in the normal state to multiple absorption peaks in the inverted state with distinct magnetic field dependence. Using an eight-band Pidgeon-Brown model, we explain all the major absorption peaks observed in our experiment. We demonstrate that the semiconductor to semimetal transition can be realized by manipulating the quantum confinement, the strain, and the magnetic field. Our work paves the way for band engineering of optimal InAs/GaSb structures for realizing novel topological states as well as for device applications in the terahertz regime. △ Less

Submitted 18 October, 2016; originally announced October 2016.

Comments: 15 pages, 9 figures

Journal ref: Phys. Rev. B 95, 045116 (2017)

arXiv:1605.02789 [pdf]

Far Infrared Edge Photoresponse and Persistent Edge Transport in an Inverted InAs/GaSb Heterostructure

Authors: G. C. Dyer, X. Shi, B. V. Olson, S. D. Hawkins, J. F. Klem, E. A. Shaner, W. Pan

Abstract: Direct current (DC) transport and far infrared photoresponse were studied an InAs/GaSb double quantum well with an inverted band structure. The DC transport depends systematically upon the DC bias configuration and operating temperature. Surprisingly, it reveals robust edge conduction despite prevalent bulk transport in our device of macroscopic size. Under 180 GHz far infrared illumination at obl… ▽ More Direct current (DC) transport and far infrared photoresponse were studied an InAs/GaSb double quantum well with an inverted band structure. The DC transport depends systematically upon the DC bias configuration and operating temperature. Surprisingly, it reveals robust edge conduction despite prevalent bulk transport in our device of macroscopic size. Under 180 GHz far infrared illumination at oblique incidence, we measured a strong photovoltaic response. We conclude that quantum spin Hall edge transport produces the observed transverse photovoltages. Overall, our experimental results support a hypothesis that the photoresponse arises from direct coupling of the incident radiation field to edge states. △ Less

Submitted 9 May, 2016; originally announced May 2016.

Journal ref: Appl. Phys. Lett. 108, 013106 (2016)

arXiv:1510.06744 [pdf, ps, other]

doi 10.1088/0004-637X/814/2/140

First Results from COPSS: The CO Power Spectrum Survey

Authors: Garrett K. Keating, Geoffrey C. Bower, Daniel P. Marrone, David R. DeBoer, Carl Heiles, Tzu-Ching Chang, John E. Carlstrom, Christopher H. Greer, David Hawkins, James W. Lamb, Erik Leitch, Amber D. Miller, Stephen Muchovej, David P. Woody

Abstract: We present constraints on the abundance of carbon-monoxide in the early Universe from the CO Power Spectrum Survey (COPSS). We utilize a data set collected between 2005 and 2008 using the Sunyaev-Zel'dovich Array (SZA), which were previously used to measure arcminute-scale fluctuations of the CMB. This data set features observations of 44 fields, covering an effective area of 1.7 square degrees, o… ▽ More We present constraints on the abundance of carbon-monoxide in the early Universe from the CO Power Spectrum Survey (COPSS). We utilize a data set collected between 2005 and 2008 using the Sunyaev-Zel'dovich Array (SZA), which were previously used to measure arcminute-scale fluctuations of the CMB. This data set features observations of 44 fields, covering an effective area of 1.7 square degrees, over a frequency range of 27 to 35 GHz. Using the technique of intensity map**, we are able to probe the CO(1-0) transition, with sensitivity to spatial modes between $k=0.5{-}2\ h\,\textrm{Mpc}^{-1}$ over a range in redshift of $z=2.3{-}3.3$, spanning a comoving volume of $3.6\times10^{6}\ h^{-3}\,\textrm{Mpc}^{3}$. We demonstrate our ability to mitigate foregrounds, and present estimates of the impact of continuum sources on our measurement. We constrain the CO power spectrum to $P_{\textrm{CO}}<2.6\times10^{4}\ μ\textrm{K}^{2} (h^{-1}\,\textrm{Mpc})^{3}$, or $Δ^{2}_{\textrm{CO}}(k\! = \! 1 \ h\,\textrm{Mpc}^{-1})<1.3 \times10^{3}\ μ\textrm{K}^{2}$, at $95\%$ confidence. This limit resides near optimistic predictions for the CO power spectrum. Under the assumption that CO emission is proportional to halo mass during bursts of active star formation, this corresponds to a limit on the ratio of $\textrm{CO}(1{-}0)$ luminosity to host halo mass of $A_{\textrm{CO}}<1.2\times10^{-5}\ L_{\odot}\ M_{\odot}^{-1}$. Further assuming a Milky Way-like conversion factor between CO luminosity and molecular gas mass ($α_{\textrm{CO}}=4.3\ M_{\odot}\ (\textrm{K}\ \textrm{km}\ \textrm{s}^{-1}\ \textrm{pc}^{-2})^{-1}$), we constrain the global density of molecular gas to $ρ_{z\sim3}(M_{\textrm{H}_{2}})\leq 2.8 \times10^{8}\ M_{\odot}\ \textrm{Mpc}^{-3}$. △ Less

Submitted 22 October, 2015; originally announced October 2015.

Comments: 15 pages, 10 figures, 2 tables; Accepted for publication in ApJ

arXiv:1509.02962 [pdf, other]

Coarse-to-Fine Sequential Monte Carlo for Probabilistic Programs

Authors: Andreas Stuhlmüller, Robert X. D. Hawkins, N. Siddharth, Noah D. Goodman

Abstract: Many practical techniques for probabilistic inference require a sequence of distributions that interpolate between a tractable distribution and an intractable distribution of interest. Usually, the sequences used are simple, e.g., based on geometric averages between distributions. When models are expressed as probabilistic programs, the models themselves are highly structured objects that can be u… ▽ More Many practical techniques for probabilistic inference require a sequence of distributions that interpolate between a tractable distribution and an intractable distribution of interest. Usually, the sequences used are simple, e.g., based on geometric averages between distributions. When models are expressed as probabilistic programs, the models themselves are highly structured objects that can be used to derive annealing sequences that are more sensitive to domain structure. We propose an algorithm for transforming probabilistic programs to coarse-to-fine programs which have the same marginal distribution as the original programs, but generate the data at increasing levels of detail, from coarse to fine. We apply this algorithm to an Ising model, its depth-from-disparity variation, and a factorial hidden Markov model. We show preliminary evidence that the use of coarse-to-fine models can make existing generic inference algorithms more efficient. △ Less

Submitted 9 September, 2015; originally announced September 2015.

arXiv:1410.7342 [pdf, other]

doi 10.1063/1.4932644

Giant supercurrent states in a superconductor-InAs/GaSb-superconductor junction

Authors: Xiaoyan Shi, Wenlong Yu, Zhigang Jiang, B. Andrei Bernevig, W. Pan, S. D. Hawkins, J. F. Klem

Abstract: Superconductivity in topological materials has attracted a great deal of interest in both electron physics and material sciences since the theoretical predictions that Majorana fermions can be realized in topological superconductors [1-4]. Topological superconductivity could be realized in a type II, band-inverted, InAs/GaSb quantum well if it is in proximity to a conventional superconductor. Here… ▽ More Superconductivity in topological materials has attracted a great deal of interest in both electron physics and material sciences since the theoretical predictions that Majorana fermions can be realized in topological superconductors [1-4]. Topological superconductivity could be realized in a type II, band-inverted, InAs/GaSb quantum well if it is in proximity to a conventional superconductor. Here we report observations of the proximity effect induced giant supercurrent states in an InAs/GaSb bilayer system that is sandwiched between two superconducting tantalum electrodes to form a superconductor-InAs/GaSb-superconductor junction. Electron transport results show that the supercurrent states can be preserved in a surprisingly large temperature-magnetic field (T-H) parameter space. In addition, the evolution of differential resistance in T and H reveals an interesting superconducting gap structure. △ Less

Submitted 20 November, 2014; v1 submitted 27 October, 2014; originally announced October 2014.

arXiv:1402.7282 [pdf]

Superconducting proximity effect in inverted InAs/GaSb quantum well structures with Ta electrodes

Authors: Wenlong Yu, Yuxuan Jiang, Chao Huan, Xunchi Chen, Zhigang Jiang, Samuel D. Hawkins, John F. Klem, Wei Pan

Abstract: We present our recent electronic transport results in top-gated InAs/GaSb quantum well hybrid structures with superconducting Ta electrodes. We show that the transport across the InAs-Ta junction depends largely on the interfacial transparency, exhibiting distinct zero-bias behavior. For a relatively resistive interface a broad conductance peak is observed at zero bias. When a transparent InAs-Ta… ▽ More We present our recent electronic transport results in top-gated InAs/GaSb quantum well hybrid structures with superconducting Ta electrodes. We show that the transport across the InAs-Ta junction depends largely on the interfacial transparency, exhibiting distinct zero-bias behavior. For a relatively resistive interface a broad conductance peak is observed at zero bias. When a transparent InAs-Ta interface is achieved, a zero-bias conductance dip appears with two coherent-peak-like features forming at bias voltages corresponding to the superconducting gap of Ta. The conductance spectra of the transparent InAs-Ta junction at different gate voltages can be fit well using the standard Blonder-Tinkham-Klapwijk theory. △ Less

Submitted 28 February, 2014; originally announced February 2014.

Comments: submitted

arXiv:1302.0907 [pdf, other]

doi 10.3390/e15062246

Bootstrap Methods for the Empirical Study of Decision-Making and Information Flows in Social Systems

Authors: Simon DeDeo, Robert X. D. Hawkins, Sara Klingenstein, Tim Hitchcock

Abstract: We characterize the statistical bootstrap for the estimation of information-theoretic quantities from data, with particular reference to its use in the study of large-scale social phenomena. Our methods allow one to preserve, approximately, the underlying axiomatic relationships of information theory---in particular, consistency under arbitrary coarse-graining---that motivate use of these quantiti… ▽ More We characterize the statistical bootstrap for the estimation of information-theoretic quantities from data, with particular reference to its use in the study of large-scale social phenomena. Our methods allow one to preserve, approximately, the underlying axiomatic relationships of information theory---in particular, consistency under arbitrary coarse-graining---that motivate use of these quantities in the first place, while providing reliability comparable to the state of the art for Bayesian estimators. We show how information-theoretic quantities allow for rigorous empirical study of the decision-making capacities of rational agents and the time-asymmetric flows of information in distributed systems. We provide illustrative examples by reference to ongoing collaborative work on the semantic structure of the British Criminal Court system and the conflict dynamics of the contemporary Afghanistan insurgency. △ Less

Submitted 5 June, 2013; v1 submitted 4 February, 2013; originally announced February 2013.

Comments: 32 pages, 8 figures, 5 tables. Matched published version. Code for NSB, naive, and bootstrap estimation of entropy, mutual information, and other quantities available at http://thoth-python.org

Journal ref: Entropy 2013, 15(6), 2246-2276

arXiv:1202.2411 [pdf, ps, other]

doi 10.1088/0004-637X/748/2/113

Joint analysis of X-ray and Sunyaev Zel'dovich observations of galaxy clusters using an analytic model of the intra-cluster medium

Authors: Nicole Hasler, Esra Bulbul, Massimiliano Bonamente, John E. Carlstrom, Thomas L. Culverhouse, Megan Gralla, Christopher Greer, David Hawkins, Ryan Hennessy, Marshall Joy, Jeffery Kolodziejczak, James W. Lamb, David Landry, Erik M. Leitch, Adam Mantz, Daniel P. Marrone, Amber Miller, Tony Mroczkowski, Stephen Muchovej, Thomas Plagge, Clem Pryke, David Woody

Abstract: We perform a joint analysis of X-ray and Sunyaev Zel'dovich (SZ) effect data using an analytic model that describes the gas properties of galaxy clusters. The joint analysis allows the measurement of the cluster gas mass fraction profile and Hubble constant independent of cosmological parameters. Weak cosmological priors are used to calculate the overdensity radius within which the gas mass fracti… ▽ More We perform a joint analysis of X-ray and Sunyaev Zel'dovich (SZ) effect data using an analytic model that describes the gas properties of galaxy clusters. The joint analysis allows the measurement of the cluster gas mass fraction profile and Hubble constant independent of cosmological parameters. Weak cosmological priors are used to calculate the overdensity radius within which the gas mass fractions are reported. Such an analysis can provide direct constraints on the evolution of the cluster gas mass fraction with redshift. We validate the model and the joint analysis on high signal-to-noise data from the Chandra X-ray Observatory and the Sunyaev-Zel'dovich Array for two clusters, Abell 2631 and Abell 2204. △ Less

Submitted 10 February, 2012; originally announced February 2012.

Comments: ApJ in press

arXiv:1112.1599 [pdf, other]

doi 10.1088/1367-2630/14/2/025010

Comparison of Pressure Profiles of Massive Relaxed Galaxy Clusters using Sunyaev-Zel'dovich and X-ray Data

Authors: Massimiliano Bonamente, Nicole Hasler, Esra Bulbul, John E. Carlstrom, Thomas L. Culverhouse, Megan Gralla, Christopher Greer, David Hawkins, Ryan Hennessy, Marshall Joy, Jeffery Kolodziejczak, James W. Lamb, David Landry, Erik M. Leitch, Daniel P. Marrone, Amber Miller, Tony Mroczkowski, Stephen Muchovej, Thomas Plagge, Clem Pryke, Matthew Sharp, David Woody

Abstract: We present Sunyaev-Zel'dovich (SZ) effect observations of a sample of 25 massive relaxed galaxy clusters observed with the Sunyaev-Zel'dovich Array (SZA), an 8-element interferometer that is part of the Combined Array for Research in Millimeter-wave Astronomy (CARMA). We perform an analysis of new SZA data and archival Chandra observations of this sample to investigate the integrated pressure -- a… ▽ More We present Sunyaev-Zel'dovich (SZ) effect observations of a sample of 25 massive relaxed galaxy clusters observed with the Sunyaev-Zel'dovich Array (SZA), an 8-element interferometer that is part of the Combined Array for Research in Millimeter-wave Astronomy (CARMA). We perform an analysis of new SZA data and archival Chandra observations of this sample to investigate the integrated pressure -- a proxy for cluster mass -- determined from X-ray and SZ observations, two independent probes of the intra-cluster medium. This analysis makes use of a model for the intra-cluster medium introduced by Bulbul (2010) which can be applied simultaneously to SZ and X-ray data. With this model, we estimate the pressure profile for each cluster using a joint analysis of the SZ and X-ray data, and using the SZ data alone. We find that the integrated pressures measured from X-ray and SZ data are consistent. This conclusion is in agreement with recent results obtained using WMAP and Planck data, confirming that SZ and X-ray observations of massive clusters detect the same amount of thermal pressure from the intra-cluster medium. To test for possible biases introduced by our choice of model, we also fit the SZ data using the universal pressure profile proposed by Arnaud (2010), and find consistency between the two models out to r500 in the pressure profiles and integrated pressures. △ Less

Submitted 20 December, 2011; v1 submitted 7 December, 2011; originally announced December 2011.

Comments: Accepted for New Journal of Physics, Focus Issue on Galaxy Clusters

arXiv:1107.5115 [pdf, other]

doi 10.1088/0004-637X/754/2/119

LoCuSS: The Sunyaev-Zel'dovich Effect and Weak Lensing Mass Scaling Relation

Authors: Daniel P. Marrone, Graham P. Smith, Nobuhiro Okabe, Massimiliano Bonamente, John E. Carlstrom, Thomas L. Culverhouse, Megan Gralla, Christopher H. Greer, Nicole Hasler, David Hawkins, Ryan Hennessy, Marshall Joy, James W. Lamb, Erik M. Leitch, Rossella Martino, Pasquale Mazzotta, Amber Miller, Tony Mroczkowski, Stephen Muchovej, Thomas Plagge, Clem Pryke, Alastair J. R. Sanderson, Masahiro Takada, David Woody, Yu-Ying Zhang

Abstract: We present the first weak-lensing-based scaling relation between galaxy cluster mass, M_wl, and integrated Compton parameter Y_sph. Observations of 18 galaxy clusters at z~0.2 were obtained with the Subaru 8.2-m telescope and the Sunyaev-Zel'dovich Array. The M_wl-Y_sph scaling relations, measured at Delta=500, 1000, and 2500 rho_c, are consistent in slope and normalization with previous results d… ▽ More We present the first weak-lensing-based scaling relation between galaxy cluster mass, M_wl, and integrated Compton parameter Y_sph. Observations of 18 galaxy clusters at z~0.2 were obtained with the Subaru 8.2-m telescope and the Sunyaev-Zel'dovich Array. The M_wl-Y_sph scaling relations, measured at Delta=500, 1000, and 2500 rho_c, are consistent in slope and normalization with previous results derived under the assumption of hydrostatic equilibrium (HSE). We find an intrinsic scatter in M_wl at fixed Y_sph of 20%, larger than both previous measurements of M_HSE-Y_sph scatter as well as the scatter in true mass at fixed Y_sph found in simulations. Moreover, the scatter in our lensing-based scaling relations is morphology dependent, with 30-40% larger M_wl for undisturbed compared to disturbed clusters at the same Y_sph at r_500. Further examination suggests that the segregation may be explained by the inability of our spherical lens models to faithfully describe the three-dimensional structure of the clusters, in particular, the structure along the line-of-sight. We find that the ellipticity of the brightest cluster galaxy, a proxy for halo orientation, correlates well with the offset in mass from the mean scaling relation, which supports this picture. This provides empirical evidence that line-of-sight projection effects are an important systematic uncertainty in lensing-based scaling relations. △ Less

Submitted 17 July, 2012; v1 submitted 26 July, 2011; originally announced July 2011.

Comments: Accepted version

Journal ref: The Astrophysical Journal, 754:119, 2012

arXiv:1012.1610 [pdf, other]

doi 10.1088/0004-637X/732/1/28

Cosmological Constraints from a 31 GHz Sky Survey with the Sunyaev-Zel'dovich Array

Authors: Stephen Muchovej, Erik Leitch, John E Carlstrom, Thomas Culverhouse, Chris Greer, David Hawkins, Ryan Hennessy, Marshall Joy, James Lamb, Michael Loh, Daniel P Marrone, Amber Miller, Tony Mroczkowski, Clem Pryke, Matthew Sharp, David Woody

Abstract: We present the results of a 6.1 square degree survey for clusters of galaxies via their Sunyaev- Zel'dovich (SZ) effect at 31 GHz. From late 2005 to mid 2007 the Sunyaev-Zel'dovich Array (SZA) observed four fields of roughly 1.5 square degrees each. One of the fields shows evidence for significant diffuse Galactic emission, and we therefore restrict our analysis to the remaining 4.4 square degrees… ▽ More We present the results of a 6.1 square degree survey for clusters of galaxies via their Sunyaev- Zel'dovich (SZ) effect at 31 GHz. From late 2005 to mid 2007 the Sunyaev-Zel'dovich Array (SZA) observed four fields of roughly 1.5 square degrees each. One of the fields shows evidence for significant diffuse Galactic emission, and we therefore restrict our analysis to the remaining 4.4 square degrees. We estimate the cluster detectability for the survey using mock observations of simulations of clusters of galaxies; and determine that, at intermediate redshifts (z ~ 0.8), the survey is 50% complete to a limiting mass (M200 rho mean) of ~ 6.0 x 10^14M_{solar}, with the mass limit decreasing at higher redshifts. We detect no clusters at a significance greater than 5 times the RMS noise level in the maps, and place an upper limit on σ_8, the amplitude of mass density fluctuations on a scale of 8h^-1 Mpc, of 0.84 + 0.07 at 95% confidence, where the uncertainty reflects calibration and systematic effects. This result is consistent with estimates from other cluster surveys and CMB anisotropy experiments. △ Less

Submitted 7 December, 2010; originally announced December 2010.

Comments: 10 pages, 7 figures, 3 tables

arXiv:1011.6341 [pdf, ps, other]

doi 10.1088/0004-637X/737/2/74

Sunyaev Zel'dovich Effect Observations of Strong Lensing Galaxy Clusters: Probing the Over-Concentration Problem

Authors: Megan B. Gralla, Keren Sharon, Michael D. Gladders, Daniel P. Marrone, L. Felipe Barrientos, Matthew Bayliss, Massimiliano Bonamente, Esra Bulbul, John E. Carlstrom, Thomas Culverhouse, David G. Gilbank, Christopher Greer, Nicole Hasler, David Hawkins, Ryan Hennessy, Marshall Joy, Benjamin Koester, James Lamb, Erik Leitch, Amber Miller, Tony Mroczkowski, Stephen Muchovej, Masamune Oguri, Tom Plagge, Clem Pryke , et al. (1 additional authors not shown)

Abstract: We have measured the Sunyaev Zel'dovich (SZ) effect for a sample of ten strong lensing selected galaxy clusters using the Sunyaev Zel'dovich Array (SZA). The SZA is sensitive to structures on spatial scales of a few arcminutes, while the strong lensing mass modeling constrains the mass at small scales (typically < 30"). Combining the two provides information about the projected concentrations of t… ▽ More We have measured the Sunyaev Zel'dovich (SZ) effect for a sample of ten strong lensing selected galaxy clusters using the Sunyaev Zel'dovich Array (SZA). The SZA is sensitive to structures on spatial scales of a few arcminutes, while the strong lensing mass modeling constrains the mass at small scales (typically < 30"). Combining the two provides information about the projected concentrations of the strong lensing clusters. The Einstein radii we measure are twice as large as expected given the masses inferred from SZ scaling relations. A Monte Carlo simulation indicates that a sample randomly drawn from the expected distribution would have a larger median Einstein radius than the observed clusters about 3% of the time. The implied overconcentration has been noted in previous studies with smaller samples of lensing clusters. It persists for this sample, with the caveat that this could result from a systematic effect such as if the gas fractions of the strong lensing clusters are substantially below what is expected. △ Less

Submitted 29 November, 2010; originally announced November 2010.

Comments: submitted

arXiv:1007.2853 [pdf, other]

doi 10.1088/2041-8205/723/1/L78

Galaxy Clusters at z>=1: Gas Constraints from the Sunyaev-Zel'dovich Array

Authors: T. L. Culverhouse, M. Bonamente, E. Bulbul, J. E. Carlstrom, M. B. Gralla, C. Greer, N. Hasler, D. Hawkins, R. Hennessy, N. N. Jetha, M. Joy, J. W. Lamb, E. M. Leitch, D. P. Marrone, A. Miller, T. Mroczkowski, S. Muchovej, C. Pryke, M. Sharp, D. Woody, S. Andreon, B. Maughan, S. A. Stanford

Abstract: We present gas constraints from Sunyaev-Zel'dovich (SZ) effect measurements in a sample of eleven X-ray and infrared (IR) selected galaxy clusters at z >=1, using data from the Sunyaev-Zel'dovich Array (SZA). The cylindrically integrated Compton-y parameter, Y , is calculated by fitting the data to a two-parameter gas pressure profile. Where possible, we also determine the temperature of the hot i… ▽ More We present gas constraints from Sunyaev-Zel'dovich (SZ) effect measurements in a sample of eleven X-ray and infrared (IR) selected galaxy clusters at z >=1, using data from the Sunyaev-Zel'dovich Array (SZA). The cylindrically integrated Compton-y parameter, Y , is calculated by fitting the data to a two-parameter gas pressure profile. Where possible, we also determine the temperature of the hot intra-cluster plasma from Chandra and XMM-Newton data, and constrain the gas mass within the same aperture (r_2500 ) as Y . The SZ effect is detected in the clusters for which the X-ray data indicate gas masses above ~ 10^13 Msun, including XMMU J2235-2557 at redshift z = 1.39, which to date is one of the most distant clusters detected using the SZ effect. None of the IR-selected targets are detected by the SZA measurements, indicating low gas masses for these objects. For these and the four other undetected clusters, we quote upper limits on Y and Mgas_SZ , with the latter derived from scaling relations calibrated with lower redshift clusters. We compare the constraints on Y and X-ray derived gas mass Mgas_X-ray to self-similar scaling relations between these observables determined from observations of lower redshift clusters, finding consistency given the measurement error. △ Less

Submitted 16 July, 2010; originally announced July 2010.

Comments: 6 pages, 2 figures, submitted on ApJL

arXiv:0912.2335 [pdf, other]

doi 10.1088/0004-637X/716/1/521

Radio Sources from a 31 GHz Sky Survey with the Sunyaev-Zel'dovich Array

Authors: Stephen Muchovej, Erik Leitch, John E. Carlstrom, Thomas Culverhouse, Chris Greer, David Hawkins, Ryan Hennessy, Marshall Joy, James Lamb, Michael Loh, Daniel P. Marrone, Amber Miller, Tony Mroczkowski, Clem Pryke, Matthew Sharp, David Woody

Abstract: We present the first sample of 31-GHz selected sources to flux levels of 1 mJy. From late 2005 to mid 2007, the Sunyaev-Zel'dovich Array (SZA) observed 7.7 square degrees of the sky at 31 GHz to a median rms of 0.18 mJy/beam. We identify 209 sources at greater than 5 sigma significance in the 31 GHz maps, ranging in flux from 0.7 mJy to ~200 mJy. Archival NVSS data at 1.4 GHz and observations at… ▽ More We present the first sample of 31-GHz selected sources to flux levels of 1 mJy. From late 2005 to mid 2007, the Sunyaev-Zel'dovich Array (SZA) observed 7.7 square degrees of the sky at 31 GHz to a median rms of 0.18 mJy/beam. We identify 209 sources at greater than 5 sigma significance in the 31 GHz maps, ranging in flux from 0.7 mJy to ~200 mJy. Archival NVSS data at 1.4 GHz and observations at 5 GHz with the Very Large Array are used to characterize the sources. We determine the maximum-likelihood integrated source count to be N(>S) = (27.2 +- 2.5) deg^-2 x (S_mJy)^(-1.18 +- 0.12) over the flux range 0.7 - 15 mJy. This result is significantly higher than predictions based on 1.4-GHz selected samples, a discrepancy which can be explained by a small shift in the spectral index distribution for faint 1.4-GHz sources. From comparison with previous measurements of sources within the central arcminute of massive clusters, we derive an overdensity of 6.8 +- 4.4, relative to field sources. △ Less

Submitted 11 December, 2009; originally announced December 2009.

Comments: 13 pages, 5 figures

Journal ref: Astrophys.J.716:521-529,2010

Showing 1–50 of 61 results for author: Hawkins, D