-
Hydrogen is not necessary for superconductivity in topotactically reduced nickelates
Authors:
Purnima P. Balakrishnan,
Dan Ferenc Segedin,
Lin Er Chow,
P. Quarterman,
Shin Muramoto,
Mythili Surendran,
Ranjan K. Patel,
Harrison LaBollita,
Grace A. Pan,
Qi Song,
Yang Zhang,
Ismail El Baggari,
Koushik Jagadish,
Yu-Tsun Shao,
Berit H. Goodge,
Lena F. Kourkoutis,
Srimanta Middey,
Antia S. Botana,
Jayakanth Ravichandran,
A. Ariando,
Julia A. Mundy,
Alexander J. Grutter
Abstract:
A key open question in the study of layered superconducting nickelate films is the role that hydrogen incorporation into the lattice plays in the appearance of the superconducting state. Due to the challenges of stabilizing highly crystalline square planar nickelate films, films are prepared by the deposition of a more stable parent compound which is then transformed into the target phase via a to…
▽ More
A key open question in the study of layered superconducting nickelate films is the role that hydrogen incorporation into the lattice plays in the appearance of the superconducting state. Due to the challenges of stabilizing highly crystalline square planar nickelate films, films are prepared by the deposition of a more stable parent compound which is then transformed into the target phase via a topotactic reaction with a strongly reducing agent such as CaH$_2$. Recent studies, both experimental and theoretical, have introduced the possibility that the incorporation of hydrogen from the reducing agent into the nickelate lattice may be critical for the superconductivity. In this work, we use secondary ion mass spectrometry to examine superconducting La$_{1-x}$X$_x$NiO$_2$ / SrTiO$_3$ (X = Ca and Sr) and Nd$_6$Ni$_5$O$_{12}$ / NdGaO$_3$ films, along with non-superconducting NdNiO$_2$ / SrTiO$_3$ and (Nd,Sr)NiO$_2$ / SrTiO$_3$. We find no evidence for extensive hydrogen incorporation across a broad range of samples, including both superconducting and non-superconducting films. Theoretical calculations indicate that hydrogen incorporation is broadly energetically unfavorable in these systems, supporting our conclusion that hydrogen incorporation is not generally required to achieve a superconducting state in layered square-planar nickelates.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
In-context learning agents are asymmetric belief updaters
Authors:
Johannes A. Schubert,
Akshay K. Jagadish,
Marcel Binz,
Eric Schulz
Abstract:
We study the in-context learning dynamics of large language models (LLMs) using three instrumental learning tasks adapted from cognitive psychology. We find that LLMs update their beliefs in an asymmetric manner and learn more from better-than-expected outcomes than from worse-than-expected ones. Furthermore, we show that this effect reverses when learning about counterfactual feedback and disappe…
▽ More
We study the in-context learning dynamics of large language models (LLMs) using three instrumental learning tasks adapted from cognitive psychology. We find that LLMs update their beliefs in an asymmetric manner and learn more from better-than-expected outcomes than from worse-than-expected ones. Furthermore, we show that this effect reverses when learning about counterfactual feedback and disappears when no agency is implied. We corroborate these findings by investigating idealized in-context learning agents derived through meta-reinforcement learning, where we observe similar patterns. Taken together, our results contribute to our understanding of how in-context learning works by highlighting that the framing of a problem significantly influences how learning occurs, a phenomenon also observed in human cognition.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
Authors:
Akshay K. Jagadish,
Julian Coda-Forno,
Mirko Thalmann,
Eric Schulz,
Marcel Binz
Abstract:
Ecological rationality refers to the notion that humans are rational agents adapted to their environment. However, testing this theory remains challenging due to two reasons: the difficulty in defining what tasks are ecologically valid and building rational models for these tasks. In this work, we demonstrate that large language models can generate cognitive tasks, specifically category learning t…
▽ More
Ecological rationality refers to the notion that humans are rational agents adapted to their environment. However, testing this theory remains challenging due to two reasons: the difficulty in defining what tasks are ecologically valid and building rational models for these tasks. In this work, we demonstrate that large language models can generate cognitive tasks, specifically category learning tasks, that match the statistics of real-world tasks, thereby addressing the first challenge. We tackle the second challenge by deriving rational agents adapted to these tasks using the framework of meta-learning, leading to a class of models called ecologically rational meta-learned inference (ERMI). ERMI quantitatively explains human data better than seven other cognitive models in two different experiments. It additionally matches human behavior on a qualitative level: (1) it finds the same tasks difficult that humans find difficult, (2) it becomes more reliant on an exemplar-based strategy for assigning categories with learning, and (3) it generalizes to unseen stimuli in a human-like way. Furthermore, we show that ERMI's ecologically valid priors allow it to achieve state-of-the-art performance on the OpenML-CC18 classification benchmark.
△ Less
Submitted 28 May, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Inducing anxiety in large language models increases exploration and bias
Authors:
Julian Coda-Forno,
Kristin Witte,
Akshay K. Jagadish,
Marcel Binz,
Zeynep Akata,
Eric Schulz
Abstract:
Large language models are transforming research on machine learning while galvanizing public debates. Understanding not only when these models work well and succeed but also why they fail and misbehave is of great societal relevance. We propose to turn the lens of computational psychiatry, a framework used to computationally describe and modify aberrant behavior, to the outputs produced by these m…
▽ More
Large language models are transforming research on machine learning while galvanizing public debates. Understanding not only when these models work well and succeed but also why they fail and misbehave is of great societal relevance. We propose to turn the lens of computational psychiatry, a framework used to computationally describe and modify aberrant behavior, to the outputs produced by these models. We focus on the Generative Pre-Trained Transformer 3.5 and subject it to tasks commonly studied in psychiatry. Our results show that GPT-3.5 responds robustly to a common anxiety questionnaire, producing higher anxiety scores than human subjects. Moreover, GPT-3.5's responses can be predictably changed by using emotion-inducing prompts. Emotion-induction not only influences GPT-3.5's behavior in a cognitive task measuring exploratory decision-making but also influences its behavior in a previously-established task measuring biases such as racism and ableism. Crucially, GPT-3.5 shows a strong increase in biases when prompted with anxiety-inducing text. Thus, it is likely that how prompts are communicated to large language models has a strong influence on their behavior in applied settings. These results progress our understanding of prompt engineering and demonstrate the usefulness of methods taken from computational psychiatry for studying the capable algorithms to which we increasingly delegate authority and autonomy.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Selfie Detection by Synergy-Constraint Based Convolutional Neural Network
Authors:
Yashas Annadani,
Vijayakrishna Naganoor,
Akshay Kumar Jagadish,
Krishnan Chemmangat
Abstract:
Categorisation of huge amount of data on the multimedia platform is a crucial task. In this work, we propose a novel approach to address the subtle problem of selfie detection for image database segregation on the web, given rapid rise in number of selfies clicked. A Convolutional Neural Network (CNN) is modeled to learn a synergy feature in the common subspace of head and shoulder orientation, de…
▽ More
Categorisation of huge amount of data on the multimedia platform is a crucial task. In this work, we propose a novel approach to address the subtle problem of selfie detection for image database segregation on the web, given rapid rise in number of selfies clicked. A Convolutional Neural Network (CNN) is modeled to learn a synergy feature in the common subspace of head and shoulder orientation, derived from Local Binary Pattern (LBP) and Histogram of Oriented Gradients (HOG) features respectively. This synergy was captured by projecting the aforementioned features using Canonical Correlation Analysis (CCA). We show that the resulting network's convolutional activations in the neighbourhood of spatial keypoints captured by SIFT are discriminative for selfie-detection. In general, proposed approach aids in capturing intricacies present in the image data and has the potential for usage in other subtle image analysis scenarios apart from just selfie detection. We investigate and analyse the performance of popular CNN architectures (GoogleNet, AlexNet), used for other image classification tasks, when subjected to the task of detecting the selfies on the multimedia platform. The results of the proposed approach are compared with these popular architectures on a dataset of ninety thousand images comprising of roughly equal number of selfies and non-selfies. Experimental results on this dataset shows the effectiveness of the proposed approach.
△ Less
Submitted 14 November, 2016;
originally announced November 2016.