Skip to main content

Showing 1–30 of 30 results for author: Nie, A

.
  1. arXiv:2406.16218  [pdf, other

    cs.AI cs.LG

    Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

    Authors: Ching-An Cheng, Allen Nie, Adith Swaminathan

    Abstract: We study a class of optimization problems motivated by automating the design and update of AI systems like coding assistants, robots, and copilots. We propose an end-to-end optimization framework, Trace, which treats the computational workflow of an AI system as a graph akin to neural networks, based on a generalization of back-propagation. Optimization of computational workflows often involves ri… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2405.17708  [pdf, other

    cs.LG cs.AI stat.ML

    OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators

    Authors: Allen Nie, Yash Chandak, Christina J. Yuan, Anirudhan Badrinath, Yannis Flet-Berliac, Emma Brunskil

    Abstract: Offline policy evaluation (OPE) allows us to evaluate and estimate a new sequential decision-making policy's performance by leveraging historical interaction data collected from other policies. Evaluating a new policy online without a confident estimate of its performance can lead to costly, unsafe, or hazardous outcomes, especially in education and healthcare. Several OPE estimators have been pro… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 22 pages

  3. arXiv:2405.16434  [pdf, other

    cs.AI cs.CL cs.NE

    The Importance of Directional Feedback for LLM-based Optimizers

    Authors: Allen Nie, Ching-An Cheng, Andrey Kolobov, Adith Swaminathan

    Abstract: We study the potential of using large language models (LLMs) as an interactive optimizer for solving maximization problems in a text space using natural language and numerical feedback. Inspired by the classical optimization literature, we classify the natural language feedback into directional and non-directional, where the former is a generalization of the first-order feedback to the natural lan… ▽ More

    Submitted 20 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted and Presented at Foundation Models for Decision Making at NeurIPS 2023 (December 15, 2023). Work completed from June 2023 to September 2023

  4. arXiv:2312.06853  [pdf, other

    cs.AI

    LLF-Bench: Benchmark for Interactive Learning from Language Feedback

    Authors: Ching-An Cheng, Andrey Kolobov, Dipendra Misra, Allen Nie, Adith Swaminathan

    Abstract: We introduce a new benchmark, LLF-Bench (Learning from Language Feedback Benchmark; pronounced as "elf-bench"), to evaluate the ability of AI agents to interactively learn from natural language feedback and instructions. Learning from language feedback (LLF) is essential for people, largely because the rich information this feedback provides can help a learner avoid much of trial and error and the… ▽ More

    Submitted 13 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

  5. arXiv:2310.19677  [pdf, other

    cs.CL

    MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

    Authors: Allen Nie, Yuhui Zhang, Atharva Amdekar, Chris Piech, Tatsunori Hashimoto, Tobias Gerstenberg

    Abstract: Human commonsense understanding of the physical and social world is organized around intuitive theories. These theories support making causal and moral judgments. When something bad happens, we naturally ask: who did what, and why? A rich literature in cognitive science has studied people's causal and moral intuitions. This work has revealed a number of factors that systematically influence people… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 34 pages, 7 figures. NeurIPS 2023

  6. arXiv:2306.14069  [pdf, other

    cs.LG

    Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets

    Authors: Anirudhan Badrinath, Yannis Flet-Berliac, Allen Nie, Emma Brunskill

    Abstract: Despite the recent advancements in offline reinforcement learning via supervised learning (RvS) and the success of the decision transformer (DT) architecture in various domains, DTs have fallen short in several challenging benchmarks. The root cause of this underperformance lies in their inability to seamlessly connect segments of suboptimal trajectories. To overcome this limitation, we present a… ▽ More

    Submitted 18 November, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Comments: Accepted to the Conference on Neural Information Processing Systems 2023 (NeurIPS 2023)

  7. arXiv:2304.04933  [pdf, other

    cs.AI cs.CL

    Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task

    Authors: Sherry Ruan, Allen Nie, William Steenbergen, Jiayu He, JQ Zhang, Meng Guo, Yao Liu, Kyle Dang Nguyen, Catherine Y Wang, Rui Ying, James A Landay, Emma Brunskill

    Abstract: Resource limitations make it hard to provide all students with one of the most effective educational interventions: personalized instruction. Reinforcement learning could be a key tool to reduce the development cost and improve the effectiveness of intelligent tutoring software that aims to provide the right support, at the right time, to a student. Here we illustrate that deep reinforcement learn… ▽ More

    Submitted 13 April, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: 23 pages. Under review

  8. arXiv:2301.11426  [pdf, other

    cs.LG

    Model-based Offline Reinforcement Learning with Local Misspecification

    Authors: Kefan Dong, Yannis Flet-Berliac, Allen Nie, Emma Brunskill

    Abstract: We present a model-based offline reinforcement learning policy performance lower bound that explicitly captures dynamics model misspecification and distribution mismatch and we propose an empirical algorithm for optimal offline policy selection. Theoretically, we prove a novel safe policy improvement theorem by establishing pessimism approximations to the value function. Our key insight is to join… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: Accepted by AAAI-23

  9. arXiv:2211.10829  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.app-ph

    Depositing boron on Cu(111): Borophene or boride?

    Authors: Xiao-Ji Weng, Jie Bai, **gyu Hou, Yi Zhu, Li Wang, Penghui Li, Anmin Nie, Bo Xu, Xiang-Feng Zhou, Yongjun Tian

    Abstract: Large-area single-crystal surface structures were successfully prepared on Cu(111) substrate with boron deposition, which is critical for prospective applications. However, the proposed borophene structures do not match the scanning tunneling microscopy (STM) results very well, while the proposed copper boride is at odds with the traditional knowledge that ordered copper-rich borides normally do n… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 15 pages, 4 figures

  10. arXiv:2211.08909  [pdf

    cond-mat.mtrl-sci

    Continuous Electrical Manipulation of Magnetic Anisotropy and Spin Flop** in van der Waals Ferromagnetic Devices

    Authors: Ming Tang, Junwei Huang, Feng Qin, Kun Zhai, Toshiya Ideue, Zeya Li, Fanhao Meng, Anmin Nie, Linglu Wu, Xiangyu Bi, Caorong Zhang, Ling Zhou, Peng Chen, Caiyu Qiu, Peizhe Tang, Haijun Zhang, Xiangang Wan, Lin Wang, Zhongyuan Liu, Yongjun Tian, Yoshihiro Iwasa, Hongtao Yuan

    Abstract: Controlling the magnetic anisotropy of ferromagnetic materials plays a key role in magnetic switching devices and spintronic applications. Examples of spin-orbit torque devices with different magnetic anisotropy geometries (in-plane or out-of-plane directions) have been demonstrated with novel magnetization switching mechanisms for extended device functionalities. Normally, the intrinsic magnetic… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 4 figures

  11. arXiv:2211.08802  [pdf, other

    cs.LG cs.AI stat.ML

    Giving Feedback on Interactive Student Programs with Meta-Exploration

    Authors: Evan Zheran Liu, Moritz Stephan, Allen Nie, Chris Piech, Emma Brunskill, Chelsea Finn

    Abstract: Develo** interactive software, such as websites or games, is a particularly engaging way to learn computer science. However, teaching and giving feedback on such software is time-consuming -- standard approaches require instructors to manually grade student-implemented interactive programs. As a result, online platforms that serve millions, like Code.org, are unable to provide any feedback on as… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2022). Selected as Oral

  12. arXiv:2210.08642  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

    Authors: Allen Nie, Yannis Flet-Berliac, Deon R. Jordan, William Steenbergen, Emma Brunskill

    Abstract: Offline reinforcement learning (RL) can be used to improve future performance by leveraging historical data. There exist many different algorithms for offline RL, and it is well recognized that these algorithms, and their hyperparameter settings, can lead to decision policies with substantially differing performance. This prompts the need for pipelines that allow practitioners to systematically pe… ▽ More

    Submitted 12 January, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: 32 pages. Published at NeurIPS 2022. Presented at RLDM 2022

  13. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  14. arXiv:2201.03181  [pdf, other

    math.ST math.PR

    Spiked eigenvalues of high-dimensional sample autocovariance matrices: CLT and applications

    Authors: Daning Bi, Xiao Han, Adam Nie, Yanrong Yang

    Abstract: High-dimensional autocovariance matrices play an important role in dimension reduction for high-dimensional time series. In this article, we establish the central limit theorem (CLT) for spiked eigenvalues of high-dimensional sample autocovariance matrices, which are developed under general conditions. The spiked eigenvalues are allowed to go to infinity in a flexible way without restrictions in d… ▽ More

    Submitted 13 May, 2024; v1 submitted 10 January, 2022; originally announced January 2022.

  15. arXiv:2110.14615  [pdf, other

    cs.AI cs.CY cs.LG

    Play to Grade: Testing Coding Games as Classifying Markov Decision Process

    Authors: Allen Nie, Emma Brunskill, Chris Piech

    Abstract: Contemporary coding education often presents students with the task of develo** programs that have user interaction and complex dynamic systems, such as mouse based games. While pedagogically compelling, there are no contemporary autonomous methods for providing feedback. Notably, interactive programs are impossible to grade by traditional unit tests. In this paper we formalize the challenge of… ▽ More

    Submitted 14 December, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021, 16 pages, 7 figures

  16. arXiv:2109.06100  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Atomic-Scale Visualization and Manipulation of Domain boundaries in 2D Ferroelectric In2Se3

    Authors: Fan Zhang, Zhe Wang, Lixuan Liu, Anmin Nie, Yongji Gong, Wenguang Zhu, Chenggang Tao

    Abstract: Domain boundaries in ferroelectric materials exhibit rich and diverse physical properties distinct from their parent materials and have been proposed for novel applications in nanoelectronics and quantum information technology. Due to their complexity and diversity, the internal atomic and electronic structure of domain boundaries that governs the electronic properties as well as the kinetics of d… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: 26 pages (not including SI), 4 figures

  17. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  18. arXiv:2011.14819  [pdf

    cond-mat.mtrl-sci

    Discovery of carbon-based strongest and hardest amorphous material

    Authors: Shuangshuang Zhang, Zihe Li, Kun Luo, Julong He, Yufei Gao, Alexander V. Soldatov, Vicente Benavides, Kaiyuan Shi, Anmin Nie, Bin Zhang, Wentao Hu, Mengdong Ma, Yong Liu, Bin Wen, Guoying Gao, Bing Liu, Yang Zhang, Dongli Yu, Xiang-Feng Zhou, Zhisheng Zhao, Bo Xu, Lei Su, Guoqiang Yang, Olga P. Chernogorova, Yongjun Tian

    Abstract: Carbon is likely the most fascinating element of the periodic table because of the diversity of its allotropes stemming from its variable (sp, sp2, and sp3) bonding motifs. Exploration of new forms of carbon has been an eternal theme of contemporary scientific research. Here we report on novel amorphous carbon phases containing high fraction of sp3 bonded atoms recovered after compressing fulleren… ▽ More

    Submitted 25 June, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: 40 pages, 17 figures

    Report number: nwab140

    Journal ref: National Science Review, 2021

  19. arXiv:2005.04926  [pdf

    cond-mat.mes-hall

    Orthogonal electric control of the out-of-plane field-effect in two-dimensional ferroelectric alpha-In2Se3

    Authors: Yue Li, Chen Chen, Wei Li, Xiaoyu Mao, Heng Liu, Jianyong Xiang, Anmin Nie, Zhongyuan Liu, Wenguang Zhu, Hualing Zeng

    Abstract: Tuning the electric properties of crystalline solids is at the heart of material science and electronics. Generating the electric field-effect via an external voltage is a clean, continuous and systematic method. Here, utilizing the unique electric dipole locking in van der Waals (vdW) ferroelectric alpha-In2Se3, we report a new approach to establish the electric gating effect, where the electrost… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

  20. arXiv:2004.14451  [pdf, other

    cs.CL cs.CV

    Pragmatic Issue-Sensitive Image Captioning

    Authors: Allen Nie, Reuben Cohn-Gordon, Christopher Potts

    Abstract: Image captioning systems have recently improved dramatically, but they still tend to produce captions that are insensitive to the communicative goals that captions should meet. To address this, we propose Issue-Sensitive Image Captioning (ISIC). In ISIC, a captioning system is given a target image and an issue, which is a set of images partitioned in a way that specifies what information is releva… ▽ More

    Submitted 5 October, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: 15 pages, 7 figures. EMNLP 2020 Findings Accepted

  21. arXiv:2002.05913  [pdf

    cond-mat.mtrl-sci

    Direct Observation of Room-Temperature Dislocation Plasticity in Diamond

    Authors: Anmin Nie, Yeqiang Bu, Junquan Huang, Yecheng Shao, Yizhi Zhang, Wentao Hu, Jiabin Liu, Yanbin Wang, Bo Xu, Zhongyuan Liu, Hongtao Wang, Wei Yang, Yongjun Tian

    Abstract: It is well known that diamond does not deform plastically at room temperature and usually fails in catastrophic brittle fracture. Here we demonstrate room-temperature dislocation plasticity in sub-micrometer sized diamond pillars by in-situ mechanical testing in the transmission electron microscope. We document in unprecedented details of spatio-temporal features of the dislocations introduced by… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

  22. arXiv:2002.01104  [pdf

    cond-mat.mtrl-sci

    Dislocation Slip or Phase Transformation Lead to Room-Temperature Plasticity in Diamond: Comment on Plastic Deformation of Single-Crystal Diamond Nanopillars

    Authors: Yeqiang Bu, Peng Wang, Anmin Nie, Hongtao Wang

    Abstract: Despite decades of extensive research on mechanical properties of diamond, much remains to be understood in term of plastic deformation mechanisms due to the poor deformability at room temperature. In a recent work in Advanced Materials, it was claimed that room-temperature plasticity occurred in <001>-oriented single-crystal diamond nanopillars based on observation of unrecovered deformation insi… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

  23. arXiv:1909.10699  [pdf, other

    cs.CL cs.IR cs.LG

    LitGen: Genetic Literature Recommendation Guided by Human Explanations

    Authors: Allen Nie, Arturo L. Pineda, Matt W. Wright Hannah Wand, Bryan Wulf, Helio A. Costa, Ronak Y. Patel, Carlos D. Bustamante, James Zou

    Abstract: As genetic sequencing costs decrease, the lack of clinical interpretation of variants has become the bottleneck in using genetics data. A major rate limiting step in clinical interpretation is the manual curation of evidence in the genetic literature by highly trained biocurators. What makes curation particularly time-consuming is that the curator needs to identify papers that study variant pathog… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

    Comments: 12 pages; 5 figures. Accepted by PSB 2020 (Pacific Symposium on Biocomputing) track: Artificial Intelligence for Enhancing Clinical Medicine

  24. arXiv:1906.01243  [pdf, other

    cs.CL

    Learning to Explain: Answering Why-Questions via Rephrasing

    Authors: Allen Nie, Erin D. Bennett, Noah D. Goodman

    Abstract: Providing plausible responses to why questions is a challenging but critical goal for language based human-machine interaction. Explanations are challenging in that they require many different forms of abstract knowledge and reasoning. Previous work has either relied on human-curated structured knowledge bases or detailed domain representation to generate satisfactory explanations. They are also o… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: 8 pages, 5 figures. 1st ConvAI Workshop at ACL 2019

  25. arXiv:1811.11958  [pdf, other

    cs.CL

    Large-scale Generative Modeling to Improve Automated Veterinary Disease Coding

    Authors: Yuhui Zhang, Allen Nie, James Zou

    Abstract: Supervised learning is limited both by the quantity and quality of the labeled data. In the field of medical record tagging, writing styles between hospitals vary drastically. The knowledge learned from one hospital might not transfer well to another. This problem is amplified in veterinary medicine domain because veterinary clinics rarely apply medical codes to their records. We proposed and trai… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/83

  26. arXiv:1810.05328  [pdf

    cond-mat.mtrl-sci

    Non-volatile ferroelectric memory effect in ultrathin α-In2Se3

    Authors: Siyuan Wan, Yue Li, Wei Li, Xiaoyu Mao, Chen Wang, Jiyu Dong, Anmin Nie, Jianyong Xiang, Zhongyuan Liu, Wenguang Zhu, Hualing Zeng

    Abstract: Recent experiments on layered α-In2Se3 have confirmed its room-temperature ferroelectricity under ambient condition. This observation renders α-In2Se3 an excellent platform for develo** two-dimensional (2D) layered-material based electronics with nonvolatile functionality. In this letter, we demonstrate non-volatile memory effect in a hybrid 2D ferroelectric field effect transistor (FeFET) made… ▽ More

    Submitted 11 October, 2018; originally announced October 2018.

    Comments: 19 pages, 4 figures

  27. arXiv:1806.10722  [pdf, other

    cs.CL

    DeepTag: inferring all-cause diagnoses from clinical notes in under-resourced medical domain

    Authors: Allen Nie, Ashley Zehnder, Rodney L. Page, Arturo L. Pineda, Manuel A. Rivas, Carlos D. Bustamante, James Zou

    Abstract: Large scale veterinary clinical records can become a powerful resource for patient care and research. However, clinicians lack the time and resource to annotate patient records with standard medical diagnostic codes and most veterinary visits are captured in free text notes. The lack of standard coding makes it challenging to use the clinical data to improve patient care. It is also a major impedi… ▽ More

    Submitted 3 September, 2018; v1 submitted 27 June, 2018; originally announced June 2018.

    Comments: 17 pages, 6 figures. Updated the text for clarity

  28. arXiv:1804.08824  [pdf, other

    math.PR

    A Continuous Time GARCH(p,q) Process with Delay

    Authors: Adam Nie

    Abstract: We investigate the properties of a continuous time GARCH process as the solution to a Lévy driven stochastic functional integral equation. This process occurs as a weak limit of a sequence of discrete time GARCH processes as the time between observations converges to zero and the number of lags grows to infinity. The resulting limit generalizes the COGARCH process and can be interpreted as a COGAR… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: 24 pages, 2 figures

  29. arXiv:1710.04334  [pdf, other

    cs.CL cs.AI

    DisSent: Sentence Representation Learning from Explicit Discourse Relations

    Authors: Allen Nie, Erin D. Bennett, Noah D. Goodman

    Abstract: Learning effective representations of sentences is one of the core missions of natural language understanding. Existing models either train on a vast amount of text, or require costly, manually curated sentence relation datasets. We show that with dependency parsing and rule-based rubrics, we can curate a high quality sentence relation task by leveraging explicit discourse relations. We show that… ▽ More

    Submitted 4 June, 2019; v1 submitted 11 October, 2017; originally announced October 2017.

    Comments: 13 pages, 4 figures. ACL 2019

  30. arXiv:1703.02573  [pdf, other

    cs.LG cs.CL

    Data Noising as Smoothing in Neural Network Language Models

    Authors: Ziang Xie, Sida I. Wang, Jiwei Li, Daniel Lévy, Aiming Nie, Dan Jurafsky, Andrew Y. Ng

    Abstract: Data noising is an effective technique for regularizing neural network models. While noising is widely adopted in application domains such as vision and speech, commonly used noising primitives have not been developed for discrete sequence-level settings such as language modeling. In this paper, we derive a connection between input noising in neural network language models and smoothing in $n$-gra… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: ICLR 2017