Skip to main content

Showing 1–31 of 31 results for author: Shin, R

.
  1. arXiv:2406.14739  [pdf, other

    cs.CL

    Learning to Retrieve Iteratively for In-Context Learning

    Authors: Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme

    Abstract: We introduce iterative retrieval, a novel framework that empowers retrievers to make iterative decisions through policy optimization. Finding an optimal portfolio of retrieved items is a combinatorial optimization problem, generally considered NP-hard. This approach provides a learned approximation to such a solution, meeting specific task requirements under a given family of large language models… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2311.10775  [pdf, other

    cs.CL cs.AI cs.LG

    ToolTalk: Evaluating Tool-Usage in a Conversational Setting

    Authors: Nicholas Farn, Richard Shin

    Abstract: Large language models (LLMs) have displayed massive improvements in reasoning and decision-making skills and can hold natural conversations with users. Many recent works seek to augment LLM-based assistants with external tools so they can access private or up-to-date information and carry out actions on behalf of users. To better measure the performance of these assistants, this paper introduces T… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 10 pages, 1 figure, ICLR 2024 Submission, https://github.com/microsoft/ToolTalk

  3. arXiv:2309.11765  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

    Authors: Xinyu Tang, Richard Shin, Huseyin A. Inan, Andre Manoel, Fatemehsadat Mireshghallah, Zinan Lin, Sivakanth Gopi, Janardhan Kulkarni, Robert Sim

    Abstract: We study the problem of in-context learning (ICL) with large language models (LLMs) on private datasets. This scenario poses privacy risks, as LLMs may leak or regurgitate the private examples demonstrated in the prompt. We propose a novel algorithm that generates synthetic few-shot demonstrations from the private dataset with formal differential privacy (DP) guarantees, and show empirically that… ▽ More

    Submitted 27 January, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  4. arXiv:2212.10520  [pdf, other

    cs.CL

    Privacy-Preserving Domain Adaptation of Semantic Parsers

    Authors: Fatemehsadat Mireshghallah, Yu Su, Tatsunori Hashimoto, Jason Eisner, Richard Shin

    Abstract: Task-oriented dialogue systems often assist users with personal or confidential matters. For this reason, the developers of such a system are generally prohibited from observing actual usage. So how can they know where the system is failing and needs more training data or new functionality? In this work, we study ways in which realistic user utterances can be generated synthetically, to help incre… ▽ More

    Submitted 8 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  5. SoK: Modeling Explainability in Security Analytics for Interpretability, Trustworthiness, and Usability

    Authors: Dipkamal Bhusal, Rosalyn Shin, Ajay Ashok Shewale, Monish Kumar Manikya Veerabhadran, Michael Clifford, Sara Rampazzi, Nidhi Rastogi

    Abstract: Interpretability, trustworthiness, and usability are key considerations in high-stake security applications, especially when utilizing deep learning models. While these models are known for their high accuracy, they behave as black boxes in which identifying important features and factors that led to a classification or a prediction is difficult. This can lead to uncertainty and distrust, especial… ▽ More

    Submitted 12 June, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 12 pages, 4 figures

  6. arXiv:2206.10668  [pdf, ps, other

    cs.CL

    BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic Parsing

    Authors: Subhro Roy, Sam Thomson, Tongfei Chen, Richard Shin, Adam Pauls, Jason Eisner, Benjamin Van Durme

    Abstract: Recent work has shown that generation from a prompted or fine-tuned language model can perform well at semantic parsing when the output is constrained to be a valid semantic representation. We introduce BenchCLAMP, a Benchmark to evaluate Constrained LAnguage Model Parsing, that includes context-free grammars for seven semantic parsing datasets and two syntactic parsing datasets with varied output… ▽ More

    Submitted 10 January, 2024; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks

  7. arXiv:2205.08675  [pdf, other

    cs.CL cs.AI

    Addressing Resource and Privacy Constraints in Semantic Parsing Through Data Augmentation

    Authors: Kevin Yang, Olivia Deng, Charles Chen, Richard Shin, Subhro Roy, Benjamin Van Durme

    Abstract: We introduce a novel setup for low-resource task-oriented semantic parsing which incorporates several constraints that may arise in real-world scenarios: (1) lack of similar datasets/models from a related domain, (2) inability to sample useful logical forms directly from a grammar, and (3) privacy requirements for unlabeled natural utterances. Our goal is to improve a low-resource semantic parser… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: ACL Findings 2022

  8. arXiv:2201.08014  [pdf

    eess.SP

    Application of Particle Swarm Optimization method to On-going Monitoring for estimating vehicle-bridge interaction system

    Authors: Kyosuke Yamamoto, Kakeru Murakami, Ryota Shin, Yukihiko Okada

    Abstract: This study proposes a method for estimating the mechanical parameters of vehicles and bridges and the road unevenness, using only vehicle vibration and position data. In the proposed method, vehicle input and bridge vibration are estimated using randomly assumed vehicle and bridge parameters. Then, the road profiles at the front and rear wheels can be determined from the vehicle input and bridge v… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 31 pages, 22 figures, 6 tables, submited to Journal of Mechanical Systems and Signal Processing

  9. arXiv:2112.08696  [pdf, other

    cs.CL

    Few-Shot Semantic Parsing with Language Models Trained On Code

    Authors: Richard Shin, Benjamin Van Durme

    Abstract: Large language models can perform semantic parsing with little training data, when prompted with in-context examples. It has been shown that this can be improved by formulating the problem as paraphrasing into canonical utterances, which casts the underlying meaning representation into a controlled natural language-like representation. Intuitively, such models can more easily output canonical utte… ▽ More

    Submitted 29 May, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: NAACL 2022

  10. arXiv:2112.05705  [pdf, other

    cs.CL cs.LG

    Pruning Pretrained Encoders with a Multitask Objective

    Authors: Patrick Xia, Richard Shin

    Abstract: The sizes of pretrained language models make them challenging and expensive to use when there are multiple desired downstream tasks. In this work, we adopt recent strategies for model pruning during finetuning to explore the question of whether it is possible to prune a single encoder so that it can be used for multiple tasks. We allocate a fixed parameter budget and compare pruning a single model… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: ENLSP NeurIPS 2021

  11. arXiv:2105.05601  [pdf, other

    cs.CL cs.LG

    OutFlip: Generating Out-of-Domain Samples for Unknown Intent Detection with Natural Language Attack

    Authors: DongHyun Choi, Myeong Cheol Shin, EungGyun Kim, Dong Ryeol Shin

    Abstract: Out-of-domain (OOD) input detection is vital in a task-oriented dialogue system since the acceptance of unsupported inputs could lead to an incorrect response of the system. This paper proposes OutFlip, a method to generate out-of-domain samples using only in-domain training dataset automatically. A white-box natural language attack method HotFlip is revised to generate out-of-domain samples inste… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 9 pages, 3 figures; to be appear in ACL Findings of ACL-IJCNLP 2021

  12. arXiv:2104.08768  [pdf, other

    cs.CL

    Constrained Language Models Yield Few-Shot Semantic Parsers

    Authors: Richard Shin, Christopher H. Lin, Sam Thomson, Charles Chen, Subhro Roy, Emmanouil Antonios Platanios, Adam Pauls, Dan Klein, Jason Eisner, Benjamin Van Durme

    Abstract: We explore the use of large pretrained language models as few-shot semantic parsers. The goal in semantic parsing is to generate a structured meaning representation given a natural language input. However, language models are trained to generate natural language. To bridge the gap, we use language models to paraphrase inputs into a controlled sublanguage resembling English that can be automaticall… ▽ More

    Submitted 16 November, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021. Code is available at https://github.com/microsoft/semantic_parsing_with_constrained_lm

  13. Integrated Eojeol Embedding for Erroneous Sentence Classification in Korean Chatbots

    Authors: DongHyun Choi, IlNam Park, Myeong Cheol Shin, EungGyun Kim, Dong Ryeol Shin

    Abstract: This paper attempts to analyze the Korean sentence classification system for a chatbot. Sentence classification is the task of classifying an input sentence based on predefined categories. However, spelling or space error contained in the input sentence causes problems in morphological analysis and tokenization. This paper proposes a novel approach of Integrated Eojeol (Korean syntactic word separ… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: 9 pages, 2 figures

    Journal ref: IEEE Access, 2021

  14. arXiv:2004.03125  [pdf, other

    cs.CL

    RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases

    Authors: DongHyun Choi, Myeong Cheol Shin, EungGyun Kim, Dong Ryeol Shin

    Abstract: Text-to-SQL is the problem of converting a user question into an SQL query, when the question and database are given. In this paper, we present a neural network approach called RYANSQL (Recursively Yielding Annotation Network for SQL) to solve complex Text-to-SQL tasks for cross-domain databases. State-ment Position Code (SPC) is defined to trans-form a nested SQL query into a set of non-nested SE… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: 10 pages, 1 figure

  15. arXiv:1912.12612  [pdf, other

    cs.LG stat.ML

    Hierarchical Variational Imitation Learning of Control Programs

    Authors: Roy Fox, Richard Shin, William Paul, Yitian Zou, Dawn Song, Ken Goldberg, Pieter Abbeel, Ion Stoica

    Abstract: Autonomous agents can learn by imitating teacher demonstrations of the intended behavior. Hierarchical control policies are ubiquitously useful for such learning, having the potential to break down structured tasks into simpler sub-tasks, thereby improving data efficiency and generalization. In this paper, we propose a variational inference method for imitation learning of a control policy represe… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

  16. arXiv:1912.12345  [pdf, other

    cs.LG cs.AI cs.PL stat.ML

    Synthetic Datasets for Neural Program Synthesis

    Authors: Richard Shin, Neel Kant, Kavi Gupta, Christopher Bender, Brandon Trabucco, Rishabh Singh, Dawn Song

    Abstract: The goal of program synthesis is to automatically generate programs in a particular language from corresponding specifications, e.g. input-output behavior. Many current approaches achieve impressive results after training on randomly generated I/O examples in limited domain-specific languages (DSLs), as with string transformations in RobustFill. However, we empirically discover that applying test… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    Comments: ICLR 2019

  17. arXiv:1911.04942  [pdf, other

    cs.CL cs.AI

    RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers

    Authors: Bailin Wang, Richard Shin, Xiaodong Liu, Oleksandr Polozov, Matthew Richardson

    Abstract: When translating natural language questions into SQL queries to answer questions from a database, contemporary semantic parsing models struggle to generalize to unseen database schemas. The generalization challenge lies in (a) encoding the database relations in an accessible way for the semantic parser, and (b) modeling alignment between database columns and their mentions in a given query. We pre… ▽ More

    Submitted 24 August, 2021; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Fix some errors of ACL 2020 camera-ready; 12 pages, 5 figures, 7 tables. arXiv admin note: text overlap with arXiv:1906.11790

  18. arXiv:1906.11790  [pdf, other

    cs.LG cs.CL stat.ML

    Encoding Database Schemas with Relation-Aware Self-Attention for Text-to-SQL Parsers

    Authors: Richard Shin

    Abstract: When translating natural language questions into SQL queries to answer questions from a database, we would like our methods to generalize to domains and database schemas outside of the training set. To handle complex questions and database schemas with a neural encoder-decoder paradigm, it is critical to properly encode the schema as part of the input with the question. In this paper, we use relat… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

  19. arXiv:1906.10816  [pdf, other

    cs.LG cs.AI cs.CL cs.PL stat.ML

    Program Synthesis and Semantic Parsing with Learned Code Idioms

    Authors: Richard Shin, Miltiadis Allamanis, Marc Brockschmidt, Oleksandr Polozov

    Abstract: Program synthesis of general-purpose source code from natural language specifications is challenging due to the need to reason about high-level patterns in the target program and low-level implementation details at the same time. In this work, we present PATOIS, a system that allows a neural program synthesizer to explicitly interleave high-level and low-level reasoning at every generation step. I… ▽ More

    Submitted 4 November, 2019; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS) 2019. 13 pages total, 9 pages of main text

  20. arXiv:1704.06611  [pdf, other

    cs.LG cs.NE cs.PL

    Making Neural Programming Architectures Generalize via Recursion

    Authors: Jonathon Cai, Richard Shin, Dawn Song

    Abstract: Empirically, neural networks that attempt to learn programs from data have exhibited poor generalizability. Moreover, it has traditionally been difficult to reason about the behavior of these models beyond a certain level of input complexity. In order to address these issues, we propose augmenting neural architectures with a key abstraction: recursion. As an application, we implement recursion in… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

    Comments: Published in ICLR 2017

  21. arXiv:1704.03118  [pdf, other

    cs.CR

    PIANO: Proximity-based User Authentication on Voice-Powered Internet-of-Things Devices

    Authors: Neil Zhenqiang Gong, Altay Ozen, Yu Wu, Xiaoyu Cao, Richard Shin, Dawn Song, Hongxia **, Xuan Bao

    Abstract: Voice is envisioned to be a popular way for humans to interact with Internet-of-Things (IoT) devices. We propose a proximity-based user authentication method (called PIANO) for access control on such voice-powered IoT devices. PIANO leverages the built-in speaker, microphone, and Bluetooth that voice-powered IoT devices often already have. Specifically, we assume that a user carries a personal voi… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: To appear in ICDCS'17

  22. arXiv:1611.01867  [pdf, other

    cs.CL

    Latent Attention For If-Then Program Synthesis

    Authors: Xinyun Chen, Chang Liu, Richard Shin, Dawn Song, Mingcheng Chen

    Abstract: Automatic translation from natural language descriptions into programs is a longstanding challenging problem. In this work, we consider a simple yet important sub-problem: translation from textual descriptions to If-Then programs. We devise a novel neural network architecture for this task which we train end-to-end. Specifically, we introduce Latent Attention, which computes multiplicative weights… ▽ More

    Submitted 6 November, 2016; originally announced November 2016.

    Comments: Accepted by NIPS 2016

  23. arXiv:1507.08309  [pdf, other

    cs.CR

    Exploring Privacy Preservation in Outsourced K-Nearest Neighbors with Multiple Data Owners

    Authors: Frank Li, Richard Shin, Vern Paxson

    Abstract: The k-nearest neighbors (k-NN) algorithm is a popular and effective classification algorithm. Due to its large storage and computational requirements, it is suitable for cloud outsourcing. However, k-NN is often run on sensitive data such as medical records, user images, or personal information. It is important to protect the privacy of data in an outsourced k-NN system. Prior works have all ass… ▽ More

    Submitted 29 July, 2015; originally announced July 2015.

  24. The study of Anisotropic Flows at LHC with purturbative simulation

    Authors: Ghi R. Shin

    Abstract: We study the harmonic flows, for example, the directed, elliptic, third and fourth flow of the system of partons formed just after relativistic heavy ion collisions. We calculate the minijets produced during the primary collisions using standard parton distributions for the incomming projectile and target nucleus. We solve the Boltzmann equations of motion for the system of minijets by Monte Carlo… ▽ More

    Submitted 23 June, 2013; originally announced June 2013.

    Comments: 7 figures

  25. arXiv:1112.3265  [pdf, other

    cs.SI physics.soc-ph

    Jointly Predicting Links and Inferring Attributes using a Social-Attribute Network (SAN)

    Authors: Neil Zhenqiang Gong, Ameet Talwalkar, Lester Mackey, Ling Huang, Eui Chul Richard Shin, Emil Stefanov, Elaine, Shi, Dawn Song

    Abstract: The effects of social influence and homophily suggest that both network structure and node attribute information should inform the tasks of link prediction and node attribute inference. Recently, Yin et al. proposed Social-Attribute Network (SAN), an attribute-augmented social network, to integrate network structure and node attributes to perform both link prediction and attribute inference. They… ▽ More

    Submitted 22 June, 2012; v1 submitted 14 December, 2011; originally announced December 2011.

    Comments: 9 pages, 4 figures and 4 tables

  26. arXiv:1010.2473  [pdf, other

    cond-mat.mtrl-sci

    Control of defect-mediated tunneling barrier heights in ultrathin MgO films

    Authors: D. J. Kim, W. S. Choi, F. Schleicher, R. H. Shin, S. Boukari, V. Davesne, C. Kieber, J. Arabski, G. Schmerber, E. Beaurepaire, W. Jo, M. Bowen

    Abstract: The impact of oxygen vacancies on local tunneling properties across rf-sputtered MgO thin films was investigated by optical absorption spectroscopy and conducting atomic force microscopy. Adding O$_2$ to the Ar plasma during MgO growth alters the oxygen defect populations, leading to improved local tunneling characteristics such as a lower density of current hotspots and a lower tunnel current amp… ▽ More

    Submitted 22 November, 2010; v1 submitted 12 October, 2010; originally announced October 2010.

    Comments: 4 pages, 4 figures

  27. Transport Theoretical Description of Collisional Energy Loss in Infinite Quark-Gluon Matter

    Authors: Ghi R. Shin, Steffen A. Bass, Berndt Mueller

    Abstract: We study the time evolution of a high-momentum gluon or quark propagating through an infinite, thermalized, partonic medium utilizing a Boltzmann equation approach. We calculate the collisional energy loss of the parton, study its temperature and flavor dependence as well as the the momentum broadening incurred through multiple interactions. Our transport calculations agree well with analytic calc… ▽ More

    Submitted 11 June, 2010; v1 submitted 8 June, 2010; originally announced June 2010.

    Comments: 12 pages, updated with additional references and typos corrected

    Journal ref: J.Phys.G37:105112,2010

  28. Elliptic Flow and Jet Quenching of a Parton System after Relativistic Heavy Ion Collision

    Authors: Ghi R. Shin

    Abstract: We obtain the initial phase space distribution after relativistic heavy ion collision by the CGC shattering method incorporating the uncertainty principle and solve the semi-classical Boltzmann equation which includes the gluon radiation processes. We present as a function of time the attenuation rate of high $p_T$ partons, which have transverse momenta over 6 $GeV/c$, in the medium which is for… ▽ More

    Submitted 26 April, 2007; originally announced April 2007.

    Comments: 14 pages, applied for J. Korean Physical Society

  29. Initial Parton Distribution just after Heavy Ion Collisions

    Authors: Ghi R. Shin, Kang S. Lee

    Abstract: We study the initial distribution of a parton system which is formed just after relativistic heavy ion collision by the elastic scattering among the constituent partons in details and analyze the baryon and strangeness contents of the primary parton system. We present the rapidity and energy distributions of the system.

    Submitted 7 December, 2006; v1 submitted 1 December, 2006; originally announced December 2006.

    Comments: 17 pages

    Journal ref: J.Korean Phys.Soc.50:426-432,2007

  30. Studies of parton thermalization at RHIC

    Authors: Ghi R. Shin, Berndt Müller

    Abstract: We consider the evolution of a parton system which is formed in the central region just after a relativistic heavy ion collision. The parton consist of mostly gluons, minijets, which are produced by elastic scattering between constituent partons of the colliding nuclei. We assume the system can be described by a semi-classical Boltzmann transport equation, which we solve by means of the test par… ▽ More

    Submitted 22 July, 2003; originally announced July 2003.

    Comments: 14 pages

    Journal ref: J.Phys.G29:2485-2495,2003

  31. A relativistic parton cascade with radiation

    Authors: Ghi R. Shin, Berndt Müller

    Abstract: We consider the evolution of a parton system which is formed at the central rapidity region just after an ultrarelativistic heavy ion collision. The evolution of the system, which is composed of gluons, quarks and antiquarks, is described by a relativistic Boltzmann equations with collision terms including radiation and retardation effects. The equations are solved by the test particle method us… ▽ More

    Submitted 8 August, 2002; v1 submitted 15 July, 2002; originally announced July 2002.

    Comments: 14 pages, 4 figures

    Report number: DUKE-TH-02-223

    Journal ref: J.Phys.G28:2643-2656,2002