Skip to main content

Showing 1–20 of 20 results for author: Wang, S I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.03065  [pdf, other

    cs.SE cs.AI cs.LG

    CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

    Authors: Alex Gu, Baptiste Rozière, Hugh Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida I. Wang

    Abstract: We present CRUXEval (Code Reasoning, Understanding, and eXecution Evaluation), a benchmark consisting of 800 Python functions (3-13 lines). Each function comes with an input-output pair, leading to two natural tasks: input prediction and output prediction. First, we propose a generic recipe for generating our execution benchmark which can be used to create future variation of the benchmark. Second… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 71 pages, 29 figures

  2. arXiv:2305.14200  [pdf, other

    cs.CL cs.LG

    Accessing Higher Dimensions for Unsupervised Word Translation

    Authors: Sida I. Wang

    Abstract: The striking ability of unsupervised word translation has been demonstrated with the help of word vectors / pretraining; however, they require large amounts of data and usually fails if the data come from different domains. We propose coocmap, a method that can use either high-dimensional co-occurrence counts or their lower-dimensional approximations. Freed from the limits of low dimensions, we sh… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    ACM Class: I.2.7

  3. arXiv:2305.08195  [pdf, other

    cs.CL

    Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing

    Authors: Hao Yan, Saurabh Srivastava, Yintao Tai, Sida I. Wang, Wen-tau Yih, Ziyu Yao

    Abstract: Interactive semantic parsing based on natural language (NL) feedback, where users provide feedback to correct the parser mistakes, has emerged as a more practical scenario than the traditional one-shot semantic parsing. However, prior work has heavily relied on human-annotated feedback data to train the interactive semantic parser, which is prohibitively expensive and not scalable. In this work, w… ▽ More

    Submitted 4 June, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023. 18 pages, 6 figures

  4. arXiv:2302.08468  [pdf, other

    cs.LG cs.CL cs.PL cs.SE

    LEVER: Learning to Verify Language-to-Code Generation with Execution

    Authors: Ansong Ni, Srini Iyer, Dragomir Radev, Ves Stoyanov, Wen-tau Yih, Sida I. Wang, Xi Victoria Lin

    Abstract: The advent of large language models trained on code (code LLMs) has led to significant progress in language-to-code generation. State-of-the-art approaches in this area combine LLM decoding with sample pruning and reranking using test cases or heuristics based on the execution results. However, it is challenging to obtain test cases for many real-world language-to-code applications, and heuristics… ▽ More

    Submitted 1 September, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: ICML'23; code available at https://github.com/niansong1996/lever

  5. arXiv:2211.16490  [pdf, other

    cs.LG cs.CL cs.PL cs.SE

    Coder Reviewer Reranking for Code Generation

    Authors: Tianyi Zhang, Tao Yu, Tatsunori B. Hashimoto, Mike Lewis, Wen-tau Yih, Daniel Fried, Sida I. Wang

    Abstract: Sampling diverse programs from a code language model and reranking with model likelihood is a popular method for code generation but it is prone to preferring degenerate solutions. Inspired by collaborative programming, we propose Coder-Reviewer reranking. We augment Coder language models from past work, which generate programs given language instructions, with Reviewer models, which evaluate the… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  6. arXiv:2204.11454  [pdf, other

    cs.CL cs.SE

    Natural Language to Code Translation with Execution

    Authors: Freda Shi, Daniel Fried, Marjan Ghazvininejad, Luke Zettlemoyer, Sida I. Wang

    Abstract: Generative models of code, pretrained on large corpora of programs, have shown great success in translating natural language to code (Chen et al., 2021; Austin et al., 2021; Li et al., 2022, inter alia). While these models do not explicitly incorporate program semantics (i.e., execution results) during training, they are able to generate correct solutions for many problems. However, choosing a sin… ▽ More

    Submitted 1 November, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: EMNLP 2022

  7. arXiv:2201.05966  [pdf, other

    cs.CL

    UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

    Authors: Tianbao Xie, Chen Henry Wu, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida I. Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu

    Abstract: Structured knowledge grounding (SKG) leverages structured knowledge to complete user requests, such as semantic parsing over databases and question answering over knowledge bases. Since the inputs and outputs of SKG tasks are heterogeneous, they have been studied separately by different communities, which limits systematic and compatible research on SKG. In this paper, we overcome this limitation… ▽ More

    Submitted 18 October, 2022; v1 submitted 15 January, 2022; originally announced January 2022.

    Comments: EMNLP 2022

  8. arXiv:2111.06787  [pdf, other

    cs.CL

    BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation

    Authors: Eleftheria Briakou, Sida I. Wang, Luke Zettlemoyer, Marjan Ghazvininejad

    Abstract: Mined bitexts can contain imperfect translations that yield unreliable training signals for Neural Machine Translation (NMT). While filtering such pairs out is known to improve final model quality, we argue that it is suboptimal in low-resource conditions where even mined data can be limited. In our work, we propose instead, to refine the mined bitexts via automatic editing: given a sentence in a… ▽ More

    Submitted 30 May, 2022; v1 submitted 12 November, 2021; originally announced November 2021.

  9. arXiv:2110.10661  [pdf, other

    cs.CL cs.AI cs.LG

    SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark

    Authors: Victor Zhong, Austin W. Hanjie, Sida I. Wang, Karthik Narasimhan, Luke Zettlemoyer

    Abstract: Existing work in language grounding typically study single environments. How do we build unified models that apply across multiple environments? We propose the multi-environment Symbolic Interactive Language Grounding benchmark (SILG), which unifies a collection of diverse grounded language learning environments under a common interface. SILG consists of grid-world environments that require genera… ▽ More

    Submitted 24 January, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021. 14 pages, 8 figures

  10. Using Machine Learning to Predict Game Outcomes Based on Player-Champion Experience in League of Legends

    Authors: Tiffany D. Do, Seong Ioi Wang, Dylan S. Yu, Matthew G. McMillian, Ryan P. McMahan

    Abstract: League of Legends (LoL) is the most widely played multiplayer online battle arena (MOBA) game in the world. An important aspect of LoL is competitive ranked play, which utilizes a skill-based matchmaking system to form fair teams. However, players' skill levels vary widely depending on which champion, or hero, that they choose to play as. In this paper, we propose a method for predicting game outc… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: 5 pages, In The 16th International Conference on the Foundations of Digital Games (FDG) 2021

  11. arXiv:2101.00148  [pdf, other

    cs.CL

    Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment

    Authors: Haoyue Shi, Luke Zettlemoyer, Sida I. Wang

    Abstract: Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear projections to align monolingual word embedding spaces. In this paper, we show it is possible to produce much higher quality lexicons with methods that combine (1) unsupervised bitext mining and (2) unsupervised word alignment. Directly applying a pipeline that uses recent al… ▽ More

    Submitted 12 June, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

    Comments: ACL-IJCNLP 2021 camera-ready version, with full supplementary material

  12. arXiv:2009.07396  [pdf, other

    cs.CL cs.AI cs.DB cs.LG

    Grounded Adaptation for Zero-shot Executable Semantic Parsing

    Authors: Victor Zhong, Mike Lewis, Sida I. Wang, Luke Zettlemoyer

    Abstract: We propose Grounded Adaptation for Zero-shot Executable Semantic Parsing (GAZP) to adapt an existing semantic parser to new environments (e.g. new database schemas). GAZP combines a forward semantic parser with a backward utterance generator to synthesize data (e.g. utterances and SQL queries) in the new environment, then selects cycle-consistent examples to adapt the parser. Unlike data-augmentat… ▽ More

    Submitted 1 February, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: EMNLP 2020 long paper. 14 pages, 5 figures

  13. arXiv:2006.10191  [pdf, other

    cs.HC

    Using Collaborative Filtering to Recommend Champions in League of Legends

    Authors: Tiffany D. Do, Dylan S. Yu, Salman Anwer, Seong Ioi Wang

    Abstract: League of Legends (LoL), one of the most widely played computer games in the world, has over 140 playable characters known as champions that have highly varying play styles. However, there is not much work on providing champion recommendations to a player in LoL. In this paper, we propose that a recommendation system based on a collaborative filtering approach using singular value decomposition pr… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 4 pages, in proceedings of 2020 IEEE Conference on Games, COG 2020

  14. arXiv:1709.02755  [pdf, other

    cs.CL cs.NE

    Simple Recurrent Units for Highly Parallelizable Recurrence

    Authors: Tao Lei, Yu Zhang, Sida I. Wang, Hui Dai, Yoav Artzi

    Abstract: Common recurrent neural architectures scale poorly due to the intrinsic difficulty in parallelizing their state computations. In this work, we propose the Simple Recurrent Unit (SRU), a light recurrent unit that balances model capacity and scalability. SRU is designed to provide expressive recurrence, enable highly parallelized implementation, and comes with careful initialization to facilitate tr… ▽ More

    Submitted 7 September, 2018; v1 submitted 8 September, 2017; originally announced September 2017.

    Comments: EMNLP

  15. arXiv:1704.06956  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Naturalizing a Programming Language via Interactive Learning

    Authors: Sida I. Wang, Samuel Ginn, Percy Liang, Christoper D. Manning

    Abstract: Our goal is to create a convenient natural language interface for performing well-specified but complex actions such as analyzing data, manipulating text, and querying databases. However, existing natural language interfaces for such tasks are quite primitive compared to the power one wields with a programming language. To bridge this gap, we start with a core programming language and allow users… ▽ More

    Submitted 23 April, 2017; originally announced April 2017.

    Comments: 10 pages, ACL2017

    ACM Class: I.2.7; I.2.6; I.2.1

  16. arXiv:1703.02573  [pdf, other

    cs.LG cs.CL

    Data Noising as Smoothing in Neural Network Language Models

    Authors: Ziang Xie, Sida I. Wang, Jiwei Li, Daniel Lévy, Aiming Nie, Dan Jurafsky, Andrew Y. Ng

    Abstract: Data noising is an effective technique for regularizing neural network models. While noising is widely adopted in application domains such as vision and speech, commonly used noising primitives have not been developed for discrete sequence-level settings such as language modeling. In this paper, we derive a connection between input noising in neural network language models and smoothing in $n$-gra… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: ICLR 2017

  17. arXiv:1606.02447  [pdf, other

    cs.CL cs.AI

    Learning Language Games through Interaction

    Authors: Sida I. Wang, Percy Liang, Christopher D. Manning

    Abstract: We introduce a new language learning setting relevant to building adaptive natural language interfaces. It is inspired by Wittgenstein's language games: a human wishes to accomplish some task (e.g., achieving a certain configuration of blocks), but can only communicate with a computer, who performs the actual actions (e.g., removing all red blocks). The computer initially knows nothing about langu… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

    Comments: 11 pages, ACL 2016

    ACM Class: I.2.6; I.2.7

  18. arXiv:1603.08482  [pdf, other

    stat.ML cs.LG

    Estimating Mixture Models via Mixtures of Polynomials

    Authors: Sida I. Wang, Arun Tejasvi Chaganty, Percy Liang

    Abstract: Mixture modeling is a general technique for making any simple model more expressive through weighted combination. This generality and simplicity in part explains the success of the Expectation Maximization (EM) algorithm, in which updates are easy to derive for a wide class of mixture models. However, the likelihood of a mixture model is non-convex, so EM has no known global convergence guarantees… ▽ More

    Submitted 28 March, 2016; originally announced March 2016.

    Comments: NIPS 2015

  19. arXiv:1408.2270  [pdf, ps, other

    cs.DS cs.CC math.OC

    A sub-constant improvement in approximating the positive semidefinite Grothendieck problem

    Authors: Roy Frostig, Sida I. Wang

    Abstract: Semidefinite relaxations are a powerful tool for approximately solving combinatorial optimization problems such as MAX-CUT and the Grothendieck problem. By exploiting a bounded rank property of extreme points in the semidefinite cone, we make a sub-constant improvement in the approximation ratio of one such problem. Precisely, we describe a polynomial-time algorithm for the positive semidefinite G… ▽ More

    Submitted 10 August, 2014; originally announced August 2014.

  20. arXiv:1312.6205  [pdf, other

    stat.ML cs.LG

    Relaxations for inference in restricted Boltzmann machines

    Authors: Sida I. Wang, Roy Frostig, Percy Liang, Christopher D. Manning

    Abstract: We propose a relaxation-based approximate inference algorithm that samples near-MAP configurations of a binary pairwise Markov random field. We experiment on MAP inference tasks in several restricted Boltzmann machines. We also use our underlying sampler to estimate the log-partition function of restricted Boltzmann machines and compare against other sampling-based methods.

    Submitted 2 January, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: ICLR 2014 workshop track submission