Skip to main content

Showing 1–27 of 27 results for author: de Vries, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15877  [pdf, other

    cs.SE cs.AI cs.CL

    BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

    Authors: Terry Yue Zhuo, Minh Chien Vu, Jenny Chim, Han Hu, Wenhao Yu, Ratnadira Widyasari, Imam Nur Bani Yusuf, Haolan Zhan, Junda He, Indraneil Paul, Simon Brunner, Chen Gong, Thong Hoang, Armel Randy Zebaze, Xiaoheng Hong, Wen-Ding Li, Jean Kaddour, Ming Xu, Zhihan Zhang, Prateek Yadav, Naman Jain, Alex Gu, Zhoujun Cheng, Jiawei Liu, Qian Liu , et al. (8 additional authors not shown)

    Abstract: Automated software engineering has been greatly empowered by the recent advances in Large Language Models (LLMs) for programming. While current benchmarks have shown that LLMs can perform various software engineering tasks like human developers, the majority of their evaluations are limited to short and self-contained algorithmic tasks. Solving challenging and practical programming tasks requires… ▽ More

    Submitted 26 June, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: 44 pages, 14 figures, 7 tables, built with love by the BigCode community :)

  2. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  3. arXiv:2401.00788  [pdf, other

    cs.CL cs.AI cs.SE

    Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

    Authors: Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von Werra, Harm de Vries, Qian Liu, Niklas Muennighoff

    Abstract: The high cost of full-parameter fine-tuning (FFT) of Large Language Models (LLMs) has led to a series of parameter-efficient fine-tuning (PEFT) methods. However, it remains unclear which methods provide the best cost-performance trade-off at different model scales. We introduce Astraios, a suite of 28 instruction-tuned OctoCoder models using 7 tuning methods and 4 model sizes up to 16 billion para… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 25 pages (12 main), 19 figures, 8 tables

  4. arXiv:2312.03872  [pdf, other

    cs.CY cs.AI cs.CL cs.LG cs.PL

    The BigCode Project Governance Card

    Authors: BigCode collaboration, Sean Hughes, Harm de Vries, Jennifer Robinson, Carlos Muñoz Ferrandis, Loubna Ben Allal, Leandro von Werra, Jennifer Ding, Sebastien Paquet, Yacine Jernite

    Abstract: This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support transparency by providing relevant information about choices that were made during the project to the broader public, and to serve as an example of intentional governance of an open research project that future endeavors can leverage to shape their own approach. The fi… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 12 pages, related papers arXiv:2305.06161 and arXiv:2301.03988 and arXiv:2211.15533v1, learn more at https://www.bigcode-project.org/

  5. arXiv:2306.10998  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    RepoFusion: Training Code Models to Understand Your Repository

    Authors: Disha Shrivastava, Denis Kocetkov, Harm de Vries, Dzmitry Bahdanau, Torsten Scholak

    Abstract: Despite the huge success of Large Language Models (LLMs) in coding assistants like GitHub Copilot, these models struggle to understand the context present in the repository (e.g., imports, parent classes, files with similar names, etc.), thereby producing inaccurate code completions. This effect is more pronounced when using these assistants for repositories that the model has not seen during trai… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  6. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  7. arXiv:2304.01412  [pdf, other

    cs.CL

    The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents

    Authors: Xing Han Lu, Siva Reddy, Harm de Vries

    Abstract: We introduce the StatCan Dialogue Dataset consisting of 19,379 conversation turns between agents working at Statistics Canada and online users looking for published data tables. The conversations stem from genuine intents, are held in English or French, and lead to agents retrieving one of over 5000 complex data tables. Based on this dataset, we propose two tasks: (1) automatic retrieval of releva… ▽ More

    Submitted 4 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted at EACL 2023

    Journal ref: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. (2023) 2799-2829

  8. arXiv:2301.03988  [pdf, other

    cs.SE cs.AI cs.LG

    SantaCoder: don't reach for the stars!

    Authors: Loubna Ben Allal, Raymond Li, Denis Kocetkov, Chenghao Mou, Christopher Akiki, Carlos Munoz Ferrandis, Niklas Muennighoff, Mayank Mishra, Alex Gu, Manan Dey, Logesh Kumar Umapathi, Carolyn Jane Anderson, Yangtian Zi, Joel Lamy Poirier, Hailey Schoelkopf, Sergey Troshin, Dmitry Abulkhanov, Manuel Romero, Michael Lappert, Francesco De Toni, Bernardo García del Río, Qian Liu, Shamik Bose, Urvashi Bhattacharyya, Terry Yue Zhuo , et al. (16 additional authors not shown)

    Abstract: The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Personally Identifiable Information (PII) redaction pipeline, the experiments conducted to de-risk the model architecture, and the experiments investigat… ▽ More

    Submitted 24 February, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

  9. arXiv:2211.15533  [pdf, other

    cs.CL cs.AI

    The Stack: 3 TB of permissively licensed source code

    Authors: Denis Kocetkov, Raymond Li, Loubna Ben Allal, Jia Li, Chenghao Mou, Carlos Muñoz Ferrandis, Yacine Jernite, Margaret Mitchell, Sean Hughes, Thomas Wolf, Dzmitry Bahdanau, Leandro von Werra, Harm de Vries

    Abstract: Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for code understanding and generation. To stimulate open and responsible research on LLMs for code, we introduce The Stack, a 3.1 TB dataset consisting of permissively licensed source code in 30 programming languages. We describe how we collect t… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  10. arXiv:2110.08525  [pdf, other

    cs.CL

    The Power of Prompt Tuning for Low-Resource Semantic Parsing

    Authors: Nathan Schucher, Siva Reddy, Harm de Vries

    Abstract: Prompt tuning has recently emerged as an effective method for adapting pre-trained language models to a number of language understanding and generation tasks. In this paper, we investigate prompt tuning for semantic parsing -- the task of map** natural language utterances onto formal meaning representations. On the low-resource splits of Overnight and TOPv2, we find that a prompt tuned T5-xl sig… ▽ More

    Submitted 1 April, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: ACL 2022 (main conference); updated results

  11. arXiv:2110.00768  [pdf, other

    cs.CL

    TopiOCQA: Open-domain Conversational Question Answering with Topic Switching

    Authors: Vaibhav Adlakha, Shehzaad Dhuliawala, Kaheer Suleman, Harm de Vries, Siva Reddy

    Abstract: In a conversational question answering scenario, a questioner seeks to extract information about a topic through a series of interdependent questions and answers. As the conversation progresses, they may switch to related topics, a phenomenon commonly observed in information-seeking search sessions. However, current datasets for conversational question answering are limiting in two ways: 1) they d… ▽ More

    Submitted 20 February, 2022; v1 submitted 2 October, 2021; originally announced October 2021.

    Comments: accepted at TACL

  12. DuoRAT: Towards Simpler Text-to-SQL Models

    Authors: Torsten Scholak, Raymond Li, Dzmitry Bahdanau, Harm de Vries, Chris Pal

    Abstract: Recent neural text-to-SQL models can effectively translate natural language questions to corresponding SQL queries on unseen databases. Working mostly on the Spider dataset, researchers have proposed increasingly sophisticated solutions to the problem. Contrary to this trend, in this paper we focus on simplifications. We begin by building DuoRAT, a re-implementation of the state-of-the-art RAT-SQL… ▽ More

    Submitted 10 September, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted to NAACL 2021. 9 pages

  13. arXiv:2007.14435  [pdf, other

    cs.CL

    Towards Ecologically Valid Research on Language User Interfaces

    Authors: Harm de Vries, Dzmitry Bahdanau, Christopher Manning

    Abstract: Language User Interfaces (LUIs) could improve human-machine interaction for a wide variety of tasks, such as playing music, getting insights from databases, or instructing domestic robots. In contrast to traditional hand-crafted approaches, recent work attempts to build LUIs in a data-driven way using modern deep learning methods. To satisfy the data needs of such learning algorithms, researchers… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

  14. arXiv:2007.05756  [pdf, other

    cs.CV cs.LG stat.ML

    Generative Compositional Augmentations for Scene Graph Prediction

    Authors: Boris Knyazev, Harm de Vries, Cătălina Cangea, Graham W. Taylor, Aaron Courville, Eugene Belilovsky

    Abstract: Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges in this task due to a long tail data distribution. Current scene graph generation models are trained on a tiny fraction of the distribution corresponding to the… ▽ More

    Submitted 1 October, 2021; v1 submitted 11 July, 2020; originally announced July 2020.

    Comments: ICCV 2021 camera ready. Added more baselines, combining GANs with Neural Motifs and t-sne visualizations. Code is available at https://github.com/bknyaz/sgg

  15. arXiv:2005.08230  [pdf, other

    cs.CV cs.LG

    Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation

    Authors: Boris Knyazev, Harm de Vries, Cătălina Cangea, Graham W. Taylor, Aaron Courville, Eugene Belilovsky

    Abstract: Scene graph generation (SGG) aims to predict graph-structured descriptions of input images, in the form of objects and relationships between them. This task is becoming increasingly useful for progress at the interface of vision and language. Here, it is important - yet challenging - to perform well on novel (zero-shot) or rare (few-shot) compositions of objects and relationships. In this paper, w… ▽ More

    Submitted 17 August, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: accepted at BMVC 2020, the code is available at https://github.com/bknyaz/sgg

  16. arXiv:1912.05783  [pdf, other

    cs.AI cs.LG

    CLOSURE: Assessing Systematic Generalization of CLEVR Models

    Authors: Dzmitry Bahdanau, Harm de Vries, Timothy J. O'Donnell, Shikhar Murty, Philippe Beaudoin, Yoshua Bengio, Aaron Courville

    Abstract: The CLEVR dataset of natural-looking questions about 3D-rendered scenes has recently received much attention from the research community. A number of models have been proposed for this task, many of which achieved very high accuracies of around 97-99%. In this work, we study how systematic the generalization of such models is, that is to which extent they are capable of handling novel combinations… ▽ More

    Submitted 17 October, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: Technical report

  17. arXiv:1811.12889  [pdf, other

    cs.CL cs.AI

    Systematic Generalization: What Is Required and Can It Be Learned?

    Authors: Dzmitry Bahdanau, Shikhar Murty, Michael Noukhovitch, Thien Huu Nguyen, Harm de Vries, Aaron Courville

    Abstract: Numerous models for grounded language understanding have been recently proposed, including (i) generic models that can be easily adapted to any given task and (ii) intuitively appealing modular models that require background knowledge to be instantiated. We compare both types of models in how much they lend themselves to a particular form of systematic generalization. Using a synthetic VQA test, w… ▽ More

    Submitted 21 April, 2019; v1 submitted 30 November, 2018; originally announced November 2018.

    Comments: Published as a conference paper at ICLR 2019

  18. arXiv:1808.04446  [pdf, other

    cs.CV cs.CL cs.LG stat.ML

    Visual Reasoning with Multi-hop Feature Modulation

    Authors: Florian Strub, Mathieu Seurin, Ethan Perez, Harm de Vries, Jérémie Mary, Philippe Preux, Aaron Courville, Olivier Pietquin

    Abstract: Recent breakthroughs in computer vision and natural language processing have spurred interest in challenging multi-modal tasks such as visual question-answering and visual dialogue. For such tasks, one successful approach is to condition image-based convolutional network computation on language via Feature-wise Linear Modulation (FiLM) layers, i.e., per-channel scaling and shifting. We propose to… ▽ More

    Submitted 12 October, 2018; v1 submitted 3 August, 2018; originally announced August 2018.

    Comments: In Proc of ECCV 2018

  19. arXiv:1807.03367  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Talk the Walk: Navigating New York City through Grounded Dialogue

    Authors: Harm de Vries, Kurt Shuster, Dhruv Batra, Devi Parikh, Jason Weston, Douwe Kiela

    Abstract: We introduce "Talk The Walk", the first large-scale dialogue dataset grounded in action and perception. The task involves two agents (a "guide" and a "tourist") that communicate via natural language in order to achieve a common goal: having the tourist navigate to a given target location. The task and dataset, which are described in detail, are challenging and their full solution is an open proble… ▽ More

    Submitted 23 December, 2018; v1 submitted 9 July, 2018; originally announced July 2018.

  20. arXiv:1709.07871  [pdf, other

    cs.CV cs.AI cs.CL stat.ML

    FiLM: Visual Reasoning with a General Conditioning Layer

    Authors: Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, Aaron Courville

    Abstract: We introduce a general-purpose conditioning method for neural networks called FiLM: Feature-wise Linear Modulation. FiLM layers influence neural network computation via a simple, feature-wise affine transformation based on conditioning information. We show that FiLM layers are highly effective for visual reasoning - answering image-related questions which require a multi-step, high-level process -… ▽ More

    Submitted 18 December, 2017; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: AAAI 2018. Code available at http://github.com/ethanjperez/film . Extends arXiv:1707.03017

  21. arXiv:1707.03017  [pdf, other

    cs.CV cs.AI cs.CL stat.ML

    Learning Visual Reasoning Without Strong Priors

    Authors: Ethan Perez, Harm de Vries, Florian Strub, Vincent Dumoulin, Aaron Courville

    Abstract: Achieving artificial visual reasoning - the ability to answer image-related questions which require a multi-step, high-level process - is an important step towards artificial general intelligence. This multi-modal task requires learning a question-dependent, structured reasoning process over images from language. Standard deep learning approaches tend to exploit biases in the data rather than lear… ▽ More

    Submitted 18 December, 2017; v1 submitted 10 July, 2017; originally announced July 2017.

    Comments: Full AAAI 2018 paper is at arXiv:1709.07871. Presented at ICML 2017's Machine Learning in Speech and Language Processing Workshop. Code is at http://github.com/ethanjperez/film

  22. arXiv:1707.00683  [pdf, other

    cs.CV cs.CL cs.LG

    Modulating early visual processing by language

    Authors: Harm de Vries, Florian Strub, Jérémie Mary, Hugo Larochelle, Olivier Pietquin, Aaron Courville

    Abstract: It is commonly assumed that language refers to high-level visual concepts while leaving low-level visual processing unaffected. This view dominates the current literature in computational models for language-vision tasks, where visual and linguistic input are mostly processed independently before being fused into a single representation. In this paper, we deviate from this classic pipeline and pro… ▽ More

    Submitted 18 December, 2017; v1 submitted 2 July, 2017; originally announced July 2017.

    Comments: Advances in Neural Information Processing Systems 30 (NIPS 2017)

  23. arXiv:1703.05423  [pdf, other

    cs.CL

    End-to-end optimization of goal-driven and visually grounded dialogue systems

    Authors: Florian Strub, Harm de Vries, Jeremie Mary, Bilal Piot, Aaron Courville, Olivier Pietquin

    Abstract: End-to-end design of dialogue systems has recently become a popular research topic thanks to powerful tools such as encoder-decoder architectures for sequence-to-sequence learning. Yet, most current approaches cast human-machine dialogue management as a supervised learning problem, aiming at predicting the next utterance of a participant given the full history of the dialogue. This vision is too s… ▽ More

    Submitted 15 March, 2017; originally announced March 2017.

  24. arXiv:1611.08481  [pdf, other

    cs.AI cs.CV

    GuessWhat?! Visual object discovery through multi-modal dialogue

    Authors: Harm de Vries, Florian Strub, Sarath Chandar, Olivier Pietquin, Hugo Larochelle, Aaron Courville

    Abstract: We introduce GuessWhat?!, a two-player guessing game as a testbed for research on the interplay of computer vision and dialogue systems. The goal of the game is to locate an unknown object in a rich image scene by asking a sequence of questions. Higher-level image understanding, like spatial reasoning and language grounding, is required to solve the proposed task. Our key contribution is the colle… ▽ More

    Submitted 6 February, 2017; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: 23 pages; CVPR 2017 submission; see https://guesswhat.ai

  25. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  26. arXiv:1505.00359  [pdf, other

    cs.LG

    Can deep learning help you find the perfect match?

    Authors: Harm de Vries, Jason Yosinski

    Abstract: Is he/she my type or not? The answer to this question depends on the personal preferences of the one asking it. The individual process of obtaining a full answer may generally be difficult and time consuming, but often an approximate answer can be obtained simply by looking at a photo of the potential match. Such approximate answers based on visual cues can be produced in a fraction of a second, a… ▽ More

    Submitted 20 June, 2015; v1 submitted 2 May, 2015; originally announced May 2015.

  27. arXiv:1502.04390  [pdf, other

    cs.LG math.NA

    Equilibrated adaptive learning rates for non-convex optimization

    Authors: Yann N. Dauphin, Harm de Vries, Yoshua Bengio

    Abstract: Parameter-specific adaptive learning rate methods are computationally efficient ways to reduce the ill-conditioning problems encountered when training large deep networks. Following recent work that strongly suggests that most of the critical points encountered when training such networks are saddle points, we find how considering the presence of negative eigenvalues of the Hessian could help us d… ▽ More

    Submitted 29 August, 2015; v1 submitted 15 February, 2015; originally announced February 2015.