Skip to main content

Showing 1–46 of 46 results for author: Usunier, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01611  [pdf, other

    cs.IR cs.LG stat.ML

    System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes

    Authors: Arpit Agarwal, Nicolas Usunier, Alessandro Lazaric, Maximilian Nickel

    Abstract: Recommender systems are an important part of the modern human experience whose influence ranges from the food we eat to the news we read. Yet, there is still debate as to what extent recommendation platforms are aligned with the user goals. A core issue fueling this debate is the challenge of inferring a user utility based on engagement signals such as likes, shares, watch time etc., which are the… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

    Comments: Accepted at FAccT'24

  2. arXiv:2308.12950  [pdf, other

    cs.CL

    Code Llama: Open Foundation Models for Code

    Authors: Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, **gyu Liu, Romain Sauvestre, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom , et al. (1 additional authors not shown)

    Abstract: We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following ability for programming tasks. We provide multiple flavors to cover a wide range of applications: foundation models (Code Llama), Python specializations (Code Llama… ▽ More

    Submitted 31 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

  3. Group fairness without demographics using social networks

    Authors: David Liu, Virginie Do, Nicolas Usunier, Maximilian Nickel

    Abstract: Group fairness is a popular approach to prevent unfavorable treatment of individuals based on sensitive attributes such as race, gender, and disability. However, the reliance of group fairness on access to discrete group information raises several limitations and concerns, especially with regard to privacy, intersectionality, and unforeseen biases. In this work, we propose a "group-free" measure o… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  4. arXiv:2302.08572  [pdf, other

    cs.CV cs.HC cs.SI

    Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers

    Authors: Melissa Hall, Bobbie Chern, Laura Gustafson, Denisse Ventura, Harshad Kulkarni, Candace Ross, Nicolas Usunier

    Abstract: Disaggregated performance metrics across demographic groups are a hallmark of fairness assessments in computer vision. These metrics successfully incentivized performance improvements on person-centric tasks such as face analysis and are used to understand risks of modern models. However, there is a lack of discussion on the vulnerabilities of these measurements for more complex computer vision ta… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  5. arXiv:2210.14685  [pdf, other

    cs.LG cs.AI cs.RO

    Leveraging Demonstrations with Latent Space Priors

    Authors: Jonas Gehring, Deepak Gopinath, Jungdam Won, Andreas Krause, Gabriel Synnaeve, Nicolas Usunier

    Abstract: Demonstrations provide insight into relevant state or action space regions, bearing great potential to boost the efficiency and practicality of reinforcement learning agents. In this work, we propose to leverage demonstration datasets by combining skill learning and sequence modeling. Starting with a learned joint latent space, we separately train a generative model of demonstration sequences and… ▽ More

    Submitted 13 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Published in Transactions on Machine Learning Research (03/2023)

  6. arXiv:2210.09957  [pdf, other

    cs.LG cs.AI cs.CY cs.IR stat.ML

    Contextual bandits with concave rewards, and an application to fair ranking

    Authors: Virginie Do, Elvis Dohmatob, Matteo Pirotta, Alessandro Lazaric, Nicolas Usunier

    Abstract: We consider Contextual Bandits with Concave Rewards (CBCR), a multi-objective bandit problem where the desired trade-off between the rewards is defined by a known concave objective function, and the reward vector depends on an observed stochastic context. We present the first algorithm with provably vanishing regret for CBCR without restrictions on the policy space, whereas prior works were restri… ▽ More

    Submitted 28 February, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: ICLR 2023

  7. arXiv:2210.02586  [pdf, other

    cs.GT cs.CY

    Implementing Fairness Constraints in Markets Using Taxes and Subsidies

    Authors: Alexander Peysakhovich, Christian Kroer, Nicolas Usunier

    Abstract: Fisher markets are those where buyers with budgets compete for scarce items, a natural model for many real world markets including online advertising. A market equilibrium is a set of prices and allocations of items such that supply meets demand. We show how market designers can use taxes or subsidies in Fisher markets to ensure that market equilibrium outcomes fall within certain constraints. We… ▽ More

    Submitted 13 March, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

  8. arXiv:2209.13019  [pdf, other

    cs.IR cs.AI cs.CY cs.LG

    Fast online ranking with fairness of exposure

    Authors: Nicolas Usunier, Virginie Do, Elvis Dohmatob

    Abstract: As recommender systems become increasingly central for sorting and prioritizing the content available online, they have a growing impact on the opportunities or revenue of their items producers. For instance, they influence which recruiter a resume is recommended to, or to whom and how much a music track, video or news article is being exposed. This calls for recommendation approaches that not onl… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: FAccT 2022

  9. arXiv:2207.09960  [pdf, other

    stat.ML cs.CY cs.LG

    Measuring and signing fairness as performance under multiple stakeholder distributions

    Authors: David Lopez-Paz, Diane Bouchacourt, Levent Sagun, Nicolas Usunier

    Abstract: As learning machines increase their influence on decisions concerning human lives, analyzing their fairness properties becomes a subject of central importance. Yet, our best tools for measuring the fairness of learning systems are rigid fairness metrics encapsulated as mathematical one-liners, offer limited power to the stakeholders involved in the prediction task, and are easy to manipulate when… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  10. arXiv:2204.06521  [pdf, other

    cs.IR cs.AI cs.CY

    Optimizing generalized Gini indices for fairness in rankings

    Authors: Virginie Do, Nicolas Usunier

    Abstract: There is growing interest in designing recommender systems that aim at being fair towards item producers or their least satisfied users. Inspired by the domain of inequality measurement in economics, this paper explores the use of generalized Gini welfare functions (GGFs) as a means to specify the normative criterion that recommender systems should optimize for. GGFs weight individuals depending o… ▽ More

    Submitted 28 March, 2023; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: Accepted to SIGIR 2022

  11. arXiv:2202.07603  [pdf, other

    cs.CV cs.AI cs.CY

    Fairness Indicators for Systematic Assessments of Visual Feature Extractors

    Authors: Priya Goyal, Adriana Romero Soriano, Caner Hazirbas, Levent Sagun, Nicolas Usunier

    Abstract: Does everyone equally benefit from computer vision systems? Answers to this question become more and more important as computer vision systems are deployed at large scale, and can spark major concerns when they exhibit vast performance discrepancies between people from various demographic and social backgrounds. Systematic diagnosis of fairness, harms, and biases of computer vision systems is an i… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  12. arXiv:2110.15781  [pdf, other

    cs.IR cs.AI cs.CY cs.LG

    Two-sided fairness in rankings via Lorenz dominance

    Authors: Virginie Do, Sam Corbett-Davies, Jamal Atif, Nicolas Usunier

    Abstract: We consider the problem of generating rankings that are fair towards both users and item producers in recommender systems. We address both usual recommendation (e.g., of music or movies) and reciprocal recommendation (e.g., dating). Following concepts of distributive justice in welfare economics, our notion of fairness aims at increasing the utility of the worse-off individuals, which we formalize… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Journal ref: NeurIPS 2021

  13. arXiv:2110.10809  [pdf, other

    cs.LG cs.AI cs.RO

    Hierarchical Skills for Efficient Exploration

    Authors: Jonas Gehring, Gabriel Synnaeve, Andreas Krause, Nicolas Usunier

    Abstract: In reinforcement learning, pre-trained low-level skills have the potential to greatly facilitate exploration. However, prior knowledge of the downstream task is required to strike the right balance between generality (fine-grained control) and specificity (faster learning) in skill design. In previous work on continuous control, the sensitivity of methods to this trade-off has not been addressed e… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: To appear in 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  14. arXiv:2105.09295  [pdf, other

    cs.AI cs.CY cs.LG

    Online Selection of Diverse Committees

    Authors: Virginie Do, Jamal Atif, Jérôme Lang, Nicolas Usunier

    Abstract: Citizens' assemblies need to represent subpopulations according to their proportions in the general population. These large committees are often constructed in an online fashion by contacting people, asking for the demographic features of the volunteers, and deciding to include them or not. This raises a trade-off between the number of people contacted (and the incurring cost) and the representati… ▽ More

    Submitted 3 December, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: Proceedings of IJCAI 2021

  15. arXiv:2104.14527  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Online certification of preference-based fairness for personalized recommender systems

    Authors: Virginie Do, Sam Corbett-Davies, Jamal Atif, Nicolas Usunier

    Abstract: Recommender systems are facing scrutiny because of their growing impact on the opportunities we have access to. Current audits for fairness are limited to coarse-grained parity assessments at the level of sensitive groups. We propose to audit for envy-freeness, a more granular criterion aligned with individual preferences: every user should prefer their recommendations to those of other users. Sin… ▽ More

    Submitted 4 January, 2022; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: AAAI 2022

  16. arXiv:2104.09937  [pdf, other

    cs.LG stat.ML

    Gradient Matching for Domain Generalization

    Authors: Yuge Shi, Jeffrey Seely, Philip H. S. Torr, N. Siddharth, Awni Hannun, Nicolas Usunier, Gabriel Synnaeve

    Abstract: Machine learning systems typically assume that the distributions of training and test sets match closely. However, a critical requirement of such systems in the real world is their ability to generalize to unseen domains. Here, we propose an inter-domain gradient matching objective that targets domain generalization by maximizing the inner product between gradients from different domains. Since di… ▽ More

    Submitted 13 July, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

  17. arXiv:2104.08492  [pdf, other

    cs.AI cs.LG

    A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings

    Authors: Eltayeb Ahmed, Luisa Zintgraf, Christian A. Schroeder de Witt, Nicolas Usunier

    Abstract: In this work we explore an auxiliary loss useful for reinforcement learning in environments where strong performing agents are required to be able to navigate a spatial environment. The auxiliary loss proposed is to minimize the classification error of a neural network classifier that predicts whether or not a pair of states sampled from the agents current episode trajectory are in order. The clas… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  18. arXiv:2005.12872  [pdf, other

    cs.CV

    End-to-End Object Detection with Transformers

    Authors: Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, Sergey Zagoruyko

    Abstract: We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components like a non-maximum suppression procedure or anchor generation that explicitly encode our prior knowledge about the task. The main ingredients of the new framework, called DEtection TRansformer or DET… ▽ More

    Submitted 28 May, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

  19. arXiv:2005.02934  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization

    Authors: Pierre-Alexandre Kamienny, Matteo Pirotta, Alessandro Lazaric, Thibault Lavril, Nicolas Usunier, Ludovic Denoyer

    Abstract: We study the problem of learning exploration-exploitation strategies that effectively adapt to dynamic environments, where the task may change over time. While RNN-based policies could in principle represent such strategies, in practice their training time is prohibitive and the learning process often converges to poor solutions. In this paper, we consider the case where the agent has access to a… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: 18 pages

    MSC Class: 68T99

  20. arXiv:2004.04926  [pdf, ps, other

    stat.ML cs.LG

    Tensor Decompositions for temporal knowledge base completion

    Authors: Timothée Lacroix, Guillaume Obozinski, Nicolas Usunier

    Abstract: Most algorithms for representation learning and link prediction in relational data have been designed for static data. However, the data they are applied to usually evolves with time, such as friend graphs in social networks or user interactions with items in recommender systems. This is also the case for knowledge bases, which contain facts such as (US, has president, B. Obama, [2009-2017]) that… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

  21. arXiv:2003.02395  [pdf, other

    stat.ML cs.LG

    A Simple Convergence Proof of Adam and Adagrad

    Authors: Alexandre Défossez, Léon Bottou, Francis Bach, Nicolas Usunier

    Abstract: We provide a simple proof of convergence covering both the Adam and Adagrad adaptive optimization algorithms when applied to smooth (possibly non-convex) objective functions with bounded gradients. We show that in expectation, the squared norm of the objective gradient averaged over the trajectory has an upper-bound which is explicit in the constants of the problem, parameters of the optimizer, th… ▽ More

    Submitted 17 October, 2022; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: final TMLR version

  22. arXiv:1911.13254  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Music Source Separation in the Waveform Domain

    Authors: Alexandre Défossez, Nicolas Usunier, Léon Bottou, Francis Bach

    Abstract: Source separation for music is the task of isolating contributions, or stems, from different instruments recorded individually and arranged together to form a song. Such components include voice, bass, drums and any other accompaniments.Contrarily to many audio synthesis tasks where the best performances are achieved by models that directly generate the waveform, the state-of-the-art in source… ▽ More

    Submitted 28 April, 2021; v1 submitted 27 November, 2019; originally announced November 2019.

  23. arXiv:1910.08809  [pdf, other

    cs.LG cs.MA stat.ML

    A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning

    Authors: Nicolas Carion, Gabriel Synnaeve, Alessandro Lazaric, Nicolas Usunier

    Abstract: Effective coordination is crucial to solve multi-agent collaborative (MAC) problems. While centralized reinforcement learning methods can optimally solve small MAC instances, they do not scale to large problems and they fail to generalize to scenarios different from those seen during training. In this paper, we consider MAC problems with some intrinsic notion of locality (e.g., geographic proximit… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Journal ref: NeurIPS 2019

  24. arXiv:1909.01174  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed

    Authors: Alexandre Défossez, Nicolas Usunier, Léon Bottou, Francis Bach

    Abstract: We study the problem of source separation for music using deep learning with four known sources: drums, bass, vocals and other accompaniments. State-of-the-art approaches predict soft masks over mixture spectrograms while methods working on the waveform are lagging behind as measured on the standard MusDB benchmark. Our contribution is two fold. (i) We introduce a simple convolutional and recurren… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

  25. arXiv:1906.12266  [pdf, other

    cs.LG cs.AI stat.ML

    Growing Action Spaces

    Authors: Gregory Farquhar, Laura Gustafson, Zeming Lin, Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve

    Abstract: In complex tasks, such as those with large combinatorial action spaces, random exploration may be too inefficient to achieve meaningful learning progress. In this work, we use a curriculum of progressively growing action spaces to accelerate learning. We assume the environment is out of our control, but that the agent may set an internal curriculum by initially restricting its action space. Our ap… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

  26. arXiv:1812.06864  [pdf, other

    cs.CL

    Fully Convolutional Speech Recognition

    Authors: Neil Zeghidour, Qiantong Xu, Vitaliy Liptchinsky, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert

    Abstract: Current state-of-the-art speech recognition systems build on recurrent neural networks for acoustic and/or language modeling, and rely on feature extraction pipelines to extract mel-filterbanks or cepstral coefficients. In this paper we present an alternative approach based solely on convolutional neural networks, leveraging recent advances in acoustic models from the raw waveform and language mod… ▽ More

    Submitted 9 April, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

  27. arXiv:1812.03483  [pdf, ps, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition

    Authors: Yossi Adi, Neil Zeghidour, Ronan Collobert, Nicolas Usunier, Vitaliy Liptchinsky, Gabriel Synnaeve

    Abstract: Transcribed datasets typically contain speaker identity for each instance in the data. We investigate two ways to incorporate this information during training: Multi-Task Learning and Adversarial Learning. In multi-task learning, the goal is speaker prediction; we expect a performance improvement with this joint training if the two tasks of speech recognition and speaker recognition share a common… ▽ More

    Submitted 14 February, 2019; v1 submitted 9 December, 2018; originally announced December 2018.

  28. arXiv:1812.00054  [pdf, other

    cs.LG cs.AI

    Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger

    Authors: Gabriel Synnaeve, Zeming Lin, Jonas Gehring, Dan Gant, Vegard Mella, Vasil Khalidov, Nicolas Carion, Nicolas Usunier

    Abstract: We formulate the problem of defogging as state estimation and future state prediction from previous, partial observations in the context of real-time strategy games. We propose to employ encoder-decoder neural networks for this task, and introduce proxy tasks and baselines for evaluation to assess their ability of capturing basic game rules and high-level dynamics. By combining convolutional neura… ▽ More

    Submitted 30 November, 2018; originally announced December 2018.

    Journal ref: Advances in Neural Information Processing Systems 31 (2018) 10759-10770

  29. arXiv:1811.08568  [pdf, other

    cs.LG stat.ML

    High-Level Strategy Selection under Partial Observability in StarCraft: Brood War

    Authors: Jonas Gehring, Da Ju, Vegard Mella, Daniel Gant, Nicolas Usunier, Gabriel Synnaeve

    Abstract: We consider the problem of high-level strategy selection in the adversarial setting of real-time strategy games from a reinforcement learning perspective, where taking an action corresponds to switching to the respective strategy. Here, a good strategy successfully counters the opponent's current and possible future strategies which can only be estimated using partial observations. We investigate… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

  30. arXiv:1810.09785  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    SING: Symbol-to-Instrument Neural Generator

    Authors: Alexandre Défossez, Neil Zeghidour, Nicolas Usunier, Léon Bottou, Francis Bach

    Abstract: Recent progress in deep learning for audio synthesis opens the way to models that directly produce the waveform, shifting away from the traditional paradigm of relying on vocoders or MIDI synthesizers for speech or music generation. Despite their successes, current state-of-the-art neural audio synthesizers such as WaveNet and SampleRNN suffer from prohibitive training and inference times because… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Journal ref: Conference on Neural Information Processing Systems (NIPS), Dec 2018, Montr{é}al, Canada

  31. arXiv:1806.07297  [pdf, other

    stat.ML cs.AI cs.LG cs.SI

    Canonical Tensor Decomposition for Knowledge Base Completion

    Authors: Timothée Lacroix, Nicolas Usunier, Guillaume Obozinski

    Abstract: The problem of Knowledge Base Completion can be framed as a 3rd-order binary tensor completion problem. In this light, the Canonical Tensor Decomposition (CP) (Hitchcock, 1927) seems like a natural solution; however, current implementations of CP on standard Knowledge Base Completion benchmarks are lagging behind their competitors. In this work, we attempt to understand the limits of CP for knowle… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

  32. arXiv:1806.07098  [pdf, other

    cs.CL cs.SD eess.AS

    End-to-End Speech Recognition From the Raw Waveform

    Authors: Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert, Emmanuel Dupoux

    Abstract: State-of-the-art speech recognition systems rely on fixed, hand-crafted features such as mel-filterbanks to preprocess the waveform before the training pipeline. In this paper, we study end-to-end systems trained directly from the raw waveform, building on two alternatives for trainable replacements of mel-filterbanks that use a convolutional architecture. The first one is inspired by gammatone fi… ▽ More

    Submitted 21 June, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted for presentation at Interspeech 2018

  33. arXiv:1805.11199  [pdf, other

    cs.AI cs.LG

    Value Propagation Networks

    Authors: Nantas Nardelli, Gabriel Synnaeve, Zeming Lin, Pushmeet Kohli, Philip H. S. Torr, Nicolas Usunier

    Abstract: We present Value Propagation (VProp), a set of parameter-efficient differentiable planning modules built on Value Iteration which can successfully be trained using reinforcement learning to solve unseen tasks, has the capability to generalize to larger map sizes, and can learn to navigate in dynamic environments. We show that the modules enable learning to plan when the environment also includes s… ▽ More

    Submitted 25 March, 2019; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: Updated to match ICLR 2019 OpenReview's version

  34. arXiv:1711.01161  [pdf, other

    cs.CL

    Learning Filterbanks from Raw Speech for Phone Recognition

    Authors: Neil Zeghidour, Nicolas Usunier, Iasonas Kokkinos, Thomas Schatz, Gabriel Synnaeve, Emmanuel Dupoux

    Abstract: We train a bank of complex filters that operates on the raw waveform and is fed into a convolutional neural network for end-to-end phone recognition. These time-domain filterbanks (TD-filterbanks) are initialized as an approximation of mel-filterbanks, and then fine-tuned jointly with the remaining convolutional architecture. We perform phone recognition experiments on TIMIT and show that for seve… ▽ More

    Submitted 4 April, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Accepted at ICASSP 2018

  35. arXiv:1706.00409  [pdf, other

    cs.CV

    Fader Networks: Manipulating Images by Sliding Attributes

    Authors: Guillaume Lample, Neil Zeghidour, Nicolas Usunier, Antoine Bordes, Ludovic Denoyer, Marc'Aurelio Ranzato

    Abstract: This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our model can generate different realistic versions of an input image by varying the attribute values. By using continuous attribute values, we can choose how much… ▽ More

    Submitted 28 January, 2018; v1 submitted 1 June, 2017; originally announced June 2017.

    Comments: NIPS 2017

  36. arXiv:1704.08847  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Parseval Networks: Improving Robustness to Adversarial Examples

    Authors: Moustapha Cisse, Piotr Bojanowski, Edouard Grave, Yann Dauphin, Nicolas Usunier

    Abstract: We introduce Parseval networks, a form of deep neural networks in which the Lipschitz constant of linear, convolutional and aggregation layers is constrained to be smaller than 1. Parseval networks are empirically and theoretically motivated by an analysis of the robustness of the predictions made by deep neural networks when their input is subject to an adversarial perturbation. The most importan… ▽ More

    Submitted 1 May, 2017; v1 submitted 28 April, 2017; originally announced April 2017.

    Comments: submitted

  37. arXiv:1612.04426  [pdf, other

    cs.CL cs.LG

    Improving Neural Language Models with a Continuous Cache

    Authors: Edouard Grave, Armand Joulin, Nicolas Usunier

    Abstract: We propose an extension to neural network language models to adapt their prediction to the recent history. Our model is a simplified version of memory augmented networks, which stores past hidden activations as memory and accesses them through a dot product with the current hidden activation. This mechanism is very efficient and scales to very large memory sizes. We also draw a link between the us… ▽ More

    Submitted 13 December, 2016; originally announced December 2016.

    Comments: Submitted to ICLR 2017

  38. arXiv:1611.00625  [pdf, other

    cs.LG cs.AI

    TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games

    Authors: Gabriel Synnaeve, Nantas Nardelli, Alex Auvolat, Soumith Chintala, Timothée Lacroix, Zeming Lin, Florian Richoux, Nicolas Usunier

    Abstract: We present TorchCraft, a library that enables deep learning research on Real-Time Strategy (RTS) games such as StarCraft: Brood War, by making it easier to control these games from a machine learning framework, here Torch. This white paper argues for using RTS games as a benchmark for AI research, and describes the design and components of TorchCraft.

    Submitted 3 November, 2016; v1 submitted 1 November, 2016; originally announced November 2016.

    ACM Class: I.2.1

  39. arXiv:1609.06753  [pdf, other

    cs.CV

    How should we evaluate supervised hashing?

    Authors: Alexandre Sablayrolles, Matthijs Douze, Hervé Jégou, Nicolas Usunier

    Abstract: Hashing produces compact representations for documents, to perform tasks like classification or retrieval based on these short codes. When hashing is supervised, the codes are trained using labels on the training data. This paper first shows that the evaluation protocols used in the literature for supervised hashing are not satisfactory: we show that a trivial solution that encodes the output of a… ▽ More

    Submitted 10 August, 2017; v1 submitted 21 September, 2016; originally announced September 2016.

  40. arXiv:1609.02993  [pdf, other

    cs.AI cs.LG

    Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks

    Authors: Nicolas Usunier, Gabriel Synnaeve, Zeming Lin, Soumith Chintala

    Abstract: We consider scenarios from the real-time strategy game StarCraft as new benchmarks for reinforcement learning algorithms. We propose micromanagement tasks, which present the problem of the short-term, low-level control of army members during a battle. From a reinforcement learning point of view, these scenarios are challenging because the state-action space is very large, and because there is no o… ▽ More

    Submitted 26 November, 2016; v1 submitted 9 September, 2016; originally announced September 2016.

    Comments: 18 pages, 1 figure (2 plots), 2 tables

    ACM Class: I.2.1; I.2.6

  41. arXiv:1506.02075  [pdf, ps, other

    cs.LG cs.CL

    Large-scale Simple Question Answering with Memory Networks

    Authors: Antoine Bordes, Nicolas Usunier, Sumit Chopra, Jason Weston

    Abstract: Training large-scale question answering systems is complicated because training sources usually cover a small portion of the range of possible questions. This paper studies the impact of multitask and transfer learning for simple question answering; a setting for which the reasoning required to answer is quite easy, as long as one can retrieve the correct evidence given a question, which can be di… ▽ More

    Submitted 5 June, 2015; originally announced June 2015.

  42. arXiv:1506.00999  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Combining Two And Three-Way Embeddings Models for Link Prediction in Knowledge Bases

    Authors: Alberto Garcia-Duran, Antoine Bordes, Nicolas Usunier, Yves Grandvalet

    Abstract: This paper tackles the problem of endogenous link prediction for Knowledge Base completion. Knowledge Bases can be represented as directed graphs whose nodes correspond to entities and edges to relationships. Previous attempts either consist of powerful systems with high capacity to model complex connectivity patterns, which unfortunately usually end up overfitting on rare relationships, or in app… ▽ More

    Submitted 2 June, 2015; originally announced June 2015.

    Comments: 26 pages

  43. arXiv:1505.00199  [pdf, other

    cs.LG

    Theory of Optimizing Pseudolinear Performance Measures: Application to F-measure

    Authors: Shameem A Puthiya Parambath, Nicolas Usunier, Yves Grandvalet

    Abstract: Non-linear performance measures are widely used for the evaluation of learning algorithms. For example, $F$-measure is a commonly used performance measure for classification problems in machine learning and information retrieval community. We study the theoretical properties of a subset of non-linear performance measures called pseudo-linear performance measures which includes $F$-measure, \emph{J… ▽ More

    Submitted 1 January, 2018; v1 submitted 1 May, 2015; originally announced May 2015.

    Comments: Extended Version of the NIPS 2014 Paper

  44. arXiv:1404.4326  [pdf, other

    cs.CL cs.LG

    Open Question Answering with Weakly Supervised Embedding Models

    Authors: Antoine Bordes, Jason Weston, Nicolas Usunier

    Abstract: Building computers able to answer questions on any subject is a long standing goal of artificial intelligence. Promising progress has recently been achieved by methods that learn to map questions to logical forms or database queries. Such approaches can be effective but at the cost of either large amounts of human-labeled data or by defining lexicons and grammars tailored by practitioners. In this… ▽ More

    Submitted 16 April, 2014; originally announced April 2014.

  45. arXiv:1307.7973  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction

    Authors: Jason Weston, Antoine Bordes, Oksana Yakhnenko, Nicolas Usunier

    Abstract: This paper proposes a novel approach for relation extraction from free text which is trained to jointly use information from the text and from existing knowledge. Our model is based on two scoring functions that operate by learning low-dimensional embeddings of words and of entities and relationships from a knowledge base. We empirically show on New York Times articles aligned with Freebase relati… ▽ More

    Submitted 30 July, 2013; originally announced July 2013.

  46. arXiv:1304.7158  [pdf, ps, other

    cs.LG

    Irreflexive and Hierarchical Relations as Translations

    Authors: Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, Oksana Yakhnenko

    Abstract: We consider the problem of embedding entities and relations of knowledge bases in low-dimensional vector spaces. Unlike most existing approaches, which are primarily efficient for modeling equivalence relations, our approach is designed to explicitly model irreflexive relations, such as hierarchies, by interpreting them as translations operating on the low-dimensional embeddings of the entities. P… ▽ More

    Submitted 26 April, 2013; originally announced April 2013.

    Comments: Submitted at the ICML 2013 workshop "Structured Learning: Inferring Graphs from Structured and Unstructured Inputs"