Skip to main content

Showing 1–14 of 14 results for author: Godin, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15612  [pdf, other

    cs.LG q-fin.RM

    Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients

    Authors: Parisa Davar, Frédéric Godin, Jose Garrido

    Abstract: This paper tackles the problem of mitigating catastrophic risk (which is risk with very low frequency but very high severity) in the context of a sequential decision making process. This problem is particularly challenging due to the scarcity of observations in the far tail of the distribution of cumulative costs (negative rewards). A policy gradient algorithm is developed, that we call POTPG. It… ▽ More

    Submitted 28 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: The Python code to replicate the various numerical experiments of this paper is available at https://github.com/parisadavar/EVT-policy-gradient-RL

  2. arXiv:2402.13326  [pdf, other

    q-fin.CP cs.AI

    Deep Hedging with Market Impact

    Authors: Andrei Neagu, Frédéric Godin, Clarence Simard, Leila Kosseim

    Abstract: Dynamic hedging is the practice of periodically transacting financial instruments to offset the risk caused by an investment or a liability. Dynamic hedging optimization can be framed as a sequential decision problem; thus, Reinforcement Learning (RL) models were recently proposed to tackle this task. However, existing RL works for hedging do not consider market impact caused by the finite liquidi… ▽ More

    Submitted 22 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures

  3. arXiv:2311.06549  [pdf, other

    cs.CL

    Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study

    Authors: Maarten De Raedt, Semere Kiros Bitew, Fréderic Godin, Thomas Demeester, Chris Develder

    Abstract: The brittleness of finetuned language model performance on out-of-distribution (OOD) test samples in unseen domains has been well-studied for English, yet is unexplored for multi-lingual models. Therefore, we study generalization to OOD test data specifically in zero-shot cross-lingual transfer settings, analyzing performance impacts of both language and domain shifts between train and test data.… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: The 3rd Workshop on Multilingual Representation Learning (MRL@EMNLP2023)

  4. arXiv:2305.19783  [pdf, other

    cs.CL

    IDAS: Intent Discovery with Abstractive Summarization

    Authors: Maarten De Raedt, Fréderic Godin, Thomas Demeester, Chris Develder

    Abstract: Intent discovery is the task of inferring latent intents from a set of unlabeled utterances, and is a useful step towards the efficient creation of new conversational agents. We show that recent competitive methods in intent discovery can be outperformed by clustering utterances based on abstractive summaries, i.e., "labels", that retain the core elements while removing non-essential information.… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: The 5th Workshop on NLP for Conversational AI (NLP4ConvAI@ACL)

  5. arXiv:2302.14141  [pdf, other

    cs.LG

    Linear pretraining in recurrent mixture density networks

    Authors: Hubert Normandin-Taillon, Frédéric Godin, Chun Wang

    Abstract: We present a method for pretraining a recurrent mixture density network (RMDN). We also propose a slight modification to the architecture of the RMDN-GARCH proposed by Nikolaev et al. [2012]. The pretraining method helps the RMDN avoid bad local minima during training and improves its robustness to the persistent NaN problem, as defined by Guillaumes [2017], which is often encountered with mixture… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  6. arXiv:2210.11805  [pdf, other

    cs.CL

    Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals

    Authors: Maarten De Raedt, Fréderic Godin, Chris Develder, Thomas Demeester

    Abstract: For text classification tasks, finetuned language models perform remarkably well. Yet, they tend to rely on spurious patterns in training data, thus limiting their performance on out-of-distribution (OOD) test data. Among recent models aiming to avoid this spurious pattern problem, adding extra counterfactual samples to the training data has proven to be very effective. Yet, counterfactual data ge… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  7. arXiv:2109.13977  [pdf, other

    cs.LG

    Risk averse non-stationary multi-armed bandits

    Authors: Leo Benac, Frédéric Godin

    Abstract: This paper tackles the risk averse multi-armed bandits problem when incurred losses are non-stationary. The conditional value-at-risk (CVaR) is used as the objective function. Two estimation methods are proposed for this objective function in the presence of non-stationary losses, one relying on a weighted empirical distribution of losses and another on the dual representation of the CVaR. Such es… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  8. arXiv:2104.03630  [pdf, other

    cs.CL cs.LG

    A Simple Geometric Method for Cross-Lingual Linguistic Transformations with Pre-trained Autoencoders

    Authors: Maarten De Raedt, Fréderic Godin, Pieter Buteneers, Chris Develder, Thomas Demeester

    Abstract: Powerful sentence encoders trained for multiple languages are on the rise. These systems are capable of embedding a wide range of linguistic properties into vector representations. While explicit probing tasks can be used to verify the presence of specific linguistic properties, it is unclear whether the vector representations can be manipulated to indirectly steer such properties. For efficient l… ▽ More

    Submitted 21 September, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 - Short Paper Track

  9. arXiv:1912.01718  [pdf, other

    stat.ML cs.LG

    Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaR

    Authors: Dylan Troop, Frédéric Godin, Jia Yuan Yu

    Abstract: In a wide variety of sequential decision making problems, it can be important to estimate the impact of rare events in order to minimize risk exposure. A popular risk measure is the conditional value-at-risk (CVaR), which is commonly estimated by averaging observations that occur beyond a quantile at a given confidence level. When this confidence level is very high, this estimation method can exhi… ▽ More

    Submitted 10 December, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

  10. arXiv:1902.10236  [pdf, ps, other

    cs.CL

    Learning When Not to Answer: A Ternary Reward Structure for Reinforcement Learning based Question Answering

    Authors: Fréderic Godin, Anjishnu Kumar, Arpit Mittal

    Abstract: In this paper, we investigate the challenges of using reinforcement learning agents for question-answering over knowledge graphs for real-world applications. We examine the performance metrics used by state-of-the-art systems and determine that they are inadequate for such settings. More specifically, they do not evaluate the systems correctly for situations when there is no answer available and t… ▽ More

    Submitted 3 April, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: Accepted at NAACL 2019. Version 1 was presented at NIPS 2018 workshop on Relational Representation Learning

  11. arXiv:1808.09551  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Explaining Character-Aware Neural Networks for Word-Level Prediction: Do They Discover Linguistic Rules?

    Authors: Fréderic Godin, Kris Demuynck, Joni Dambre, Wesley De Neve, Thomas Demeester

    Abstract: Character-level features are currently used in different neural network-based natural language processing algorithms. However, little is known about the character-level patterns those models learn. Moreover, models are often compared only quantitatively while a qualitative analysis is missing. In this paper, we investigate which character-level patterns neural networks learn and if those patterns… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: Accepted at EMNLP 2018

  12. arXiv:1808.08720  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Predefined Sparseness in Recurrent Sequence Models

    Authors: Thomas Demeester, Johannes Deleu, Fréderic Godin, Chris Develder

    Abstract: Inducing sparseness while training neural networks has been shown to yield models with a lower memory footprint but similar effectiveness to dense models. However, sparseness is typically induced starting from a dense model, and thus this advantage does not hold during training. We propose techniques to enforce sparseness upfront in recurrent sequence models for NLP applications, to also benefit t… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

    Comments: the SIGNLL Conference on Computational Natural Language Learning (CoNLL, 2018)

  13. Dual Rectified Linear Units (DReLUs): A Replacement for Tanh Activation Functions in Quasi-Recurrent Neural Networks

    Authors: Fréderic Godin, Jonas Degrave, Joni Dambre, Wesley De Neve

    Abstract: In this paper, we introduce a novel type of Rectified Linear Unit (ReLU), called a Dual Rectified Linear Unit (DReLU). A DReLU, which comes with an unbounded positive and negative image, can be used as a drop-in replacement for a tanh activation function in the recurrent step of Quasi-Recurrent Neural Networks (QRNNs) (Bradbury et al. (2017)). Similar to ReLUs, DReLUs are less prone to the vanishi… ▽ More

    Submitted 31 October, 2017; v1 submitted 25 July, 2017; originally announced July 2017.

  14. arXiv:1707.06130  [pdf, ps, other

    cs.CL

    Improving Language Modeling using Densely Connected Recurrent Neural Networks

    Authors: Fréderic Godin, Joni Dambre, Wesley De Neve

    Abstract: In this paper, we introduce the novel concept of densely connected layers into recurrent neural networks. We evaluate our proposed architecture on the Penn Treebank language modeling task. We show that we can obtain similar perplexity scores with six times fewer parameters compared to a standard stacked 2-layer LSTM model trained with dropout (Zaremba et al. 2014). In contrast with the current usa… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

    Comments: Accepted at Workshop on Representation Learning, ACL2017