Skip to main content

Showing 1–23 of 23 results for author: Jean, S

.
  1. arXiv:2406.15173  [pdf, ps, other

    cs.IR cs.AI

    Évaluation des capacités de réponse de larges modèles de langage (LLM) pour des questions d'historiens

    Authors: Mathieu Chartier, Nabil Dakkoune, Guillaume Bourgeois, Stéphane Jean

    Abstract: Large Language Models (LLMs) like ChatGPT or Bard have revolutionized information retrieval and captivated the audience with their ability to generate custom responses in record time, regardless of the topic. In this article, we assess the capabilities of various LLMs in producing reliable, comprehensive, and sufficiently relevant responses about historical facts in French. To achieve this, we con… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: in French language

  2. arXiv:2305.15338  [pdf, other

    cs.AI cs.CL

    Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing

    Authors: Shufan Wang, Sebastien Jean, Sailik Sengupta, James Gung, Nikolaos Pappas, Yi Zhang

    Abstract: In executable task-oriented semantic parsing, the system aims to translate users' utterances in natural language to machine-interpretable programs (API calls) that can be executed according to pre-defined API specifications. With the popularity of Large Language Models (LLMs), in-context learning offers a strong baseline for such scenarios, especially in data-limited regimes. However, LLMs are kno… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  3. arXiv:2201.10446  [pdf, ps, other

    cond-mat.stat-mech nlin.SI

    Parametric resonance in a conservative system of coupled nonlinear oscillators

    Authors: Johann Maddi, Christophe Coste, Michel Saint Jean

    Abstract: We study a conservative system of two nonlinear coupled oscillators. The eigenmodes of the system are thus nonlinearly coupled, and one of them may induce a parametric amplification of the other, called an autoparametric resonance of the system. The parametric amplification implies two time scales, a fast one for the forcing and a slow one for the forced mode, thus a multiscale expansion is suitab… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  4. arXiv:1910.14075  [pdf, other

    cs.CL

    Fill in the Blanks: Imputing Missing Sentences for Larger-Context Neural Machine Translation

    Authors: Sébastien Jean, Ankur Bapna, Orhan Firat

    Abstract: Most neural machine translation systems still translate sentences in isolation. To make further progress, a promising line of research additionally considers the surrounding context in order to provide the model potentially missing source-side information, as well as to maintain a coherent output. One difficulty in training such larger-context (i.e. document-level) machine translation systems is t… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  5. arXiv:1909.06434  [pdf, other

    cs.LG cs.CL stat.ML

    Adaptive Scheduling for Multi-Task Learning

    Authors: Sébastien Jean, Orhan Firat, Melvin Johnson

    Abstract: To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we explore different task scheduling approaches. We first consider existing non-adaptive techniques, then move on to adaptive schedules that over-sample tasks with po… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: Continual Learning Workshop at NeurIPS 2018

  6. arXiv:1903.04715  [pdf, other

    cs.CL

    Context-Aware Learning for Neural Machine Translation

    Authors: Sébastien Jean, Kyunghyun Cho

    Abstract: Interest in larger-context neural machine translation, including document-level and multi-modal translation, has been growing. Multiple works have proposed new network architectures or evaluation schemes, but potentially helpful context is still sometimes ignored by larger-context translation models. In this paper, we propose a novel learning algorithm that explicitly encourages a neural translati… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

  7. arXiv:1902.08295  [pdf, other

    cs.LG stat.ML

    Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

    Authors: Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob , et al. (66 additional authors not shown)

    Abstract: Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly w… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  8. arXiv:1902.05620  [pdf

    physics.ins-det physics.app-ph

    Calculation and verification of neutron irradiation damage with differential cross sections

    Authors: Shengli Chen, David Bernard, Pierre Tamagno, Jean Tommasi, Stephane Bourganel, Gilles Noguere, Cyrille De Saint Jean

    Abstract: The Displacement per Atom (DPA) rate is conventionally computed with DPA cross sections in reactor applications. The method of direct calculation with energy-angular distributions given in the Center of Mass (CM) frame is proposed and recommended in the present work. The methods for refining and verifying the calculations of DPA cross sections are proposed: (i) Gauss-Legendre-Quadrature-based Piec… ▽ More

    Submitted 19 July, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: arXiv admin note: text overlap with arXiv:1902.04889

    Journal ref: Nuclear Instruments and Methods in Physics Research Section B: Beam Interactions with Materials and Atoms, Volume 456, 1 October 2019, Pages 120-132

  9. arXiv:1902.04889  [pdf

    nucl-th physics.app-ph

    Calculation and Verification of Irradiation Damage Cross Section with Energy-Angular Distribution

    Authors: Shengli Chen, David Bernard, Pierre Tamagno, Cyrille De Saint Jean

    Abstract: To complete the computation of Displacements per Atom (DPA) cross sections, the present work shows the methods of calculating DPA cross sections with the nuclear data of energy-angular distribution in both the laboratory and the Center-of-Mass (CM) frames. The method of direct calculation with data in the CM frame is proposed and recommended to decrease the computation burden and keep all informat… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

    Journal ref: Nuclear Inst. and Methods in Physics Research, B 456 (2019) pp. 120-132

  10. arXiv:1704.05135  [pdf, ps, other

    stat.ML cs.CL cs.LG

    Does Neural Machine Translation Benefit from Larger Context?

    Authors: Sebastien Jean, Stanislas Lauly, Orhan Firat, Kyunghyun Cho

    Abstract: We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when trained on small corpora, although this improvement largely disappears when trained with a larger corpus. We also discover that attention-based neural machine tran… ▽ More

    Submitted 17 April, 2017; originally announced April 2017.

  11. arXiv:1701.06547  [pdf, ps, other

    cs.CL

    Adversarial Learning for Neural Dialogue Generation

    Authors: Jiwei Li, Will Monroe, Tianlin Shi, Sébastien Jean, Alan Ritter, Dan Jurafsky

    Abstract: In this paper, drawing intuition from the Turing test, we propose using adversarial training for open-domain dialogue generation: the system is trained to produce sequences that are indistinguishable from human-generated dialogue utterances. We cast the task as a reinforcement learning (RL) problem where we jointly train two systems, a generative model to produce response sequences, and a discrimi… ▽ More

    Submitted 23 September, 2017; v1 submitted 23 January, 2017; originally announced January 2017.

  12. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  13. arXiv:1503.01800  [pdf, other

    cs.LG cs.CV

    EmoNets: Multimodal deep learning approaches for emotion recognition in video

    Authors: Samira Ebrahimi Kahou, Xavier Bouthillier, Pascal Lamblin, Caglar Gulcehre, Vincent Michalski, Kishore Konda, Sébastien Jean, Pierre Froumenty, Yann Dauphin, Nicolas Boulanger-Lewandowski, Raul Chandias Ferrari, Mehdi Mirza, David Warde-Farley, Aaron Courville, Pascal Vincent, Roland Memisevic, Christopher Pal, Yoshua Bengio

    Abstract: The task of the emotion recognition in the wild (EmotiW) Challenge is to assign one of seven emotions to short video clips extracted from Hollywood style movies. The videos depict acted-out emotions under realistic conditions with a large degree of variation in attributes such as pose and illumination, making it worthwhile to explore approaches which consider combinations of features from multiple… ▽ More

    Submitted 29 March, 2015; v1 submitted 5 March, 2015; originally announced March 2015.

  14. arXiv:1412.6448  [pdf, other

    cs.CL

    Embedding Word Similarity with Neural Machine Translation

    Authors: Felix Hill, Kyunghyun Cho, Sebastien Jean, Coline Devin, Yoshua Bengio

    Abstract: Neural language models learn word representations, or embeddings, that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models, a recently-developed class of neural language model. We show that embeddings from translation models outperform those learned by monolingual models at tasks that require knowledge of both conceptu… ▽ More

    Submitted 3 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: arXiv admin note: text overlap with arXiv:1410.0718

  15. arXiv:1412.2007  [pdf, other

    cs.CL

    On Using Very Large Target Vocabulary for Neural Machine Translation

    Authors: Sébastien Jean, Kyunghyun Cho, Roland Memisevic, Yoshua Bengio

    Abstract: Neural machine translation, a recently proposed approach to machine translation based purely on neural networks, has shown promising results compared to the existing approaches such as phrase-based statistical machine translation. Despite its recent success, neural machine translation has its limitation in handling a larger vocabulary, as training complexity as well as decoding complexity increase… ▽ More

    Submitted 18 March, 2015; v1 submitted 5 December, 2014; originally announced December 2014.

  16. arXiv:1410.0718  [pdf, other

    cs.CL

    Not All Neural Embeddings are Born Equal

    Authors: Felix Hill, KyungHyun Cho, Sebastien Jean, Coline Devin, Yoshua Bengio

    Abstract: Neural language models learn word representations that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models. We show that translation-based embeddings outperform those learned by cutting-edge monolingual models at single-language tasks requiring knowledge of conceptual similarity and/or syntactic role. The findings sugg… ▽ More

    Submitted 13 November, 2014; v1 submitted 2 October, 2014; originally announced October 2014.

    Comments: 4 pages plus 1 page of references

  17. arXiv:1004.3851  [pdf, ps, other

    cond-mat.stat-mech

    Single file diffusion of macroscopic charged particles

    Authors: Christophe Coste, Jean-Baptiste Delfau, Catherine Even, Michel Saint Jean

    Abstract: In this paper, we study a macroscopic system of electrically interacting metallic beads organized as a sequence along an annulus. A random mechanical shaking mimics the thermal excitation. We exhibit non Fickian diffusion (Single File Diffusion) at large time. We measure the mobility of the particles, and compare it to theoretical expectations. We show that our system cannot be accurately describe… ▽ More

    Submitted 22 April, 2010; originally announced April 2010.

    Comments: 26

  18. arXiv:0803.3157  [pdf, ps, other

    cond-mat.mes-hall cond-mat.stat-mech

    Local Symmetries and Order-Disorder Transitions in Small Macroscopic Wigner Islands

    Authors: Gwennou Coupier, Claudine Guthmann, Yves Noat, Michel Saint Jean

    Abstract: The influence of local order on the disordering scenario of small Wigner islands is discussed. A first disordering step is put in evidence by the time correlation functions and is linked to individual excitations resulting in configuration transitions, which are very sensitive to the local symmetries. This is followed by two other transitions, corresponding to orthoradial and radial diffusion, f… ▽ More

    Submitted 21 March, 2008; originally announced March 2008.

    Comments: 14 pages, 10 figures

    Journal ref: Phys. Rev. E 71, 046105 (2005)

  19. Enhancement of mobilities in a pinned multidomain crystal

    Authors: Gwennou Coupier, Michel Saint Jean, Claudine Guthmann

    Abstract: Mobility properties inside and around degenerate domains of an elastic lattice partially pinned on a square array of traps are explored by means of a fully controllable model system of macroscopic particles. We focus on the different configurations obtained for filling ratios equal to 1 or 2 when the pinning strength is lowered. These theoretically expected but never observed configurations are… ▽ More

    Submitted 21 March, 2008; v1 submitted 22 November, 2006; originally announced November 2006.

    Comments: 7 pages, 10 figures Version 2 : longer version

    Journal ref: Phys. Rev. B 75, 224103 (2007)

  20. Single File Diffusion enhancement in a fluctuating modulated 1D channel

    Authors: Gwennou Coupier, Michel Saint Jean, Claudine Guthmann

    Abstract: We show that the diffusion of a single file of particles moving in a fluctuating modulated 1D channel is enhanced with respect to the one in a bald pipe. This effect, induced by the fluctuations of the modulation, is favored by the incommensurability between the channel potential modulation and the moving file periodicity. This phenomenon could be of importance in order to optimize the critical… ▽ More

    Submitted 19 October, 2006; originally announced October 2006.

    Comments: 4 pages, 4 figures

    Journal ref: Europhys. Lett. 77, 60001 (2007)

  21. Single file diffusion in macroscopic Wigner rings

    Authors: Gwennou Coupier, Michel Saint Jean, Claudine Guthmann

    Abstract: The single file diffusion in a circular channel of millimetric charged balls is studied. The evolution in time of the mean square displacement is shown to be subdiffusive, but slower than the power-like $t^{1/2}$ behavior observed in circular colloidal systems or predicted in one-dimensional infinite systems.

    Submitted 2 March, 2006; originally announced March 2006.

    Comments: 7 pages, 7 figures

    Journal ref: Phys. Rev. E 73, 031112 (2006)

  22. arXiv:cond-mat/0602272  [pdf, ps, other

    cond-mat.mes-hall cond-mat.supr-con

    Determination of the interactions in confined macroscopic Wigner islands: theory and experiments

    Authors: P. Galatola, G. Coupier, M. Saint Jean, J. -B. Fournier, C. Guthmann

    Abstract: Macroscopic Wigner islands present an interesting complementary approach to explore the properties of two-dimensional confined particles systems. In this work, we characterize theoretically and experimentally the interaction between their basic components, viz., conducting spheres lying on the bottom electrode of a plane condenser. We show that the interaction energy can be approximately describ… ▽ More

    Submitted 2 March, 2006; v1 submitted 10 February, 2006; originally announced February 2006.

    Comments: 8 pages, 8 figures

    Journal ref: Eur. Phys. J. B 50, 549 (2006)

  23. arXiv:cond-mat/0101285  [pdf

    cond-mat.mes-hall cond-mat.supr-con

    Macroscopic 2D Wigner islands

    Authors: M. Saint Jean, C. Even, C. Guthmann

    Abstract: In this paper we present new versatile "2D macroscopic Wigner islands" useful to investigate the various behaviors observed in mesoscopic confined systems. Our "Wigner islands" consist of electrostatically-interacting charged balls with millimetric size. We have experimentally determined the ground configurations for systems of N particles (N=1-30) confined in a parabolic potential and checked t… ▽ More

    Submitted 18 January, 2001; originally announced January 2001.

    Comments: 8 pages, 4 figures