Search | arXiv e-print repository

Évaluation des capacités de réponse de larges modèles de langage (LLM) pour des questions d'historiens

Authors: Mathieu Chartier, Nabil Dakkoune, Guillaume Bourgeois, Stéphane Jean

Abstract: Large Language Models (LLMs) like ChatGPT or Bard have revolutionized information retrieval and captivated the audience with their ability to generate custom responses in record time, regardless of the topic. In this article, we assess the capabilities of various LLMs in producing reliable, comprehensive, and sufficiently relevant responses about historical facts in French. To achieve this, we con… ▽ More Large Language Models (LLMs) like ChatGPT or Bard have revolutionized information retrieval and captivated the audience with their ability to generate custom responses in record time, regardless of the topic. In this article, we assess the capabilities of various LLMs in producing reliable, comprehensive, and sufficiently relevant responses about historical facts in French. To achieve this, we constructed a testbed comprising numerous history-related questions of varying types, themes, and levels of difficulty. Our evaluation of responses from ten selected LLMs reveals numerous shortcomings in both substance and form. Beyond an overall insufficient accuracy rate, we highlight uneven treatment of the French language, as well as issues related to verbosity and inconsistency in the responses provided by LLMs. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: in French language

arXiv:2305.15338 [pdf, other]

Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing

Authors: Shufan Wang, Sebastien Jean, Sailik Sengupta, James Gung, Nikolaos Pappas, Yi Zhang

Abstract: In executable task-oriented semantic parsing, the system aims to translate users' utterances in natural language to machine-interpretable programs (API calls) that can be executed according to pre-defined API specifications. With the popularity of Large Language Models (LLMs), in-context learning offers a strong baseline for such scenarios, especially in data-limited regimes. However, LLMs are kno… ▽ More In executable task-oriented semantic parsing, the system aims to translate users' utterances in natural language to machine-interpretable programs (API calls) that can be executed according to pre-defined API specifications. With the popularity of Large Language Models (LLMs), in-context learning offers a strong baseline for such scenarios, especially in data-limited regimes. However, LLMs are known to hallucinate and therefore pose a formidable challenge in constraining generated content. Thus, it remains uncertain if LLMs can effectively perform task-oriented utterance-to-API generation where respecting API's structural and task-specific constraints is crucial. In this work, we seek to measure, analyze and mitigate such constraints violations. First, we identify the categories of various constraints in obtaining API-semantics from task-oriented utterances, and define fine-grained metrics that complement traditional ones. Second, we leverage these metrics to conduct a detailed error analysis of constraints violations seen in state-of-the-art LLMs, which motivates us to investigate two mitigation strategies: Semantic-Retrieval of Demonstrations (SRD) and API-aware Constrained Decoding (API-CD). Our experiments show that these strategies are effective at reducing constraints violations and improving the quality of the generated API calls, but require careful consideration given their implementation complexity and latency. △ Less

Submitted 24 May, 2023; originally announced May 2023.

arXiv:2201.10446 [pdf, ps, other]

doi 10.1103/PhysRevE.105.054208

Parametric resonance in a conservative system of coupled nonlinear oscillators

Authors: Johann Maddi, Christophe Coste, Michel Saint Jean

Abstract: We study a conservative system of two nonlinear coupled oscillators. The eigenmodes of the system are thus nonlinearly coupled, and one of them may induce a parametric amplification of the other, called an autoparametric resonance of the system. The parametric amplification implies two time scales, a fast one for the forcing and a slow one for the forced mode, thus a multiscale expansion is suitab… ▽ More We study a conservative system of two nonlinear coupled oscillators. The eigenmodes of the system are thus nonlinearly coupled, and one of them may induce a parametric amplification of the other, called an autoparametric resonance of the system. The parametric amplification implies two time scales, a fast one for the forcing and a slow one for the forced mode, thus a multiscale expansion is suitable to get amplitude equations describing the slow dynamics of the oscillators. We recall the parametric resonance in a dissipationless system, the parametrically forced Duffing oscillator, with emphasis on the energy transfer between the oscillator and the source that ensures the parametric forcing. Energy conservation is observed when averaging is done on the slow time scale relevant to parametric amplification,evidenced by a constant of the motion in the amplitude equation. Then we study a dimer in a periodic potential well, which is a conservative but non integrable system. When the dimer energy is such that it is trapped in neighboring potential wells, we derive coupled nonlinear differential equations for the eigenmodes amplitudes (center of mass motion and relative motion). We exhibit two constants of the motion, which demonstrates that the amplitude equations are integrable. We establish the conditions for autoparametric amplification of the relative motion by the center of mass motion, and describe the phase portraits of the system. In the opposite limit, when the dimer slides along the external potential so that the center of mass motion is basically a translation, we calculate the amplitude equation for the relative motion. In this latter case, we also exhibit autoparametric amplification of the relative motions of the dimer particles. In both cases, the comparison between numerical integration of the actual system and the asymptotic analysis evidences an excellent agreement. △ Less

Submitted 25 January, 2022; originally announced January 2022.

arXiv:1910.14075 [pdf, other]

Fill in the Blanks: Imputing Missing Sentences for Larger-Context Neural Machine Translation

Authors: Sébastien Jean, Ankur Bapna, Orhan Firat

Abstract: Most neural machine translation systems still translate sentences in isolation. To make further progress, a promising line of research additionally considers the surrounding context in order to provide the model potentially missing source-side information, as well as to maintain a coherent output. One difficulty in training such larger-context (i.e. document-level) machine translation systems is t… ▽ More Most neural machine translation systems still translate sentences in isolation. To make further progress, a promising line of research additionally considers the surrounding context in order to provide the model potentially missing source-side information, as well as to maintain a coherent output. One difficulty in training such larger-context (i.e. document-level) machine translation systems is that context may be missing from many parallel examples. To circumvent this issue, two-stage approaches, in which sentence-level translations are post-edited in context, have recently been proposed. In this paper, we instead consider the viability of filling in the missing context. In particular, we consider three distinct approaches to generate the missing context: using random contexts, applying a copy heuristic or generating it with a language model. In particular, the copy heuristic significantly helps with lexical coherence, while using completely random contexts hurts performance on many long-distance linguistic phenomena. We also validate the usefulness of tagged back-translation. In addition to improving BLEU scores as expected, using back-translated data helps larger-context machine translation systems to better capture long-range phenomena. △ Less

Submitted 30 October, 2019; originally announced October 2019.

arXiv:1909.06434 [pdf, other]

Adaptive Scheduling for Multi-Task Learning

Authors: Sébastien Jean, Orhan Firat, Melvin Johnson

Abstract: To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we explore different task scheduling approaches. We first consider existing non-adaptive techniques, then move on to adaptive schedules that over-sample tasks with po… ▽ More To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we explore different task scheduling approaches. We first consider existing non-adaptive techniques, then move on to adaptive schedules that over-sample tasks with poorer results compared to their respective baseline. As explicit schedules can be inefficient, especially if one task is highly over-sampled, we also consider implicit schedules, learning to scale learning rates or gradients of individual tasks instead. These techniques allow training multilingual models that perform better for low-resource language pairs (tasks with small amount of data), while minimizing negative effects on high-resource tasks. △ Less

Submitted 13 September, 2019; originally announced September 2019.

Comments: Continual Learning Workshop at NeurIPS 2018

arXiv:1903.04715 [pdf, other]

Context-Aware Learning for Neural Machine Translation

Authors: Sébastien Jean, Kyunghyun Cho

Abstract: Interest in larger-context neural machine translation, including document-level and multi-modal translation, has been growing. Multiple works have proposed new network architectures or evaluation schemes, but potentially helpful context is still sometimes ignored by larger-context translation models. In this paper, we propose a novel learning algorithm that explicitly encourages a neural translati… ▽ More Interest in larger-context neural machine translation, including document-level and multi-modal translation, has been growing. Multiple works have proposed new network architectures or evaluation schemes, but potentially helpful context is still sometimes ignored by larger-context translation models. In this paper, we propose a novel learning algorithm that explicitly encourages a neural translation model to take into account additional context using a multilevel pair-wise ranking loss. We evaluate the proposed learning algorithm with a transformer-based larger-context translation system on document-level translation. By comparing performance using actual and random contexts, we show that a model trained with the proposed algorithm is more sensitive to the additional context. △ Less

Submitted 11 March, 2019; originally announced March 2019.

arXiv:1902.08295 [pdf, other]

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Authors: Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob , et al. (66 additional authors not shown)

Abstract: Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly w… ▽ More Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly within the framework, and it contains existing implementations of a large number of utilities, helper functions, and the newest research ideas. Lingvo has been used in collaboration by dozens of researchers in more than 20 papers over the last two years. This document outlines the underlying design of Lingvo and serves as an introduction to the various pieces of the framework, while also offering examples of advanced features that showcase the capabilities of the framework. △ Less

Submitted 21 February, 2019; originally announced February 2019.

arXiv:1902.05620 [pdf]

doi 10.1016/j.nimb.2019.07.011

Calculation and verification of neutron irradiation damage with differential cross sections

Authors: Shengli Chen, David Bernard, Pierre Tamagno, Jean Tommasi, Stephane Bourganel, Gilles Noguere, Cyrille De Saint Jean

Abstract: The Displacement per Atom (DPA) rate is conventionally computed with DPA cross sections in reactor applications. The method of direct calculation with energy-angular distributions given in the Center of Mass (CM) frame is proposed and recommended in the present work. The methods for refining and verifying the calculations of DPA cross sections are proposed: (i) Gauss-Legendre-Quadrature-based Piec… ▽ More The Displacement per Atom (DPA) rate is conventionally computed with DPA cross sections in reactor applications. The method of direct calculation with energy-angular distributions given in the Center of Mass (CM) frame is proposed and recommended in the present work. The methods for refining and verifying the calculations of DPA cross sections are proposed: (i) Gauss-Legendre-Quadrature-based Piecewise Integration (GLQPI) for ensuring the numeric convergence of integral over emission angle due to the discontinuity of integrand; (ii) verification of the convergence for trapezoidal integration over the secondary energy; (iii) interpolation of double-differential cross sections. For 56Fe of JEFF-3.1.1, the current numeric integration over emission angle is shown not convergent, whereas the direct trapezoidal over the secondary energy and the direct interpolation of energy-angle-integrated damage are shown accurate. On the other hand, it is shown that the DPA cross sections are overestimated if isotropic angular distributions are assumed. However, the DPA cross section is not sensitive to the high-order Legendre polynomials because the former is an angle-integrated quantity. Numerical results of neutron elastic scattering show that 2 orders of Legendre polynomials can give the DPA rates of 56Fe within 0.5% overestimation for fission reactors, while 4 orders are required for fusion reactors. For neutron inelastic scatterings-induced DPA, the first order Legendre polynomial is sufficient for both fission and fusion reactors. △ Less

Submitted 19 July, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

Comments: arXiv admin note: text overlap with arXiv:1902.04889

Journal ref: Nuclear Instruments and Methods in Physics Research Section B: Beam Interactions with Materials and Atoms, Volume 456, 1 October 2019, Pages 120-132

arXiv:1902.04889 [pdf]

doi 10.1016/j.nimb.2019.07.011

Calculation and Verification of Irradiation Damage Cross Section with Energy-Angular Distribution

Authors: Shengli Chen, David Bernard, Pierre Tamagno, Cyrille De Saint Jean

Abstract: To complete the computation of Displacements per Atom (DPA) cross sections, the present work shows the methods of calculating DPA cross sections with the nuclear data of energy-angular distribution in both the laboratory and the Center-of-Mass (CM) frames. The method of direct calculation with data in the CM frame is proposed and recommended to decrease the computation burden and keep all informat… ▽ More To complete the computation of Displacements per Atom (DPA) cross sections, the present work shows the methods of calculating DPA cross sections with the nuclear data of energy-angular distribution in both the laboratory and the Center-of-Mass (CM) frames. The method of direct calculation with data in the CM frame is proposed and recommended to decrease the computation burden and keep all information. Theoretical analyses reveal that more than 7-point Gauss-Legendre Quadrature (GLQ) should be used to ensure the convergence of the angular integration for DPA computations. Numerical results show that 8-point GLQ is sufficient for the continuum inelastic neutron scattering, while 64-point GLQ is implemented in NJOY. Because the integrand over secondary energy is not derivable in the whole domain of the secondary energy, the trapezoidal integration is used to perform the numerical integration. The numerical calculations show that the trapezoidal integration is suitable to perform the integration over the secondary energy on the fine grid given by nuclear data files at least for 56Fe. The present work reveals that the direct interpolation of energy-angular-integrated damage can give the same results computed with standard interpolated energy-angular distributions. The DPA cross sections will be overestimated if isotropic angular distributions are assumed. However, the first-order Legendre polynomial can give DPA cross sections within 0.4% deviation, while 12 orders are required to describe the anisotropic angular distribution. △ Less

Submitted 8 February, 2019; originally announced February 2019.

Journal ref: Nuclear Inst. and Methods in Physics Research, B 456 (2019) pp. 120-132

arXiv:1704.05135 [pdf, ps, other]

Does Neural Machine Translation Benefit from Larger Context?

Authors: Sebastien Jean, Stanislas Lauly, Orhan Firat, Kyunghyun Cho

Abstract: We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when trained on small corpora, although this improvement largely disappears when trained with a larger corpus. We also discover that attention-based neural machine tran… ▽ More We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when trained on small corpora, although this improvement largely disappears when trained with a larger corpus. We also discover that attention-based neural machine translation is well suited for pronoun prediction and compares favorably with other approaches that were specifically designed for this task. △ Less

Submitted 17 April, 2017; originally announced April 2017.

arXiv:1701.06547 [pdf, ps, other]

Adversarial Learning for Neural Dialogue Generation

Authors: Jiwei Li, Will Monroe, Tianlin Shi, Sébastien Jean, Alan Ritter, Dan Jurafsky

Abstract: In this paper, drawing intuition from the Turing test, we propose using adversarial training for open-domain dialogue generation: the system is trained to produce sequences that are indistinguishable from human-generated dialogue utterances. We cast the task as a reinforcement learning (RL) problem where we jointly train two systems, a generative model to produce response sequences, and a discrimi… ▽ More In this paper, drawing intuition from the Turing test, we propose using adversarial training for open-domain dialogue generation: the system is trained to produce sequences that are indistinguishable from human-generated dialogue utterances. We cast the task as a reinforcement learning (RL) problem where we jointly train two systems, a generative model to produce response sequences, and a discriminator---analagous to the human evaluator in the Turing test--- to distinguish between the human-generated dialogues and the machine-generated ones. The outputs from the discriminator are then used as rewards for the generative model, pushing the system to generate dialogues that mostly resemble human dialogues. In addition to adversarial training we describe a model for adversarial {\em evaluation} that uses success in fooling an adversary as a dialogue evaluation metric, while avoiding a number of potential pitfalls. Experimental results on several metrics, including adversarial evaluation, demonstrate that the adversarially-trained system generates higher-quality responses than previous baselines. △ Less

Submitted 23 September, 2017; v1 submitted 23 January, 2017; originally announced January 2017.

arXiv:1605.02688 [pdf, other]

Theano: A Python framework for fast computation of mathematical expressions

Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, multiple frameworks have been built on top of it and it has been used to produce many state-of-the-art machine learning models. The present article is structured as follows. Section I provides an overview of the Theano software and its community. Section II presents the principal features of Theano and how to use them, and compares them with other similar projects. Section III focuses on recently-introduced functionalities and improvements. Section IV compares the performance of Theano against Torch7 and TensorFlow on several machine learning models. Section V discusses current limitations of Theano and potential ways of improving it. △ Less

Submitted 9 May, 2016; originally announced May 2016.

Comments: 19 pages, 5 figures

arXiv:1503.01800 [pdf, other]

EmoNets: Multimodal deep learning approaches for emotion recognition in video

Authors: Samira Ebrahimi Kahou, Xavier Bouthillier, Pascal Lamblin, Caglar Gulcehre, Vincent Michalski, Kishore Konda, Sébastien Jean, Pierre Froumenty, Yann Dauphin, Nicolas Boulanger-Lewandowski, Raul Chandias Ferrari, Mehdi Mirza, David Warde-Farley, Aaron Courville, Pascal Vincent, Roland Memisevic, Christopher Pal, Yoshua Bengio

Abstract: The task of the emotion recognition in the wild (EmotiW) Challenge is to assign one of seven emotions to short video clips extracted from Hollywood style movies. The videos depict acted-out emotions under realistic conditions with a large degree of variation in attributes such as pose and illumination, making it worthwhile to explore approaches which consider combinations of features from multiple… ▽ More The task of the emotion recognition in the wild (EmotiW) Challenge is to assign one of seven emotions to short video clips extracted from Hollywood style movies. The videos depict acted-out emotions under realistic conditions with a large degree of variation in attributes such as pose and illumination, making it worthwhile to explore approaches which consider combinations of features from multiple modalities for label assignment. In this paper we present our approach to learning several specialist models using deep learning techniques, each focusing on one modality. Among these are a convolutional neural network, focusing on capturing visual information in detected faces, a deep belief net focusing on the representation of the audio stream, a K-Means based "bag-of-mouths" model, which extracts visual features around the mouth region and a relational autoencoder, which addresses spatio-temporal aspects of videos. We explore multiple methods for the combination of cues from these modalities into one common classifier. This achieves a considerably greater accuracy than predictions from our strongest single-modality classifier. Our method was the winning submission in the 2013 EmotiW challenge and achieved a test set accuracy of 47.67% on the 2014 dataset. △ Less

Submitted 29 March, 2015; v1 submitted 5 March, 2015; originally announced March 2015.

arXiv:1412.6448 [pdf, other]

Embedding Word Similarity with Neural Machine Translation

Authors: Felix Hill, Kyunghyun Cho, Sebastien Jean, Coline Devin, Yoshua Bengio

Abstract: Neural language models learn word representations, or embeddings, that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models, a recently-developed class of neural language model. We show that embeddings from translation models outperform those learned by monolingual models at tasks that require knowledge of both conceptu… ▽ More Neural language models learn word representations, or embeddings, that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models, a recently-developed class of neural language model. We show that embeddings from translation models outperform those learned by monolingual models at tasks that require knowledge of both conceptual similarity and lexical-syntactic role. We further show that these effects hold when translating from both English to French and English to German, and argue that the desirable properties of translation embeddings should emerge largely independently of the source and target languages. Finally, we apply a new method for training neural translation models with very large vocabularies, and show that this vocabulary expansion algorithm results in minimal degradation of embedding quality. Our embedding spaces can be queried in an online demo and downloaded from our web page. Overall, our analyses indicate that translation-based embeddings should be used in applications that require concepts to be organised according to similarity and/or lexical function, while monolingual embeddings are better suited to modelling (nonspecific) inter-word relatedness. △ Less

Submitted 3 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

Comments: arXiv admin note: text overlap with arXiv:1410.0718

arXiv:1412.2007 [pdf, other]

On Using Very Large Target Vocabulary for Neural Machine Translation

Authors: Sébastien Jean, Kyunghyun Cho, Roland Memisevic, Yoshua Bengio

Abstract: Neural machine translation, a recently proposed approach to machine translation based purely on neural networks, has shown promising results compared to the existing approaches such as phrase-based statistical machine translation. Despite its recent success, neural machine translation has its limitation in handling a larger vocabulary, as training complexity as well as decoding complexity increase… ▽ More Neural machine translation, a recently proposed approach to machine translation based purely on neural networks, has shown promising results compared to the existing approaches such as phrase-based statistical machine translation. Despite its recent success, neural machine translation has its limitation in handling a larger vocabulary, as training complexity as well as decoding complexity increase proportionally to the number of target words. In this paper, we propose a method that allows us to use a very large target vocabulary without increasing training complexity, based on importance sampling. We show that decoding can be efficiently done even with the model having a very large target vocabulary by selecting only a small subset of the whole target vocabulary. The models trained by the proposed approach are empirically found to outperform the baseline models with a small vocabulary as well as the LSTM-based neural machine translation models. Furthermore, when we use the ensemble of a few models with very large target vocabularies, we achieve the state-of-the-art translation performance (measured by BLEU) on the English->German translation and almost as high performance as state-of-the-art English->French translation system. △ Less

Submitted 18 March, 2015; v1 submitted 5 December, 2014; originally announced December 2014.

arXiv:1410.0718 [pdf, other]

Not All Neural Embeddings are Born Equal

Authors: Felix Hill, KyungHyun Cho, Sebastien Jean, Coline Devin, Yoshua Bengio

Abstract: Neural language models learn word representations that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models. We show that translation-based embeddings outperform those learned by cutting-edge monolingual models at single-language tasks requiring knowledge of conceptual similarity and/or syntactic role. The findings sugg… ▽ More Neural language models learn word representations that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models. We show that translation-based embeddings outperform those learned by cutting-edge monolingual models at single-language tasks requiring knowledge of conceptual similarity and/or syntactic role. The findings suggest that, while monolingual models learn information about how concepts are related, neural-translation models better capture their true ontological status. △ Less

Submitted 13 November, 2014; v1 submitted 2 October, 2014; originally announced October 2014.

Comments: 4 pages plus 1 page of references

arXiv:1004.3851 [pdf, ps, other]

doi 10.1103/PhysRevE.81.051201

Single file diffusion of macroscopic charged particles

Authors: Christophe Coste, Jean-Baptiste Delfau, Catherine Even, Michel Saint Jean

Abstract: In this paper, we study a macroscopic system of electrically interacting metallic beads organized as a sequence along an annulus. A random mechanical shaking mimics the thermal excitation. We exhibit non Fickian diffusion (Single File Diffusion) at large time. We measure the mobility of the particles, and compare it to theoretical expectations. We show that our system cannot be accurately describe… ▽ More In this paper, we study a macroscopic system of electrically interacting metallic beads organized as a sequence along an annulus. A random mechanical shaking mimics the thermal excitation. We exhibit non Fickian diffusion (Single File Diffusion) at large time. We measure the mobility of the particles, and compare it to theoretical expectations. We show that our system cannot be accurately described by theories assuming only hard sphere interactions. Its behavior is qualitatively described by a theory extended to more realistic potentials [Kollmann, PRL {\bf 90} 180602, (2003)]. A correct quantitative agreement is shown, and we interpret the discrepancies by the violation of a key assumption of the theory, that of overdamped dynamics. We recast previous results on colloids with known interaction potentials, and compare them quantitatively to the theory. Focusing on the transition between ordinary and single file diffusion, we exhibit a dimensionless crossover time that is of order one both for colloids and our system, although the time and length scales differ by several orders of magnitude. △ Less

Submitted 22 April, 2010; originally announced April 2010.

Comments: 26

arXiv:0803.3157 [pdf, ps, other]

doi 10.1103/PhysRevE.71.046105

Local Symmetries and Order-Disorder Transitions in Small Macroscopic Wigner Islands

Authors: Gwennou Coupier, Claudine Guthmann, Yves Noat, Michel Saint Jean

Abstract: The influence of local order on the disordering scenario of small Wigner islands is discussed. A first disordering step is put in evidence by the time correlation functions and is linked to individual excitations resulting in configuration transitions, which are very sensitive to the local symmetries. This is followed by two other transitions, corresponding to orthoradial and radial diffusion, f… ▽ More The influence of local order on the disordering scenario of small Wigner islands is discussed. A first disordering step is put in evidence by the time correlation functions and is linked to individual excitations resulting in configuration transitions, which are very sensitive to the local symmetries. This is followed by two other transitions, corresponding to orthoradial and radial diffusion, for which both individual and collective excitations play a significant role. Finally, we show that, contrary to large systems, the focus that is commonly made on collective excitations for such small systems through the Lindemann criterion has to be made carefully in order to clearly identify the relative contributions in the whole disordering process. △ Less

Submitted 21 March, 2008; originally announced March 2008.

Comments: 14 pages, 10 figures

Journal ref: Phys. Rev. E 71, 046105 (2005)

arXiv:cond-mat/0611582 [pdf, ps, other]

doi 10.1103/PhysRevB.75.224103

Enhancement of mobilities in a pinned multidomain crystal

Authors: Gwennou Coupier, Michel Saint Jean, Claudine Guthmann

Abstract: Mobility properties inside and around degenerate domains of an elastic lattice partially pinned on a square array of traps are explored by means of a fully controllable model system of macroscopic particles. We focus on the different configurations obtained for filling ratios equal to 1 or 2 when the pinning strength is lowered. These theoretically expected but never observed configurations are… ▽ More Mobility properties inside and around degenerate domains of an elastic lattice partially pinned on a square array of traps are explored by means of a fully controllable model system of macroscopic particles. We focus on the different configurations obtained for filling ratios equal to 1 or 2 when the pinning strength is lowered. These theoretically expected but never observed configurations are degenerated, which implies the existence of a multidomain crystal. We show that the distinction between trapped and untrapped particles that is made in the case of strong pinning is not relevant for such a weaker pinning. Indeed, one ought to distinguish between particles inside or around the domains associated to positional degeneracies. The possible consequences on the depinning dynamics of the lattice are discussed. △ Less

Submitted 21 March, 2008; v1 submitted 22 November, 2006; originally announced November 2006.

Comments: 7 pages, 10 figures Version 2 : longer version

Journal ref: Phys. Rev. B 75, 224103 (2007)

arXiv:cond-mat/0610545 [pdf, ps, other]

doi 10.1209/0295-5075/77/60001

Single File Diffusion enhancement in a fluctuating modulated 1D channel

Authors: Gwennou Coupier, Michel Saint Jean, Claudine Guthmann

Abstract: We show that the diffusion of a single file of particles moving in a fluctuating modulated 1D channel is enhanced with respect to the one in a bald pipe. This effect, induced by the fluctuations of the modulation, is favored by the incommensurability between the channel potential modulation and the moving file periodicity. This phenomenon could be of importance in order to optimize the critical… ▽ More We show that the diffusion of a single file of particles moving in a fluctuating modulated 1D channel is enhanced with respect to the one in a bald pipe. This effect, induced by the fluctuations of the modulation, is favored by the incommensurability between the channel potential modulation and the moving file periodicity. This phenomenon could be of importance in order to optimize the critical current in superconductors, in particular in the case where mobile vortices move in 1D channels designed by adapted patterns of pinning sites. △ Less

Submitted 19 October, 2006; originally announced October 2006.

Comments: 4 pages, 4 figures

Journal ref: Europhys. Lett. 77, 60001 (2007)

arXiv:cond-mat/0603050 [pdf, ps, other]

doi 10.1103/PhysRevE.73.031112

Single file diffusion in macroscopic Wigner rings

Authors: Gwennou Coupier, Michel Saint Jean, Claudine Guthmann

Abstract: The single file diffusion in a circular channel of millimetric charged balls is studied. The evolution in time of the mean square displacement is shown to be subdiffusive, but slower than the power-like $t^{1/2}$ behavior observed in circular colloidal systems or predicted in one-dimensional infinite systems. The single file diffusion in a circular channel of millimetric charged balls is studied. The evolution in time of the mean square displacement is shown to be subdiffusive, but slower than the power-like $t^{1/2}$ behavior observed in circular colloidal systems or predicted in one-dimensional infinite systems. △ Less

Submitted 2 March, 2006; originally announced March 2006.

Comments: 7 pages, 7 figures

Journal ref: Phys. Rev. E 73, 031112 (2006)

arXiv:cond-mat/0602272 [pdf, ps, other]

doi 10.1140/epjb/e2006-00183-0

Determination of the interactions in confined macroscopic Wigner islands: theory and experiments

Authors: P. Galatola, G. Coupier, M. Saint Jean, J. -B. Fournier, C. Guthmann

Abstract: Macroscopic Wigner islands present an interesting complementary approach to explore the properties of two-dimensional confined particles systems. In this work, we characterize theoretically and experimentally the interaction between their basic components, viz., conducting spheres lying on the bottom electrode of a plane condenser. We show that the interaction energy can be approximately describ… ▽ More Macroscopic Wigner islands present an interesting complementary approach to explore the properties of two-dimensional confined particles systems. In this work, we characterize theoretically and experimentally the interaction between their basic components, viz., conducting spheres lying on the bottom electrode of a plane condenser. We show that the interaction energy can be approximately described by a decaying exponential as well as by a modified Bessel function of the second kind. In particular, this implies that the interactions in this system, whose characteristics are easily controllable, are the same as those between vortices in type-II superconductors. △ Less

Submitted 2 March, 2006; v1 submitted 10 February, 2006; originally announced February 2006.

Comments: 8 pages, 8 figures

Journal ref: Eur. Phys. J. B 50, 549 (2006)

arXiv:cond-mat/0101285 [pdf]

doi 10.1209/epl/i2001-00379-x

Macroscopic 2D Wigner islands

Authors: M. Saint Jean, C. Even, C. Guthmann

Abstract: In this paper we present new versatile "2D macroscopic Wigner islands" useful to investigate the various behaviors observed in mesoscopic confined systems. Our "Wigner islands" consist of electrostatically-interacting charged balls with millimetric size. We have experimentally determined the ground configurations for systems of N particles (N=1-30) confined in a parabolic potential and checked t… ▽ More In this paper we present new versatile "2D macroscopic Wigner islands" useful to investigate the various behaviors observed in mesoscopic confined systems. Our "Wigner islands" consist of electrostatically-interacting charged balls with millimetric size. We have experimentally determined the ground configurations for systems of N particles (N=1-30) confined in a parabolic potential and checked the influence of the confinement and interacting potentials. The results obtained are compared with the published numerical results. △ Less

Submitted 18 January, 2001; originally announced January 2001.

Comments: 8 pages, 4 figures

Showing 1–23 of 23 results for author: Jean, S