Skip to main content

Showing 1–25 of 25 results for author: Chronopoulou, A

.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2311.09344  [pdf, other

    cs.CL

    Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization

    Authors: Alexandra Chronopoulou, Jonas Pfeiffer, Joshua Maynez, Xinyi Wang, Sebastian Ruder, Priyanka Agrawal

    Abstract: Parameter-efficient fine-tuning (PEFT) using labeled task data can significantly improve the performance of large language models (LLMs) on the downstream task. However, there are 7000 languages in the world and many of these languages lack labeled data for real-world language generation tasks. In this paper, we propose to improve zero-shot cross-lingual transfer by composing language or task spec… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  3. arXiv:2305.17182  [pdf, other

    cs.CL cs.AI

    On the Copying Problem of Unsupervised NMT: A Training Schedule with a Language Discriminator Loss

    Authors: Yihong Liu, Alexandra Chronopoulou, Hinrich Schütze, Alexander Fraser

    Abstract: Although unsupervised neural machine translation (UNMT) has achieved success in many language pairs, the copying problem, i.e., directly copying some parts of the input sentence as the translation, is common among distant language pairs, especially when low-resource languages are involved. We find this issue is closely related to an unexpected copying behavior during online back-translation (BT).… ▽ More

    Submitted 4 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: IWSLT 2023

  4. arXiv:2305.13204  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Isochronous Machine Translation with Target Factors and Auxiliary Counters

    Authors: Proyag Pal, Brian Thompson, Yogesh Virkar, Prashant Mathur, Alexandra Chronopoulou, Marcello Federico

    Abstract: To translate speech for automatic dubbing, machine translation needs to be isochronous, i.e. translated speech needs to be aligned with the source in terms of speech durations. We introduce target factors in a transformer model to predict durations jointly with target language phoneme sequences. We also introduce auxiliary counters to help the decoder to keep track of the timing information while… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at INTERSPEECH 2023

  5. arXiv:2305.12786  [pdf, other

    cs.CL

    Mitigating Data Imbalance and Representation Degeneration in Multilingual Machine Translation

    Authors: Wen Lai, Alexandra Chronopoulou, Alexander Fraser

    Abstract: Despite advances in multilingual neural machine translation (MNMT), we argue that there are still two major challenges in this area: data imbalance and representation degeneration. The data imbalance problem refers to the imbalance in the amount of parallel corpora for all language pairs, especially for long-tail languages (i.e., very low-resource languages). The representation degeneration proble… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of EMNLP 2023, add statistical significance tests. code available at https://github.com/lavine-lmu/Bi-ACL

  6. arXiv:2304.07661  [pdf, other

    stat.ME math.PR math.ST

    Optimal Sampling for Estimation of Fractional Brownian Motion

    Authors: Xiang Cui, Alexandra Chronopoulou

    Abstract: In this paper, we focus on multiple sampling problems for the estimation of the fractional Brownian motion when the maximum number of samples is limited, extending existing results in the literature in a non-Markovian framework. Two classes of sampling schemes are proposed: a deterministic scheme and a level-triggered scheme. For the deterministic sampling scheme, the sampling times are selected b… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

  7. arXiv:2302.12979  [pdf, other

    cs.CL cs.SD eess.AS

    Jointly Optimizing Translations and Speech Timing to Improve Isochrony in Automatic Dubbing

    Authors: Alexandra Chronopoulou, Brian Thompson, Prashant Mathur, Yogesh Virkar, Surafel M. Lakew, Marcello Federico

    Abstract: Automatic dubbing (AD) is the task of translating the original speech in a video into target language speech. The new target language speech should satisfy isochrony; that is, the new speech should be time aligned with the original video, including mouth movements, pauses, hand gestures, etc. In this paper, we propose training a model that directly optimizes both the translation as well as the spe… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 5 pages

  8. arXiv:2302.07027  [pdf, other

    cs.CL

    AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models

    Authors: Alexandra Chronopoulou, Matthew E. Peters, Alexander Fraser, Jesse Dodge

    Abstract: Pretrained language models (PLMs) are trained on massive corpora, but often need to specialize to specific domains. A parameter-efficient adaptation method suggests training an adapter for each domain on the task of language modeling. This leads to good in-domain scores but can be impractical for domain- or resource-restricted settings. A solution is to use a related-domain adapter for the novel d… ▽ More

    Submitted 28 March, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: Accepted at EACL 2023; camera-ready version; fixed typo in related work

  9. arXiv:2210.11912  [pdf, other

    cs.CL

    $m^4Adapter$: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter

    Authors: Wen Lai, Alexandra Chronopoulou, Alexander Fraser

    Abstract: Multilingual neural machine translation models (MNMT) yield state-of-the-art performance when evaluated on data from a domain and language pair seen at training time. However, when a MNMT model is used to translate under domain shift or to a new language pair, performance drops dramatically. We consider a very challenging scenario: adapting the MNMT model both to a new domain and to a new language… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted to Findings of EMNLP 2022

  10. arXiv:2209.15236  [pdf, other

    cs.CL

    Language-Family Adapters for Low-Resource Multilingual Neural Machine Translation

    Authors: Alexandra Chronopoulou, Dario Stojanovski, Alexander Fraser

    Abstract: Large multilingual models trained with self-supervision achieve state-of-the-art results in a wide range of natural language processing tasks. Self-supervised pretrained models are often fine-tuned on parallel data from one or multiple language pairs for machine translation. Multilingual fine-tuning improves performance on low-resource languages but requires modifying the entire model and can be p… ▽ More

    Submitted 29 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: LoResMT (@EACL 2023) camera-ready version

  11. arXiv:2112.08786  [pdf, other

    cs.CL

    Efficient Hierarchical Domain Adaptation for Pretrained Language Models

    Authors: Alexandra Chronopoulou, Matthew E. Peters, Jesse Dodge

    Abstract: The remarkable success of large language models has been driven by dense models trained on massive unlabeled, unstructured corpora. These corpora typically contain text from diverse, heterogeneous sources, but information about the source of the text is rarely used during training. Transferring their knowledge to a target domain is typically done by continuing training in-domain. In this paper, we… ▽ More

    Submitted 3 May, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: NAACL 2022 accepted paper camera ready version

  12. arXiv:2103.10531  [pdf, other

    cs.CL

    Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Translation

    Authors: Alexandra Chronopoulou, Dario Stojanovski, Alexander Fraser

    Abstract: Successful methods for unsupervised neural machine translation (UNMT) employ crosslingual pretraining via self-supervision, often in the form of a masked language modeling or a sequence generation task, which requires the model to align the lexical- and high-level representations of the two languages. While cross-lingual pretraining works for similar languages with abundant corpora, it performs po… ▽ More

    Submitted 14 April, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: Accepted at NAACL 2021

  13. arXiv:2010.13192  [pdf, other

    cs.CL

    The LMU Munich System for the WMT 2020 Unsupervised Machine Translation Shared Task

    Authors: Alexandra Chronopoulou, Dario Stojanovski, Viktor Hangya, Alexander Fraser

    Abstract: This paper describes the submission of LMU Munich to the WMT 2020 unsupervised shared task, in two language directions, German<->Upper Sorbian. Our core unsupervised neural machine translation (UNMT) system follows the strategy of Chronopoulou et al. (2020), using a monolingual pretrained language generation model (on German) and fine-tuning it on both German and Upper Sorbian, before initializing… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

    Comments: WMT Unsupervised Shared Task 2020

  14. arXiv:2009.13366  [pdf, other

    cs.LG stat.ML

    Domain Adversarial Fine-Tuning as an Effective Regularizer

    Authors: Giorgos Vernikos, Katerina Margatina, Alexandra Chronopoulou, Ion Androutsopoulos

    Abstract: In Natural Language Processing (NLP), pretrained language models (LMs) that are transferred to downstream tasks have been recently shown to achieve state-of-the-art results. However, standard fine-tuning can degrade the general-domain representations captured during pretraining. To address this issue, we introduce a new regularization technique, AFTER; domain Adversarial Fine-Tuning as an Effectiv… ▽ More

    Submitted 5 October, 2020; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: EMNLP 2020, Findings of EMNLP

  15. arXiv:2009.07610  [pdf, other

    cs.CL

    Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT

    Authors: Alexandra Chronopoulou, Dario Stojanovski, Alexander Fraser

    Abstract: Using a language model (LM) pretrained on two languages with large monolingual data in order to initialize an unsupervised neural machine translation (UNMT) system yields state-of-the-art results. When limited data is available for one language, however, this method leads to poor translations. We present an effective approach that reuses an LM that is pretrained only on the high-resource language.… ▽ More

    Submitted 6 October, 2020; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: EMNLP 2020, main conference

  16. arXiv:1902.10547  [pdf, other

    cs.CL cs.LG

    An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models

    Authors: Alexandra Chronopoulou, Christos Baziotis, Alexandros Potamianos

    Abstract: A growing number of state-of-the-art transfer learning methods employ language models pretrained on large generic corpora. In this paper we present a conceptually simple and effective transfer learning approach that addresses the problem of catastrophic forgetting. Specifically, we combine the task-specific optimization function with an auxiliary language model objective, which is adjusted during… ▽ More

    Submitted 31 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: NAACL 2019

  17. arXiv:1809.00717  [pdf, other

    cs.CL

    NTUA-SLP at IEST 2018: Ensemble of Neural Transfer Methods for Implicit Emotion Classification

    Authors: Alexandra Chronopoulou, Aikaterini Margatina, Christos Baziotis, Alexandros Potamianos

    Abstract: In this paper we present our approach to tackle the Implicit Emotion Shared Task (IEST) organized as part of WASSA 2018 at EMNLP 2018. Given a tweet, from which a certain word has been removed, we are asked to predict the emotion of the missing word. In this work, we experiment with neural Transfer Learning (TL) methods. Our models are based on LSTM networks, augmented with a self-attention mechan… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

  18. arXiv:1805.01603  [pdf, ps, other

    stat.AP

    A Customer Choice Model with HALO Effect

    Authors: Reza Yousefi Maragheh, Alexandra Chronopoulou, James Mario Davis

    Abstract: In this paper, we propose an extension to the multinomial logit (MNL) model, the Halo MNL, that takes into account the interaction effects among products in an assortment. In particular, this model incorporates pairwise interactions of items in an effort to describe positive/negative effects among products that are present/absent in the assortment. Furthermore, we are interested in establishing su… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

  19. arXiv:1804.06658  [pdf, other

    cs.CL

    NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets with Deep Attentive RNNs and Transfer Learning

    Authors: Christos Baziotis, Nikos Athanasiou, Alexandra Chronopoulou, Athanasia Kolovou, Georgios Paraskevopoulos, Nikolaos Ellinas, Shrikanth Narayanan, Alexandros Potamianos

    Abstract: In this paper we present deep-learning models that submitted to the SemEval-2018 Task~1 competition: "Affect in Tweets". We participated in all subtasks for English tweets. We propose a Bi-LSTM architecture equipped with a multi-layer self attention mechanism. The attention mechanism improves the model performance and allows us to identify salient words in tweets, as well as gain insight into the… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

    Comments: Semeval 2018, Task 1 "Affect in Tweets"

  20. arXiv:1508.02651  [pdf, other

    stat.ME math.ST stat.CO

    Sequential Monte Carlo for fractional Stochastic Volatility Models

    Authors: Alexandra Chronopoulou, Konstantinos Spiliopoulos

    Abstract: In this paper we consider a fractional stochastic volatility model, that is a model in which the volatility may exhibit a long-range dependent or a rough/antipersistent behavior. We propose a dynamic sequential Monte Carlo methodology that is applicable to both long memory and antipersistent processes in order to estimate the volatility as well as the unknown parameters of the model. We establish… ▽ More

    Submitted 25 February, 2017; v1 submitted 11 August, 2015; originally announced August 2015.

  21. arXiv:1301.6413  [pdf, ps, other

    math.ST math.PR

    Maximum likelihood estimation for small noise multiscale diffusions

    Authors: Konstantinos Spiliopoulos, Alexandra Chronopoulou

    Abstract: We study the problem of parameter estimation for stochastic differential equations with small noise and fast oscillating parameters. Depending on how fast the intensity of the noise goes to zero relative to the homogenization parameter, we consider three different regimes. For each regime, we construct the maximum likelihood estimator and we study its consistency and asymptotic normality propertie… ▽ More

    Submitted 18 February, 2015; v1 submitted 27 January, 2013; originally announced January 2013.

    MSC Class: 62M05; 62M86; 60F05; 60G99

    Journal ref: Statistical Inference for Stochastic Processes,Volume 16, Issue 3, 2013, pp. 237-266

  22. arXiv:1104.3966  [pdf, ps, other

    math.PR math.ST

    On inference for fractional differential equations

    Authors: Alexandra Chronopoulou, Samy Tindel

    Abstract: Based on Malliavin calculus tools and approximation results, we show how to compute a maximum likelihood type estimator for a rather general differential equation driven by a fractional Brownian motion with Hurst parameter H>1/2. Rates of convergence for the approximation task are provided, and numerical experiments show that our procedure leads to good results in terms of estimation.

    Submitted 20 April, 2011; originally announced April 2011.

    Comments: 33 pages, 2 figures

    Journal ref: Stat. Inference Stoch. Process. 16, 1 (2013) 29-61

  23. arXiv:1102.0598  [pdf, ps, other

    math.ST stat.ME

    Optimal sequential change-detection for fractional diffusion-type processes

    Authors: Alexandra Chronopoulou, Georgios Fellouris

    Abstract: We consider the problem of detecting an abrupt change in the distribution of a sequentially observed stochastic process. We establish the optimality of the CUSUM test with respect to a modified version of Lorden's criterion for arbitrary processes with continuous paths and apply this general result to the special case of fractional diffusion-type processes. As a by-product, we show that the CUSUM… ▽ More

    Submitted 12 July, 2012; v1 submitted 2 February, 2011; originally announced February 2011.

    Comments: 23 pages

    MSC Class: 60G35; 60G22 (Primary) 60L10; 60G40 (Secondary)

  24. arXiv:0912.3148  [pdf, ps, other

    math.PR math.ST

    Variations and Hurst index estimation for a Rosenblatt process using longer filters

    Authors: Alexandra Chronopoulou, Ciprian Tudor, Frederi Viens

    Abstract: The Rosenblatt process is a self-similar non-Gaussian process which lives in second Wiener chaos, and occurs as the limit of correlated random sequences in so-called \textquotedblleft non-central limit theorems\textquotedblright. It shares the same covariance as fractional Brownian motion. We study the asymptotic distribution of the quadratic variations of the Rosenblatt process based on long fi… ▽ More

    Submitted 16 December, 2009; originally announced December 2009.

    Comments: To appear in Electronic Journal of Statistics

    Journal ref: Electronic Journal of Statistics 3 (2009) 1393-1435

  25. arXiv:0807.1208  [pdf, ps, other

    math.PR math.ST

    Self-similarity parameter estimation and reproduction property for non-Gaussian Hermite processes

    Authors: Alexandra Chronopoulou, Frederi Viens, Ciprian Tudor

    Abstract: We consider the class of all the Hermite processes $(Z_{t}^{(q,H)})_{t\in \lbrack 0,1]}$ of order $q\in \mathbf{N}^{\ast}$ and with Hurst parameter $% H\in (\frac{1}{2},1)$. The process $Z^{(q,H)}$ is $H$-selfsimilar, it has stationary increments and it exhibits long-range dependence identical to that of fractional Brownian motion (fBm). For $q=1$, $Z^{(1,H)}$ is fBm, which is Gaussian; for $q=2$,… ▽ More

    Submitted 18 June, 2010; v1 submitted 8 July, 2008; originally announced July 2008.

    Comments: To appear in "Communications on Stochastic Analysis"