Skip to main content

Showing 1–15 of 15 results for author: Łancucki, A

.
  1. arXiv:2403.09636  [pdf, other

    cs.CL

    Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference

    Authors: Piotr Nawrot, Adrian Łańcucki, Marcin Chochowski, David Tarjan, Edoardo M. Ponti

    Abstract: Transformers have emerged as the backbone of large language models (LLMs). However, generation remains inefficient due to the need to store in memory a cache of key-value representations for past tokens, whose size scales linearly with the input sequence length and batch size. As a solution, we propose Dynamic Memory Compression (DMC), a method for on-line key-value cache compression at inference… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  2. Efficient Transformers with Dynamic Token Pooling

    Authors: Piotr Nawrot, Jan Chorowski, Adrian Łańcucki, Edoardo M. Ponti

    Abstract: Transformers achieve unrivalled performance in modelling language, but remain inefficient in terms of memory and time complexity. A possible remedy is to reduce the sequence length in the intermediate layers by pooling fixed-length segments of tokens. Nevertheless, natural units of meaning, such as words or phrases, display varying sizes. To address this mismatch, we equip language models with a d… ▽ More

    Submitted 24 May, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Journal ref: Proceedings of the 61st (Toronto 2023) Annual Meeting of the Association for Computational Linguistics (Volume 1 Long Papers) Pages 6403 to 6417

  3. arXiv:2206.02211  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.NE eess.AS

    Variable-rate hierarchical CPC leads to acoustic unit discovery in speech

    Authors: Santiago Cuervo, Adrian Łańcucki, Ricard Marxer, Paweł Rychlikowski, Jan Chorowski

    Abstract: The success of deep learning comes from its ability to capture the hierarchical structure of data by learning high-level representations defined in terms of low-level ones. In this paper we explore self-supervised learning of hierarchical representations of speech by applying multiple levels of Contrastive Predictive Coding (CPC). We observe that simply stacking two CPC models does not yield signi… ▽ More

    Submitted 4 December, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: Accepted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

    Journal ref: Advances in Neural Information Processing Systems, 2022

  4. arXiv:2110.15909  [pdf, other

    cs.LG cs.SD eess.AS

    Contrastive prediction strategies for unsupervised segmentation and categorization of phonemes and words

    Authors: Santiago Cuervo, Maciej Grabias, Jan Chorowski, Grzegorz Ciesielski, Adrian Łańcucki, Paweł Rychlikowski, Ricard Marxer

    Abstract: We investigate the performance on phoneme categorization and phoneme and word segmentation of several self-supervised learning (SSL) methods based on Contrastive Predictive Coding (CPC). Our experiments show that with the existing algorithms there is a trade off between categorization and segmentation performance. We investigate the source of this conflict and conclude that the use of context buil… ▽ More

    Submitted 25 February, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

  5. arXiv:2108.10447  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    One TTS Alignment To Rule Them All

    Authors: Rohan Badlani, Adrian Łancucki, Kevin J. Shih, Rafael Valle, Wei **, Bryan Catanzaro

    Abstract: Speech-to-text alignment is a critical component of neural textto-speech (TTS) models. Autoregressive TTS models typically use an attention mechanism to learn these alignments on-line. However, these alignments tend to be brittle and often fail to generalize to long utterances and out-of-domain text, leading to missing or repeating words. Most non-autoregressive endto-end TTS models rely on durati… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  6. arXiv:2106.11603  [pdf, ps, other

    cs.LG cs.SD eess.AS

    Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw

    Authors: Jan Chorowski, Grzegorz Ciesielski, Jarosław Dzikowski, Adrian Łańcucki, Ricard Marxer, Mateusz Opala, Piotr Pusz, Paweł Rychlikowski, Michał Stypułkowski

    Abstract: We present a number of low-resource approaches to the tasks of the Zero Resource Speech Challenge 2021. We build on the unsupervised representations of speech proposed by the organizers as a baseline, derived from CPC and clustered with the k-means algorithm. We demonstrate that simple methods of refining those representations can narrow the gap, or even improve upon the solutions which use a high… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: Published in Interspeech 2021

  7. arXiv:2104.13456  [pdf, other

    cs.CL

    Named Entity Recognition and Linking Augmented with Large-Scale Structured Data

    Authors: Paweł Rychlikowski, Bartłomiej Najdecki, Adrian Łańcucki, Adam Kaczmarek

    Abstract: In this paper we describe our submissions to the 2nd and 3rd SlavNER Shared Tasks held at BSNLP 2019 and BSNLP 2021, respectively. The tasks focused on the analysis of Named Entities in multilingual Web documents in Slavic languages with rich inflection. Our solution takes advantage of large collections of both unstructured and structured documents. The former serve as data for unsupervised traini… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  8. arXiv:2104.11946  [pdf, other

    cs.LG cs.SD eess.AS

    Aligned Contrastive Predictive Coding

    Authors: Jan Chorowski, Grzegorz Ciesielski, Jarosław Dzikowski, Adrian Łańcucki, Ricard Marxer, Mateusz Opala, Piotr Pusz, Paweł Rychlikowski, Michał Stypułkowski

    Abstract: We investigate the possibility of forcing a self-supervised model trained using a contrastive predictive loss to extract slowly varying latent representations. Rather than producing individual predictions for each of the future representations, the model emits a sequence of predictions shorter than that of the upcoming representations to which they will be aligned. In this way, the prediction netw… ▽ More

    Submitted 22 June, 2021; v1 submitted 24 April, 2021; originally announced April 2021.

    Comments: Published in Interspeech 2021

  9. arXiv:2006.06873  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    FastPitch: Parallel Text-to-speech with Pitch Prediction

    Authors: Adrian Łańcucki

    Abstract: We present FastPitch, a fully-parallel text-to-speech model based on FastSpeech, conditioned on fundamental frequency contours. The model predicts pitch contours during inference. By altering these predictions, the generated speech can be more expressive, better match the semantic of the utterance, and in the end more engaging to the listener. Uniformly increasing or decreasing pitch with FastPitc… ▽ More

    Submitted 16 February, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted to ICASSP 2021

  10. arXiv:2006.02547  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning

    Authors: Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski, Adrian Lancucki, Ricard Marxer, James Glass

    Abstract: Probabilistic Latent Variable Models (LVMs) provide an alternative to self-supervised learning approaches for linguistic representation learning from speech. LVMs admit an intuitive probabilistic interpretation where the latent structure shapes the information extracted from the signal. Even though LVMs have recently seen a renewed interest due to the introduction of Variational Autoencoders (VAEs… ▽ More

    Submitted 8 September, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Proceedings of Interspeech, 2020

  11. arXiv:2005.08520  [pdf, other

    cs.LG cs.CL stat.ML

    Robust Training of Vector Quantized Bottleneck Models

    Authors: Adrian Łańcucki, Jan Chorowski, Guillaume Sanchez, Ricard Marxer, Nanxin Chen, Hans J. G. A. Dolfing, Sameer Khurana, Tanel Alumäe, Antoine Laurent

    Abstract: In this paper we demonstrate methods for reliable and efficient training of discrete representation using Vector-Quantized Variational Auto-Encoder models (VQ-VAEs). Discrete latent variable models have been shown to learn nontrivial representations of speech, applicable to unsupervised voice conversion and reaching state-of-the-art performance on unit discovery tasks. For unsupervised representat… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: Published at IJCNN 2020

  12. arXiv:1901.04379  [pdf, other

    cs.CL

    Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees

    Authors: Jan Chorowski, Adrian Lancucki, Bartosz Kostka, Michal Zapotoczny

    Abstract: Deep neural acoustic models benefit from context-dependent (CD) modeling of output symbols. We consider direct training of CTC networks with CD outputs, and identify two issues. The first one is frame-level normalization of probabilities in CTC, which induces strong language modeling behavior that leads to overfitting and interference with external language models. The second one is poor generaliz… ▽ More

    Submitted 23 April, 2019; v1 submitted 14 January, 2019; originally announced January 2019.

  13. arXiv:1808.01160  [pdf, other

    cs.CL

    Efficient Purely Convolutional Text Encoding

    Authors: Szymon Malik, Adrian Lancucki, Jan Chorowski

    Abstract: In this work, we focus on a lightweight convolutional architecture that creates fixed-size vector embeddings of sentences. Such representations are useful for building NLP systems, including conversational agents. Our work derives from a recently proposed recursive convolutional architecture for auto-encoding text paragraphs at byte level. We propose alternations that significantly reduce training… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

    Comments: As accepted to: LaCATODA Workshop, ICML 2018

  14. arXiv:1805.08032  [pdf, ps, other

    cs.CL

    A Talker Ensemble: the University of Wrocław's Entry to the NIPS 2017 Conversational Intelligence Challenge

    Authors: Jan Chorowski, Adrian Łańcucki, Szymon Malik, Maciej Pawlikowski, Paweł Rychlikowski, Paweł Zykowski

    Abstract: We present Poetwannabe, a chatbot submitted by the University of Wrocław to the NIPS 2017 Conversational Intelligence Challenge, in which it ranked first ex-aequo. It is able to conduct a conversation with a user in a natural language. The primary functionality of our dialogue system is context-aware question answering (QA), while its secondary function is maintaining user engagement. The chatbot… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: To appear in NIPS 2017 Competition track Springer Proceedings

  15. arXiv:1401.5813  [pdf, ps, other

    cs.AI

    GGP with Advanced Reasoning and Board Knowledge Discovery

    Authors: Adrian Łańcucki

    Abstract: Quality of General Game Playing (GGP) matches suffers from slow state-switching and weak knowledge modules. Instantiation and Propositional Networks offer great performance gains over Prolog-based reasoning, but do not scale well. In this publication mGDL, a variant of GDL stripped of function constants, has been defined as a basis for simple reasoning machines. mGDL allows to easily map rules to… ▽ More

    Submitted 22 January, 2014; originally announced January 2014.