Skip to main content

Showing 1–21 of 21 results for author: Vechtomova, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.09559  [pdf, other

    cs.CL cs.AI

    Prompt-based Pseudo-labeling Strategy for Sample-Efficient Semi-Supervised Extractive Summarization

    Authors: Gaurav Sahu, Olga Vechtomova, Issam H. Laradji

    Abstract: Semi-supervised learning (SSL) is a widely used technique in scenarios where labeled data is scarce and unlabeled data is abundant. While SSL is popular for image and text classification, it is relatively underexplored for the task of extractive text summarization. Standard SSL methods follow a teacher-student paradigm to first train a classification model and then use the classifier's confidence… ▽ More

    Submitted 1 July, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages, 6 figures, 3 tables

  2. arXiv:2310.14192  [pdf, other

    cs.CL cs.AI

    PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

    Authors: Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, Issam H. Laradji

    Abstract: Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training data. Recent work often tackles this problem using large language models (LLMs) like GPT3 that can generate new examples given already available ones. In this work, we propose a method to generate more helpful augmented data by utilizing the LLM's abilities to follo… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Long paper)

  3. arXiv:2212.09947  [pdf, other

    cs.CL cs.AI cs.LG

    Future Sight: Dynamic Story Generation with Large Pretrained Language Models

    Authors: Brian D. Zimmerman, Gaurav Sahu, Olga Vechtomova

    Abstract: Recent advances in deep learning research, such as transformers, have bolstered the ability for automated agents to generate creative texts similar to those that a human would write. By default, transformer decoders can only generate new text with respect to previously generated text. The output distribution of candidate tokens at any position is conditioned on previously selected tokens using a s… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 9 pages, 1 figure, 4 tables

  4. arXiv:2210.15638  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    LyricJam Sonic: A Generative System for Real-Time Composition and Musical Improvisation

    Authors: Olga Vechtomova, Gaurav Sahu

    Abstract: Electronic music artists and sound designers have unique workflow practices that necessitate specialized approaches for develo** music information retrieval and creativity support tools. Furthermore, electronic music instruments, such as modular synthesizers, have near-infinite possibilities for sound creation and can be combined to create unique and complex audio paths. The process of discoveri… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 15 pages, 9 figures, 2 tables

  5. arXiv:2106.01960  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    LyricJam: A system for generating lyrics for live instrumental music

    Authors: Olga Vechtomova, Gaurav Sahu, Dhruv Kumar

    Abstract: We describe a real-time system that receives a live audio stream from a jam session and generates lyric lines that are congruent with the live music being played. Two novel approaches are proposed to align the learned latent spaces of audio and text representations that allow the system to generate novel lyric lines matching live instrumental music. One approach is based on adversarial alignment o… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Accepted to International Conference on Computational Creativity (ICCC) 2021 [Oral]

  6. arXiv:2105.01129  [pdf, other

    cs.AI cs.CL cs.CV cs.MA

    Towards A Multi-agent System for Online Hate Speech Detection

    Authors: Gaurav Sahu, Robin Cohen, Olga Vechtomova

    Abstract: This paper envisions a multi-agent system for detecting the presence of hate speech in online social media platforms such as Twitter and Facebook. We introduce a novel framework employing deep learning techniques to coordinate the channels of textual and im-age processing. Our experimental results aim to demonstrate the effectiveness of our methods for classifying online content, training the prop… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: Accepted to the 2nd International Workshop on Autonomous Agents for Social Good (AASG), AAMAS, 2021

  7. arXiv:2011.00416  [pdf, other

    cs.CL cs.AI cs.LG

    Deep Learning for Text Style Transfer: A Survey

    Authors: Di **, Zhi**g **, Zhiting Hu, Olga Vechtomova, Rada Mihalcea

    Abstract: Text style transfer is an important task in natural language generation, which aims to control certain attributes in the generated text, such as politeness, emotion, humor, and many others. It has a long history in the field of natural language processing, and recently has re-gained significant attention thanks to the promising performance brought by deep neural models. In this paper, we present a… ▽ More

    Submitted 16 December, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: Computational Linguistics Journal 2022

  8. arXiv:2009.14375  [pdf, other

    cs.CL

    Generation of lyrics lines conditioned on music audio clips

    Authors: Olga Vechtomova, Gaurav Sahu, Dhruv Kumar

    Abstract: We present a system for generating novel lyrics lines conditioned on music audio. A bimodal neural network model learns to generate lines conditioned on any given short audio clip. The model consists of a spectrogram variational autoencoder (VAE) and a text VAE. Both automatic and human evaluations demonstrate effectiveness of our model in generating lines that have an emotional impact matching a… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to First Workshop on NLP for Music and Audio (NLP4MusA) at ISMIR 2020

  9. arXiv:2006.09639  [pdf, other

    cs.CL

    Iterative Edit-Based Unsupervised Sentence Simplification

    Authors: Dhruv Kumar, Lili Mou, Lukasz Golab, Olga Vechtomova

    Abstract: We present a novel iterative, edit-based approach to unsupervised sentence simplification. Our model is guided by a scoring function involving fluency, simplicity, and meaning preservation. Then, we iteratively perform word and phrase-level edits on the complex sentence. Compared with previous approaches, our model does not require a parallel training set, but is more controllable and interpretabl… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: The paper has been accepted to ACL 2020

  10. arXiv:2005.01791  [pdf, other

    cs.CL

    Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction

    Authors: Raphael Schumann, Lili Mou, Yao Lu, Olga Vechtomova, Katja Markert

    Abstract: Automatic sentence summarization produces a shorter version of a sentence, while preserving its most important information. A good summary is characterized by language fluency and high information overlap with the source sentence. We model these two aspects in an unsupervised objective function, consisting of language modeling and semantic similarity metrics. We search for a high-scoring summary b… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: Accepted at ACL 2020

  11. arXiv:2004.10809  [pdf, other

    cs.CL

    Polarized-VAE: Proximity Based Disentangled Representation Learning for Text Generation

    Authors: Vikash Balasubramanian, Ivan Kobyzev, Hareesh Bahuleyan, Ilya Shapiro, Olga Vechtomova

    Abstract: Learning disentangled representations of real-world data is a challenging open problem. Most previous methods have focused on either supervised approaches which use attribute labels or unsupervised approaches that manipulate the factorization in the latent space of models such as the variational autoencoder (VAE) by training with task-specific losses. In this work, we propose polarized-VAE, an app… ▽ More

    Submitted 24 January, 2021; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: Camera Ready for EACL 2021

  12. arXiv:1911.03828  [pdf, other

    cs.CL

    Stylized Text Generation Using Wasserstein Autoencoders with a Mixture of Gaussian Prior

    Authors: Amirpasha Ghabussi, Lili Mou, Olga Vechtomova

    Abstract: Wasserstein autoencoders are effective for text generation. They do not however provide any control over the style and topic of the generated sentences if the dataset has multiple classes and includes different topics. In this work, we present a semi-supervised approach for generating stylized sentences. Our model is trained on a multi-class dataset and learns the latent representation of the sent… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

  13. arXiv:1911.03821  [pdf, other

    cs.CL cs.CV cs.LG eess.AS

    Adaptive Fusion Techniques for Multimodal Data

    Authors: Gaurav Sahu, Olga Vechtomova

    Abstract: Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data. In this paper, we propose adaptive fusion techniques that aim to model context from different modalities effectively. Instead of defining a deterministic fusion operation, such as concatenation, for the network, we let the network decide "how" to co… ▽ More

    Submitted 26 January, 2021; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: Camera-ready version for EACL 2021

  14. arXiv:1911.03817  [pdf, other

    cs.CL

    Adversarial Learning on the Latent Space for Diverse Dialog Generation

    Authors: Kashif Khan, Gaurav Sahu, Vikash Balasubramanian, Lili Mou, Olga Vechtomova

    Abstract: Generating relevant responses in a dialog is challenging, and requires not only proper modeling of context in the conversation but also being able to generate fluent sentences during inference. In this paper, we propose a two-step framework based on generative adversarial nets for generating conditioned responses. Our model first learns a meaningful representation of sentences by autoencoding and… ▽ More

    Submitted 3 November, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: Accepted to COLING 2020

  15. arXiv:1907.05789  [pdf, other

    cs.CL

    Generating Sentences from Disentangled Syntactic and Semantic Spaces

    Authors: Yu Bao, Hao Zhou, Shujian Huang, Lei Li, Lili Mou, Olga Vechtomova, Xinyu Dai, Jiajun Chen

    Abstract: Variational auto-encoders (VAEs) are widely used in natural language generation due to the regularization of the latent space. However, generating sentences from the continuous latent space does not explicitly model the syntactic information. In this paper, we propose to generate sentences from disentangled syntactic and semantic spaces. Our proposed method explicitly models syntactic information… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

    Comments: 11 pages, accepted in ACL-2019

  16. arXiv:1903.12136  [pdf, other

    cs.CL cs.LG

    Distilling Task-Specific Knowledge from BERT into Simple Neural Networks

    Authors: Raphael Tang, Yao Lu, Linqing Liu, Lili Mou, Olga Vechtomova, Jimmy Lin

    Abstract: In the natural language processing literature, neural networks are becoming increasingly deeper and complex. The recent poster child of this trend is the deep language representation model, which includes BERT, ELMo, and GPT. These developments have led to the conviction that previous-generation, shallower neural networks for language understanding are obsolete. In this paper, however, we demonstr… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: 8 pages, 2 figures; first three authors contributed equally

  17. arXiv:1812.08318  [pdf

    cs.CL cs.SD eess.AS

    Generating lyrics with variational autoencoder and multi-modal artist embeddings

    Authors: Olga Vechtomova, Hareesh Bahuleyan, Amirpasha Ghabussi, Vineet John

    Abstract: We present a system for generating song lyrics lines conditioned on the style of a specified artist. The system uses a variational autoencoder with artist embeddings. We propose the pre-training of artist embeddings with the representations learned by a CNN classifier, which is trained to predict artists based on MEL spectrograms of their song clips. This work is the first step towards combining a… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: 5 pages, 5 tables, 1 figure

  18. arXiv:1808.04339  [pdf, other

    cs.CL

    Disentangled Representation Learning for Non-Parallel Text Style Transfer

    Authors: Vineet John, Lili Mou, Hareesh Bahuleyan, Olga Vechtomova

    Abstract: This paper tackles the problem of disentangling the latent variables of style and content in language models. We propose a simple yet effective approach, which incorporates auxiliary multi-task and adversarial objectives, for label prediction and bag-of-words prediction, respectively. We show, both qualitatively and quantitatively, that the style and content are indeed disentangled in the latent s… ▽ More

    Submitted 10 September, 2018; v1 submitted 13 August, 2018; originally announced August 2018.

    Comments: 11 pages, 7 figures, 6 tables; Preliminary work rejected by EMNLP-18

    MSC Class: 68T50 ACM Class: I.2.7

  19. arXiv:1806.08462  [pdf, other

    cs.CL cs.LG stat.ML

    Stochastic Wasserstein Autoencoder for Probabilistic Sentence Generation

    Authors: Hareesh Bahuleyan, Lili Mou, Hao Zhou, Olga Vechtomova

    Abstract: The variational autoencoder (VAE) imposes a probabilistic distribution (typically Gaussian) on the latent space and penalizes the Kullback--Leibler (KL) divergence between the posterior and prior. In NLP, VAEs are extremely difficult to train due to the problem of KL collapsing to zero. One has to implement various heuristics such as KL weight annealing and word dropout in a carefully engineered m… ▽ More

    Submitted 12 April, 2019; v1 submitted 21 June, 2018; originally announced June 2018.

    Comments: Accepted by NAACL-HLT 2019

  20. arXiv:1712.08207  [pdf, other

    cs.CL

    Variational Attention for Sequence-to-Sequence Models

    Authors: Hareesh Bahuleyan, Lili Mou, Olga Vechtomova, Pascal Poupart

    Abstract: The variational encoder-decoder (VED) encodes source information as a set of random variables using a neural network, which in turn is decoded into target data using another neural network. In natural language processing, sequence-to-sequence (Seq2Seq) models typically serve as encoder-decoder networks. When combined with a traditional (deterministic) attention mechanism, the variational latent sp… ▽ More

    Submitted 21 June, 2018; v1 submitted 21 December, 2017; originally announced December 2017.

    Comments: In Proceedings of COLING 2018. Also accepted by TADGM Workshop@ICML 2018 for presentation

  21. arXiv:1707.09448  [pdf, ps, other

    cs.CL

    Sentiment Analysis on Financial News Headlines using Training Dataset Augmentation

    Authors: Vineet John, Olga Vechtomova

    Abstract: This paper discusses the approach taken by the UWaterloo team to arrive at a solution for the Fine-Grained Sentiment Analysis problem posed by Task 5 of SemEval 2017. The paper describes the document vectorization and sentiment score prediction techniques used, as well as the design and implementation decisions taken while building the system for this task. The system uses text vectorization model… ▽ More

    Submitted 28 July, 2017; originally announced July 2017.

    Comments: 5 pages

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: Association for Computational Linguistics, Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), 869-873