Skip to main content

Showing 1–25 of 25 results for author: Sahu, G

.
  1. arXiv:2311.11462  [pdf, other

    cs.CL cs.AI

    LLM aided semi-supervision for Extractive Dialog Summarization

    Authors: Nishant Mishra, Gaurav Sahu, Iacer Calixto, Ameen Abu-Hanna, Issam H. Laradji

    Abstract: Generating high-quality summaries for chat dialogs often requires large labeled datasets. We propose a method to efficiently use unlabeled data for extractive summarization of customer-agent dialogs. In our method, we frame summarization as a question-answering problem and use state-of-the-art large language models (LLMs) to generate pseudo-labels for a dialog. We then use these pseudo-labels to f… ▽ More

    Submitted 23 November, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: to be published in EMNLP Findings

  2. arXiv:2311.09559  [pdf, other

    cs.CL cs.AI

    Prompt-based Pseudo-labeling Strategy for Sample-Efficient Semi-Supervised Extractive Summarization

    Authors: Gaurav Sahu, Olga Vechtomova, Issam H. Laradji

    Abstract: Semi-supervised learning (SSL) is a widely used technique in scenarios where labeled data is scarce and unlabeled data is abundant. While SSL is popular for image and text classification, it is relatively underexplored for the task of extractive text summarization. Standard SSL methods follow a teacher-student paradigm to first train a classification model and then use the classifier's confidence… ▽ More

    Submitted 1 July, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages, 6 figures, 3 tables

  3. arXiv:2310.14192  [pdf, other

    cs.CL cs.AI

    PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

    Authors: Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, Issam H. Laradji

    Abstract: Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training data. Recent work often tackles this problem using large language models (LLMs) like GPT3 that can generate new examples given already available ones. In this work, we propose a method to generate more helpful augmented data by utilizing the LLM's abilities to follo… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Long paper)

  4. arXiv:2307.09312  [pdf, other

    cs.CL cs.LG cs.MM cs.SI

    Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media

    Authors: Liam Hebert, Gaurav Sahu, Yuxuan Guo, Nanda Kishore Sreenivas, Lukasz Golab, Robin Cohen

    Abstract: We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the cont… ▽ More

    Submitted 22 February, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted to AAAI 2024 (AI for Social Impact Track)

  5. arXiv:2212.09947  [pdf, other

    cs.CL cs.AI cs.LG

    Future Sight: Dynamic Story Generation with Large Pretrained Language Models

    Authors: Brian D. Zimmerman, Gaurav Sahu, Olga Vechtomova

    Abstract: Recent advances in deep learning research, such as transformers, have bolstered the ability for automated agents to generate creative texts similar to those that a human would write. By default, transformer decoders can only generate new text with respect to previously generated text. The output distribution of candidate tokens at any position is conditioned on previously selected tokens using a s… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 9 pages, 1 figure, 4 tables

  6. arXiv:2210.15638  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    LyricJam Sonic: A Generative System for Real-Time Composition and Musical Improvisation

    Authors: Olga Vechtomova, Gaurav Sahu

    Abstract: Electronic music artists and sound designers have unique workflow practices that necessitate specialized approaches for develo** music information retrieval and creativity support tools. Furthermore, electronic music instruments, such as modular synthesizers, have near-infinite possibilities for sound creation and can be combined to create unique and complex audio paths. The process of discoveri… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 15 pages, 9 figures, 2 tables

  7. arXiv:2204.01959  [pdf, other

    cs.CL cs.AI

    Data Augmentation for Intent Classification with Off-the-shelf Large Language Models

    Authors: Gaurav Sahu, Pau Rodriguez, Issam H. Laradji, Parmida Atighehchian, David Vazquez, Dzmitry Bahdanau

    Abstract: Data augmentation is a widely employed technique to alleviate the problem of data scarcity. In this work, we propose a prompting-based approach to generate labelled training data for intent classification with off-the-shelf language models (LMs) such as GPT-3. An advantage of this method is that no task-specific LM-fine-tuning for data generation is required; hence the method requires no hyper-par… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted to 4th Workshop on NLP for Conversational AI, ACL 2022

  8. arXiv:2111.06440  [pdf, other

    cs.SI cs.AI

    Personalized multi-faceted trust modeling to determine trust links in social media and its potential for misinformation management

    Authors: Alexandre Parmentier, Robin Cohen, Xueguang Ma, Gaurav Sahu, Queenie Chen

    Abstract: In this paper, we present an approach for predicting trust links between peers in social media, one that is grounded in the artificial intelligence area of multiagent trust modeling. In particular, we propose a data-driven multi-faceted trust modeling which incorporates many distinct features for a comprehensive analysis. We focus on demonstrating how clustering of similar users enables a critical… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: 28 pages

  9. arXiv:2106.01960  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    LyricJam: A system for generating lyrics for live instrumental music

    Authors: Olga Vechtomova, Gaurav Sahu, Dhruv Kumar

    Abstract: We describe a real-time system that receives a live audio stream from a jam session and generates lyric lines that are congruent with the live music being played. Two novel approaches are proposed to align the learned latent spaces of audio and text representations that allow the system to generate novel lyric lines matching live instrumental music. One approach is based on adversarial alignment o… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Accepted to International Conference on Computational Creativity (ICCC) 2021 [Oral]

  10. arXiv:2105.01129  [pdf, other

    cs.AI cs.CL cs.CV cs.MA

    Towards A Multi-agent System for Online Hate Speech Detection

    Authors: Gaurav Sahu, Robin Cohen, Olga Vechtomova

    Abstract: This paper envisions a multi-agent system for detecting the presence of hate speech in online social media platforms such as Twitter and Facebook. We introduce a novel framework employing deep learning techniques to coordinate the channels of textual and im-age processing. Our experimental results aim to demonstrate the effectiveness of our methods for classifying online content, training the prop… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: Accepted to the 2nd International Workshop on Autonomous Agents for Social Good (AASG), AAMAS, 2021

  11. arXiv:2009.14375  [pdf, other

    cs.CL

    Generation of lyrics lines conditioned on music audio clips

    Authors: Olga Vechtomova, Gaurav Sahu, Dhruv Kumar

    Abstract: We present a system for generating novel lyrics lines conditioned on music audio. A bimodal neural network model learns to generate lines conditioned on any given short audio clip. The model consists of a spectrogram variational autoencoder (VAE) and a text VAE. Both automatic and human evaluations demonstrate effectiveness of our model in generating lines that have an emotional impact matching a… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to First Workshop on NLP for Music and Audio (NLP4MusA) at ISMIR 2020

  12. arXiv:1911.03821  [pdf, other

    cs.CL cs.CV cs.LG eess.AS

    Adaptive Fusion Techniques for Multimodal Data

    Authors: Gaurav Sahu, Olga Vechtomova

    Abstract: Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data. In this paper, we propose adaptive fusion techniques that aim to model context from different modalities effectively. Instead of defining a deterministic fusion operation, such as concatenation, for the network, we let the network decide "how" to co… ▽ More

    Submitted 26 January, 2021; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: Camera-ready version for EACL 2021

  13. arXiv:1911.03817  [pdf, other

    cs.CL

    Adversarial Learning on the Latent Space for Diverse Dialog Generation

    Authors: Kashif Khan, Gaurav Sahu, Vikash Balasubramanian, Lili Mou, Olga Vechtomova

    Abstract: Generating relevant responses in a dialog is challenging, and requires not only proper modeling of context in the conversation but also being able to generate fluent sentences during inference. In this paper, we propose a two-step framework based on generative adversarial nets for generating conditioned responses. Our model first learns a meaningful representation of sentences by autoencoding and… ▽ More

    Submitted 3 November, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: Accepted to COLING 2020

  14. arXiv:1904.06022  [pdf, other

    cs.LG cs.CL stat.ML

    Multimodal Speech Emotion Recognition and Ambiguity Resolution

    Authors: Gaurav Sahu

    Abstract: Identifying emotion from speech is a non-trivial task pertaining to the ambiguous definition of emotion itself. In this work, we adopt a feature-engineering based approach to tackle the task of speech emotion recognition. Formalizing our problem as a multi-class classification problem, we compare the performance of two categories of models. For both, we extract eight hand-crafted features from the… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: 9 pages

  15. Non-Extensive Statistics in Free-Electron Metals and Thermal Effective Mass

    Authors: Arvind Khuntia, Gayatri Sahu, Raghunath Sahoo, Durga P. Mahapatra, Niranjan Barik

    Abstract: We have applied the non-extensive statistical mechanics to free electrons in several metals to calculate the electronic specific heat at low temperature. In this case, the Fermi-Dirac (FD) function is modified from its Boltzmann-Gibbs (BG) form, with the exponential part going to a $q$-exponential, in its non-extensive form. In most cases, the non-extensive parameter, $q$, is found to be greater t… ▽ More

    Submitted 8 April, 2019; v1 submitted 21 September, 2018; originally announced September 2018.

    Comments: Final Published version

    Journal ref: Physica A 523 (2019) 852

  16. arXiv:1809.01446  [pdf, other

    cs.CL

    Free as in Free Word Order: An Energy Based Model for Word Segmentation and Morphological Tagging in Sanskrit

    Authors: Amrith Krishna, Bishal Santra, Sasi Prasanth Bandaru, Gaurav Sahu, Vishnu Dutt Sharma, Pavankumar Satuluri, Pawan Goyal

    Abstract: The configurational information in sentences of a free word order language such as Sanskrit is of limited use. Thus, the context of the entire sentence will be desirable even for basic processing tasks such as word segmentation. We propose a structured prediction framework that jointly solves the word segmentation and morphological tagging tasks in Sanskrit. We build an energy based model where we… ▽ More

    Submitted 25 October, 2018; v1 submitted 5 September, 2018; originally announced September 2018.

    Comments: version 2: Corrected typo in Table1, page7 | Accepted in EMNLP 2018. Supplementary material can be found at - http://cse.iitkgp.ac.in/~amrithk/1080_supp.pdf

  17. arXiv:1408.4314  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Effect of silicon resistivity on its porosification using metal induced chemical etching

    Authors: Shailendra K Saxena, Gayatri Sahu, P. K. Sahoo, Pankaj R. Sagdeo, Rajesh Kumar

    Abstract: A comparison of porous structures formed from silicon (Si) wafers with different resistivities has been reported here based on the morphological studies carried out using scanning electron microscope (SEM). The porous Si samples have been prepared using metal induced etching (MIE) technique from two different Si wafers having two different resistivities. It is observed that porous Si containing we… ▽ More

    Submitted 19 August, 2014; originally announced August 2014.

    Comments: 9 Pages, 5 Figures

    Journal ref: Material Research Express, Vol. 2, 036501, 2015

  18. arXiv:1403.6269  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Comparison of porous silicon prepared using metal-induced etching (MIE) and laser-induced etching (LIE)

    Authors: Shailendra K. Saxena, Vivek Kumar, Hari M. Rai, Gayatri Sahu, Ravi K. Late, Kapil Saxena, A. K. Shukla, Pankaj R. Sagdeo, Rajesh Kumar

    Abstract: Porous silicon (p-Si), prepared by two routes (metal induced etching (MIE) and laser induced etching (LIE)) have been studied by comparing the observed surface morphologies using SEM. A uniformly distributed smaller (submicron sized) pores are formed when MIE technique is used because the pore formation is driven by uniformly distributed metal (silver in present case) nanoparticles, deposited prio… ▽ More

    Submitted 25 March, 2014; originally announced March 2014.

  19. arXiv:1309.5180  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Evolution of Asymmetric Raman line-shape from nano-structures

    Authors: Rajesh Kumar, Gayatri Sahu, Shailendra K. Saxena, Hari M. Rai, Pankaj R. Sagdeo

    Abstract: A step-by-step evolution of an asymmetric Raman line-shape function from a Lorentzian line-shape is presented here for low dimensional semiconductors. The evolution reported here is based on the phonon confinement model which is successfully used in literature to explain the asymmetric Raman line-shape from semiconductor nano-structures. Physical significance of different terms in the theoretical… ▽ More

    Submitted 20 September, 2013; originally announced September 2013.

    Journal ref: Silicon, Vol. 6, Page 117, Year 2014

  20. arXiv:1302.3402  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Fabrication of silicon nanocrystals using sequential Au ion implantation

    Authors: Gayatri Sahu, Rajesh Kumar, D. P. Mahapatra

    Abstract: Silicon nanocrystals are produced using a two-stage gold ion implantation technique. First stage implantation using low energy ions leads to the formation of an amorphous Si (a-Si) layer. A subsequent high energy Au irradiation in the second stage is found to produce strained Si NCs. An annealing at a temperature as low as 500$^o$C is seen to result in strain free NCs showing quantum confinement e… ▽ More

    Submitted 14 February, 2013; originally announced February 2013.

    Comments: PDFLATEX, 10 Pages, 6 Figures

    Journal ref: Silicon, Vol. 6, Page 65-71, Year 2014

  21. Narrow band UV emission from direct band gap Si nanoclusters embedded in bulk Si

    Authors: G. Sahu, H. P. Lenka, D. P. Mahapatra, Karol Grycginski, A. K. Singh, Jianyou Li, B. Rout, F. D. McDaniel, Arup Neogi

    Abstract: This paper has been withdrawn kee** in view of publication elsewhere with some appropriate modifications.

    Submitted 24 December, 2009; v1 submitted 15 October, 2009; originally announced October 2009.

    Comments: This paper has been withdrawn

  22. arXiv:0811.0806  [pdf, ps, other

    cond-mat.mtrl-sci

    The mechanism of ion induced amorphization in Si

    Authors: H. P. Lenka, U. M. Bhatta, P. K. Kuiri, G. Sahu, B. Joseph, B. Satpati, D. P. Mahapatra

    Abstract: Some results on damage build up in, and amorphization of, Si, induced by 25-30 keV Al$_5^-$, Si$_5^-$ and Cs$^-$ ions, at room temperature, are reported. We show that at low energy, amorphization is a nucleation and growth process, based on the direct impact mechanism. With an Avrami exponent $\sim 1.6$, the growth towards amorphization seems to be diffusion limited. A transition to a completely… ▽ More

    Submitted 16 February, 2009; v1 submitted 5 November, 2008; originally announced November 2008.

    Comments: 4 pages, 5 figures

  23. arXiv:0811.0122  [pdf, other

    cond-mat.mtrl-sci

    Enhanced UV Light emission from Silicon nanoparticles induced by Au ion implantation

    Authors: Akhilesh Singh, Karol G. Grycznski, Bibhu Rout, Jianyou Li, Floyd McDaniel, Arup Neogi, Gayatri Sahu, Durga P. Mahapatra

    Abstract: Study of light emitting silicon fabricated by ion implantation.

    Submitted 1 November, 2008; originally announced November 2008.

    Comments: G-COE Conference - Kyoto 2008 abstract

  24. arXiv:0805.0066  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.other

    Study of low energy Si$_5^-$ and Cs$^-$ implantation induced amorphization effects in Si(100)

    Authors: H. P. Lenka, B. Joseph, P. K. Kuiri, G. Sahu, P. Mishra, D. Ghose, D. P. Mahapatra

    Abstract: The damage growth and surface modifications in Si(100), induced by 25 keV Si$_5^-$ cluster ions, as a function of fluence, $φ$, has been studied using atomic force microscopy (AFM) and channeling Rutherford backscattering spectrometry (CRBS). CRBS results indicate a nonlinear growth in damage from which it has been possible to get a threshold fluence, $φ_0$, for amorphization as… ▽ More

    Submitted 4 June, 2008; v1 submitted 1 May, 2008; originally announced May 2008.

    Comments: 7 pages, 4 figures

  25. arXiv:0802.2494  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.stat-mech

    Observation of a Universal Aggregation Mechanism and a Possible Phase Transition in Au Sputtered by Swift Heavy Ions

    Authors: P. K. Kuiri, B. Joseph, H. P. Lenka, G. Sahu, J. Ghatak, D. Kanjilal, D. P. Mahapatra

    Abstract: Two exponents, $δ$, for size distribution of $n$-atom clusters, $Y(n)\sim n^{-δ}$, have been found in Au clusters sputtered from embedded Au nanoparticles under swift heavy ion irradiation. For small clusters, below 12.5 nm in size, $δ$ has been found to be 3/2, which can be rationalized as occurring from a steady state aggregation process with size independent aggregation. For larger clusters,… ▽ More

    Submitted 16 October, 2009; v1 submitted 17 February, 2008; originally announced February 2008.

    Comments: 4 pages, 3 figures

    Journal ref: Phys Rev Lett 100, 245501 (2008)