Skip to main content

Showing 1–7 of 7 results for author: Gosangi, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.08962  [pdf, other

    cs.CL cs.AI cs.LG

    On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles

    Authors: Rakesh Gosangi, Ravneet Arora, Mohsen Gheisarieha, Debanjan Mahata, Haimin Zhang

    Abstract: In this paper, we study the importance of context in predicting the citation worthiness of sentences in scholarly articles. We formulate this problem as a sequence labeling task solved using a hierarchical BiLSTM model. We contribute a new benchmark dataset containing over two million sentences and their corresponding labels. We preserve the sentence order in this dataset and perform document-leve… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

    Comments: To be published in the proceedings of NAACL 2021

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2104.08578  [pdf, other

    cs.CL cs.AI cs.CY

    GupShup: An Annotated Corpus for Abstractive Summarization of Open-Domain Code-Switched Conversations

    Authors: Laiba Mehnaz, Debanjan Mahata, Rakesh Gosangi, Uma Sushmitha Gunturi, Riya Jain, Gauri Gupta, Amardeep Kumar, Isabelle Lee, Anish Acharya, Rajiv Ratn Shah

    Abstract: Code-switching is the communication phenomenon where speakers switch between different languages during a conversation. With the widespread adoption of conversational agents and chat platforms, code-switching has become an integral part of written conversations in many multi-lingual communities worldwide. This makes it essential to develop techniques for summarizing and understanding these convers… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  3. arXiv:2009.02619  [pdf, other

    cs.CL

    MIDAS at SemEval-2020 Task 10: Emphasis Selection using Label Distribution Learning and Contextual Embeddings

    Authors: Sarthak Anand, Pradyumna Gupta, Hemant Yadav, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah

    Abstract: This paper presents our submission to the SemEval 2020 - Task 10 on emphasis selection in written text. We approach this emphasis selection problem as a sequence labeling task where we represent the underlying text with various contextual embedding models. We also employ label distribution learning to account for annotator disagreements. We experiment with the choice of model architectures, traina… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

  4. arXiv:1912.06927  [pdf, other

    cs.CL cs.SI

    #MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement

    Authors: Akash Gautam, Puneet Mathur, Rakesh Gosangi, Debanjan Mahata, Ramit Sawhney, Rajiv Ratn Shah

    Abstract: In this paper, we present a dataset containing 9,973 tweets related to the MeToo movement that were manually annotated for five different linguistic aspects: relevance, stance, hate speech, sarcasm, and dialogue acts. We present a detailed account of the data collection and annotation processes. The annotations have a very high inter-annotator agreement (0.79 to 0.93 k-alpha) due to the domain exp… ▽ More

    Submitted 20 April, 2020; v1 submitted 14 December, 2019; originally announced December 2019.

    Comments: Preprint of paper accepted at ICWSM 2020

  5. arXiv:1910.08840  [pdf, other

    cs.CL

    Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings

    Authors: Dhruva Sahrawat, Debanjan Mahata, Mayank Kulkarni, Haimin Zhang, Rakesh Gosangi, Amanda Stent, Agniv Sharma, Yaman Kumar, Rajiv Ratn Shah, Roger Zimmermann

    Abstract: In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where the words in the input text are represented using deep contextualized embeddings. We evaluate the proposed architecture using both contextualized and fixed word embedding models on three different benchmark datasets (Inspec, SemEval 2010, SemEval 2017) and compare w… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

  6. arXiv:1909.12229  [pdf, other

    cs.CL cs.IR cs.LG

    Keyphrase Generation for Scientific Articles using GANs

    Authors: Avinash Swaminathan, Raj Kuwar Gupta, Haimin Zhang, Debanjan Mahata, Rakesh Gosangi, Rajiv Ratn Shah

    Abstract: In this paper, we present a keyphrase generation approach using conditional Generative Adversarial Networks (GAN). In our GAN model, the generator outputs a sequence of keyphrases based on the title and abstract of a scientific article. The discriminator learns to distinguish between machine-generated and human-curated keyphrases. We evaluate this approach on standard benchmark datasets. Our model… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

    Comments: 2 pages, 1 fig, 8 references, 2 tables

  7. arXiv:1908.01821  [pdf, other

    cs.CL cs.LG stat.ML

    Dialogue Act Classification in Group Chats with DAG-LSTMs

    Authors: Ozan İrsoy, Rakesh Gosangi, Haimin Zhang, Mu-Hsin Wei, Peter Lund, Duccio Pappadopulo, Brendan Fahy, Neophytos Nephytou, Camilo Ortiz

    Abstract: Dialogue act (DA) classification has been studied for the past two decades and has several key applications such as workflow automation and conversation analytics. Researchers have used, to address this problem, various traditional machine learning models, and more recently deep neural network models such as hierarchical convolutional neural networks (CNNs) and long short-term memory (LSTM) networ… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Comments: Appeared in SIGIR 2019 Workshop on Conversational Interaction Systems