Skip to main content

Showing 1–9 of 9 results for author: Sundararaman, D

.
  1. arXiv:2210.09132  [pdf, other

    cs.CL

    Pseudo-OOD training for robust language models

    Authors: Dhanasekar Sundararaman, Nikhil Mehta, Lawrence Carin

    Abstract: While pre-trained large-scale deep models have garnered attention as an important topic for many downstream natural language processing (NLP) tasks, such models often make unreliable predictions on out-of-distribution (OOD) inputs. As such, OOD detection is a key component of a reliable machine-learning model for any industry-scale application. Common approaches often assume access to additional O… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Work in progress

  2. arXiv:2208.01755  [pdf, ps, other

    cs.CL cs.IR

    Debiasing Gender Bias in Information Retrieval Models

    Authors: Dhanasekar Sundararaman, Vivek Subramanian

    Abstract: Biases in culture, gender, ethnicity, etc. have existed for decades and have affected many areas of human social interaction. These biases have been shown to impact machine learning (ML) models, and for natural language processing (NLP), this can have severe consequences for downstream tasks. Mitigating gender bias in information retrieval (IR) is important to avoid propagating stereotypes. In thi… ▽ More

    Submitted 20 September, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: Updated title to be reflective of the methods

  3. arXiv:2205.03559  [pdf, other

    cs.CL cs.LG

    Improving Downstream Task Performance by Treating Numbers as Entities

    Authors: Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Liyan Xu, Lawrence Carin

    Abstract: Numbers are essential components of text, like any other word tokens, from which natural language processing (NLP) models are built and deployed. Though numbers are typically not accounted for distinctly in most NLP tasks, there is still an underlying amount of numeracy already exhibited by NLP models. In this work, we attempt to tap this potential of state-of-the-art NLP models and transfer their… ▽ More

    Submitted 18 September, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: Accepted to CIKM 2022

  4. arXiv:2201.00075  [pdf, other

    cs.CL cs.LG

    How do lexical semantics affect translation? An empirical study

    Authors: Vivek Subramanian, Dhanasekar Sundararaman

    Abstract: Neural machine translation (NMT) systems aim to map text from one language into another. While there are a wide variety of applications of NMT, one of the most important is translation of natural language. A distinguishing factor of natural language is that words are typically ordered according to the rules of the grammar of a given language. Although many advances have been made in develo** NMT… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

  5. arXiv:1911.06156  [pdf, other

    cs.CL cs.LG stat.ML

    Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding

    Authors: Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Shi**g Si, Dinghan Shen, Dong Wang, Lawrence Carin

    Abstract: Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks. The Transformer, for instance, is an illustrative example that generates abstract representations of tokens inputted to an encoder based on their relationships to all tokens in a sequence. Recent studies have shown that although such models are capable of learning syntactic features purely b… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

  6. arXiv:1906.08340  [pdf, other

    cs.CL cs.LG

    Learning Compressed Sentence Representations for On-Device Text Processing

    Authors: Dinghan Shen, Pengyu Cheng, Dhanasekar Sundararaman, Xinyuan Zhang, Qian Yang, Meng Tang, Asli Celikyilmaz, Lawrence Carin

    Abstract: Vector representations of sentences, trained on massive text corpora, are widely used as generic sentence embeddings across a variety of NLP problems. The learned representations are generally assumed to be continuous and real-valued, giving rise to a large memory footprint and slow retrieval speed, which hinders their applicability to low-resource (memory and computation) platforms, such as mobil… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

    Comments: To appear at ACL 2019

  7. arXiv:1711.10002  [pdf

    cs.SI

    TweetIT- Analyzing Topics for Twitter Users to garner Maximum Attention

    Authors: Dhanasekar Sundararaman, Priya Arora, Vishwanath Seshagiri

    Abstract: Twitter, a microblogging service, is todays most popular platform for communication in the form of short text messages, called Tweets. Users use Twitter to publish their content either for expressing concerns on information news or views on daily conversations. When this expression emerges, they are experienced by the worldwide distribution network of users and not only by the interlocutor(s). Dep… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

  8. arXiv:1711.06970  [pdf

    cs.CY cs.LG

    How much is my car worth? A methodology for predicting used cars prices using Random Forest

    Authors: Nabarun Pal, Priya Arora, Dhanasekar Sundararaman, Puneet Kohli, Sai Sumanth Palakurthy

    Abstract: Cars are being sold more than ever. Develo** countries adopt the lease culture instead of buying a new car due to affordability. Therefore, the rise of used cars sales is exponentially increasing. Car sellers sometimes take advantage of this scenario by listing unrealistic prices owing to the demand. Therefore, arises a need for a model that can assign a price for a vehicle by evaluating its fea… ▽ More

    Submitted 19 November, 2017; originally announced November 2017.

    Comments: FICC Camera Ready

  9. arXiv:1706.05361  [pdf

    cs.SI cs.IR

    Twigraph: Discovering and Visualizing Influential Words between Twitter Profiles

    Authors: Dhanasekar Sundararaman, Sudharshan Srinivasan

    Abstract: The social media craze is on an ever increasing spree, and people are connected with each other like never before, but these vast connections are visually unexplored. We propose a methodology Twigraph to explore the connections between persons using their Twitter profiles. First, we propose a hybrid approach of recommending social media profiles, articles, and advertisements to a user.The profiles… ▽ More

    Submitted 29 June, 2017; v1 submitted 16 June, 2017; originally announced June 2017.