Skip to main content

Showing 1–14 of 14 results for author: Dhanasekar

.
  1. arXiv:2210.09132  [pdf, other

    cs.CL

    Pseudo-OOD training for robust language models

    Authors: Dhanasekar Sundararaman, Nikhil Mehta, Lawrence Carin

    Abstract: While pre-trained large-scale deep models have garnered attention as an important topic for many downstream natural language processing (NLP) tasks, such models often make unreliable predictions on out-of-distribution (OOD) inputs. As such, OOD detection is a key component of a reliable machine-learning model for any industry-scale application. Common approaches often assume access to additional O… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Work in progress

  2. arXiv:2208.01755  [pdf, ps, other

    cs.CL cs.IR

    Debiasing Gender Bias in Information Retrieval Models

    Authors: Dhanasekar Sundararaman, Vivek Subramanian

    Abstract: Biases in culture, gender, ethnicity, etc. have existed for decades and have affected many areas of human social interaction. These biases have been shown to impact machine learning (ML) models, and for natural language processing (NLP), this can have severe consequences for downstream tasks. Mitigating gender bias in information retrieval (IR) is important to avoid propagating stereotypes. In thi… ▽ More

    Submitted 20 September, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: Updated title to be reflective of the methods

  3. arXiv:2205.03559  [pdf, other

    cs.CL cs.LG

    Improving Downstream Task Performance by Treating Numbers as Entities

    Authors: Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Liyan Xu, Lawrence Carin

    Abstract: Numbers are essential components of text, like any other word tokens, from which natural language processing (NLP) models are built and deployed. Though numbers are typically not accounted for distinctly in most NLP tasks, there is still an underlying amount of numeracy already exhibited by NLP models. In this work, we attempt to tap this potential of state-of-the-art NLP models and transfer their… ▽ More

    Submitted 18 September, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: Accepted to CIKM 2022

  4. arXiv:2201.00075  [pdf, other

    cs.CL cs.LG

    How do lexical semantics affect translation? An empirical study

    Authors: Vivek Subramanian, Dhanasekar Sundararaman

    Abstract: Neural machine translation (NMT) systems aim to map text from one language into another. While there are a wide variety of applications of NMT, one of the most important is translation of natural language. A distinguishing factor of natural language is that words are typically ordered according to the rules of the grammar of a given language. Although many advances have been made in develo** NMT… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

  5. arXiv:2110.12345  [pdf

    eess.SY

    Quantitative Analysis of Demand Response Using Thermostatically Controlled Loads

    Authors: Praveen Dhanasekar, Cunzhi Zhao, Xingpeng Li

    Abstract: The flexible power consumption feature of thermostatically controlled loads (TCLs) such as heating, ventilation, and air-conditioning (HVAC) systems makes them attractive targets for demand response (DR). TCLs possess a brief period where their power utilization can be altered without any significant impact on customer comfort level. This indicates TCLs are hidden potentials for providing ancillar… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

  6. arXiv:1911.06156  [pdf, other

    cs.CL cs.LG stat.ML

    Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding

    Authors: Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Shi**g Si, Dinghan Shen, Dong Wang, Lawrence Carin

    Abstract: Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks. The Transformer, for instance, is an illustrative example that generates abstract representations of tokens inputted to an encoder based on their relationships to all tokens in a sequence. Recent studies have shown that although such models are capable of learning syntactic features purely b… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

  7. arXiv:1911.01562  [pdf, other

    cs.LG cs.AI cs.RO

    DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning

    Authors: Bharathan Balaji, Sunil Mallya, Sahika Genc, Saurabh Gupta, Leo Dirac, Vineet Khare, Gourav Roy, Tao Sun, Yunzhe Tao, Brian Townsend, Eddie Calleja, Sunil Muralidhara, Dhanasekar Karuppasamy

    Abstract: DeepRacer is a platform for end-to-end experimentation with RL and can be used to systematically investigate the key challenges in develo** intelligent control systems. Using the platform, we demonstrate how a 1/18th scale car can learn to drive autonomously using RL with a monocular camera. It is trained in simulation with no additional tuning in physical world and demonstrates: 1) formulation… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

  8. arXiv:1906.08340  [pdf, other

    cs.CL cs.LG

    Learning Compressed Sentence Representations for On-Device Text Processing

    Authors: Dinghan Shen, Pengyu Cheng, Dhanasekar Sundararaman, Xinyuan Zhang, Qian Yang, Meng Tang, Asli Celikyilmaz, Lawrence Carin

    Abstract: Vector representations of sentences, trained on massive text corpora, are widely used as generic sentence embeddings across a variety of NLP problems. The learned representations are generally assumed to be continuous and real-valued, giving rise to a large memory footprint and slow retrieval speed, which hinders their applicability to low-resource (memory and computation) platforms, such as mobil… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

    Comments: To appear at ACL 2019

  9. arXiv:1806.01104  [pdf

    cs.DC

    Consolidating the innovative concepts towards Exascale computing for Co-Design of Co-Applications ll: Co-Design Automation - Workload Characterization

    Authors: Dhanasekar, Anirudh Seshadri, Sudharshan Srinivasan, Suryanarayanan, Akash Sridhar

    Abstract: Many-core co-design is a complex task in which application complexity design space, heterogeneous many-core architecture design space, parallel programming language design space, simulator design space and optimizer design space should get integrated through a binding process and these design spaces, an ensemble of what is called many-core co-design spaces. It is indispensable to build a co-design… ▽ More

    Submitted 29 April, 2018; originally announced June 2018.

    Comments: Revised Submission 2

  10. arXiv:1711.10002  [pdf

    cs.SI

    TweetIT- Analyzing Topics for Twitter Users to garner Maximum Attention

    Authors: Dhanasekar Sundararaman, Priya Arora, Vishwanath Seshagiri

    Abstract: Twitter, a microblogging service, is todays most popular platform for communication in the form of short text messages, called Tweets. Users use Twitter to publish their content either for expressing concerns on information news or views on daily conversations. When this expression emerges, they are experienced by the worldwide distribution network of users and not only by the interlocutor(s). Dep… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

  11. arXiv:1711.09737  [pdf

    cs.CY

    Rating the online review rating system using Yelp

    Authors: Dhanasekar S, Balaji

    Abstract: The impact of ratings on a restaurant plays a major role in attracting future customers to that restaurant. The word of mouth has been systematically replaced with the online reviews. It gives a sense of satisfaction for people to know beforehand about the number of average stars the restaurant has acquired before step** into a restaurant. However, these ratings are indirectly biased based on th… ▽ More

    Submitted 10 May, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: Version 1

  12. arXiv:1711.06970  [pdf

    cs.CY cs.LG

    How much is my car worth? A methodology for predicting used cars prices using Random Forest

    Authors: Nabarun Pal, Priya Arora, Dhanasekar Sundararaman, Puneet Kohli, Sai Sumanth Palakurthy

    Abstract: Cars are being sold more than ever. Develo** countries adopt the lease culture instead of buying a new car due to affordability. Therefore, the rise of used cars sales is exponentially increasing. Car sellers sometimes take advantage of this scenario by listing unrealistic prices owing to the demand. Therefore, arises a need for a model that can assign a price for a vehicle by evaluating its fea… ▽ More

    Submitted 19 November, 2017; originally announced November 2017.

    Comments: FICC Camera Ready

  13. arXiv:1706.05361  [pdf

    cs.SI cs.IR

    Twigraph: Discovering and Visualizing Influential Words between Twitter Profiles

    Authors: Dhanasekar Sundararaman, Sudharshan Srinivasan

    Abstract: The social media craze is on an ever increasing spree, and people are connected with each other like never before, but these vast connections are visually unexplored. We propose a methodology Twigraph to explore the connections between persons using their Twitter profiles. First, we propose a hybrid approach of recommending social media profiles, articles, and advertisements to a user.The profiles… ▽ More

    Submitted 29 June, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

  14. arXiv:1509.07543  [pdf, other

    cs.HC cs.CV

    On Optimizing Human-Machine Task Assignments

    Authors: Andreas Veit, Michael Wilber, Rajan Vaish, Serge Belongie, James Davis, Vishal Anand, Anshu Aviral, Prithvijit Chakrabarty, Yash Chandak, Sidharth Chaturvedi, Chinmaya Devaraj, Ankit Dhall, Utkarsh Dwivedi, Sanket Gupte, Sharath N. Sridhar, Karthik Paga, Anuj Pahuja, Aditya Raisinghani, Ayush Sharma, Shweta Sharma, Darpana Sinha, Nisarg Thakkar, K. Bala Vignesh, Utkarsh Verma, Kanniganti Abhishek , et al. (26 additional authors not shown)

    Abstract: When crowdsourcing systems are used in combination with machine inference systems in the real world, they benefit the most when the machine system is deeply integrated with the crowd workers. However, if researchers wish to integrate the crowd with "off-the-shelf" machine classifiers, this deep integration is not always possible. This work explores two strategies to increase accuracy and decrease… ▽ More

    Submitted 24 September, 2015; originally announced September 2015.

    Comments: HCOMP 2015 Work in Progress