Skip to main content

Showing 1–19 of 19 results for author: Sai, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03893  [pdf, other

    cs.CL

    How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?

    Authors: Anushka Singh, Ananya B. Sai, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Mitesh M Khapra

    Abstract: While machine translation evaluation has been studied primarily for high-resource languages, there has been a recent interest in evaluation for low-resource languages due to the increasing availability of data and models. In this paper, we focus on a zero-shot evaluation setting focusing on low-resource Indian languages, namely Assamese, Kannada, Maithili, and Punjabi. We collect sufficient Multi-… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2404.09664  [pdf, other

    cs.LG cs.CY

    Closing the Gap in the Trade-off between Fair Representations and Accuracy

    Authors: Biswajit Rout, Ananya B. Sai, Arun Rajkumar

    Abstract: The rapid developments of various machine learning models and their deployments in several applications has led to discussions around the importance of looking beyond the accuracies of these models. Fairness of such models is one such aspect that is deservedly gaining more attention. In this work, we analyse the natural language representations of documents and sentences (i.e., encodings) for any… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: DAI-24

  3. arXiv:2307.03322  [pdf, other

    cs.CL

    BiPhone: Modeling Inter Language Phonetic Influences in Text

    Authors: Abhirut Gupta, Ananya B. Sai, Richard Sproat, Yuri Vasilevski, James S. Ren, Ambarish Jash, Sukhdeep S. Sodhi, Aravindan Raghuveer

    Abstract: A large number of people are forced to use the Web in a language they have low literacy in due to technology asymmetries. Written text in the second language (L2) from such users often contains a large number of errors that are influenced by their native language (L1). We propose a method to mine phoneme confusions (sounds in L2 that an L1 speaker is likely to conflate) for pairs of L1 and L2. The… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at ACL 2023

  4. arXiv:2306.04366  [pdf, other

    cs.SI cs.AI cs.HC cs.LG

    Enhancing Worker Recruitment in Collaborative Mobile Crowdsourcing: A Graph Neural Network Trust Evaluation Approach

    Authors: Zhongwei Zhan, Yingjie Wang, Peiyong Duan, Akshita Maradapu Vera Venkata Sai, Zhaowei Liu, Chaocan Xiang, Xiangrong Tong, Weilong Wang, Zhipeng Cai

    Abstract: Collaborative Mobile Crowdsourcing (CMCS) allows platforms to recruit worker teams to collaboratively execute complex sensing tasks. The efficiency of such collaborations could be influenced by trust relationships among workers. To obtain the asymmetric trust values among all workers in the social network, the Trust Reinforcement Evaluation Framework (TREF) based on Graph Convolutional Neural Netw… ▽ More

    Submitted 21 March, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: The article has been accepted by IEEE TMC, and its DOI is 10.1109/TMC.2024.3373469

  5. arXiv:2212.10180  [pdf, other

    cs.CL

    IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages

    Authors: Ananya B. Sai, Vignesh Nagarajan, Tanay Dixit, Raj Dabre, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra

    Abstract: The rapid growth of machine translation (MT) systems has necessitated comprehensive studies to meta-evaluate evaluation metrics being used, which enables a better selection of metrics that best reflect MT quality. Unfortunately, most of the research focuses on high-resource languages, mainly English, the observations for which may not always apply to other languages. Indian languages, having over… ▽ More

    Submitted 3 July, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 long paper

  6. arXiv:2210.11664  [pdf, other

    cs.CY cs.DC

    Promoting Rigour in Blockchains Energy & Environmental Footprint Research: A Systematic Literature Review

    Authors: Ashish Rajendra Sai, Harald Vranken

    Abstract: There is a growing interest in understanding the energy and environmental footprint of digital currencies, specifically in cryptocurrencies such as Bitcoin and Ethereum. These cryptocurrencies are operated by a geographically distributed network of computing nodes, making it hard to accurately estimate their energy consumption. Existing studies, both in academia and industry, attempt to model th… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: This article is currently under peer review

  7. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, **ho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  8. arXiv:2109.05771  [pdf, other

    cs.CL

    Perturbation CheckLists for Evaluating NLG Evaluation Metrics

    Authors: Ananya B. Sai, Tanay Dixit, Dev Yashpal Sheth, Sreyas Mohan, Mitesh M. Khapra

    Abstract: Natural Language Generation (NLG) evaluation is a multifaceted task requiring assessment of multiple desirable criteria, e.g., fluency, coherency, coverage, relevance, adequacy, overall quality, etc. Across existing datasets for 6 NLG tasks, we observe that the human evaluation scores on these multiple criteria are often not correlated. For example, there is a very low correlation between human sc… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021. See https://iitmnlp.github.io/EvalEval/ for our templates and code

  9. arXiv:2108.13599  [pdf, other

    cs.RO

    Through the Looking Glass: Diminishing Occlusions in Robot Vision Systems with Mirror Reflections

    Authors: Kentaro Yoshioka, Hidenori Okuni, Tuan Thanh Ta, Akihide Sai

    Abstract: The quality of robot vision greatly affects the performance of automation systems, where occlusions stand as one of the biggest challenges. If the target is occluded from the sensor, detecting and gras** such objects become very challenging. For example, when multiple robot arms cooperate in a single workplace, occlusions will be created under the robot arm itself and hide objects underneath. Wh… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted to IROS 2021

  10. arXiv:2108.12901  [pdf, other

    cs.SE

    BoostNSift: A Query Boosting and Code Sifting Technique for Method Level Bug Localization

    Authors: Abdul Razzaq, Jim Buckley, James Vincent Patten, Muslim Chochlov, Ashish Rajendra Sai

    Abstract: Locating bugs is an important, but effort-intensive and time-consuming task, when dealing with large-scale systems. To address this, Information Retrieval (IR) techniques are increasingly being used to suggest potential buggy source code locations, for given bug reports. While IR techniques are very scalable, in practice their effectiveness in accurately localizing bugs in a software system remain… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

  11. arXiv:2009.12542  [pdf, other

    cs.CY cs.CR

    Taxonomy of Centralization in Public Blockchain Systems: A Systematic Literature Review

    Authors: Ashish Rajendra Sai, Jim Buckley, Brian Fitzgerald, Andrew Le Gear

    Abstract: Bitcoin introduced delegation of control over a monetary system from a select few to all who participate in that system. This delegation is known as the decentralization of controlling power and is a powerful security mechanism for the ecosystem. After the introduction of Bitcoin, the field of cryptocurrency has seen widespread attention from industry and academia, so much so that the original nov… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: Currently under review at ELS Information Processing and Management

  12. arXiv:2009.11321  [pdf, other

    cs.CL

    Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining

    Authors: Ananya B. Sai, Akash Kumar Mohankumar, Siddhartha Arora, Mitesh M. Khapra

    Abstract: There is an increasing focus on model-based dialog evaluation metrics such as ADEM, RUBER, and the more recent BERT-based metrics. These models aim to assign a high score to all relevant responses and a low score to all irrelevant responses. Ideally, such models should be trained using multiple relevant and irrelevant responses for any given context. However, no such data is publicly available, an… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

    Comments: Accepted for publication in TACL

  13. arXiv:2008.12009  [pdf, other

    cs.CL

    A Survey of Evaluation Metrics Used for NLG Systems

    Authors: Ananya B. Sai, Akash Kumar Mohankumar, Mitesh M. Khapra

    Abstract: The success of Deep Learning has created a surge in interest in a wide a range of Natural Language Generation (NLG) tasks. Deep Learning has not only pushed the state of the art in several existing NLG tasks but has also facilitated researchers to explore various newer NLG tasks such as image captioning. Such rapid progress in NLG has necessitated the development of accurate automatic evaluation m… ▽ More

    Submitted 5 October, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: A condensed version of this paper is submitted to ACM CSUR

  14. arXiv:2007.08222  [pdf, other

    cs.SE cs.PL

    Inheritance software metrics on smart contracts

    Authors: Ashish Rajendra Sai, Conor Holmes, Jim Buckley, Andrew Le Gear

    Abstract: Blockchain systems have gained substantial traction recently, partly due to the potential of decentralized immutable mediation of economic activities. Ethereum is a prominent example that has the provision for executing stateful computing scripts known as Smart Contracts. These smart contracts resemble traditional programs, but with immutability being the core differentiating factor. Given their i… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted by International Conference on Program Comprehension (ICPC 2020)

  15. arXiv:2007.05611  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Contextual Clinical Prediction with Reverse Distillation

    Authors: Rohan S. Kodialam, Rebecca Boiarsky, Justin Lim, Neil Dixit, Aditya Sai, David Sontag

    Abstract: Healthcare providers are increasingly using machine learning to predict patient outcomes to make meaningful interventions. However, despite innovations in this area, deep learning models often struggle to match performance of shallow linear models in predicting these outcomes, making it difficult to leverage such techniques in practice. In this work, motivated by the task of clinical prediction fr… ▽ More

    Submitted 16 December, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: To appear in AAAI 2021

  16. arXiv:1904.02665  [pdf, ps, other

    cs.CL

    Frustratingly Poor Performance of Reading Comprehension Models on Non-adversarial Examples

    Authors: Soham Parikh, Ananya B. Sai, Preksha Nema, Mitesh M. Khapra

    Abstract: When humans learn to perform a difficult task (say, reading comprehension (RC) over longer passages), it is typically the case that their performance improves significantly on an easier version of this task (say, RC over shorter passages). Ideally, we would want an intelligent agent to also exhibit such a behavior. However, on experimenting with state of the art RC models using the standard RACE d… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: 8 pages

  17. arXiv:1904.02651  [pdf, other

    cs.CL

    ElimiNet: A Model for Eliminating Options for Reading Comprehension with Multiple Choice Questions

    Authors: Soham Parikh, Ananya B. Sai, Preksha Nema, Mitesh M. Khapra

    Abstract: The task of Reading Comprehension with Multiple Choice Questions, requires a human (or machine) to read a given passage, question pair and select one of the n given options. The current state of the art model for this task first computes a question-aware representation for the passage and then selects the option which has the maximum similarity with this representation. However, when humans perfor… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: IJCAI-18

    Journal ref: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (2018) Main track. Pages 4272-4278

  18. arXiv:1902.08832  [pdf, other

    cs.CL

    Re-evaluating ADEM: A Deeper Look at Scoring Dialogue Responses

    Authors: Ananya B. Sai, Mithun Das Gupta, Mitesh M. Khapra, Mukundhan Srinivasan

    Abstract: Automatically evaluating the quality of dialogue responses for unstructured domains is a challenging problem. ADEM(Lowe et al. 2017) formulated the automatic evaluation of dialogue systems as a learning problem and showed that such a model was able to predict responses which correlate significantly with human judgements, both at utterance and system level. Their system was shown to have beaten wor… ▽ More

    Submitted 23 February, 2019; originally announced February 2019.

    Comments: Accepted as a long paper in the proceedings of AAAI-2019

  19. arXiv:1808.09335  [pdf

    cs.OH

    PhaseMAC: A 14 TOPS/W 8bit GRO based Phase Domain MAC Circuit for In-Sensor-Computed Deep Learning Accelerators

    Authors: Kentaro Yoshioka, Yosuke Toyama, Koichiro Ban, Daisuke Yashima, Shigeru Maya, Akihide Sai, Kohei Onizuka

    Abstract: PhaseMAC (PMAC), a phase domain Gated-Ring-Oscillator (GRO) based 8bit MAC circuit, is proposed to minimize both area and power consumption of deep learning accelerators. PMAC composes of only digital cells and consumes significantly smaller power than standard digital designs, owing to its efficient analog accumulation nature. It occupies 26.6 times smaller area than conventional analog designs,… ▽ More

    Submitted 23 August, 2018; originally announced August 2018.

    Comments: Presented at Symp. VLSI 2018