Skip to main content

Showing 1–32 of 32 results for author: Tadepalli, P

.
  1. arXiv:2401.16032  [pdf, other

    astro-ph.SR astro-ph.HE astro-ph.IM physics.space-ph

    A Modelling Investigation for Solar Flare X-ray Stereoscopy with Solar Orbiter/STIX and Earth Orbiting Missions

    Authors: Natasha L. S. Jeffrey, Säm Krucker, Morgan Stores, Eduard P. Kontar, Pascal Saint-Hilaire, Andrea F. Battaglia, Laura Hayes, Hannah Collier, Astrid Veronig, Yang Su, Srikar Paavan Tadepalli, Fanxiaoyu Xia

    Abstract: The Spectrometer/Telescope for Imaging X-rays (STIX) on board Solar Orbiter (SolO) provides a unique opportunity to systematically perform stereoscopic X-ray observations of solar flares with current and upcoming X-ray missions at Earth. These observations will produce the first reliable measurements of hard X-ray (HXR) directivity in decades, providing a new diagnostic of the flare-accelerated el… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in ApJ (January 2024)

  2. arXiv:2310.05308  [pdf, other

    cs.LG cs.DS stat.ML

    Adversarial Attacks on Combinatorial Multi-Armed Bandits

    Authors: Rishab Balasubramanian, Jiawei Li, Prasad Tadepalli, Huazheng Wang, Qingyun Wu, Haoyu Zhao

    Abstract: We study reward poisoning attacks on Combinatorial Multi-armed Bandits (CMAB). We first provide a sufficient and necessary condition for the attackability of CMAB, a notion to capture the vulnerability and robustness of CMAB. The attackability condition depends on the intrinsic properties of the corresponding CMAB instance such as the reward distributions of super arms and outcome distributions of… ▽ More

    Submitted 3 June, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: 28 pages, Accepted to ICML 2024

  3. arXiv:2307.13692  [pdf, other

    cs.CL cs.LG

    ARB: Advanced Reasoning Benchmark for Large Language Models

    Authors: Tomohiro Sawada, Daniel Paleka, Alexander Havrilla, Pranav Tadepalli, Paula Vidas, Alexander Kranias, John J. Nay, Kshitij Gupta, Aran Komatsuzaki

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance on various quantitative reasoning and knowledge benchmarks. However, many of these benchmarks are losing utility as LLMs get increasingly high scores, despite not yet reaching expert performance in these domains. We introduce ARB, a novel benchmark composed of advanced reasoning problems in multiple fields. ARB presents a more c… ▽ More

    Submitted 27 July, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Submitted to NeurIPS Datasets and Benchmarks Track

  4. arXiv:2206.13477  [pdf, other

    cs.AI

    Parametrically Retargetable Decision-Makers Tend To Seek Power

    Authors: Alexander Matt Turner, Prasad Tadepalli

    Abstract: If capable AI agents are generally incentivized to seek power in service of the objectives we specify for them, then these systems will pose enormous risks, in addition to enormous benefits. In fully observable environments, most reward functions have an optimal policy which seeks power by kee** options open and staying alive. However, the real world is neither fully observable, nor must trained… ▽ More

    Submitted 11 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 10-page main paper, 36 pages total, poster at NeurIPS 2022

  5. arXiv:2206.11812  [pdf, other

    cs.AI

    Formalizing the Problem of Side Effect Regularization

    Authors: Alexander Matt Turner, Aseem Saxena, Prasad Tadepalli

    Abstract: AI objectives are often hard to specify properly. Some approaches tackle this problem by regularizing the AI's side effects: Agents must weigh off "how much of a mess they make" with an imperfectly specified proxy objective. We propose a formal criterion for side effect regularization via the assistance game framework. In these games, the agent solves a partially observable Markov decision process… ▽ More

    Submitted 8 November, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: 14 pages, accepted to ML Safety Workshop at NeurIPS 2022. Alexander Turner and Aseem Saxena contributed equally

  6. arXiv:2206.07904  [pdf, other

    cs.LG

    Explainable Models via Compression of Tree Ensembles

    Authors: Siwen Yan, Sriraam Natarajan, Saket Joshi, Roni Khardon, Prasad Tadepalli

    Abstract: Ensemble models (bagging and gradient-boosting) of relational decision trees have proved to be one of the most effective learning methods in the area of probabilistic logic models (PLMs). While effective, they lose one of the most important aspect of PLMs -- interpretability. In this paper we consider the problem of compressing a large set of learned trees into a single explainable model. To this… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 24 pages, 14 figures

  7. arXiv:2110.08318  [pdf, other

    cs.AI

    Dynamic probabilistic logic models for effective abstractions in RL

    Authors: Harsha Kokel, Arjun Manoharan, Sriraam Natarajan, Balaraman Ravindran, Prasad Tadepalli

    Abstract: State abstraction enables sample-efficient learning and better task transfer in complex reinforcement learning environments. Recently, we proposed RePReL (Kokel et al. 2021), a hierarchical framework that leverages a relational planner to provide useful state abstractions for learning. We present a brief overview of this framework and the use of a dynamic probabilistic logic model to design these… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted at StarAI 2021 (held in conjunction with IJCLR 2021)

  8. arXiv:2109.06365  [pdf, other

    cs.CV cs.LG

    From Heatmaps to Structural Explanations of Image Classifiers

    Authors: Li Fuxin, Zhongang Qi, Saeed Khorram, Vivswan Shitole, Prasad Tadepalli, Minsuk Kahng, Alan Fern

    Abstract: This paper summarizes our endeavors in the past few years in terms of explaining image classifiers, with the aim of including negative results and insights we have gained. The paper starts with describing the explainable neural network (XNN), which attempts to extract and visualize several high-level concepts purely from the deep network, without relying on human linguistic concepts. This helps us… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: Submitted to Applied AI Letters

    Journal ref: Applied AI Letters.2021;2:e46

  9. arXiv:2109.04778  [pdf, other

    cs.CL cs.AI

    Improving Multilingual Translation by Representation and Gradient Regularization

    Authors: Yilin Yang, Akiko Eriguchi, Alexandre Muzio, Prasad Tadepalli, Stefan Lee, Hany Hassan

    Abstract: Multilingual Neural Machine Translation (NMT) enables one model to serve all translation directions, including ones that are unseen during training, i.e. zero-shot translation. Despite being theoretically attractive, current models often produce low quality translations -- commonly failing to even produce outputs in the right target language. In this work, we observe that off-target translation is… ▽ More

    Submitted 18 January, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 (Oral). Code and data: https://github.com/yilinyang7/fairseq_multi_fix

  10. Imaging and Spectral Observations of a Type-II Radio Burst Revealing the Section of the CME-Driven Shock that Accelerates Electrons

    Authors: Satabdwa Majumdar, Srikar Paavan Tadepalli, Samriddhi Sankar Maity, Ketaki Deshpande, Anshu Kumari, Ritesh Patel, Nat Gopalswamy

    Abstract: We report on a multi-wavelength analysis of the 26 January 2014 solar eruption involving a coronal mass ejection (CME) and a Type-II radio burst, performed by combining data from various space-and ground-based instruments. An increasing standoff distance with height shows the presence of a strong shock, which further manifests itself in the continuation of the metric Type-II burst into the decamet… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Comments: 19 pages, 7 Figures, 1 table ; Accepted for publication in Solar Physics

  11. arXiv:2011.06733  [pdf, other

    cs.CV cs.LG

    One Explanation is Not Enough: Structured Attention Graphs for Image Classification

    Authors: Vivswan Shitole, Li Fuxin, Minsuk Kahng, Prasad Tadepalli, Alan Fern

    Abstract: Attention maps are a popular way of explaining the decisions of convolutional networks for image classification. Typically, for each image of interest, a single attention map is produced, which assigns weights to pixels based on their importance to the classification. A single attention map, however, provides an incomplete understanding since there are often many other maps that explain a classifi… ▽ More

    Submitted 7 November, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 26 pages, 25 figures

    Journal ref: NeuRIPS 2021

  12. arXiv:2010.08891  [pdf, other

    cs.LG cs.AI stat.ML

    DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs

    Authors: Aayam Shrestha, Stefan Lee, Prasad Tadepalli, Alan Fern

    Abstract: We study an approach to offline reinforcement learning (RL) based on optimally solving finitely-represented MDPs derived from a static dataset of experience. This approach can be applied on top of any learned representation and has the potential to easily support multiple solution objectives as well as zero-shot adjustment to changing environments and goals. Our main contribution is to introduce t… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: Preprint. Under review at ICLR 2021

  13. arXiv:2010.02648  [pdf, other

    cs.CL cs.AI

    On the Sub-Layer Functionalities of Transformer Decoder

    Authors: Yilin Yang, Longyue Wang, Shuming Shi, Prasad Tadepalli, Stefan Lee, Zhaopeng Tu

    Abstract: There have been significant efforts to interpret the encoder of Transformer-based encoder-decoder architectures for neural machine translation (NMT); meanwhile, the decoder remains largely unexamined despite its critical role. During translation, the decoder must predict output tokens by considering both the source-language text from the encoder and the target-language prefix produced in previous… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Findings of the 2020 Conference on Empirical Methods in Natural Language Processing (Long)

  14. arXiv:2006.06547  [pdf, other

    cs.AI

    Avoiding Side Effects in Complex Environments

    Authors: Alexander Matt Turner, Neale Ratzlaff, Prasad Tadepalli

    Abstract: Reward function specification can be difficult. Rewarding the agent for making a widget may be easy, but penalizing the multitude of possible negative side effects is hard. In toy environments, Attainable Utility Preservation (AUP) avoided side effects by penalizing shifts in the ability to achieve randomly generated goals. We scale this approach to large, randomly generated environments based on… ▽ More

    Submitted 22 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted as spotlight paper at NeurIPS 2020. 10 pages main paper; 19 pages with appendices

  15. arXiv:2005.14271  [pdf, other

    cs.IR

    Relation Extraction with Explanation

    Authors: Hamed Shahbazi, Xiaoli Z. Fern, Reza Ghaeini, Prasad Tadepalli

    Abstract: Recent neural models for relation extraction with distant supervision alleviate the impact of irrelevant sentences in a bag by learning importance weights for the sentences. Efforts thus far have focused on improving extraction accuracy but little is known about their explainability. In this work we annotate a test set with ground-truth sentence-level explanations to evaluate the quality of explan… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: accepted by ACL 2020

    Journal ref: ACL.2020

  16. arXiv:1912.01683  [pdf, other

    cs.AI

    Optimal Policies Tend to Seek Power

    Authors: Alexander Matt Turner, Logan Smith, Rohin Shah, Andrew Critch, Prasad Tadepalli

    Abstract: Some researchers speculate that intelligent reinforcement learning (RL) agents would be incentivized to seek resources and power in pursuit of their objectives. Other researchers point out that RL agents need not have human-like power-seeking instincts. To clarify this discussion, we develop the first formal theory of the statistical tendencies of optimal policies. In the context of Markov decisio… ▽ More

    Submitted 28 January, 2023; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: Accepted to NeurIPS 2021 as spotlight paper. 12 pages, 44 pages with appendices. Since the 2021 acceptance, we updated the paper to point out that optimal policies can be qualitatively divorced from real-world learned policies

  17. arXiv:1910.00614  [pdf, other

    cs.AI

    The Choice Function Framework for Online Policy Improvement

    Authors: Murugeswari Issakkimuthu, Alan Fern, Prasad Tadepalli

    Abstract: There are notable examples of online search improving over hand-coded or learned policies (e.g. AlphaZero) for sequential decision making. It is not clear, however, whether or not policy improvement is guaranteed for many of these approaches, even when given a perfect evaluation function and transition model. Indeed, simple counter examples show that seemingly reasonable online search procedures c… ▽ More

    Submitted 7 October, 2019; v1 submitted 1 October, 2019; originally announced October 2019.

  18. arXiv:1908.05762  [pdf, ps, other

    cs.CL cs.IR cs.LG stat.ML

    Entity-aware ELMo: Learning Contextual Entity Representation for Entity Disambiguation

    Authors: Hamed Shahbazi, Xiaoli Z. Fern, Reza Ghaeini, Rasha Obeidat, Prasad Tadepalli

    Abstract: We present a new local entity disambiguation system. The key to our system is a novel approach for learning entity representations. In our approach we learn an entity aware extension of Embedding for Language Model (ELMo) which we call Entity-ELMo (E-ELMo). Given a paragraph containing one or more named entity mentions, each mention is first defined as a function of the entire paragraph (including… ▽ More

    Submitted 22 August, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

  19. Conservative Agency via Attainable Utility Preservation

    Authors: Alexander Matt Turner, Dylan Hadfield-Menell, Prasad Tadepalli

    Abstract: Reward functions are easy to misspecify; although designers can make corrections after observing mistakes, an agent pursuing a misspecified reward function can irreversibly change the state of its environment. If that change precludes optimization of the correctly specified reward function, then correction is futile. For example, a robotic factory assistant could break expensive equipment due to a… ▽ More

    Submitted 10 June, 2020; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: Published in AI, Ethics, and Society 2020

  20. arXiv:1902.08649  [pdf, other

    cs.CL cs.AI cs.LG

    Saliency Learning: Teaching the Model Where to Pay Attention

    Authors: Reza Ghaeini, Xiaoli Z. Fern, Hamed Shahbazi, Prasad Tadepalli

    Abstract: Deep learning has emerged as a compelling solution to many NLP tasks with remarkable performances. However, due to their opacity, such models are hard to interpret and trust. Recent work on explaining deep models has introduced approaches to provide insights toward the model's behaviour and predictions, which are helpful for assessing the reliability of the model's predictions. However, such metho… ▽ More

    Submitted 4 April, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Accepted as a short paper at NAACL 2019. 10 pages, 2 figures, 6 tables

    Journal ref: NAACL 2019

  21. arXiv:1812.07150  [pdf, other

    cs.LG cs.CV stat.ML

    Interactive Naming for Explaining Deep Neural Networks: A Formative Study

    Authors: Mandana Hamidi-Haines, Zhongang Qi, Alan Fern, Fuxin Li, Prasad Tadepalli

    Abstract: We consider the problem of explaining the decisions of deep neural networks for image recognition in terms of human-recognizable visual concepts. In particular, given a test set of images, we aim to explain each classification in terms of a small number of image regions, or activation maps, which have been associated with semantic concepts by a human annotator. This allows for generating summary v… ▽ More

    Submitted 20 December, 2018; v1 submitted 17 December, 2018; originally announced December 2018.

  22. arXiv:1809.03680  [pdf, other

    cs.CL

    Learning Scripts as Hidden Markov Models

    Authors: J. Walker Orr, Prasad Tadepalli, Janardhan Rao Doppa, Xiaoli Fern, Thomas G. Dietterich

    Abstract: Scripts have been proposed to model the stereotypical event sequences found in narratives. They can be applied to make a variety of inferences including filling gaps in the narratives and resolving ambiguous references. This paper proposes the first formal framework for scripts based on Hidden Markov Models (HMMs). Our framework supports robust inference and learning algorithms, which are lacking… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: 7 pages, AAAI 2014

  23. arXiv:1809.03051  [pdf, other

    cs.CL cs.AI cs.LG

    Attentional Multi-Reading Sarcasm Detection

    Authors: Reza Ghaeini, Xiaoli Z. Fern, Prasad Tadepalli

    Abstract: Recognizing sarcasm often requires a deep understanding of multiple sources of information, including the utterance, the conversational context, and real world facts. Most of the current sarcasm detection systems consider only the utterance in isolation. There are some limited attempts toward taking into account the conversational context. In this paper, we propose an interpretable end-to-end mode… ▽ More

    Submitted 9 September, 2018; originally announced September 2018.

  24. arXiv:1808.08504  [pdf, other

    cs.CL

    Event Detection with Neural Networks: A Rigorous Empirical Evaluation

    Authors: J. Walker Orr, Prasad Tadepalli, Xiaoli Fern

    Abstract: Detecting events and classifying them into predefined types is an important step in knowledge extraction from natural language texts. While the neural network models have generally led the state-of-the-art, the differences in performance between different architectures have not been rigorously studied. In this paper we present a novel GRU-based model that combines syntactic information along with… ▽ More

    Submitted 26 August, 2018; originally announced August 2018.

    Comments: 5 pages, EMNLP2018

  25. arXiv:1808.03894  [pdf, other

    cs.CL cs.AI cs.LG

    Interpreting Recurrent and Attention-Based Neural Models: a Case Study on Natural Language Inference

    Authors: Reza Ghaeini, Xiaoli Z. Fern, Prasad Tadepalli

    Abstract: Deep learning models have achieved remarkable success in natural language inference (NLI) tasks. While these models are widely explored, they are hard to interpret and it is often unclear how and why they actually work. In this paper, we take a step toward explaining such deep learning based models through a case study on a popular neural model for NLI. In particular, we propose to interpret the i… ▽ More

    Submitted 12 August, 2018; originally announced August 2018.

    Comments: 11 pages, 11 figures, accepted as a short paper at EMNLP 2018

    Journal ref: EMNLP 2018

  26. arXiv:1806.07495  [pdf, other

    cs.CL cs.AI

    Joint Neural Entity Disambiguation with Output Space Search

    Authors: Hamed Shahbazi, Xiaoli Z. Fern, Reza Ghaeini, Chao Ma, Rasha Obeidat, Prasad Tadepalli

    Abstract: In this paper, we present a novel model for entity disambiguation that combines both local contextual information and global evidences through Limited Discrepancy Search (LDS). Given an input document, we start from a complete solution constructed by a local model and conduct a search in the space of possible corrections to improve the local solution from a global view point. Our search utilizes a… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted as a long paper at COLING 2018, 11 pages

    Journal ref: Proceedings of COLING 2018

  27. arXiv:1805.10528  [pdf, other

    cs.CL cs.AI

    Dependent Gated Reading for Cloze-Style Question Answering

    Authors: Reza Ghaeini, Xiaoli Z. Fern, Hamed Shahbazi, Prasad Tadepalli

    Abstract: We present a novel deep learning architecture to address the cloze-style question answering task. Existing approaches employ reading mechanisms that do not fully exploit the interdependency between the document and the query. In this paper, we propose a novel \emph{dependent gated reading} bidirectional GRU network (DGR) to efficiently model the relationship between the document and the query duri… ▽ More

    Submitted 1 June, 2018; v1 submitted 26 May, 2018; originally announced May 2018.

    Comments: Accepted as a long paper at COLING 2018, 16 pages, 12 figures

    Journal ref: COLING 2018

  28. arXiv:1802.05672  [pdf, other

    cs.CL

    Event Nugget Detection with Forward-Backward Recurrent Neural Networks

    Authors: Reza Ghaeini, Xiaoli Z. Fern, Liang Huang, Prasad Tadepalli

    Abstract: Traditional event detection methods heavily rely on manually engineered rich features. Recent deep learning approaches alleviate this problem by automatic feature engineering. But such efforts, like tradition methods, have so far only focused on single-token event mentions, whereas in practice events can also be a phrase. We instead use forward-backward recurrent neural networks (FBRNNs) to detect… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Comments: Published as a short paper at ACL 2016. The main purpose of this submission is to add this paper to arxiv

    Report number: http://www.aclweb.org/anthology/P16-2060

    Journal ref: ACL 2016

  29. arXiv:1404.5511  [pdf, other

    cs.LG

    Coactive Learning for Locally Optimal Problem Solving

    Authors: Robby Goetschalckx, Alan Fern, Prasad Tadepalli

    Abstract: Coactive learning is an online problem solving setting where the solutions provided by a solver are interactively improved by a domain expert, which in turn drives learning. In this paper we extend the study of coactive learning to problems where obtaining a globally optimal or near-optimal solution may be intractable or where an expert can only be expected to make small, local improvements to a c… ▽ More

    Submitted 18 April, 2014; originally announced April 2014.

    Comments: AAAI 2014 paper, including appendices

  30. arXiv:1306.6302  [pdf, other

    cs.AI cs.LG

    Solving Relational MDPs with Exogenous Events and Additive Rewards

    Authors: S. Joshi, R. Khardon, P. Tadepalli, A. Raghavan, A. Fern

    Abstract: We formalize a simple but natural subclass of service domains for relational planning problems with object-centered, independent exogenous events and additive rewards capturing, for example, problems in inventory control. Focusing on this subclass, we present a new symbolic planning algorithm which is the first algorithm that has explicit performance guarantees for relational MDPs with exogenous e… ▽ More

    Submitted 27 June, 2013; v1 submitted 26 June, 2013; originally announced June 2013.

    Comments: This is an extended version of our ECML/PKDD 2013 paper including all proofs. (v2 corrects typos and updates ref [10] to cite this report as the full version)

  31. arXiv:1206.6460  [pdf

    cs.LG cs.AI stat.ML

    Output Space Search for Structured Prediction

    Authors: Janardhan Rao Doppa, Alan Fern, Prasad Tadepalli

    Abstract: We consider a framework for structured prediction based on search in the space of complete structured outputs. Given a structured input, an output is produced by running a time-bounded search procedure guided by a learned cost function, and then returning the least cost output uncovered during the search. This framework can be instantiated for a wide range of search spaces and search procedures, a… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  32. arXiv:cs/9605105  [pdf, ps

    cs.AI

    A Formal Framework for Speedup Learning from Problems and Solutions

    Authors: P. Tadepalli, B. K. Natarajan

    Abstract: Speedup learning seeks to improve the computational efficiency of problem solving with experience. In this paper, we develop a formal framework for learning efficient problem solving from random problems and their solutions. We apply this framework to two different representations of learned knowledge, namely control rules and macro-operators, and prove theorems that identify sufficient conditio… ▽ More

    Submitted 30 April, 1996; originally announced May 1996.

    Comments: See http://www.jair.org/ for any accompanying files

    Journal ref: Journal of Artificial Intelligence Research, Vol 4, (1996), 445-475