Skip to main content

Showing 1–10 of 10 results for author: Consul, S

.
  1. arXiv:2406.17642  [pdf, other

    cs.CL cs.AI

    Banishing LLM Hallucinations Requires Rethinking Generalization

    Authors: Johnny Li, Saksham Consul, Eda Zhou, James Wong, Naila Farooqui, Yuxin Ye, Nithyashree Manohar, Zhuxiaona Wei, Tian Wu, Ben Echols, Sharon Zhou, Gregory Diamos

    Abstract: Despite their powerful chat, coding, and reasoning abilities, Large Language Models (LLMs) frequently hallucinate. Conventional wisdom suggests that hallucinations are a consequence of a balance between creativity and factuality, which can be mitigated, but not eliminated, by grounding the LLM in external knowledge sources. Through extensive systematic experiments, we show that these traditional a… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2308.14769  [pdf, other

    q-bio.GN cs.LG

    XVir: A Transformer-Based Architecture for Identifying Viral Reads from Cancer Samples

    Authors: Shorya Consul, John Robertson, Haris Vikalo

    Abstract: It is estimated that approximately 15% of cancers worldwide can be linked to viral infections. The viruses that can cause or increase the risk of cancer include human papillomavirus, hepatitis B and C viruses, Epstein-Barr virus, and human immunodeficiency virus, to name a few. The computational analysis of the massive amounts of tumor DNA data, whose collection is enabled by the recent advancemen… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 8 pages

    ACM Class: I.2.1; J.3

  3. arXiv:2307.10573  [pdf, other

    cs.AI

    Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting

    Authors: Rylan Schaeffer, Kateryna Pistunova, Samar Khanna, Sarthak Consul, Sanmi Koyejo

    Abstract: Language models can be prompted to reason through problems in a manner that significantly improves performance. However, \textit{why} such prompting improves performance is unclear. Recent work showed that using logically \textit{invalid} Chain-of-Thought (CoT) prompting improves performance almost as much as logically \textit{valid} CoT prompting, and that editing CoT prompts to replace problem-s… ▽ More

    Submitted 22 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: ICML 2023 Workshop: Knowledge and Logical Reasoning in the Era of Data-driven Learning

  4. arXiv:2302.02785  [pdf, other

    cs.AI

    An intelligent tutor for planning in large partially observable environments

    Authors: Lovis Heindrich, Saksham Consul, Falk Lieder

    Abstract: AI can not only outperform people in many planning tasks, but it can also teach them how to plan better. A recent and promising approach to improving human decision-making is to create intelligent tutors that utilize AI to discover and teach optimal planning strategies automatically. Prior work has shown that this approach can improve planning in artificial, fully observable planning tasks. Unlike… ▽ More

    Submitted 6 June, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

  5. arXiv:2109.06505  [pdf, other

    cs.AI

    Optimal To-Do List Gamification for Long Term Planning

    Authors: Saksham Consul, Jugoslav Stojcheski, Valkyrie Felso, Falk Lieder

    Abstract: Most people struggle with prioritizing work. While inexact heuristics have been developed over time, there is still no tractable principled algorithm for deciding which of the many possible tasks one should tackle in any given day, month, week, or year. Additionally, some people suffer from cognitive biases such as the present bias, leading to prioritization of their immediate experience over long… ▽ More

    Submitted 15 September, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

  6. arXiv:2102.00521  [pdf, other

    cs.AI

    Improving Human Decision-Making by Discovering Efficient Strategies for Hierarchical Planning

    Authors: Saksham Consul, Lovis Heindrich, Jugoslav Stojcheski, Falk Lieder

    Abstract: To make good decisions in the real world people need efficient planning strategies because their computational resources are limited. Knowing which planning strategies would work best for people in different situations would be very useful for understanding and improving human decision-making. But our ability to compute those strategies used to be limited to very small and very simple planning tas… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  7. arXiv:2009.07842  [pdf, ps, other

    cs.LG math.OC stat.ML

    Lower Bounds for Policy Iteration on Multi-action MDPs

    Authors: Kumar Ashutosh, Sarthak Consul, Bhishma Dedhia, Parthasarathi Khirwadkar, Sahil Shah, Shivaram Kalyanakrishnan

    Abstract: Policy Iteration (PI) is a classical family of algorithms to compute an optimal policy for any given Markov Decision Problem (MDP). The basic idea in PI is to begin with some initial policy and to repeatedly update the policy to one from an improving set, until an optimal policy is reached. Different variants of PI result from the (switching) rule used for improvement. An important theoretical que… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 8 pages, 3 diagrams, 2 tables. Paper in IEEE CDC 2020

  8. arXiv:2006.08795  [pdf, other

    cs.LG stat.ML

    Balance is key: Private median splits yield high-utility random trees

    Authors: Shorya Consul, Sinead A. Williamson

    Abstract: Random forests are a popular method for classification and regression due to their versatility. However, this flexibility can come at the cost of user privacy, since training random forests requires multiple data queries, often on small, identifiable subsets of the training data. Privatizing these queries typically comes at a high utility cost, in large part because we are privatizing queries on s… ▽ More

    Submitted 19 February, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 17 pages

  9. A Compressed Sensing Approach to Pooled RT-PCR Testing for COVID-19 Detection

    Authors: Sabyasachi Ghosh, Rishi Agarwal, Mohammad Ali Rehan, Shreya Pathak, Pratyush Agrawal, Yash Gupta, Sarthak Consul, Nimay Gupta, Ritika, Ritesh Goenka, Ajit Rajwade, Manoj Gopalkrishnan

    Abstract: We propose `Tapestry', a novel approach to pooled testing with application to COVID-19 testing with quantitative Reverse Transcription Polymerase Chain Reaction (RT-PCR) that can result in shorter testing time and conservation of reagents and testing kits. Tapestry combines ideas from compressed sensing and combinatorial group testing with a novel noise model for RT-PCR used for generation of synt… ▽ More

    Submitted 29 April, 2021; v1 submitted 16 May, 2020; originally announced May 2020.

    Comments: Accepted for publication at IEEE Open Journal of Signal Processing

  10. arXiv:1911.12842  [pdf, other

    cs.LG math.OC stat.ML

    Analysis of Lower Bounds for Simple Policy Iteration

    Authors: Sarthak Consul, Bhishma Dedhia, Kumar Ashutosh, Parthasarathi Khirwadkar

    Abstract: Policy iteration is a family of algorithms that are used to find an optimal policy for a given Markov Decision Problem (MDP). Simple Policy iteration (SPI) is a type of policy iteration where the strategy is to change the policy at exactly one improvable state at every step. Melekopoglou and Condon [1990] showed an exponential lower bound on the number of iterations taken by SPI for a 2 action MDP… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.