Skip to main content

Showing 1–27 of 27 results for author: Raghavan, P

.
  1. arXiv:2404.00386  [pdf, other

    cs.CL

    Jetsons at FinNLP 2024: Towards Understanding the ESG Impact of a News Article using Transformer-based Models

    Authors: Parag Pravin Dakle, Alolika Gon, Sihan Zha, Liang Wang, SaiKrishna Rallabandi, Preethi Raghavan

    Abstract: In this paper, we describe the different approaches explored by the Jetsons team for the Multi-Lingual ESG Impact Duration Inference (ML-ESG-3) shared task. The shared task focuses on predicting the duration and type of the ESG impact of a news article. The shared task dataset consists of 2,059 news titles and articles in English, French, Korean, and Japanese languages. For the impact duration cla… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  2. arXiv:2402.17882  [pdf, other

    cs.CL

    BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra

    Authors: Parker Glenn, Parag Pravin Dakle, Liang Wang, Preethi Raghavan

    Abstract: Many existing end-to-end systems for hybrid question answering tasks can often be boiled down to a "prompt-and-pray" paradigm, where the user has limited control and insight into the intermediate reasoning steps used to achieve the final result. Additionally, due to the context size limitation of many transformer-based LLMs, it is often not reasonable to expect that the full structured and unstruc… ▽ More

    Submitted 10 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: For associated codebase, see https://github.com/parkervg/blendsql

  3. arXiv:2402.16882  [pdf, other

    physics.chem-ph cs.AI cs.LG q-bio.BM

    Substrate Scope Contrastive Learning: Repurposing Human Bias to Learn Atomic Representations

    Authors: Wenhao Gao, Priyanka Raghavan, Ron Shprints, Connor W. Coley

    Abstract: Learning molecular representation is a critical step in molecular machine learning that significantly influences modeling success, particularly in data-scarce situations. The concept of broadly pre-training neural networks has advanced fields such as computer vision, natural language processing, and protein engineering. However, similar approaches for small organic molecules have not achieved comp… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  4. arXiv:2312.01143  [pdf, other

    cs.CL

    Towards leveraging LLMs for Conditional QA

    Authors: Syed-Amad Hussain, Parag Pravin Dakle, SaiKrishna Rallabandi, Preethi Raghavan

    Abstract: This study delves into the capabilities and limitations of Large Language Models (LLMs) in the challenging domain of conditional question-answering. Utilizing the Conditional Question Answering (CQA) dataset and focusing on generative models like T5 and UL2, we assess the performance of LLMs across diverse question types. Our findings reveal that fine-tuned LLMs can surpass the state-of-the-art (S… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  5. arXiv:2309.08777  [pdf, other

    cs.CL

    Self-training Strategies for Sentiment Analysis: An Empirical Study

    Authors: Haochen Liu, Sai Krishna Rallabandi, Yi**g Wu, Parag Pravin Dakle, Preethi Raghavan

    Abstract: Sentiment analysis is a crucial task in natural language processing that involves identifying and extracting subjective sentiment from text. Self-training has recently emerged as an economical and efficient technique for develo** sentiment analysis models by leveraging a small amount of labeled data and a large amount of unlabeled data. However, given a set of training data, how to utilize them… ▽ More

    Submitted 3 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted by EACL Findings 2024

  6. arXiv:2305.19974  [pdf, other

    cs.CL

    Correcting Semantic Parses with Natural Language through Dynamic Schema Encoding

    Authors: Parker Glenn, Parag Pravin Dakle, Preethi Raghavan

    Abstract: In addressing the task of converting natural language to SQL queries, there are several semantic and syntactic challenges. It becomes increasingly important to understand and remedy the points of failure as the performance of semantic parsing systems improve. We explore semantic parse correction with natural language feedback, proposing a new solution built on the success of autoregressive decoder… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Workshop on NLP for Conversational AI

  7. arXiv:2304.13689  [pdf, other

    cs.CL cs.AI

    HeySQuAD: A Spoken Question Answering Dataset

    Authors: Yi**g Wu, SaiKrishna Rallabandi, Ravisutha Srinivasamurthy, Parag Pravin Dakle, Alolika Gon, Preethi Raghavan

    Abstract: Spoken question answering (SQA) systems are critical for digital assistants and other real-world use cases, but evaluating their performance is a challenge due to the importance of human-spoken questions. This study presents a new large-scale community-shared SQA dataset called HeySQuAD, which includes 76k human-spoken questions, 97k machine-generated questions, and their corresponding textual ans… ▽ More

    Submitted 27 February, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

  8. arXiv:2211.14865  [pdf, other

    cs.CL

    Understanding BLOOM: An empirical study on diverse NLP tasks

    Authors: Parag Pravin Dakle, SaiKrishna Rallabandi, Preethi Raghavan

    Abstract: We view the landscape of large language models (LLMs) through the lens of the recently released BLOOM model to understand the performance of BLOOM and other decoder-only LLMs compared to BERT-style encoder-only models. We achieve this by evaluating the smaller BLOOM model variants (\textit{350m/560m} and \textit{1b3/1b7}) on several NLP benchmark datasets and popular leaderboards. We make the foll… ▽ More

    Submitted 14 March, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

  9. arXiv:2206.02696  [pdf, other

    cs.CL

    Learning to Ask Like a Physician

    Authors: Eric Lehman, Vladislav Lialin, Katelyn Y. Legaspi, Anne Janelle R. Sy, Patricia Therese S. Pile, Nicole Rose I. Alberto, Richard Raymund R. Ragasa, Corinna Victoria M. Puyat, Isabelle Rose I. Alberto, Pia Gabrielle I. Alfonso, Marianne Taliño, Dana Moukheiber, Byron C. Wallace, Anna Rumshisky, Jenifer J. Liang, Preethi Raghavan, Leo Anthony Celi, Peter Szolovits

    Abstract: Existing question answering (QA) datasets derived from electronic health records (EHR) are artificially generated and consequently fail to capture realistic physician information needs. We present Discharge Summary Clinical Questions (DiSCQ), a newly curated question dataset composed of 2,000+ questions paired with the snippets of text (triggers) that prompted each question. The questions are gene… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  10. arXiv:2106.07059  [pdf, other

    cs.DC cs.DS

    Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints

    Authors: Lucas Perotin, Hongyang Sun, Padma Raghavan

    Abstract: The scheduling literature has traditionally focused on a single type of resource (e.g., computing nodes). However, scientific applications in modern High-Performance Computing (HPC) systems process large amounts of data, hence have diverse requirements on different types of resources (e.g., cores, cache, memory, I/O). All of these resources could potentially be exploited by the runtime scheduler t… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  11. arXiv:2007.00271  [pdf, other

    cs.LG stat.ML

    TransINT: Embedding Implication Rules in Knowledge Graphs with Isomorphic Intersections of Linear Subspaces

    Authors: So Yeon Min, Preethi Raghavan, Peter Szolovits

    Abstract: Knowledge Graphs (KG), composed of entities and relations, provide a structured representation of knowledge. For easy access to statistical approaches on relational data, multiple methods to embed a KG into f(KG) $\in$ R^d have been introduced. We propose TransINT, a novel and interpretable KG embedding method that isomorphically preserves the implication ordering among relations in the embedding… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: Conference Paper published in the proceedings of AKBC (Automated Knowledge Base Construction) 2020 (https://openreview.net/forum?id=shkmWLRBXH)

  12. arXiv:2005.06587  [pdf, other

    cs.AI cs.CL cs.LG

    Entity-Enriched Neural Models for Clinical Question Answering

    Authors: Bhanu Pratap Singh Rawat, Wei-Hung Weng, So Yeon Min, Preethi Raghavan, Peter Szolovits

    Abstract: We explore state-of-the-art neural models for question answering on electronic medical records and improve their ability to generalize better on previously unseen (paraphrased) questions at test time. We enable this by learning to predict logical forms as an auxiliary task along with the main task of answer span detection. The predicted logical forms also serve as a rationale for the answer. Furth… ▽ More

    Submitted 19 February, 2021; v1 submitted 13 May, 2020; originally announced May 2020.

    Journal ref: BioNLP Workshop, ACL'2020

  13. arXiv:1911.03322  [pdf, other

    physics.app-ph cond-mat.mes-hall

    High temperature annealing enhanced diamond 13C hyperpolarization at room temperature

    Authors: M. Gierth, V. Krespach, A. I. Shames, P. Raghavan, E. Druga, N. Nunn, M. Torelli, R. Nirodi, S. Le, R. Zhao, A. Aguilar, X. Lv, M. Shen, C. A. Meriles, J. A. Reimer, A. Zaitsev, A. Pines, O. Shenderova, A. Ajoy

    Abstract: Methods of optical dynamic nuclear polarization (DNP) open the door to the replenishable hyperpolarization of nuclear spins, boosting their NMR/MRI signature by orders of magnitude. Nanodiamond powder rich in negatively charged Nitrogen Vacancy (NV) defect centers has recently emerged as one such promising platform, wherein 13C nuclei can be hyperpolarized through the optically pumped defects comp… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 10+4 pages

    Journal ref: Advanced Quantum Technologies 2000050 (2020)

  14. arXiv:1907.09146  [pdf, other

    cs.GR cs.HC

    Motion Browser: Visualizing and Understanding Complex Upper Limb Movement Under Obstetrical Brachial Plexus Injuries

    Authors: Gromit Yeuk-Yin Chan, Luis Gustavo Nonato, Alice Chu, Preeti Raghavan, Viswanath Aluru, Claudio T. Silva

    Abstract: The brachial plexus is a complex network of peripheral nerves that enables sensing from and control of the movements of the arms and hand. Nowadays, the coordination between the muscles to generate simple movements is still not well understood, hindering the knowledge of how to best treat patients with this type of peripheral nerve injury. To acquire enough information for medical data analysis, p… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: IEEE Transactions on Visualization and Computer Graphics (VAST 2019, to appear)

  15. arXiv:1902.06204  [pdf, other

    quant-ph cond-mat.mes-hall

    Hyperpolarized relaxometry based nuclear T1 noise spectroscopy in hybrid diamond quantum registers

    Authors: Ashok Ajoy, Ben Safvati, Raffi Nazaryan, J. T. Oon, Ben Han, Priyanka Raghavan, Ruhee Nirodi, Alessandra Aguilar, Kristina Liu, Xiao Cai, Xudong Lv, Emanuel Druga, Chandrasekhar Ramanathan, Jeffrey A. Reimer, Carlos A. Meriles, Dieter Suter, Alexander Pines

    Abstract: Understanding the origins of spin lifetimes in hybrid quantum systems is a matter of current importance in several areas of quantum information and sensing. Methods that spectrally map spin relaxation processes provide insight into their origin and can motivate methods to mitigate them. In this paper, using a combination of hyperpolarization and precision field cycling over a wide range (1mT-7T),… ▽ More

    Submitted 16 February, 2019; originally announced February 2019.

    Comments: Contains supplementary info

  16. arXiv:1809.00732  [pdf, other

    cs.CL

    emrQA: A Large Corpus for Question Answering on Electronic Medical Records

    Authors: Anusri Pampari, Preethi Raghavan, Jennifer Liang, Jian Peng

    Abstract: We propose a novel methodology to generate domain-specific large-scale question answering (QA) datasets by re-purposing existing annotations for other NLP tasks. We demonstrate an instance of this methodology in generating a large-scale QA dataset for electronic medical records by leveraging existing expert annotations on clinical notes for various NLP tasks from the community shared i2b2 datasets… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: Accepted at Conference on Empirical Methods in Natural Language Processing (EMNLP) 2018

  17. arXiv:1806.09812  [pdf, other

    physics.app-ph physics.atom-ph quant-ph

    Orientation independent room-temperature optical 13C hyperpolarization in powdered diamond

    Authors: A. Ajoy, K. Liu, R. Nazaryan, X. Lv, P. R. Zangara, B. Safvati, G. Wang, D. Arnold, G. Li, A. Lin, P. Raghavan, E. Druga, S. Dhomkar, D. Pagliero, J. A. Reimer, D. Suter, C. A. Meriles, A. Pines

    Abstract: Dynamic nuclear polarization via contact with electronic spins has emerged as an attractive route to enhance the sensitivity of nuclear magnetic resonance (NMR) beyond the traditional limits imposed by magnetic field strength and temperature. Among the various alternative implementations, the use of nitrogen vacancy (NV) centers in diamond - a paramagnetic point defect whose spin can be optically… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: Contains supplementary info

    Journal ref: Science Advances 18 May 2018: Vol. 4, no. 5, eaar5492

  18. arXiv:1805.06816  [pdf

    cs.CL cs.CY

    Annotating Electronic Medical Records for Question Answering

    Authors: Preethi Raghavan, Siddharth Patwardhan, Jennifer J. Liang, Murthy V. Devarakonda

    Abstract: Our research is in the relatively unexplored area of question answering technologies for patient-specific questions over their electronic health records. A large dataset of human expert curated question and answer pairs is an important pre-requisite for develo**, training and evaluating any question answering system that is powered by machine learning. In this paper, we describe a process for cr… ▽ More

    Submitted 17 May, 2018; originally announced May 2018.

    Comments: 10 pages, 2016

  19. arXiv:1612.02170  [pdf, other

    cs.ET cond-mat.mes-hall

    Non-volatile spin wave majority gate at the nanoscale

    Authors: Odysseas Zografos, Sourav Dutta, Mauricio Manfrini, Adrien Vaysset, Bart Sorée, Azad Naeemi, Praveen Raghavan, Rudy Lauwereins, Iuliana P. Radu

    Abstract: A spin wave majority fork-like structure with feature size of 40\,nm, is presented and investigated, through micromagnetic simulations. The structure consists of three merging out-of-plane magnetization spin wave buses and four magneto-electric cells serving as three inputs and an output. The information of the logic signals is encoded in the phase of the transmitted spin waves and subsequently st… ▽ More

    Submitted 7 December, 2016; originally announced December 2016.

    Journal ref: AIP Advances, Volume 7, Issue 5, 2017

  20. arXiv:1610.02608  [pdf, other

    cs.CE math.HO stat.OT

    Research and Education in Computational Science and Engineering

    Authors: Ulrich Rüde, Karen Willcox, Lois Curfman McInnes, Hans De Sterck, George Biros, Hans Bungartz, James Corones, Evin Cramer, James Crowley, Omar Ghattas, Max Gunzburger, Michael Hanke, Robert Harrison, Michael Heroux, Jan Hesthaven, Peter Jimack, Chris Johnson, Kirk E. Jordan, David E. Keyes, Rolf Krause, Vipin Kumar, Stefan Mayer, Juan Meza, Knut Martin Mørken, J. Tinsley Oden , et al. (8 additional authors not shown)

    Abstract: Over the past two decades the field of computational science and engineering (CSE) has penetrated both basic and applied research in academia, industry, and laboratories to advance discovery, optimize systems, support decision-makers, and educate the scientific and engineering workforce. Informed by centuries of theory and experiment, CSE performs computational experiments to answer questions that… ▽ More

    Submitted 31 December, 2017; v1 submitted 8 October, 2016; originally announced October 2016.

    Comments: Major revision, to appear in SIAM Review

    Report number: Argonne National Laboratory Preprint ANL/MCS-P6054-0916 MSC Class: 00A72; 62-07; 68U20; 68W01; 68W10; 97A99; 97M10; 97N80; 97R20; 97R30 ACM Class: G.0; G.4; I.6; J.0; J.2; J.3; J.4; J.6; J.7; K.3.2

  21. arXiv:1607.04263  [pdf, other

    cs.SI physics.soc-ph

    The Limits of Popularity-Based Recommendations, and the Role of Social Ties

    Authors: Marco Bressan, Stefano Leucci, Alessandro Panconesi, Prabhakar Raghavan, Erisa Terolli

    Abstract: In this paper we introduce a mathematical model that captures some of the salient features of recommender systems that are based on popularity and that try to exploit social ties among the users. We show that, under very general conditions, the market always converges to a steady state, for which we are able to give an explicit form. Thanks to this we can tell rather precisely how much a market is… ▽ More

    Submitted 14 July, 2016; originally announced July 2016.

    Comments: 10 pages, 9 figures, KDD 2016

  22. arXiv:1606.02638  [pdf, other

    cs.CL

    Addressing Limited Data for Textual Entailment Across Domains

    Authors: Chaitanya Shivade, Preethi Raghavan, Siddharth Patwardhan

    Abstract: We seek to address the lack of labeled data (and high cost of annotation) for textual entailment in some domains. To that end, we first create (for experimental purposes) an entailment dataset for the clinical domain, and a highly competitive supervised entailment system, ENT, that is effective (out of the box) on two domains. We then explore self-training and active learning strategies to address… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

  23. arXiv:1606.00803  [pdf, other

    math.NA

    Locality-Aware Laplacian Mesh Smoothing

    Authors: Guillaume Aupy, JeongHyung Park, Padma Raghavan

    Abstract: In this paper, we propose a novel reordering scheme to improve the performance of a Laplacian Mesh Smoothing (LMS). While the Laplacian smoothing algorithm is well optimized and studied, we show how a simple reordering of the vertices of the mesh can greatly improve the execution time of the smoothing algorithm. The idea of our reordering is based on (i) the postulate that cache misses are a very… ▽ More

    Submitted 2 June, 2016; originally announced June 2016.

    Comments: Accepted to ICPP'16

  24. arXiv:1602.03855  [pdf, other

    stat.AP

    A Statistical Framework for Single Subject Design with an Application in Post-stroke Rehabilitation

    Authors: Ying Lu, Marc Scott, Preeti Raghavan

    Abstract: This paper proposes a practical yet novel solution to a longstanding statistical testing problem regarding single subject design. In particular, we aim to resolve an important clinical question: does a new patient behave the same as one from a healthy population? This question cannot be answered using the traditional single subject design when only test subject information is used, nor can it be s… ▽ More

    Submitted 11 February, 2016; originally announced February 2016.

    Comments: 31 pages, 3 figures, 2 tables

    MSC Class: 62C05

  25. arXiv:1502.04049  [pdf

    cs.CY cs.AI cs.CL

    How essential are unstructured clinical narratives and information fusion to clinical trial recruitment?

    Authors: Preethi Raghavan, James L. Chen, Eric Fosler-Lussier, Albert M. Lai

    Abstract: Electronic health records capture patient information using structured controlled vocabularies and unstructured narrative text. While structured data typically encodes lab values, encounters and medication lists, unstructured data captures the physician's interpretation of the patient's condition, prognosis, and response to therapeutic intervention. In this paper, we demonstrate that information e… ▽ More

    Submitted 13 February, 2015; originally announced February 2015.

    Comments: AMIA TBI 2014, 6 pages

  26. arXiv:1304.7793  [pdf, other

    cs.DS cs.DC

    Co-Scheduling Algorithms for High-Throughput Workload Execution

    Authors: Guillaume Aupy, Manu Shantharam, Anne Benoit, Yves Robert, Padma Raghavan

    Abstract: This paper investigates co-scheduling algorithms for processing a set of parallel applications. Instead of executing each application one by one, using a maximum degree of parallelism for each of them, we aim at scheduling several applications concurrently. We partition the original application set into a series of packs, which are executed one by one. A pack comprises several applications, each o… ▽ More

    Submitted 29 April, 2013; originally announced April 2013.

    Report number: INRIA RR-8293

  27. arXiv:math/9409223  [pdf, ps

    math.CO cs.CC

    On the minimum latency problem

    Authors: Avrim Blum, Prasad Chalasani, Don Coppersmith, Bill Pulleyblank, Prabhakar Raghavan, Madhu Sudan

    Abstract: We are given a set of points $p_1,\ldots , p_n$ and a symmetric distance matrix $(d_{ij})$ giving the distance between $p_i$ and $p_j$. We wish to construct a tour that minimizes $\sum_{i=1}^n \ell(i)$, where $\ell(i)$ is the {\em latency} of $p_i$, defined to be the distance traveled before first visiting $p_i$. This problem is also known in the literature as the {\em deliveryman problem} or th… ▽ More

    Submitted 20 September, 1994; originally announced September 1994.

    Comments: 9 pages

    Report number: LACES 68Q-94-18 MSC Class: 68Q25