Skip to main content

Showing 1–16 of 16 results for author: Sridhar, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.16234  [pdf, other

    cs.CR

    DAEDALUS: Defense Against Firmware ROP Exploits Using Stochastic Software Diversity

    Authors: Islam Obaidat, Meera Sridhar, Fatemeh Tavakoli

    Abstract: This paper presents DAEDALUS, a software diversity-based framework designed to resist ROP attacks on Linux-based IoT devices. DAEDALUS generates unique, semantically equivalent but syntactically different rewrites of IoT firmware, disrupting large-scale replication of ROP attacks. DAEDALUS employs STOKE, a stochastic optimizer for x86 binaries, as its core diversity engine but introduces significa… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2301.09809  [pdf, other

    cs.CL

    Low-Resource Compositional Semantic Parsing with Concept Pretraining

    Authors: Subendhu Rongali, Mukund Sridhar, Haidar Khan, Konstantine Arkoudas, Wael Hamza, Andrew McCallum

    Abstract: Semantic parsing plays a key role in digital voice assistants such as Alexa, Siri, and Google Assistant by map** natural language to structured meaning representations. When we want to improve the capabilities of a voice assistant by adding a new domain, the underlying semantic parsing model needs to be retrained using thousands of annotated examples from the new domain, which is time-consuming… ▽ More

    Submitted 30 January, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: EACL 2023

  4. arXiv:2208.01448  [pdf, other

    cs.CL cs.LG

    AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model

    Authors: Saleh Soltan, Shankar Ananthakrishnan, Jack FitzGerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan

    Abstract: In this work, we demonstrate that multilingual large-scale sequence-to-sequence (seq2seq) models, pre-trained on a mixture of denoising and Causal Language Modeling (CLM) tasks, are more efficient few-shot learners than decoder-only models on various tasks. In particular, we train a 20 billion parameter multilingual seq2seq model called Alexa Teacher Model (AlexaTM 20B) and show that it achieves s… ▽ More

    Submitted 3 August, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

  5. arXiv:2206.07808  [pdf, other

    cs.CL cs.AI cs.LG

    Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems

    Authors: Jack FitzGerald, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, ** Cao, Rakesh Chada, Amit Chauhan, Luoxin Chen, Anurag Dwarakanath, Satyam Dwivedi, Turan Gojayev, Karthik Gopalakrishnan, Thomas Gueudre, Dilek Hakkani-Tur, Wael Hamza, Jonathan Hueser, Kevin Martin Jose, Haidar Khan, Beiye Liu, Jianhua Lu, Alessandro Manzotti, Pradeep Natarajan, Karolina Owczarzak , et al. (16 additional authors not shown)

    Abstract: We present results from a large-scale experiment on pretraining encoders with non-embedding parameter counts ranging from 700M to 9.3B, their subsequent distillation into smaller models ranging from 17M-170M parameters, and their application to the Natural Language Understanding (NLU) component of a virtual assistant system. Though we train using 70% spoken-form data, our teacher models perform co… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: KDD 2022

    ACM Class: I.2.7

    Journal ref: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

  6. arXiv:2204.13796  [pdf, other

    cs.CL cs.AI

    Instilling Type Knowledge in Language Models via Multi-Task QA

    Authors: Shuyang Li, Mukund Sridhar, Chandana Satya Prakash, ** Cao, Wael Hamza, Julian McAuley

    Abstract: Understanding human language often necessitates understanding entities and their place in a taxonomy of knowledge -- their types. Previous methods to learn entity types rely on training classifiers on datasets with coarse, noisy, and incomplete labels. We introduce a method to instill fine-grained type knowledge in language models with text-to-text pre-training on type-centric questions leveraging… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: Findings of NAACL 2022; dataset link: https://github.com/amazon-research/wikiwiki-dataset

  7. arXiv:2110.14694  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Realistic Single-Task Continuous Learning Research for NER

    Authors: Justin Payan, Yuval Merhav, He Xie, Satyapriya Krishna, Anil Ramakrishna, Mukund Sridhar, Rahul Gupta

    Abstract: There is an increasing interest in continuous learning (CL), as data privacy is becoming a priority for real-world machine learning applications. Meanwhile, there is still a lack of academic NLP benchmarks that are applicable for realistic CL settings, which is a major challenge for the advancement of the field. In this paper we discuss some of the unrealistic data characteristics of public datase… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: 11 pages, 2 figures, Findings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP) (short paper), November 2021

  8. arXiv:2101.08333  [pdf, other

    cs.CL

    Zero-shot Generalization in Dialog State Tracking through Generative Question Answering

    Authors: Shuyang Li, ** Cao, Mukund Sridhar, Henghui Zhu, Shang-Wen Li, Wael Hamza, Julian McAuley

    Abstract: Dialog State Tracking (DST), an integral part of modern dialog systems, aims to track user preferences and constraints (slots) in task-oriented dialogs. In real-world settings with constantly changing services, DST systems must generalize to new domains and unseen slot types. Existing methods for DST do not generalize well to new slot names and many require known ontologies of slot types and value… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

    Comments: Accepted as a Long Paper at EACL 2021

  9. arXiv:2006.01208  [pdf, other

    cs.CL cs.IR cs.SD eess.AS

    Automatic Discovery of Novel Intents & Domains from Text Utterances

    Authors: Nikhita Vedula, Rahul Gupta, Aman Alok, Mukund Sridhar

    Abstract: One of the primary tasks in Natural Language Understanding (NLU) is to recognize the intents as well as domains of users' spoken and written language utterances. Most existing research formulates this as a supervised classification problem with a closed-world assumption, i.e. the domains or intents to be identified are pre-defined or known beforehand. Real-world applications however increasingly e… ▽ More

    Submitted 21 May, 2020; originally announced June 2020.

    Comments: 11 pages, 5 figures

  10. arXiv:1811.08811  [pdf, other

    cs.DS

    $k$-Cut: A Simple Approximately-Uniform Method for Sampling Ballots in Post-Election Audits

    Authors: Mayuri Sridhar, Ronald L. Rivest

    Abstract: We present an approximate sampling framework and discuss how risk-limiting audits can compensate for these approximations, while maintaining their "risk-limiting" properties. Our framework is general and can compensate for counting mistakes made during audits. Moreover, we present and analyze a simple approximate sampling method,"$k$-cut", for picking a ballot randomly from a stack, without coun… ▽ More

    Submitted 2 January, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

  11. Software Reuse in Cardiology Related Medical Database Using K-Means Clustering Technique

    Authors: M. Bhanu Sridhar, Y. Srinivas, M. H. M. Krishna Prasad

    Abstract: Software technology based on reuse is identified as a process of designing software for the reuse purpose. The software reuse is a process in which the existing software is used to build new software. A metric is a quantitative indicator of an attribute of an item or thing. Reusability is the likelihood for a segment of source code that can be used again to add new functionalities with slight or n… ▽ More

    Submitted 5 November, 2013; originally announced November 2013.

    Comments: 5 pages. arXiv admin note: text overlap with arXiv:1212.0312

    Journal ref: Journal of Software Engineering and Applications, 2012, 5, 682-686

  12. Software Reuse in Medical Database for Cardiac Patients using Pearson Family Equations

    Authors: M. Bhanu Sridhar, Y. Srinivas, M. H. M. Krishna Prasad

    Abstract: Software reuse is a subfield of software engineering that is used to adopt the existing software for similar purposes. Reuse Metrics determine the extent to which an existing software component is reused in new software with an objective to minimize the errors and cost of the new project. In this paper, medical database related to cardiology is considered. The Pearson Type I Distribution is used t… ▽ More

    Submitted 3 December, 2012; originally announced December 2012.

    Comments: (0975 8887)

    Journal ref: International Journal of Computer Applications Volume 58, No.14, 2012, 12-19

  13. arXiv:1208.2121  [pdf, other

    cs.IT

    On the Sum Rate of a 2 x 2 Interference Network

    Authors: Murali Sridhar, Srikrishna Bhashyam

    Abstract: In an M x N interference network, there are M transmitters and N receivers with each transmitter having independent messages for each of the 2^N -1 possible non-empty subsets of the receivers. We consider the 2 x 2 interference network with 6 possible messages, of which the 2 x 2 interference channel and X channel are special cases obtained by using only 2 and 4 messages respectively. Starting fro… ▽ More

    Submitted 23 December, 2016; v1 submitted 10 August, 2012; originally announced August 2012.

    Comments: Theorem 4 in previous version removed. See https://arxiv.longhoe.net/abs/1505.06317 for updated results

  14. arXiv:1005.4290  [pdf

    cs.OH

    E-Speed Governors For Public Transport Vehicles

    Authors: C. S. Sridhar, R. ShashiKumar, S. Madhava Kumar, Manjula Sridhar, Varun. D

    Abstract: An accident is unexpected, unusual, unintended and identifiable external event which occurs at any place and at any time. The major concern faced by the government and traffic officials is over speeding at limited speed zones like hospitals, schools or residential places leading to causalities and more deaths on the roads. Hence the speed of the vehicles is to be regulated and confined to the limi… ▽ More

    Submitted 24 May, 2010; originally announced May 2010.

    Comments: IEEE Publication format, International Journal of Computer Science and Information Security, IJCSIS, Vol. 8 No. 1, April 2010, USA. ISSN 1947 5500, http://sites.google.com/site/ijcsis/

  15. arXiv:q-bio/0611054  [pdf

    q-bio.QM cs.DC

    Grid enabled virtual screening against malaria

    Authors: N. Jacq, J. Salzemann, F. Jacq, Y. Legré, E. Medernach, J. Montagnat, A. Maass, M. Reichstadt, H. Schwichtenberg, M. Sridhar, V. Kasam, M. Zimmermann, M. Hofmann, V. Breton

    Abstract: WISDOM is an international initiative to enable a virtual screening pipeline on a grid infrastructure. Its first attempt was to deploy large scale in silico docking on a public grid infrastructure. Protein-ligand docking is about computing the binding energy of a protein target to a library of potential drugs using a scoring algorithm. Previous deployments were either limited to one cluster, to… ▽ More

    Submitted 17 November, 2006; originally announced November 2006.

    Comments: 34 pages, 5 figures, 3 tables, to appear in Journal of Grid Computing

    Journal ref: Journal of Grid Computing 6 (2008) 29-43

  16. arXiv:cs/0611084  [pdf

    cs.DC q-bio.QM

    Large Scale In Silico Screening on Grid Infrastructures

    Authors: N. Jacq, V. Breton, H. -Y. Chen, L. -Y. Ho, M. Hofmann, H. -C. Lee, Y. Legré, S. -C. Lin, A. Maass, E. Medernach, I. Merelli, L. Milanesi, G. Rastelli, M. Reichstadt, J. Salzemann, H. Schwichtenberg, M. Sridhar, V. Kasam, Y. -T. Wu, M. Zimmermann

    Abstract: Large-scale grid infrastructures for in silico drug discovery open opportunities of particular interest to neglected and emerging diseases. In 2005 and 2006, we have been able to deploy large scale in silico docking within the framework of the WISDOM initiative against Malaria and Avian Flu requiring about 105 years of CPU on the EGEE, Auvergrid and TWGrid infrastructures. These achievements dem… ▽ More

    Submitted 17 November, 2006; originally announced November 2006.

    Comments: 14 pages, 2 figures, 2 tables, The Third International Life Science Grid Workshop, LSGrid 2006, Yokohama, Japan, 13-14 october 2006, to appear in the proceedings