Skip to main content

Showing 1–21 of 21 results for author: Ramaswamy, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07954  [pdf

    cs.IR cs.CL cs.LG

    An efficient domain-independent approach for supervised keyphrase extraction and ranking

    Authors: Sriraghavendra Ramaswamy

    Abstract: We present a supervised learning approach for automatic extraction of keyphrases from single documents. Our solution uses simple to compute statistical and positional features of candidate phrases and does not rely on any external knowledge base or on pre-trained language models or word embeddings. The ranking component of our proposed solution is a fairly lightweight ensemble model. Evaluation on… ▽ More

    Submitted 24 March, 2024; originally announced April 2024.

    ACM Class: H.3.1; I.2.7

  2. arXiv:2207.00706  [pdf, other

    eess.AS cs.CL cs.LG

    UserLibri: A Dataset for ASR Personalization Using Only Text

    Authors: Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey

    Abstract: Personalization of speech models on mobile devices (on-device personalization) is an active area of research, but more often than not, mobile devices have more text-only data than paired audio-text data. We explore training a personalized language model on text-only data, used during inference to improve speech recognition performance for that user. We experiment on a user-clustered LibriSpeech co… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted for publication in Interspeech 2022. 9 total pages with appendix, 9 total tables, 5 total figures

  3. arXiv:2112.00193  [pdf, other

    cs.LG cs.CR

    Public Data-Assisted Mirror Descent for Private Model Training

    Authors: Ehsan Amid, Arun Ganesh, Rajiv Mathews, Swaroop Ramaswamy, Shuang Song, Thomas Steinke, Vinith M. Suriyakumar, Om Thakkar, Abhradeep Thakurta

    Abstract: In this paper, we revisit the problem of using in-distribution public data to improve the privacy/utility trade-offs for differentially private (DP) model training. (Here, public data refers to auxiliary data sets that have no privacy concerns.) We design a natural variant of DP mirror descent, where the DP gradients of the private/sensitive data act as the linear term, and the loss generated by t… ▽ More

    Submitted 27 March, 2022; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: 20 pages, 8 figures, 3 tables

  4. arXiv:2111.00556  [pdf, other

    cs.LG cs.CL cs.CR

    Revealing and Protecting Labels in Distributed Training

    Authors: Trung Dang, Om Thakkar, Swaroop Ramaswamy, Rajiv Mathews, Peter Chin, Françoise Beaufays

    Abstract: Distributed learning paradigms such as federated learning often involve transmission of model updates, or gradients, over a network, thereby avoiding transmission of private data. However, it is possible for sensitive information about the training data to be revealed from such gradients. Prior works have demonstrated that labels can be revealed analytically from the last layer of certain models (… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  5. arXiv:2105.05956  [pdf

    cs.ET cond-mat.dis-nn cond-mat.mtrl-sci

    2022 Roadmap on Neuromorphic Computing and Engineering

    Authors: Dennis V. Christensen, Regina Dittmann, Bernabé Linares-Barranco, Abu Sebastian, Manuel Le Gallo, Andrea Redaelli, Stefan Slesazeck, Thomas Mikolajick, Sabina Spiga, Stephan Menzel, Ilia Valov, Gianluca Milano, Carlo Ricciardi, Shi-Jun Liang, Feng Miao, Mario Lanza, Tyler J. Quill, Scott T. Keene, Alberto Salleo, Julie Grollier, Danijela Marković, Alice Mizrahi, Peng Yao, J. Joshua Yang, Giacomo Indiveri , et al. (34 additional authors not shown)

    Abstract: Modern computation based on the von Neumann architecture is today a mature cutting-edge science. In the Von Neumann architecture, processing and memory units are implemented as separate blocks interchanging data intensively and continuously. This data transfer is responsible for a large part of the power consumption. The next generation computer technology is expected to solve problems at the exas… ▽ More

    Submitted 13 January, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Journal ref: Neuromorph. Comput. Eng. 2 022501 (2022)

  6. arXiv:2104.07815  [pdf, other

    cs.CL cs.CR cs.LG

    A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It

    Authors: Trung Dang, Om Thakkar, Swaroop Ramaswamy, Rajiv Mathews, Peter Chin, Françoise Beaufays

    Abstract: End-to-end Automatic Speech Recognition (ASR) models are commonly trained over spoken utterances using optimization methods like Stochastic Gradient Descent (SGD). In distributed settings like Federated Learning, model training requires transmission of gradients over a network. In this work, we design the first method for revealing the identity of the speaker of a training utterance with access on… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  7. arXiv:2009.10031  [pdf, other

    cs.LG cs.CR stat.ML

    Training Production Language Models without Memorizing User Data

    Authors: Swaroop Ramaswamy, Om Thakkar, Rajiv Mathews, Galen Andrew, H. Brendan McMahan, Françoise Beaufays

    Abstract: This paper presents the first consumer-scale next-word prediction (NWP) model trained with Federated Learning (FL) while leveraging the Differentially Private Federated Averaging (DP-FedAvg) technique. There has been prior work on building practical FL infrastructure, including work demonstrating the feasibility of training language models on mobile devices using such infrastructure. It has also b… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  8. arXiv:2007.09186  [pdf, other

    cs.IR

    AWS CORD-19 Search: A Neural Search Engine for COVID-19 Literature

    Authors: Parminder Bhatia, Lan Liu, Kristjan Arumae, Nima Pourdamghani, Suyog Deshpande, Ben Snively, Mona Mona, Colby Wise, George Price, Shyam Ramaswamy, Xiaofei Ma, Ramesh Nallapati, Zhiheng Huang, Bing Xiang, Taha Kass-Hout

    Abstract: Coronavirus disease (COVID-19) has been declared as a pandemic by WHO with thousands of cases being reported each day. Numerous scientific articles are being published on the disease raising the need for a service which can organize, and query them in a reliable fashion. To support this cause we present AWS CORD-19 Search (ACS), a public, COVID-19 specific, neural search engine that is powered by… ▽ More

    Submitted 7 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

  9. arXiv:2006.07490  [pdf, other

    cs.LG cs.CL stat.ML

    Understanding Unintended Memorization in Federated Learning

    Authors: Om Thakkar, Swaroop Ramaswamy, Rajiv Mathews, Françoise Beaufays

    Abstract: Recent works have shown that generative sequence models (e.g., language models) have a tendency to memorize rare or unique sequences in the training data. Since useful models are often trained on sensitive data, to ensure the privacy of the training data it is critical to identify and mitigate such unintended memorization. Federated Learning (FL) has emerged as a novel framework for large-scale di… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  10. arXiv:1911.06679  [pdf, other

    cs.LG stat.ML

    Generative Models for Effective ML on Private, Decentralized Datasets

    Authors: Sean Augenstein, H. Brendan McMahan, Daniel Ramage, Swaroop Ramaswamy, Peter Kairouz, Mingqing Chen, Rajiv Mathews, Blaise Aguera y Arcas

    Abstract: To improve real-world applications of machine learning, experienced modelers develop intuition about their datasets, their models, and how the two interact. Manual inspection of raw data - of representative samples, of outliers, of misclassifications - is an essential tool in a) identifying and fixing problems in the data, b) generating new modeling hypotheses, and c) assigning or refining human-p… ▽ More

    Submitted 4 February, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: 26 pages, 8 figures. Camera-ready ICLR 2020 version

  11. arXiv:1909.06182  [pdf, other

    cs.DB

    DBPal: Weak Supervision for Learning a Natural Language Interface to Databases

    Authors: Nathaniel Weir, Andrew Crotty, Alex Galakatos, Amir Ilkhechi, Shekar Ramaswamy, Rohin Bhushan, Ugur Cetintemel, Prasetya Utama, Nadja Geisler, Benjamin Hättasch, Steffen Eger, Carsten Binnig

    Abstract: This paper describes DBPal, a new system to translate natural language utterances into SQL statements using a neural machine translation model. While other recent approaches use neural machine translation to implement a Natural Language Interface to Databases (NLIDB), existing techniques rely on supervised learning with manually curated training data, which results in substantial overhead for supp… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: arXiv admin note: text overlap with arXiv:1804.00401

  12. arXiv:1906.04329  [pdf, other

    cs.CL cs.LG

    Federated Learning for Emoji Prediction in a Mobile Keyboard

    Authors: Swaroop Ramaswamy, Rajiv Mathews, Kanishka Rao, Françoise Beaufays

    Abstract: We show that a word-level recurrent neural network can predict emoji from text typed on a mobile keyboard. We demonstrate the usefulness of transfer learning for predicting emoji by pretraining the model using a language modeling task. We also propose mechanisms to trigger emoji and tune the diversity of candidates. The model is trained using a distributed on-device learning framework called feder… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

  13. arXiv:1905.03871  [pdf, other

    cs.LG stat.ML

    Differentially Private Learning with Adaptive Clip**

    Authors: Galen Andrew, Om Thakkar, H. Brendan McMahan, Swaroop Ramaswamy

    Abstract: Existing approaches for training neural networks with user-level differential privacy (e.g., DP Federated Averaging) in federated learning (FL) settings involve bounding the contribution of each user's model update by clip** it to some constant value. However there is no good a priori setting of the clip** norm across tasks and learning settings: the update norm distribution depends on the mod… ▽ More

    Submitted 9 May, 2022; v1 submitted 9 May, 2019; originally announced May 2019.

    Comments: Accepted to NeurIPS, 2021

  14. arXiv:1901.01342  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

    Authors: Joseph Roth, Sourish Chaudhuri, Ondrej Klejch, Radhika Marvin, Andrew Gallagher, Liat Kaver, Sharadh Ramaswamy, Arkadiusz Stopczynski, Cordelia Schmid, Zhonghua Xi, Caroline Pantofaru

    Abstract: Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech enhancement, and human-robot interaction. The absence of a large, carefully labeled audio-visual dataset for this task has constrained algorithm evaluations with respect to data diversity, environments, and accuracy. This has made com… ▽ More

    Submitted 24 May, 2019; v1 submitted 4 January, 2019; originally announced January 2019.

  15. arXiv:1811.03604  [pdf, other

    cs.CL

    Federated Learning for Mobile Keyboard Prediction

    Authors: Andrew Hard, Kanishka Rao, Rajiv Mathews, Swaroop Ramaswamy, Françoise Beaufays, Sean Augenstein, Hubert Eichner, Chloé Kiddon, Daniel Ramage

    Abstract: We train a recurrent neural network language model using a distributed, on-device learning framework called federated learning for the purpose of next-word prediction in a virtual keyboard for smartphones. Server-based training using stochastic gradient descent is compared with training on client devices using the Federated Averaging algorithm. The federated algorithm, which enables training on a… ▽ More

    Submitted 28 February, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: 7 pages, 4 figures

  16. arXiv:1804.00401  [pdf, other

    cs.DB cs.CL cs.HC

    An End-to-end Neural Natural Language Interface for Databases

    Authors: Prasetya Utama, Nathaniel Weir, Fuat Basik, Carsten Binnig, Ugur Cetintemel, Benjamin Hättasch, Amir Ilkhechi, Shekar Ramaswamy, Arif Usta

    Abstract: The ability to extract insights from new data sets is critical for decision making. Visual interactive tools play an important role in data exploration since they provide non-technical users with an effective way to visually compose queries and comprehend the results. Natural language has recently gained traction as an alternative query interface to databases with the potential to enable non-exper… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

  17. Analysis of Software Binaries for Reengineering-Driven Product Line ArchitectureâAn Industrial Case Study

    Authors: Ian D. Peake, Jan Olaf Blech, Lasith Fernando, Divyasheel Sharma, Srini Ramaswamy, Mallikarjun Kande

    Abstract: This paper describes a method for the recovering of software architectures from a set of similar (but unrelated) software products in binary form. One intention is to drive refactoring into software product lines and combine architecture recovery with run time binary analysis and existing clustering methods. Using our runtime binary analysis, we create graphs that capture the dependencies betwee… ▽ More

    Submitted 14 April, 2015; originally announced April 2015.

    Comments: In Proceedings FMSPLE 2015, arXiv:1504.03014

    ACM Class: D.2.5; D.2.m

    Journal ref: EPTCS 182, 2015, pp. 71-82

  18. Towards a Consistent, Sound and Complete Conceptual Knowledge

    Authors: Gowri Shankar Ramaswamy, F Sagayaraj Francis

    Abstract: Knowledge is only good if it is sound, consistent and complete. The same holds true for conceptual knowledge, which holds knowledge about concepts and its association. Conceptual knowledge no matter what format they are represented in, must be consistent, sound and complete in order to realise its practical use. This paper discusses consistency, soundness and completeness in the ambit of conceptua… ▽ More

    Submitted 24 November, 2014; originally announced December 2014.

    Journal ref: International Journal of Computer Trends and Technology (IJCTT) V17(2):61-63, Nov 2014

  19. arXiv:1301.0957  [pdf, other

    cs.IT

    On Large Scale Distributed Compression and Dispersive Information Routing for Networks

    Authors: Kumar Viswanatha, Sharadh Ramaswamy, Ankur Saxena, Emrah Akyol, Kenneth Rose

    Abstract: This paper considers the problem of distributed source coding for a large network. A major obstacle that poses an existential threat to practical deployment of conventional approaches to distributed coding is the exponential growth of the decoder complexity with the number of sources and the encoding rates. This growth in complexity renders many traditional approaches impractical even for moderate… ▽ More

    Submitted 5 January, 2013; originally announced January 2013.

  20. arXiv:1208.4016  [pdf

    cs.DB

    Concept driven framework for Latent Table Discovery

    Authors: Gowri Shankar Ramaswamy, F Sagayaraj Francis

    Abstract: Database systems have to cater to the growing demands of the information age. The growth of the new age information retrieval powerhouses like search engines has thrown a challenge to the data management community to come up with novel mechanisms for feeding information to end users. The burgeoning use of natural language query interfaces compels system designers to present meaningful and customis… ▽ More

    Submitted 20 August, 2012; originally announced August 2012.

    Journal ref: JOURNAL OF COMPUTING, VOLUME 4, ISSUE 7, JULY 2012, ISSN (Online) 2151-9617

  21. arXiv:1104.1311  [pdf

    cs.DB

    Latent table discovery by semantic relationship extraction between unrelated sets of entity sets of structured data sources

    Authors: Gowri Shankar Ramaswamy, F Sagayaraj Francis

    Abstract: Querying is one of the basic functionality expected from a database system. Query efficiency is adversely affected by increase in the number of participating tables. Also, querying based on syntax largely limits the gamut of queries a database system can process. Syntactic queries rely on the database table structure, which is a cause of concern for large organisations due to incompatibility betwe… ▽ More

    Submitted 7 April, 2011; originally announced April 2011.

    Journal ref: IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 2, March 2011