Skip to main content

Showing 1–27 of 27 results for author: Seshadri, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15053  [pdf, other

    cs.CL

    PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data

    Authors: Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Manohar Swaminathan, Sunayana Sitaram

    Abstract: Evaluation of multilingual Large Language Models (LLMs) is challenging due to a variety of factors -- the lack of benchmarks with sufficient linguistic diversity, contamination of popular benchmarks into LLM pre-training data and the lack of local, cultural nuances in translated benchmarks. In this work, we study human and LLM-based evaluation in a multilingual, multi-cultural setting. We evaluate… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Work in progress

  2. arXiv:2405.06346  [pdf, other

    cs.CL

    Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology

    Authors: Rishav Hada, Safiya Husain, Varun Gumma, Harshita Diddee, Aditya Yadavalli, Agrima Seth, Nidhi Kulkarni, Ujwal Gadiraju, Aditya Vashistha, Vivek Seshadri, Kalika Bali

    Abstract: Existing research in measuring and mitigating gender bias predominantly centers on English, overlooking the intricate challenges posed by non-English languages and the Global South. This paper presents the first comprehensive study delving into the nuanced landscape of gender bias in Hindi, the third most spoken language globally. Our study employs diverse mining techniques, computational models,… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted to FAccT 2024

  3. arXiv:2401.15579  [pdf, other

    cs.CL cs.SD eess.AS

    MunTTS: A Text-to-Speech System for Mundari

    Authors: Varun Gumma, Rishav Hada, Aditya Yadavalli, Pamir Gogoi, Ishani Mondal, Vivek Seshadri, Kalika Bali

    Abstract: We present MunTTS, an end-to-end text-to-speech (TTS) system specifically for Mundari, a low-resource Indian language of the Austo-Asiatic family. Our work addresses the gap in linguistic technology for underrepresented languages by collecting and processing data to build a speech synthesis system. We begin our study by gathering a substantial dataset of Mundari text and speech and train end-to-en… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted to ComputEL-7

  4. arXiv:2306.17674  [pdf, other

    cs.CL

    X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents

    Authors: Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaƫl de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam

    Abstract: Task-oriented dialogue research has mainly focused on a few popular languages like English and Chinese, due to the high dataset creation cost for a new language. To reduce the cost, we apply manual editing to automatically translated data. We create a new multilingual benchmark, X-RiSAWOZ, by translating the Chinese RiSAWOZ to 4 languages: English, French, Hindi, Korean; and a code-mixed English-H… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL 2023 Findings

  5. arXiv:2210.16556  [pdf, other

    cs.LG cs.PL

    MinUn: Accurate ML Inference on Microcontrollers

    Authors: Shikhar Jaiswal, Rahul Kiran Kranti Goli, Aayan Kumar, Vivek Seshadri, Rahul Sharma

    Abstract: Running machine learning inference on tiny devices, known as TinyML, is an emerging research area. This task requires generating inference code that uses memory frugally, a task that standard ML frameworks are ill-suited for. A deployment framework for TinyML must be a) parametric in the number representation to take advantage of the emerging representations like posits, b) carefully assign high-p… ▽ More

    Submitted 30 November, 2022; v1 submitted 29 October, 2022; originally announced October 2022.

  6. arXiv:2206.12931  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi

    Authors: Ritesh Kumar, Siddharth Singh, Shyam Ratan, Mohit Raj, Sonal Sinha, Bornini Lahiri, Vivek Seshadri, Kalika Bali, Atul Kr. Ojha

    Abstract: In this paper we discuss an in-progress work on the development of a speech corpus for four low-resource Indo-Aryan languages -- Awadhi, Bhojpuri, Braj and Magahi using the field methods of linguistic data collection. The total size of the corpus currently stands at approximately 18 hours (approx. 4-5 hours each language) and it is transcribed and annotated with grammatical information such as par… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Speech for Social Good Workshop, 2022, Interspeech 2022

  7. Rightsizing Clusters for Time-Limited Tasks

    Authors: Venkatesan T. Chakaravarthy, Padmanabha V. Seshadri, Pooja Aggarwal, Anamitra R. Choudhury, Ashok Pon Kumar, Yogish Sabharwal, Amith Singhee

    Abstract: In conventional public clouds, designing a suitable initial cluster for a given application workload is important in reducing the computational foot-print during run-time. In edge or on-premise clouds, cold-start rightsizing the cluster at the time of installation is crucial in avoiding the recurrent capital expenditure. In both these cases, rightsizing has to balance cost-performance trade-off fo… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: An abridged version appears in IEEE Cloud 2021

  8. arXiv:2107.03653  [pdf, ps, other

    cs.AR cs.DC cs.LG cs.PL

    MAFIA: Machine Learning Acceleration on FPGAs for IoT Applications

    Authors: Nikhil Pratap Ghanathe, Vivek Seshadri, Rahul Sharma, Steve Wilton, Aayan Kumar

    Abstract: Recent breakthroughs in ML have produced new classes of models that allow ML inference to run directly on milliwatt-powered IoT devices. On one hand, existing ML-to-FPGA compilers are designed for deep neural-networks on large FPGAs. On the other hand, general-purpose HLS tools fail to exploit properties specific to ML inference, thereby resulting in suboptimal performance. We propose MAFIA, a too… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: Accepted at The International Conference on Field-Programmable Logic and Applications (FPL), 2021

  9. Multilingual and code-switching ASR challenges for low resource Indian languages

    Authors: Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, Srinivasa Raghavan, Shreya Khare, Vinit Unni, Saurabh Vyas, Akash Rajpuria, Chiranjeevi Yarra, Ashish Mittal, Prasanta Kumar Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati, Karthik Sankaranarayanan, Tejaswi Seeram, Basil Abraham

    Abstract: Recently, there is increasing interest in multilingual automatic speech recognition (ASR) where a speech recognition system caters to multiple low resource languages by taking advantage of low amounts of labeled corpora in multiple languages. With multilingualism becoming common in today's world, there has been increasing interest in code-switching ASR as well. In code-switching, multiple language… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Comments: 6 pages

  10. arXiv:2005.09748  [pdf, other

    cs.AR

    The Virtual Block Interface: A Flexible Alternative to the Conventional Virtual Memory Framework

    Authors: Nastaran Ha**azar, Pratyush Patel, Minesh Patel, Konstantinos Kanellopoulos, Saugata Ghose, Rachata Ausavarungnirun, Geraldo Francisco de Oliveira Jr., Jonathan Appavoo, Vivek Seshadri, Onur Mutlu

    Abstract: Computers continue to diversify with respect to system designs, emerging memory technologies, and application memory demands. Unfortunately, continually adapting the conventional virtual memory framework to each possible system configuration is challenging, and often results in performance loss or requires non-trivial workarounds. To address these challenges, we propose a new virtual memory framew… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  11. arXiv:1905.09822  [pdf, other

    cs.AR cs.PF

    In-DRAM Bulk Bitwise Execution Engine

    Authors: Vivek Seshadri, Onur Mutlu

    Abstract: Many applications heavily use bitwise operations on large bitvectors as part of their computation. In existing systems, performing such bulk bitwise operations requires the processor to transfer a large amount of data on the memory channel, thereby consuming high latency, memory bandwidth, and energy. In this paper, we describe Ambit, a recently-proposed mechanism to perform bulk bitwise operation… ▽ More

    Submitted 5 April, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1605.06483, arXiv:1610.09603, arXiv:1611.09988

  12. arXiv:1902.07344  [pdf, other

    cs.CR

    Dataplant: Enhancing System Security with Low-Cost In-DRAM Value Generation Primitives

    Authors: Lois Orosa, Yaohua Wang, Ivan Puddu, Mohammad Sadrosadati, Kaveh Razavi, Juan GĆ³mez-Luna, Hasan Hassan, Nika Mansouri-Ghiasi, Arash Tavakkol, Minesh Patel, Jeremie Kim, Vivek Seshadri, Uksong Kang, Saugata Ghose, Rodolfo Azevedo, Onur Mutlu

    Abstract: DRAM manufacturers have been prioritizing memory capacity, yield, and bandwidth for years, while trying to keep the design complexity as simple as possible. DRAM chips do not carry out any computation or other important functions, such as security. Processors implement most of the existing security mechanisms that protect the system against security threats, because 1) executing security mechanism… ▽ More

    Submitted 5 November, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

  13. arXiv:1806.03377  [pdf, other

    cs.DC

    PipeDream: Fast and Efficient Pipeline Parallel DNN Training

    Authors: Aaron Harlap, Deepak Narayanan, Amar Phanishayee, Vivek Seshadri, Nikhil Devanur, Greg Ganger, Phil Gibbons

    Abstract: PipeDream is a Deep Neural Network(DNN) training system for GPUs that parallelizes computation by pipelining execution across multiple machines. Its pipeline parallel computing model avoids the slowdowns faced by data-parallel training when large models and/or limited network bandwidth induce high communication-to-computation ratios. PipeDream reduces communication by up to 95% for large DNNs rela… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

  14. arXiv:1805.05926  [pdf, other

    cs.AR

    Predictable Performance and Fairness Through Accurate Slowdown Estimation in Shared Main Memory Systems

    Authors: Lavanya Subramanian, Vivek Seshadri, Yoongu Kim, Ben Jaiyen, Onur Mutlu

    Abstract: This paper summarizes the ideas and key concepts in MISE (Memory Interference-induced Slowdown Estimation), which was published in HPCA 2013 [97], and examines the work's significance and future potential. Applications running concurrently on a multicore system interfere with each other at the main memory. This interference can slow down different applications differently. Accurately estimating th… ▽ More

    Submitted 15 May, 2018; originally announced May 2018.

    Journal ref: IPSI BgD Transactions on Advanced Research (TAR), July 2018, Volume 14, Number 2, ISSN 1820 - 4511

  15. arXiv:1805.03969  [pdf, other

    cs.AR

    Exploiting Row-Level Temporal Locality in DRAM to Reduce the Memory Access Latency

    Authors: Hasan Hassan, Gennady Pekhimenko, Nandita Vijaykumar, Vivek Seshadri, Donghyuk Lee, Oguz Ergin, Onur Mutlu

    Abstract: This paper summarizes the idea of ChargeCache, which was published in HPCA 2016 [51], and examines the work's significance and future potential. DRAM latency continues to be a critical bottleneck for system performance. In this work, we develop a low-cost mechanism, called ChargeCache, that enables faster access to recently-accessed rows in DRAM, with no modifications to DRAM chips. Our mechanism… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1609.07234

  16. arXiv:1805.03502  [pdf, other

    cs.AR

    RowClone: Accelerating Data Movement and Initialization Using DRAM

    Authors: Vivek Seshadri, Yoongu Kim, Chris Fallin, Donghyuk Lee, Rachata Ausavarungnirun, Gennady Pekhimenko, Yixin Luo, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry

    Abstract: In existing systems, to perform any bulk data movement operation (copy or initialization), the data has to first be read into the on-chip processor, all the way into the L1 cache, and the result of the operation must be written back to main memory. This is despite the fact that these operations do not involve any actual computation. RowClone exploits the organization and operation of commodity DRA… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.

    Comments: arXiv admin note: text overlap with arXiv:1605.06483

  17. arXiv:1805.03048  [pdf, other

    cs.AR

    Tiered-Latency DRAM: Enabling Low-Latency Main Memory at Low Cost

    Authors: Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu

    Abstract: This paper summarizes the idea of Tiered-Latency DRAM (TL-DRAM), which was published in HPCA 2013, and examines the work's significance and future potential. The capacity and cost-per-bit of DRAM have historically scaled to satisfy the needs of increasingly large and complex computer systems. However, DRAM latency has remained almost constant, making memory latency the performance bottleneck in to… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1601.06903

  18. arXiv:1805.03047  [pdf, other

    cs.AR

    Adaptive-Latency DRAM: Reducing DRAM Latency by Exploiting Timing Margins

    Authors: Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Khan, Vivek Seshadri, Kevin Chang, Onur Mutlu

    Abstract: This paper summarizes the idea of Adaptive-Latency DRAM (AL-DRAM), which was published in HPCA 2015, and examines the work's significance and future potential. AL-DRAM is a mechanism that optimizes DRAM latency based on the DRAM module and the operating temperature, by exploiting the extra margin that is built into the DRAM timing parameters. DRAM manufacturers provide a large margin for the timin… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1603.08454

  19. arXiv:1805.01966  [pdf, other

    cs.AR

    Exploiting the DRAM Microarchitecture to Increase Memory-Level Parallelism

    Authors: Yoongu Kim, Vivek Seshadri, Donghyuk Lee, Jamie Liu, Onur Mutlu

    Abstract: This paper summarizes the idea of Subarray-Level Parallelism (SALP) in DRAM, which was published in ISCA 2012, and examines the work's significance and future potential. Modern DRAMs have multiple banks to serve multiple memory requests in parallel. However, when two requests go to the same bank, they have to be served serially, exacerbating the high latency of on-chip memory. Adding more banks to… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

  20. arXiv:1611.09988  [pdf, other

    cs.AR

    Buddy-RAM: Improving the Performance and Efficiency of Bulk Bitwise Operations Using DRAM

    Authors: Vivek Seshadri, Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali Boroumand, Jeremie Kim, Michael A. Kozuch, Onur Mutlu, Phillip B. Gibbons, Todd C. Mowry

    Abstract: Bitwise operations are an important component of modern day programming. Many widely-used data structures (e.g., bitmap indices in databases) rely on fast bitwise operations on large bit vectors to achieve high performance. Unfortunately, in existing systems, regardless of the underlying architecture (e.g., CPU, GPU, FPGA), the throughput of such bulk bitwise operations is limited by the available… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.

    Comments: arXiv admin note: text overlap with arXiv:1605.06483

  21. arXiv:1610.09604  [pdf, other

    cs.AR

    Understanding and Exploiting Design-Induced Latency Variation in Modern DRAM Chips

    Authors: Donghyuk Lee, Samira Khan, Lavanya Subramanian, Saugata Ghose, Rachata Ausavarungnirun, Gennady Pekhimenko, Vivek Seshadri, Onur Mutlu

    Abstract: Variation has been shown to exist across the cells within a modern DRAM chip. We empirically demonstrate a new form of variation that exists within a real DRAM chip, induced by the design and placement of different components in the DRAM chip. Our goals are to understand design-induced variation that exists in real, state-of-the-art DRAM chips, exploit it to develop low-cost mechanisms that can dy… ▽ More

    Submitted 15 May, 2017; v1 submitted 30 October, 2016; originally announced October 2016.

    Comments: This paper is a two column version of the paper, D. Lee et al., "Design-Induced Latency Variation in Modern DRAM Chips: Characterization, Analysis, and Latency Reduction Mechanisms", SIGMETRICS 2017

  22. arXiv:1610.09603  [pdf, other

    cs.AR

    The Processing Using Memory Paradigm:In-DRAM Bulk Copy, Initialization, Bitwise AND and OR

    Authors: Vivek Seshadri, Onur Mutlu

    Abstract: In existing systems, the off-chip memory interface allows the memory controller to perform only read or write operations. Therefore, to perform any operation, the processor must first read the source data and then write the result back to memory after performing the operation. This approach consumes high latency, bandwidth, and energy for operations that work on a large amount of data. Several wor… ▽ More

    Submitted 30 October, 2016; originally announced October 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1605.06483

  23. arXiv:1605.06483  [pdf, other

    cs.AR

    Simple DRAM and Virtual Memory Abstractions to Enable Highly Efficient Memory Systems

    Authors: Vivek Seshadri

    Abstract: In most modern systems, the memory subsystem is managed and accessed at multiple different granularities at various resources. We observe that such multi-granularity management results in significant inefficiency in the memory subsystem. Specifically, we observe that 1) page-granularity virtual memory unnecessarily triggers large memory operations, and 2) existing cache-line granularity memory int… ▽ More

    Submitted 20 May, 2016; originally announced May 2016.

    Report number: CMU-CS-16-106

  24. arXiv:1603.08454  [pdf, other

    cs.AR

    Adaptive-Latency DRAM (AL-DRAM)

    Authors: Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Khan, Vivek Seshadri, Kevin Chang, Onur Mutlu

    Abstract: This paper summarizes the idea of Adaptive-Latency DRAM (AL-DRAM), which was published in HPCA 2015. The key goal of AL-DRAM is to exploit the extra margin that is built into the DRAM timing parameters to reduce DRAM latency. The key observation is that the timing parameters are dictated by the worst-case temperatures and worst-case DRAM cells, both of which lead to small amount of charge storage… ▽ More

    Submitted 28 March, 2016; originally announced March 2016.

    Comments: This is a summary of the original paper, entitled "Adaptive-Latency DRAM: Optimizing DRAM Timing for the Common-Case" which appears in HPCA 2015

    Report number: SAFARI 2016-003

  25. arXiv:1601.06903  [pdf, other

    cs.AR

    Tiered-Latency DRAM (TL-DRAM)

    Authors: Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu

    Abstract: This paper summarizes the idea of Tiered-Latency DRAM, which was published in HPCA 2013. The key goal of TL-DRAM is to provide low DRAM latency at low cost, a critical problem in modern memory systems. To this end, TL-DRAM introduces heterogeneity into the design of a DRAM subarray by segmenting the bitlines, thereby creating a low-latency, low-energy, low-capacity portion in the subarray (called… ▽ More

    Submitted 26 January, 2016; originally announced January 2016.

    Comments: This is a summary of the original paper, entitled "Tiered-Latency DRAM: A Low Latency and Low Cost DRAM Architecture" which appears in HPCA 2013

  26. arXiv:1504.00390  [pdf, other

    cs.DC

    The Blacklisting Memory Scheduler: Balancing Performance, Fairness and Complexity

    Authors: Lavanya Subramanian, Donghyuk Lee, Vivek Seshadri, Harsha Rastogi, Onur Mutlu

    Abstract: In a multicore system, applications running on different cores interfere at main memory. This inter-application interference degrades overall system performance and unfairly slows down applications. Prior works have developed application-aware memory schedulers to tackle this problem. State-of-the-art application-aware memory schedulers prioritize requests of applications that are vulnerable to in… ▽ More

    Submitted 1 April, 2015; originally announced April 2015.

    Report number: SAFARI Technical Report No. 2015-004

  27. arXiv:1211.2063  [pdf

    cs.NI

    Mobile-to-Mobile Video Recommendation

    Authors: Padmanabha Venkatagiri Seshadri, Mun Choon Chan, Wei Tsang Ooi

    Abstract: Mobile device users can now easily capture and socially share video clips in a timely manner by uploading them wirelessly to a server. When attending crowded events, such as an exhibition or the Olympic Games, however, timely sharing of videos becomes difficult due to choking bandwidth in the network infrastructure, preventing like-minded attendees from easily sharing videos with each other throug… ▽ More

    Submitted 9 November, 2012; originally announced November 2012.