Skip to main content

Showing 1–31 of 31 results for author: Sivaraman, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09617  [pdf, other

    cs.CL cs.HC eess.AS

    Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection

    Authors: Shruti Palaskar, Oggi Rudovic, Sameer Dharur, Florian Pesce, Gautam Krishna, Aswin Sivaraman, Jack Berkowitz, Ahmed Hussen Abdelaziz, Saurabh Adya, Ahmed Tewfik

    Abstract: Although Large Language Models (LLMs) have shown promise for human-like conversations, they are primarily pre-trained on text data. Incorporating audio or video improves performance, but collecting large-scale multimodal data and pre-training multimodal LLMs is challenging. To this end, we propose a Fusion Low Rank Adaptation (FLoRA) technique that efficiently adapts a pre-trained unimodal LLM to… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  2. arXiv:2402.09527  [pdf, other

    cs.NI

    Jasper: Scalable and Fair Multicast for Financial Exchanges in the Cloud

    Authors: Muhammad Haseeb, **kun Geng, Ulysses Butler, Xiyu Hao, Daniel Duclos-Cavalcanti, Anirudh Sivaraman

    Abstract: Financial exchanges have recently shown an interest in migrating to the public cloud for scalability, elasticity, and cost savings. However, financial exchanges often have strict network requirements that can be difficult to meet on the cloud. Notably, market participants (MPs) trade based on market data about different activities in the market. Exchanges often use switch multicast to disseminate… ▽ More

    Submitted 2 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  3. arXiv:2312.04857  [pdf, other

    cs.NI cs.AR

    Application-Defined Receive Side Dispatching on the NIC

    Authors: Tao Wang, **kun Lin, Gianni Antichi, Aurojit Panda, Anirudh Sivaraman

    Abstract: Recently, some application (L7) processing has been moved to the network stack (including proxies) as a way to provide a common and application-agnostic interface for security policies, simplify service management, etc. This paper looks at whether L7 network functionality can be offloaded to SmartNICs to improve performance and reduce overheads. We investigate this question by examining how to off… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  4. arXiv:2309.14647  [pdf, other

    cs.NI

    State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

    Authors: Qiongwen Xu, Sebastiano Miano, Xiangyu Gao, Tao Wang, Adithya Murugadass, Songyuan Zhang, Anirudh Sivaraman, Gianni Antichi, Srinivas Narayana

    Abstract: With the slowdown of Moore's law, CPU-oriented packet processing in software will be significantly outpaced by emerging line speeds of network interface cards (NICs). Single-core packet-processing throughput has saturated. We consider the problem of high-speed packet processing with multiple CPU cores. The key challenge is state--memory that multiple packets must read and update. The prevailing… ▽ More

    Submitted 16 June, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  5. arXiv:2211.15300  [pdf, other

    cs.NI cs.SC cs.SE

    P4Testgen: An Extensible Test Oracle For P4

    Authors: Fabian Ruffy, Jed Liu, Prathima Kotikalapudi, Vojtěch Havel, Hanneli Tavante, Rob Sherwood, Vladyslav Dubina, Volodymyr Peschanenko, Anirudh Sivaraman, Nate Foster

    Abstract: We present P4Testgen, a test oracle for the P4$_{16}$ language. P4Testgen supports automatic test generation for any P4 target and is designed to be extensible to many P4 targets. It models the complete semantics of the target's packet-processing pipeline including the P4 language, architectures and externs, and target-specific extensions. To handle non-deterministic behaviors and complex externs… ▽ More

    Submitted 6 August, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Journal ref: ACM SIGCOMM 2023 Conference (ACM SIGCOMM '23)

  6. arXiv:2211.07493  [pdf, ps, other

    eess.AS cs.SD

    The Potential of Neural Speech Synthesis-based Data Augmentation for Personalized Speech Enhancement

    Authors: Anastasia Kuznetsova, Aswin Sivaraman, Minje Kim

    Abstract: With the advances in deep learning, speech enhancement systems benefited from large neural network architectures and achieved state-of-the-art quality. However, speaker-agnostic methods are not always desirable, both in terms of quality and their complexity, when they are to be used in a resource-constrained environment. One promising way is personalized speech enhancement (PSE), which is a smalle… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  7. arXiv:2211.06475  [pdf, other

    cs.NI

    High-Level Synthesis for Packet-Processing Pipelines

    Authors: Xiangyu Gao, Divya Raghunathan, Ruijie Fang, Tao Wang, Xiaotong Zhu, Anirudh Sivaraman, Srinivas Narayana, Aarti Gupta

    Abstract: Compiling high-level programs to target high-speed packet-processing pipelines is a challenging combinatorial optimization problem. The compiler must configure the pipeline's resources to match the high-level semantics of the program, while packing all of the program's computation into the pipeline's limited resources. State of the art approaches tackle individual aspects of this problem. Yet, the… ▽ More

    Submitted 18 November, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

  8. arXiv:2206.03285  [pdf, other

    cs.DC cs.DB cs.NI

    Nezha: Deployable and High-Performance Consensus Using Synchronized Clocks

    Authors: **kun Geng, Anirudh Sivaraman, Balaji Prabhakar, Mendel Rosenblum

    Abstract: This paper presents a high-performance consensus protocol, Nezha, which can be deployed by cloud tenants without any support from their cloud provider. Nezha bridges the gap between protocols such as Multi-Paxos and Raft, which can be readily deployed and protocols such as NOPaxos and Speculative Paxos, that provide better performance, but require access to technologies such as programmable switch… ▽ More

    Submitted 24 March, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Accepted by 49th International Conference on Very Large Data Bases (VLDB 2023)

    Report number: Volume 16, Issue 4, pp 629--642 ACM Class: C.2.1; C.2.4; C.4

    Journal ref: Proc. VLDB Endow, Volume 16, Issue 4 (December 2022), pp. 629-642

  9. arXiv:2206.00553  [pdf, other

    cs.LG cs.CY

    FETA: Fairness Enforced Verifying, Training, and Predicting Algorithms for Neural Networks

    Authors: Kiarash Mohammadi, Aishwarya Sivaraman, Golnoosh Farnadi

    Abstract: Algorithmic decision making driven by neural networks has become very prominent in applications that directly affect people's quality of life. In this paper, we study the problem of verifying, training, and guaranteeing individual fairness of neural network models. A popular approach for enforcing fairness is to translate a fairness notion into constraints over the parameters of the model. However… ▽ More

    Submitted 30 January, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

  10. arXiv:2112.14845  [pdf, other

    cs.DC eess.SY

    Collective Autoscaling for Cloud Microservices

    Authors: Vighnesh Sachidananda, Anirudh Sivaraman

    Abstract: As cloud applications shift from monoliths to loosely coupled microservices, application developers must decide how many compute resources (e.g., number of replicated containers) to assign to each microservice within an application. This decision affects both (1) the dollar cost to the application developer and (2) the end-to-end latency perceived by the application user. Today, individual microse… ▽ More

    Submitted 7 August, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  11. arXiv:2110.10739  [pdf, other

    cs.SD eess.AS

    Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training

    Authors: Aswin Sivaraman, Scott Wisdom, Hakan Erdogan, John R. Hershey

    Abstract: The recently-proposed mixture invariant training (MixIT) is an unsupervised method for training single-channel sound separation models in the sense that it does not require ground-truth isolated reference sources. In this paper, we investigate using MixIT to adapt a separation model on real far-field overlap** reverberant and noisy speech data from the AMI Corpus. The models are tested on real A… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

  12. arXiv:2107.06402  [pdf, other

    cs.SE

    Mining Idioms in the Wild

    Authors: Aishwarya Sivaraman, Rui Abreu, Andrew Scott, Tobi Akomolede, Satish Chandra

    Abstract: Existing code repositories contain numerous instances of code patterns that are idiomatic ways of accomplishing a particular programming task. Sometimes, the programming language in use supports specific operators or APIs that can express the same idiomatic imperative code much more succinctly. However, those code patterns linger in repositories because the developers may be unaware of the new API… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  13. arXiv:2106.14347  [pdf, other

    cs.DC cs.LG

    Revelio: ML-Generated Debugging Queries for Distributed Systems

    Authors: Pradeep Dogga, Karthik Narasimhan, Anirudh Sivaraman, Shiv Kumar Saini, George Varghese, Ravi Netravali

    Abstract: A major difficulty in debugging distributed systems lies in manually determining which of the many available debugging tools to use and how to query its logs. Our own study of a production debugging workflow confirms the magnitude of this burden. This paper explores whether a machine-learning model can assist developers in distributed systems debugging. We present Revelio, a debugging assistant wh… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

  14. arXiv:2105.03542  [pdf, other

    eess.AS cs.LG cs.SD

    Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection

    Authors: Aswin Sivaraman, Minje Kim

    Abstract: This paper presents a novel zero-shot learning approach towards personalized speech enhancement through the use of a sparsely active ensemble model. Optimizing speech denoising systems towards a particular test-time speaker can improve performance and reduce run-time complexity. However, test-time model adaptation may be challenging if collecting data from the test-time speaker is not possible. To… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: 5 pages, 3 figures, submitted to 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

  15. arXiv:2104.02018  [pdf, other

    eess.AS cs.LG cs.SD

    Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification

    Authors: Aswin Sivaraman, Sunwoo Kim, Minje Kim

    Abstract: Training personalized speech enhancement models is innately a no-shot learning problem due to privacy constraints and limited access to noise-free speech from the target user. If there is an abundance of unlabeled noisy speech from the test-time user, a personalized speech enhancement model can be trained using self-supervised learning. One straightforward approach to model personalization is to u… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: 5 pages, 3 figures, under review

  16. arXiv:2104.02017  [pdf, other

    eess.AS cs.LG cs.SD

    Efficient Personalized Speech Enhancement through Self-Supervised Learning

    Authors: Aswin Sivaraman, Minje Kim

    Abstract: This work presents self-supervised learning methods for develo** monaural speaker-specific (i.e., personalized) speech enhancement models. While generalist models must broadly address many speakers, specialist models can adapt their enhancement function towards a particular speaker's voice, expecting to solve a narrower problem. Hence, specialists are capable of achieving more optimal performanc… ▽ More

    Submitted 27 July, 2022; v1 submitted 5 April, 2021; originally announced April 2021.

    Comments: 15 pages, 9 figures, published in IEEE JSTSP 2022

  17. arXiv:2103.02585  [pdf, other

    cs.CL

    Detecting Extraneous Content in Podcasts

    Authors: Sravana Reddy, Yongze Yu, Aasish Pappu, Aswin Sivaraman, Rezvaneh Rezapour, Rosie Jones

    Abstract: Podcast episodes often contain material extraneous to the main content, such as advertisements, interleaved within the audio and the written descriptions. We present classifiers that leverage both textual and listening patterns in order to detect such content in podcast descriptions and audio transcripts. We demonstrate that our models are effective by evaluating them on the downstream task of pod… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: EACL 2021

  18. arXiv:2103.00022  [pdf, other

    cs.NI

    Synthesizing Safe and Efficient Kernel Extensions for Packet Processing

    Authors: Qiongwen Xu, Michael D. Wong, Tanvi Wagle, Srinivas Narayana, Anirudh Sivaraman

    Abstract: Extended Berkeley Packet Filter (BPF) has emerged as a powerful method to extend packet-processing functionality in the Linux operating system. BPF allows users to write code in high-level languages (like C or Rust) and execute them at specific hooks in the kernel, such as the network device driver. To ensure safe execution of a user-developed BPF program in kernel context, Linux uses an in-kernel… ▽ More

    Submitted 14 July, 2021; v1 submitted 26 February, 2021; originally announced March 2021.

  19. arXiv:2102.04911  [pdf, other

    cs.NI eess.SY

    The case for model-driven interpretability of delay-based congestion control protocols

    Authors: Muhammad Khan, Yasir Zaki, Shiva Iyer, Talal Ahamd, Thomas Pötsch, Jay Chen, Anirudh Sivaraman, Lakshmi Subramanian

    Abstract: Analyzing and interpreting the exact behavior of new delay-based congestion control protocols with complex non-linear control loops is exceptionally difficult in highly variable networks such as cellular networks. This paper proposes a Model-Driven Interpretability (MDI) congestion control framework, which derives a model version of a delay-based protocol by simplifying a congestion control protoc… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  20. arXiv:2101.12691  [pdf, other

    cs.NI cs.AR

    Isolation mechanisms for high-speed packet-processing pipelines

    Authors: Tao Wang, Xiangrui Yang, Gianni Antichi, Anirudh Sivaraman, Aurojit Panda

    Abstract: Data-plane programmability is now mainstream. As we find more use cases, deployments need to be able to run multiple packet-processing modules in a single device. These are likely to be developed by independent teams, either within the same organization or from multiple organizations. Therefore, we need isolation mechanisms to ensure that modules on the same device do not interfere with each other… ▽ More

    Submitted 2 March, 2022; v1 submitted 29 January, 2021; originally announced January 2021.

    Journal ref: The 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI '22), 2022

  21. arXiv:2011.03426   

    eess.AS cs.LG cs.SD

    Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement

    Authors: Aswin Sivaraman, Minje Kim

    Abstract: This work explores how self-supervised learning can be universally used to discover speaker-specific features towards enabling personalized speech enhancement models. We specifically address the few-shot learning scenario where access to cleaning recordings of a test-time speaker is limited to a few seconds, but noisy recordings of the speaker are abundant. We develop a simple contrastive learning… ▽ More

    Submitted 9 August, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: This work has been superseded by article 2104.02017

  22. arXiv:2006.08852  [pdf, ps, other

    cs.LG stat.ML

    Counterexample-Guided Learning of Monotonic Neural Networks

    Authors: Aishwarya Sivaraman, Golnoosh Farnadi, Todd Millstein, Guy Van den Broeck

    Abstract: The widespread adoption of deep learning is often attributed to its automatic feature construction with minimal inductive bias. However, in many real-world tasks, the learned function is intended to satisfy domain-specific constraints. We focus on monotonicity constraints, which are common and require that the function's output increases with increasing values of specific input features. We develo… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  23. arXiv:2006.01074  [pdf, other

    cs.NI cs.SE

    Gauntlet: Finding Bugs in Compilers for Programmable Packet Processing

    Authors: Fabian Ruffy, Tao Wang, Anirudh Sivaraman

    Abstract: Programmable packet-processing devices such as programmable switches and network interface cards are becoming mainstream. These devices are configured in a domain-specific language such as P4, using a compiler to translate packet-processing programs into instructions for different targets. As networks with programmable devices become widespread, it is critical that these compilers be dependable.… ▽ More

    Submitted 25 October, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Journal ref: Proceedings of the USENIX Symposium on Operating Systems Design and Implementation (OSDI). November, 2020

  24. arXiv:2005.08128  [pdf, other

    eess.AS cs.LG cs.SD

    Sparse Mixture of Local Experts for Efficient Speech Enhancement

    Authors: Aswin Sivaraman, Minje Kim

    Abstract: In this paper, we investigate a deep learning approach for speech denoising through an efficient ensemble of specialist neural networks. By splitting up the speech denoising task into non-overlap** subproblems and introducing a classifier, we are able to improve denoising performance while also reducing computational complexity. More specifically, the proposed model incorporates a gating network… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: 5 pages, 5 figures

    Journal ref: Published in Interspeech 2020

  25. arXiv:2005.02310  [pdf, other

    cs.AR

    Testing Compilers for Programmable Switches Through Switch Hardware Simulation

    Authors: Michael D. Wong, Aatish Kishan Varma, Anirudh Sivaraman

    Abstract: Programmable switches have emerged as powerful and flexible alternatives to fixed-function forwarding devices. But because of the unique hardware constraints of network switches, the design and implementation of compilers targeting these devices is tedious and error prone. Despite the important role that compilers play in software development, there is a dearth of tools for testing compilers for p… ▽ More

    Submitted 27 October, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: 7 pages, 4 figures

    ACM Class: B.4.4; C.2.0; D.2.5; D.3.4

  26. arXiv:1902.00956  [pdf, ps, other

    cs.SD cs.LG eess.AS stat.ML

    Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances

    Authors: Sanna Wager, George Tzanetakis, Cheng-i Wang, Lijiang Guo, Aswin Sivaraman, Minje Kim

    Abstract: We describe a machine-learning approach to pitch correcting a solo singing performance in a karaoke setting, where the solo voice and accompaniment are on separate tracks. The proposed approach addresses the situation where no musical score of the vocals nor the accompaniment exists: It predicts the amount of correction from the relationship between the spectral contents of the vocal and accompani… ▽ More

    Submitted 3 February, 2019; originally announced February 2019.

  27. arXiv:1812.05265  [pdf, other

    cs.SE

    Active Inductive Logic Programming for Code Search

    Authors: Aishwarya Sivaraman, Tianyi Zhang, Guy Van den Broeck, Miryung Kim

    Abstract: Modern search techniques either cannot efficiently incorporate human feedback to refine search results or to express structural or semantic properties of desired code. The key insight of our interactive code search technique ALICE is that user feedback could be actively incorporated to allow users to easily express and refine search queries. We design a query language to model the structure and se… ▽ More

    Submitted 23 February, 2019; v1 submitted 13 December, 2018; originally announced December 2018.

  28. arXiv:1805.02603  [pdf, ps, other

    cs.SD eess.AS

    A Data-Driven Approach to Smooth Pitch Correction for Singing Voice in Pop Music

    Authors: Sanna Wager, Lijiang Guo, Aswin Sivaraman, Minje Kim

    Abstract: In this paper, we present a machine-learning approach to pitch correction for voice in a karaoke setting, where the vocals and accompaniment are on separate tracks and time-aligned. The network takes as input the time-frequency representation of the two tracks and predicts the amount of pitch-shifting in cents required to make the voice sound in-tune with the accompaniment. It is trained on exampl… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.

  29. arXiv:1801.09774  [pdf, other

    cs.SD eess.AS

    On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient Deep Neural Networks for Speech Denoising

    Authors: Kai Zhen, Aswin Sivaraman, Jongmo Sung, Minje Kim

    Abstract: We present a psychoacoustically enhanced cost function to balance network complexity and perceptual performance of deep neural networks for speech denoising. While training the network, we utilize perceptual weights added to the ordinary mean-squared error to emphasize contribution from frequency bins which are most audible while ignoring error from inaudible bins. To generate the weights, we empl… ▽ More

    Submitted 29 January, 2018; originally announced January 2018.

    Comments: 5 pages, 4 figures

  30. arXiv:1602.06045  [pdf, other

    cs.NI

    Programmable Packet Scheduling

    Authors: Anirudh Sivaraman, Suvinay Subramanian, Anurag Agrawal, Sharad Chole, Shang-Tse Chuang, Tom Edsall, Mohammad Alizadeh, Sachin Katti, Nick McKeown, Hari Balakrishnan

    Abstract: Switches today provide a small set of scheduling algorithms. While we can tweak scheduling parameters, we cannot modify algorithmic logic, or add a completely new algorithm, after the switch has been designed. This paper presents a design for a programmable packet scheduler, which allows scheduling algorithms---potentially algorithms that are unknown today---to be programmed into a switch without… ▽ More

    Submitted 18 February, 2016; originally announced February 2016.

    Comments: 14 pages

  31. arXiv:1512.05023  [pdf, other

    cs.NI

    Packet Transactions: High-level Programming for Line-Rate Switches

    Authors: Anirudh Sivaraman, Mihai Budiu, Alvin Cheung, Changhoon Kim, Steve Licking, George Varghese, Hari Balakrishnan, Mohammad Alizadeh, Nick McKeown

    Abstract: Many algorithms for congestion control, scheduling, network measurement, active queue management, security, and load balancing require custom processing of packets as they traverse the data plane of a network switch. To run at line rate, these data-plane algorithms must be in hardware. With today's switch hardware, algorithms cannot be changed, nor new algorithms installed, after a switch has been… ▽ More

    Submitted 29 January, 2016; v1 submitted 15 December, 2015; originally announced December 2015.

    Comments: 16 pages