Skip to main content

Showing 1–8 of 8 results for author: Singhvi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.13382  [pdf, ps, other

    cs.CL cs.AI cs.PL

    DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines

    Authors: Arnav Singhvi, Manish Shetty, Shangyin Tan, Christopher Potts, Koushik Sen, Matei Zaharia, Omar Khattab

    Abstract: Chaining language model (LM) calls as composable modules is fueling a new way of programming, but ensuring LMs adhere to important constraints requires heuristic "prompt engineering". We introduce LM Assertions, a programming construct for expressing computational constraints that LMs should satisfy. We integrate our constructs into the recent DSPy programming model for LMs, and present new strate… ▽ More

    Submitted 2 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Arnav*, Manish*, Shangyin* contributed equally to this work

  2. arXiv:2310.03714  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

    Authors: Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts

    Abstract: The ML community is rapidly exploring techniques for prompting language models (LMs) and for stacking them into pipelines that solve complex tasks. Unfortunately, existing LM pipelines are typically implemented using hard-coded "prompt templates", i.e. lengthy strings discovered via trial and error. Toward a more systematic approach for develo** and optimizing LM pipelines, we introduce DSPy, a… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  3. arXiv:2305.00538  [pdf, other

    cs.NI

    SFC: Near-Source Congestion Signaling and Flow Control

    Authors: Yanfang Le, Jeongkeun Lee, Jeremias Blendin, Jiayi Chen, Georgios Nikolaidis, Rong Pan, Robert Soule, Aditya Akella, Pedro Yebenes Segura, Arjun singhvi, Yuliang Li, Qingkai Meng, Changhoon Kim, Serhat Arslan

    Abstract: State-of-the-art congestion control algorithms for data centers alone do not cope well with transient congestion and high traffic bursts. To help with these, we revisit the concept of direct \emph{backward} feedback from switches and propose Back-to-Sender (BTS) signaling to many concurrent incast senders. Combining it with our novel approach to in-network caching, we achieve near-source sub-RTT c… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  4. arXiv:2209.06346  [pdf

    cs.LG

    Prediction of the outcome of a Twenty-20 Cricket Match : A Machine Learning Approach

    Authors: Ashish V Shenoy, Arjun Singhvi, Shruthi Racha, Srinivas Tunuguntla

    Abstract: Twenty20 cricket, sometimes written Twenty-20, and often abbreviated to T20, is a short form of cricket. In a Twenty20 game the two teams of 11 players have a single innings each, which is restricted to a maximum of 20 overs. This version of cricket is especially unpredictable and is one of the reasons it has gained popularity over recent times. However, in this paper we try four different machine… ▽ More

    Submitted 22 July, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: Machine Learning Applications, Sports, Cricket Outcome Prediction

  5. arXiv:1911.09849  [pdf, other

    cs.DC

    Archipelago: A Scalable Low-Latency Serverless Platform

    Authors: Arjun Singhvi, Kevin Houck, Arjun Balasubramanian, Mohammed Danish Shaikh, Shivaram Venkataraman, Aditya Akella

    Abstract: The increased use of micro-services to build web applications has spurred the rapid growth of Function-as-a-Service (FaaS) or serverless computing platforms. While FaaS simplifies provisioning and scaling for application developers, it introduces new challenges in resource management that need to be handled by the cloud provider. Our analysis of popular serverless workloads indicates that schedule… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: 14 pages

  6. arXiv:1910.07700  [pdf, other

    cs.DC

    SNF: Serverless Network Functions

    Authors: Arjun Singhvi, Junaid Khalid, Aditya Akella, Sujata Banerjee

    Abstract: It is increasingly common to outsource network functions (NFs) to the cloud. However, no cloud providers offer NFs-as-a-Service (NFaaS) that allows users to run custom NFs. Our work addresses how a cloud provider can offer NFaaS. We use the emerging serverless computing paradigm as it has the right building blocks - usage-based billing, convenient event-driven programming model and automatic compu… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: 14 pages

  7. arXiv:1907.01484  [pdf, other

    cs.DC

    Themis: Fair and Efficient GPU Cluster Scheduling

    Authors: Kshiteej Mahajan, Arjun Balasubramanian, Arjun Singhvi, Shivaram Venkataraman, Aditya Akella, Amar Phanishayee, Shuchi Chawla

    Abstract: Modern distributed machine learning (ML) training workloads benefit significantly from leveraging GPUs. However, significant contention ensues when multiple such workloads are run atop a shared cluster of GPUs. A key question is how to fairly apportion GPUs across workloads. We find that established cluster scheduling disciplines are a poor fit because of ML workloads' unique attributes: ML jobs h… ▽ More

    Submitted 29 October, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

  8. arXiv:1703.10272  [pdf, other

    cs.DC

    Whiz: A Fast and Flexible Data Analytics System

    Authors: Robert Grandl, Arjun Singhvi, Raajay Viswanathan, Aditya Akella

    Abstract: Today's data analytics frameworks are compute-centric, with analytics execution almost entirely dependent on the pre-determined physical structure of the high-level computation. Relegating intermediate data to a second class entity in this manner hurts flexibility, performance, and efficiency. We present Whiz, a new analytics framework that cleanly separates computation from intermediate data. It… ▽ More

    Submitted 21 June, 2019; v1 submitted 29 March, 2017; originally announced March 2017.

    Comments: 16 pages