Skip to main content

Showing 1–2 of 2 results for author: Mahalingam, N

.
  1. arXiv:2308.12908  [pdf, other

    cs.DC cs.AR cs.LG

    POLCA: Power Oversubscription in LLM Cloud Providers

    Authors: Pratyush Patel, Esha Choukse, Chaojie Zhang, Íñigo Goiri, Brijesh Warrier, Nithish Mahalingam, Ricardo Bianchini

    Abstract: Recent innovation in large language models (LLMs), and their myriad use-cases have rapidly driven up the compute capacity demand for datacenter GPUs. Several cloud providers and other enterprises have made substantial plans of growth in their datacenters to support these new workloads. One of the key bottleneck resources in datacenters is power, and given the increasing model sizes of LLMs, they a… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  2. arXiv:2010.15388  [pdf, other

    cs.DC

    Prediction-Based Power Oversubscription in Cloud Platforms

    Authors: Alok Kumbhare, Reza Azimi, Ioannis Manousakis, Anand Bonde, Felipe Frujeri, Nithish Mahalingam, Pulkit Misra, Seyyed Ahmad Javadi, Bianca Schroeder, Marcus Fontoura, Ricardo Bianchini

    Abstract: Datacenter designers rely on conservative estimates of IT equipment power draw to provision resources. This leaves resources underutilized and requires more datacenters to be built. Prior work has used power cap** to shave the rare power peaks and add more servers to the datacenter, thereby oversubscribing its resources and lowering capital costs. This works well when the workloads and their ser… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.