Skip to main content

Showing 1–4 of 4 results for author: Narayanaswami, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00047  [pdf, other

    cs.DC cs.CL cs.LG

    One Queue Is All You Need: Resolving Head-of-Line Blocking in Large Language Model Serving

    Authors: Archit Patke, Dhemath Reddy, Saurabh Jha, Haoran Qiu, Christian Pinto, Shengkun Cui, Chandra Narayanaswami, Zbigniew Kalbarczyk, Ravishankar Iyer

    Abstract: $ $Large language models (LLMs) have become an increasingly important workload for cloud providers catering to both enterprise and consumer applications. LLM inference requests from these applications have end-to-end latency SLOs that must be adhered to in production settings. However, existing LLM serving systems focus on optimization objectives such as request serving throughput or request execu… ▽ More

    Submitted 5 June, 2024; originally announced July 2024.

  2. arXiv:2310.12183  [pdf, other

    math.OC cs.AI

    An Optimistic-Robust Approach for Dynamic Positioning of Omnichannel Inventories

    Authors: Pavithra Harsha, Shivaram Subramanian, Ali Koc, Mahesh Ramakrishna, Brian Quanz, Dhruv Shah, Chandra Narayanaswami

    Abstract: We introduce a new class of data-driven and distribution-free optimistic-robust bimodal inventory optimization (BIO) strategy to effectively allocate inventory across a retail chain to meet time-varying, uncertain omnichannel demand. While prior Robust optimization (RO) methods emphasize the downside, i.e., worst-case adversarial demand, BIO also considers the upside to remain resilient like RO wh… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  3. Hierarchical Proxy Modeling for Improved HPO in Time Series Forecasting

    Authors: Arindam Jati, Vijay Ekambaram, Shaonli Pal, Brian Quanz, Wesley M. Gifford, Pavithra Harsha, Stuart Siegel, Sumanta Mukherjee, Chandra Narayanaswami

    Abstract: Selecting the right set of hyperparameters is crucial in time series forecasting. The classical temporal cross-validation framework for hyperparameter optimization (HPO) often leads to poor test performance because of a possible mismatch between validation and test periods. To address this test-validation mismatch, we propose a novel technique, H-Pro to drive HPO via test proxies by exploiting dat… ▽ More

    Submitted 2 November, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

  4. arXiv:1510.08210  [pdf

    cs.CY cs.HC

    Enabling Multiple QR Codes in Close Proximity

    Authors: Mercan Topkara, Thomas Erickson, Umut Topkara, Chandrasekhar Narayanaswami

    Abstract: Quick response codes - 2D patterns that can be scanned to access online resources - are being used in a variety of industrial and consumer applications. However, it is problematic to use multiple QR codes in close proximity: scans can fail or result in access to the wrong resource. While this problem is, strictly speaking, due to the design of the scanning software, the very large number of extant… ▽ More

    Submitted 28 October, 2015; originally announced October 2015.