Skip to main content

Showing 1–4 of 4 results for author: Subramanya, S J

.
  1. arXiv:2205.03763  [pdf, other

    cs.LG cs.DB cs.DS cs.PF

    Results of the NeurIPS'21 Challenge on Billion-Scale Approximate Nearest Neighbor Search

    Authors: Harsha Vardhan Simhadri, George Williams, Martin Aumüller, Matthijs Douze, Artem Babenko, Dmitry Baranchuk, Qi Chen, Lucas Hosseini, Ravishankar Krishnaswamy, Gopal Srinivasa, Suhas Jayaram Subramanya, **gdong Wang

    Abstract: Despite the broad range of algorithms for Approximate Nearest Neighbor Search, most empirical evaluations of algorithms have focused on smaller datasets, typically of 1 million points~\citep{Benchmark}. However, deploying recent advances in embedding based techniques for search, recommendation and ranking at scale require ANNS indices at billion, trillion or larger scale. Barring a few recent pape… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

  2. arXiv:2105.09613  [pdf, other

    cs.IR

    FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search

    Authors: Aditi Singh, Suhas Jayaram Subramanya, Ravishankar Krishnaswamy, Harsha Vardhan Simhadri

    Abstract: Approximate nearest neighbor search (ANNS) is a fundamental building block in information retrieval with graph-based indices being the current state-of-the-art and widely used in the industry. Recent advances in graph-based indices have made it possible to index and search billion-point datasets with high recall and millisecond-level latency on a single commodity machine with an SSD. However, ex… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: 19 pages, 22 figures

    ACM Class: H.3.3

  3. arXiv:2103.08191  [pdf, other

    cs.DC

    PACEMAKER: Avoiding HeART attacks in storage clusters with disk-adaptive redundancy

    Authors: Saurabh Kadekodi, Francisco Maturana, Suhas Jayaram Subramanya, Juncheng Yang, K. V. Rashmi, Gregory R. Ganger

    Abstract: Data redundancy provides resilience in large-scale storage clusters, but imposes significant cost overhead. Substantial space-savings can be realized by tuning redundancy schemes to observed disk failure rates. However, prior design proposals for such tuning are unusable in real-world clusters, because the IO load of transitions between schemes overwhelms the storage infrastructure (termed transit… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: Published in USENIX Symposium on Operating Systems Design and Implementation (OSDI) 2020

    ACM Class: B.8.1; C.4; D.4.2; D.4.5

    Journal ref: 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020, (pp. 369-385)

  4. arXiv:2008.12260  [pdf, other

    cs.DC cs.LG

    Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning

    Authors: Aurick Qiao, Sang Keun Choe, Suhas Jayaram Subramanya, Willie Neiswanger, Qirong Ho, Hao Zhang, Gregory R. Ganger, Eric P. Xing

    Abstract: Pollux improves scheduling performance in deep learning (DL) clusters by adaptively co-optimizing inter-dependent factors both at the per-job level and at the cluster-wide level. Most existing schedulers expect users to specify the number of resources for each job, often leading to inefficient resource use. Some recent schedulers choose job resources for users, but do so without awareness of how D… ▽ More

    Submitted 26 May, 2021; v1 submitted 27 August, 2020; originally announced August 2020.