Skip to main content

Showing 1–9 of 9 results for author: Citovsky, G

.
  1. arXiv:2405.18754  [pdf, other

    cs.DS cs.LG

    GIST: Greedy Independent Set Thresholding for Diverse Data Summarization

    Authors: Matthew Fahrbach, Srikumar Ramalingam, Morteza Zadimoghaddam, Sara Ahmadian, Gui Citovsky, Giulia DeSalvo

    Abstract: We propose a novel subset selection task called min-distance diverse data summarization ($\textsf{MDDS}$), which has a wide variety of applications in machine learning, e.g., data sampling and feature selection. Given a set of points in a metric space, the goal is to maximize an objective that combines the total utility of the points and a diversity term that captures the minimum distance between… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 15 pages, 1 figure

  2. arXiv:2401.13160  [pdf, other

    cs.LG cs.CL

    SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection

    Authors: Ke Ye, Heinrich Jiang, Afshin Rostamizadeh, Ayan Chakrabarti, Giulia DeSalvo, Jean-François Kagy, Lazaros Karydas, Gui Citovsky, Sanjiv Kumar

    Abstract: Pre-training large language models is known to be extremely resource intensive and often times inefficient, under-utilizing the information encapsulated in the training text sequences. In this paper, we present SpacTor, a new training procedure consisting of (1) a hybrid objective combining span corruption (SC) and token replacement detection (RTD), and (2) a two-stage curriculum that optimizes th… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 9+13 pages, 5 figures

  3. arXiv:2301.12052  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Importance Weights in Subset Selection

    Authors: Gui Citovsky, Giulia DeSalvo, Sanjiv Kumar, Srikumar Ramalingam, Afshin Rostamizadeh, Yunjuan Wang

    Abstract: We present a subset selection algorithm designed to work with arbitrary model families in a practical batch setting. In such a setting, an algorithm can sample examples one at a time but, in order to limit overhead costs, is only able to update its state (i.e. further train model weights) once a large enough batch of examples is selected. Our algorithm, IWeS, selects examples by importance samplin… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: ICLR 2023

  4. arXiv:2107.14263  [pdf, other

    cs.LG cs.AI

    Batch Active Learning at Scale

    Authors: Gui Citovsky, Giulia DeSalvo, Claudio Gentile, Lazaros Karydas, Anand Rajagopalan, Afshin Rostamizadeh, Sanjiv Kumar

    Abstract: The ability to train complex and highly effective models often requires an abundance of training data, which can easily become a bottleneck in cost, time, and computational resources. Batch active learning, which adaptively issues batched queries to a labeling oracle, is a common approach for addressing this problem. The practical benefits of batch sampling come with the downside of less adaptivit… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

  5. arXiv:2105.11653  [pdf, other

    cs.LG

    Scaling Hierarchical Agglomerative Clustering to Billion-sized Datasets

    Authors: Baris Sumengen, Anand Rajagopalan, Gui Citovsky, David Simcha, Olivier Bachem, Pradipta Mitra, Sam Blasiak, Mason Liang, Sanjiv Kumar

    Abstract: Hierarchical Agglomerative Clustering (HAC) is one of the oldest but still most widely used clustering methods. However, HAC is notoriously hard to scale to large data sets as the underlying complexity is at least quadratic in the number of data points and many algorithms to solve HAC are inherently sequential. In this paper, we propose {Reciprocal Agglomerative Clustering (RAC)}, a distributed al… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: 19 pages, 3 figures

  6. arXiv:2101.10639  [pdf, other

    cs.DS

    Hierarchical Clustering via Sketches and Hierarchical Correlation Clustering

    Authors: Danny Vainstein, Vaggos Chatziafratis, Gui Citovsky, Anand Rajagopalan, Mohammad Mahdian, Yossi Azar

    Abstract: Recently, Hierarchical Clustering (HC) has been considered through the lens of optimization. In particular, two maximization objectives have been defined. Moseley and Wang defined the \emph{Revenue} objective to handle similarity information given by a weighted graph on the data points (w.l.o.g., $[0,1]$ weights), while Cohen-Addad et al. defined the \emph{Dissimilarity} objective to handle dissim… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

  7. arXiv:1909.09667  [pdf, other

    cs.LG stat.ML

    Online Hierarchical Clustering Approximations

    Authors: Aditya Krishna Menon, Anand Rajagopalan, Baris Sumengen, Gui Citovsky, Qin Cao, Sanjiv Kumar

    Abstract: Hierarchical clustering is a widely used approach for clustering datasets at multiple levels of granularity. Despite its popularity, existing algorithms such as hierarchical agglomerative clustering (HAC) are limited to the offline setting, and thus require the entire dataset to be available. This prohibits their use on large datasets commonly encountered in modern learning applications. In this p… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

    Comments: 17 pages, 3 figures

  8. arXiv:1710.00876  [pdf, other

    cs.CG

    Network Optimization on Partitioned Pairs of Points

    Authors: Esther M. Arkin, Aritra Banik, Paz Carmi, Gui Citovsky, Su Jia, Matthet J. Katz, Tyler Mayer, Joseph S. B. Mitchell

    Abstract: Given $n$ pairs of points, $\mathcal{S} = \{\{p_1, q_1\}, \{p_2, q_2\}, \dots, \{p_n, q_n\}\}$, in some metric space, we study the problem of two-coloring the points within each pair, red and blue, to optimize the cost of a pair of node-disjoint networks, one over the red points and one over the blue points. In this paper we consider our network structures to be spanning trees, traveling salesman… ▽ More

    Submitted 2 October, 2017; originally announced October 2017.

  9. TSP With Locational Uncertainty: The Adversarial Model

    Authors: Gui Citovsky, Tyler Mayer, Joseph S. B. Mitchell

    Abstract: In this paper we study a natural special case of the Traveling Salesman Problem (TSP) with point-locational-uncertainty which we will call the {\em adversarial TSP} problem (ATSP). Given a metric space $(X, d)$ and a set of subsets $R = \{R_1, R_2, ... , R_n\} : R_i \subseteq X$, the goal is to devise an ordering of the regions, $σ_R$, that the tour will visit such that when a single point is chos… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Comments: To appear, International Symposium on Computational Geometry (SoCG 2017)

    ACM Class: F.2.2; G.2.2