Skip to main content

Showing 1–27 of 27 results for author: Gottlieb, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11512  [pdf, other

    cs.HC cs.AI

    Path Signature Representation of Patient-Clinician Interactions as a Predictor for Neuropsychological Tests Outcomes in Children: A Proof of Concept

    Authors: Giulio Falcioni, Alexandra Georgescu, Emilia Molimpakis, Lev Gottlieb, Taylor Kuhn, Stefano Goria

    Abstract: This research report presents a proof-of-concept study on the application of machine learning techniques to video and speech data collected during diagnostic cognitive assessments of children with a neurodevelopmental disorder. The study utilised a dataset of 39 video recordings, capturing extensive sessions where clinicians administered, among other things, four cognitive assessment tests. From t… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: accepted in IEEE MedAI 2023 conference proceedings

  2. arXiv:2310.15951  [pdf, other

    cs.LG

    Weighted Distance Nearest Neighbor Condensing

    Authors: Lee-Ad Gottlieb, Timor Sharabi, Roi Weiss

    Abstract: The problem of nearest neighbor condensing has enjoyed a long history of study, both in its theoretical and practical aspects. In this paper, we introduce the problem of weighted distance nearest neighbor condensing, where one assigns weights to each point of the condensed set, and then new points are labeled based on their weighted distance nearest neighbor in the condensed set. We study the th… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  3. arXiv:2305.07791  [pdf, other

    cs.LG cs.AI

    Using Deepfake Technologies for Word Emphasis Detection

    Authors: Eran Kaufman, Lee-Ad Gottlieb

    Abstract: In this work, we consider the task of automated emphasis detection for spoken language. This problem is challenging in that emphasis is affected by the particularities of speech of the subject, for example the subject accent, dialect or voice. To address this task, we propose to utilize deep fake technology to produce an emphasis devoid speech for this speaker. This requires extracting the text of… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  4. arXiv:2007.06283  [pdf, ps, other

    math.ST cs.LG math.PR

    Functions with average smoothness: structure, algorithms, and learning

    Authors: Yair Ashlagi, Lee-Ad Gottlieb, Aryeh Kontorovich

    Abstract: We initiate a program of average smoothness analysis for efficiently learning real-valued functions on metric spaces. Rather than using the Lipschitz constant as the regularizer, we define a local slope at each point and gauge the function complexity as the average of these values. Since the mean can be dramatically smaller than the maximum, this complexity measure can yield considerably sharper g… ▽ More

    Submitted 8 November, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

  5. arXiv:2002.07727  [pdf, other

    cs.DS cs.CG

    Faster Algorithms for Orienteering and $k$-TSP

    Authors: Lee-Ad Gottlieb, Robert Krauthgamer, Havana Rika

    Abstract: We consider the rooted orienteering problem in Euclidean space: Given $n$ points $P$ in $\mathbb R^d$, a root point $s\in P$ and a budget $\mathcal B>0$, find a path that starts from $s$, has total length at most $\mathcal B$, and visits as many points of $P$ as possible. This problem is known to be NP-hard, hence we study $(1-δ)$-approximation algorithms. The previous Polynomial-Time Approximatio… ▽ More

    Submitted 21 April, 2022; v1 submitted 18 February, 2020; originally announced February 2020.

  6. arXiv:2002.01999  [pdf, other

    cs.LG stat.ML

    Nested Barycentric Coordinate System as an Explicit Feature Map

    Authors: Lee-Ad Gottlieb, Eran Kaufman, Aryeh Kontorovich, Gabriel Nivasch, Ofir Pele

    Abstract: We propose a new embedding method which is particularly well-suited for settings where the sample size greatly exceeds the ambient dimension. Our technique consists of partitioning the space into simplices and then embedding the data points into features corresponding to the simplices' barycentric coordinates. We then train a linear classifier in the rich feature space obtained from the simplices.… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  7. arXiv:2002.01408  [pdf, other

    cs.LG stat.ML

    Apportioned Margin Approach for Cost Sensitive Large Margin Classifiers

    Authors: Lee-Ad Gottlieb, Eran Kaufman, Aryeh Kontorovich

    Abstract: We consider the problem of cost sensitive multiclass classification, where we would like to increase the sensitivity of an important class at the expense of a less important one. We adopt an {\em apportioned margin} framework to address this problem, which enables an efficient margin shift between classes that share the same boundary. The decision boundary between all pairs of classes divides the… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

  8. arXiv:1909.09969  [pdf, other

    cs.LG stat.ML

    Classification in asymmetric spaces via sample compression

    Authors: Lee-Ad Gottlieb, Shira Ozeri

    Abstract: We initiate the rigorous study of classification in quasi-metric spaces. These are point sets endowed with a distance function that is non-negative and also satisfies the triangle inequality, but is asymmetric. We develop and refine a learning algorithm for quasi-metrics based on sample compression and nearest neighbor, and prove that it has favorable statistical properties.

    Submitted 22 September, 2019; originally announced September 2019.

  9. arXiv:1907.06857  [pdf, other

    cs.DS cs.CG

    Labelings vs. Embeddings: On Distributed Representations of Distances

    Authors: Arnold Filtser, Lee-Ad Gottlieb, Robert Krauthgamer

    Abstract: We investigate for which metric spaces the performance of distance labeling and of $\ell_\infty$-embeddings differ, and how significant can this difference be. Recall that a distance labeling is a distributed representation of distances in a metric space $(X,d)$, where each point $x\in X$ is assigned a succinct label, such that the distance between any two points $x,y \in X$ can be approximated gi… ▽ More

    Submitted 20 September, 2023; v1 submitted 16 July, 2019; originally announced July 2019.

  10. arXiv:1904.03611  [pdf, ps, other

    cs.CG

    Near-linear time approximation schemes for Steiner tree and forest in low-dimensional spaces

    Authors: Lee-Ad Gottlieb, Yair Bartal

    Abstract: We give an algorithm that computes a $(1+ε)$-approximate Steiner forest in near-linear time $n \cdot 2^{(1/ε)^{O(ddim^2)} (\log \log n)^2}$. This is a dramatic improvement upon the best previous result due to Chan et al., who gave a runtime of $n^{2^{O(ddim)}} \cdot 2^{(ddim/ε)^{O(ddim)} \sqrt{\log n}}$. For Steiner tree our methods achieve an even better runtime… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

  11. arXiv:1805.09719  [pdf, other

    cs.LG cs.CC cs.CG stat.ML

    Learning convex polyhedra with margin

    Authors: Lee-Ad Gottlieb, Eran Kaufman, Aryeh Kontorovich, Gabriel Nivasch

    Abstract: We present an improved algorithm for {\em quasi-properly} learning convex polyhedra in the realizable PAC setting from data with a margin. Our learning algorithm constructs a consistent polyhedron as an intersection of about $t \log t$ halfspaces with constant-size margins in time polynomial in $t$ (where $t$ is the number of halfspaces forming an optimal polyhedron). We also identify distinct gen… ▽ More

    Submitted 2 November, 2021; v1 submitted 24 May, 2018; originally announced May 2018.

  12. arXiv:1512.01775  [pdf, ps, other

    cs.CG cs.DS

    Approximate nearest neighbor search for $\ell_p$-spaces ($2 < p < \infty$) via embeddings

    Authors: Yair Bartal, Lee-Ad Gottlieb

    Abstract: While the problem of approximate nearest neighbor search has been well-studied for Euclidean space and $\ell_1$, few non-trivial algorithms are known for $\ell_p$ when ($2 < p < \infty$). In this paper, we revisit this fundamental problem and present approximate nearest-neighbor search algorithms which give the first non-trivial approximation factor guarantees in this setting.

    Submitted 6 December, 2015; originally announced December 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1408.1789

  13. arXiv:1505.03681  [pdf, ps, other

    cs.CG

    A light metric spanner

    Authors: Lee-Ad Gottlieb

    Abstract: It has long been known that $d$-dimensional Euclidean point sets admit $(1+ε)$-stretch spanners with lightness $W_E = ε^{-O(d)}$, that is total edge weight at most $W_E$ times the weight of the minimum spaning tree of the set [DHN93]. Whether or not a similar result holds for metric spaces with low doubling dimension has remained an important open problem, and has resisted numerous attempts at res… ▽ More

    Submitted 14 May, 2015; originally announced May 2015.

  14. arXiv:1503.04250  [pdf, other

    cs.MM cs.CL

    The YLI-MED Corpus: Characteristics, Procedures, and Plans

    Authors: Julia Bernd, Damian Borth, Benjamin Elizalde, Gerald Friedland, Heather Gallagher, Luke Gottlieb, Adam Janin, Sara Karabashlieva, Jocelyn Takahashi, Jennifer Won

    Abstract: The YLI Multimedia Event Detection corpus is a public-domain index of videos with annotations and computed features, specialized for research in multimedia event detection (MED), i.e., automatically identifying what's happening in a video by analyzing the audio and visual content. The videos indexed in the YLI-MED corpus are a subset of the larger YLI feature corpus, which is being developed by th… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

    Comments: 47 pages; 3 figures; 25 tables. Also published as ICSI Technical Report TR-15-001

    Report number: TR-15-001

  15. arXiv:1502.06208  [pdf, ps, other

    cs.LG cs.CC cs.DS

    Nearly optimal classification for semimetrics

    Authors: Lee-Ad Gottlieb, Aryeh Kontorovich

    Abstract: We initiate the rigorous study of classification in semimetric spaces, which are point sets with a distance function that is non-negative and symmetric, but need not satisfy the triangle inequality. For metric spaces, the doubling dimension essentially characterizes both the runtime and sample complexity of classification algorithms --- yet we show that this is not the case for semimetrics. Instea… ▽ More

    Submitted 22 February, 2015; originally announced February 2015.

    MSC Class: 51F99; 51K05; 90C27; 90C48;

  16. arXiv:1408.1789  [pdf, ps, other

    cs.CG

    Dimension reduction techniques for $\ell_p$, $1 \le p \le 2$, with applications

    Authors: Yair Bartal, Lee-Ad Gottlieb

    Abstract: For Euclidean space ($\ell_2$), there exists the powerful dimension reduction transform of Johnson and Lindenstrauss, with a host of known applications. Here, we consider the problem of dimension reduction for all $\ell_p$ spaces $1 \le p \le 2$. Although strong lower bounds are known for dimension reduction in $\ell_1$, Ostrovsky and Rabani successfully circumvented these by presenting an… ▽ More

    Submitted 6 December, 2015; v1 submitted 8 August, 2014; originally announced August 2014.

  17. arXiv:1406.2107  [pdf, other

    cs.DS cs.CG

    Optimizing Budget Allocation in Graphs

    Authors: Boaz Ben-Moshe, Michael Elkin, Lee-Ad Gottlieb, Eran Omri

    Abstract: In the classical facility location problem we consider a graph $G$ with fixed weights on the edges of $G$. The goal is then to find an optimal positioning for a set of facilities on the graph with respect to some objective function. We introduce a new framework for facility location problems, where the weights on the graph edges are not fixed, but rather should be assigned. The goal is to find a v… ▽ More

    Submitted 9 June, 2014; originally announced June 2014.

  18. arXiv:1404.3368  [pdf, other

    cs.LG cs.CC

    Near-optimal sample compression for nearest neighbors

    Authors: Lee-Ad Gottlieb, Aryeh Kontorovich, Pinhas Nisnevitch

    Abstract: We present the first sample compression algorithm for nearest neighbors with non-trivial performance guarantees. We complement these guarantees by demonstrating almost matching hardness lower bounds, which show that our bound is nearly optimal. Our result yields new insight into margin-based nearest neighbor classification in metric spaces and allows us to significantly sharpen and simplify existi… ▽ More

    Submitted 26 March, 2018; v1 submitted 13 April, 2014; originally announced April 2014.

  19. arXiv:1401.5014  [pdf, ps, other

    cs.CG cs.DS

    Light spanners for snowflake metrics

    Authors: Lee-Ad Gottlieb, Shay Solomon

    Abstract: A classic result in the study of spanners is the existence of light low-stretch spanners for Euclidean spaces. These spanners ahve arbitrary low stretch, and weight only a constant factor greater than that of the minimum spanning tree of the points (with dependence on the stretch and Euclidean dimention). A central open problem in this field asks whether other spaces admit low weight spanners as w… ▽ More

    Submitted 20 January, 2014; originally announced January 2014.

  20. arXiv:1308.4996  [pdf, other

    cs.CG

    On the Impossibility of Dimension Reduction for Doubling Subsets of $\ell_p$, $p>2$

    Authors: Yair Bartal, Lee-Ad Gottlieb, Ofer Neiman

    Abstract: A major open problem in the field of metric embedding is the existence of dimension reduction for $n$-point subsets of Euclidean space, such that both distortion and dimension depend only on the {\em doubling constant} of the pointset, and not on its cardinality. In this paper, we negate this possibility for $\ell_p$ spaces with $p>2$. In particular, we introduce an $n$-point subset of $\ell_p$ wi… ▽ More

    Submitted 22 August, 2013; originally announced August 2013.

  21. arXiv:1306.2547  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficient Classification for Metric Data

    Authors: Lee-Ad Gottlieb, Aryeh Kontorovich, Robert Krauthgamer

    Abstract: Recent advances in large-margin classification of data residing in general metric spaces (rather than Hilbert spaces) enable classification under various natural metrics, such as string edit and earthmover distance. A general framework developed for this purpose by von Luxburg and Bousquet [JMLR, 2004] left open the questions of computational efficiency and of providing direct bounds on generaliza… ▽ More

    Submitted 10 July, 2014; v1 submitted 11 June, 2013; originally announced June 2013.

    Comments: This is the full version of an extended abstract that appeared in Proceedings of the 23rd COLT, 2010

  22. arXiv:1302.2752  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Adaptive Metric Dimensionality Reduction

    Authors: Lee-Ad Gottlieb, Aryeh Kontorovich, Robert Krauthgamer

    Abstract: We study adaptive data-dependent dimensionality reduction in the context of supervised learning in general metric spaces. Our main statistical contribution is a generalization bound for Lipschitz functions in metric spaces that are doubling, or nearly doubling. On the algorithmic front, we describe an analogue of PCA for metric spaces: namely an efficient procedure that approximates the data's int… ▽ More

    Submitted 25 March, 2015; v1 submitted 12 February, 2013; originally announced February 2013.

  23. arXiv:1112.0699  [pdf, ps, other

    cs.CC cs.DS

    The Traveling Salesman Problem: Low-Dimensionality Implies a Polynomial Time Approximation Scheme

    Authors: Yair Bartal, Lee-Ad Gottlieb, Robert Krauthgamer

    Abstract: The Traveling Salesman Problem (TSP) is among the most famous NP-hard optimization problems. We design for this problem a randomized polynomial-time algorithm that computes a (1+eps)-approximation to the optimal tour, for any fixed eps>0, in TSP instances that form an arbitrary metric space with bounded intrinsic dimension. The celebrated results of Arora (A-98) and Mitchell (M-99) prove that th… ▽ More

    Submitted 9 April, 2015; v1 submitted 3 December, 2011; originally announced December 2011.

  24. arXiv:1111.4470  [pdf, ps, other

    cs.LG

    Efficient Regression in Metric Spaces via Approximate Lipschitz Extension

    Authors: Lee-Ad Gottlieb, Aryeh Kontorovich, Robert Krauthgamer

    Abstract: We present a framework for performing efficient regression in general metric spaces. Roughly speaking, our regressor predicts the value at a new point by computing a Lipschitz extension --- the smoothest function consistent with the observed data --- after performing structural risk minimization to avoid overfitting. We obtain finite-sample risk bounds with minimal structural and noise assumptions… ▽ More

    Submitted 24 April, 2017; v1 submitted 18 November, 2011; originally announced November 2011.

  25. arXiv:1008.1498  [pdf, ps, other

    cs.CC math.NA

    Matrix sparsification and the sparse null space problem

    Authors: Lee-Ad Gottlieb, Tyler Neylon

    Abstract: We revisit the matrix problems sparse null space and matrix sparsification, and show that they are equivalent. We then proceed to seek algorithms for these problems: We prove the hardness of approximation of these problems, and also give a powerful tool to extend algorithms and heuristics for sparse approximation theory to these problems.

    Submitted 9 August, 2010; originally announced August 2010.

    Comments: A preliminary version appeared in Approx '10

  26. arXiv:1008.1480  [pdf, ps, other

    cs.DS

    Fast, precise and dynamic distance queries

    Authors: Yair Bartal, Lee-Ad Gottlieb, Tsvi Kopelowitz, Moshe Lewenstein, Liam Roditty

    Abstract: We present an approximate distance oracle for a point set S with n points and doubling dimension λ. For every ε>0, the oracle supports (1+ε)-approximate distance queries in (universal) constant time, occupies space [ε^{-O(λ)} + 2^{O(λ log λ)}]n, and can be constructed in [2^{O(λ)} log3 n + ε^{-O(λ)} + 2^{O(λ log λ)}]n expected time. This improves upon the best previously known constructions, prese… ▽ More

    Submitted 9 August, 2010; originally announced August 2010.

  27. arXiv:0907.5477  [pdf, ps, other

    cs.CG cs.DS math.FA math.MG

    A Nonlinear Approach to Dimension Reduction

    Authors: Lee-Ad Gottlieb, Robert Krauthgamer

    Abstract: The $l_2$ flattening lemma of Johnson and Lindenstrauss [JL84] is a powerful tool for dimension reduction. It has been conjectured that the target dimension bounds can be refined and bounded in terms of the intrinsic dimensionality of the data set (for example, the doubling dimension). One such problem was proposed by Lang and Plaut [LP01] (see also [GKL03,MatousekProblems07,ABN08,CGT10]), and is… ▽ More

    Submitted 14 May, 2015; v1 submitted 31 July, 2009; originally announced July 2009.