Skip to main content

Showing 1–23 of 23 results for author: Negahban, S

.
  1. arXiv:2006.01662  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Tree-Projected Gradient Descent for Estimating Gradient-Sparse Parameters on Graphs

    Authors: Sheng Xu, Zhou Fan, Sahand Negahban

    Abstract: We study estimation of a gradient-sparse parameter vector $\boldsymbolθ^* \in \mathbb{R}^p$, having strong gradient-sparsity $s^*:=\|\nabla_G \boldsymbolθ^*\|_0$ on an underlying graph $G$. Given observations $Z_1,\ldots,Z_n$ and a smooth, convex loss function $\mathcal{L}$ for which $\boldsymbolθ^*$ minimizes the population risk $\mathbb{E}[\mathcal{L}(\boldsymbolθ;Z_1,\ldots,Z_n)]$, we propose t… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

  2. arXiv:1912.01417  [pdf, other

    math.ST stat.ML

    Distributed Machine Learning with Sparse Heterogeneous Data

    Authors: Dominic Richards, Sahand N. Negahban, Patrick Rebeschini

    Abstract: Motivated by distributed machine learning settings such as Federated Learning, we consider the problem of fitting a statistical model across a distributed collection of heterogeneous data sets whose similarity structure is encoded by a graph topology. Precisely, we analyse the case where each node is associated with fitting a sparse linear model, and edges join two nodes if the difference of their… ▽ More

    Submitted 27 November, 2021; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: NeurIPS 2021 camera ready

  3. arXiv:1901.00301  [pdf, other

    cs.LG stat.ML

    Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

    Authors: Chicheng Zhang, Alekh Agarwal, Hal Daumé III, John Langford, Sahand N Negahban

    Abstract: We investigate the feasibility of learning from a mix of both fully-labeled supervised data and contextual bandit data. We specifically consider settings in which the underlying learning signal may be different between these two data sources. Theoretically, we state and prove no-regret algorithms for learning that is robust to misaligned cost distributions between the two sources. Empirically, we… ▽ More

    Submitted 21 June, 2019; v1 submitted 2 January, 2019; originally announced January 2019.

    Comments: 42 pages, 21 figures, ICML 2019

  4. arXiv:1810.09401  [pdf, other

    cs.IR cs.LG stat.ML

    Alternating Linear Bandits for Online Matrix-Factorization Recommendation

    Authors: Hamid Dadkhahi, Sahand Negahban

    Abstract: We consider the problem of online collaborative filtering in the online setting, where items are recommended to the users over time. At each time step, the user (selected by the environment) consumes an item (selected by the agent) and provides a rating of the selected item. In this paper, we propose a novel algorithm for online matrix factorization recommendation that combines linear bandits and… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

  5. arXiv:1810.04247  [pdf, other

    cs.LG stat.ML

    Feature Selection using Stochastic Gates

    Authors: Yutaro Yamada, Ofir Lindenbaum, Sahand Negahban, Yuval Kluger

    Abstract: Feature selection problems have been extensively studied for linear estimation, for instance, Lasso, but less emphasis has been placed on feature selection for non-linear functions. In this study, we propose a method for feature selection in high-dimensional non-linear function estimation problems. The new procedure is based on minimizing the $\ell_0$ norm of the vector of indicator variables that… ▽ More

    Submitted 26 July, 2020; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: Published in ICML 2020

    Journal ref: Proceedings of Machine Learning and Systems 2020, pages 8952--8963

  6. arXiv:1710.07006  [pdf, ps, other

    stat.ML

    Minimax Estimation of Bandable Precision Matrices

    Authors: Addison Hu, Sahand Negahban

    Abstract: The inverse covariance matrix provides considerable insight for understanding statistical models in the multivariate setting. In particular, when the distribution over variables is assumed to be multivariate normal, the sparsity pattern in the inverse covariance matrix, commonly referred to as the precision matrix, corresponds to the adjacency matrix representation of the Gauss-Markov graph, which… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

  7. arXiv:1704.07228  [pdf, other

    stat.ML cs.LG

    Learning from Comparisons and Choices

    Authors: Sahand Negahban, Sewoong Oh, Kiran K. Thekumparampil, Jiaming Xu

    Abstract: When tracking user-specific online activities, each user's preference is revealed in the form of choices and comparisons. For example, a user's purchase history is a record of her choices, i.e. which item was chosen among a subset of offerings. A user's preferences can be observed either explicitly as in movie ratings or implicitly as in viewing times of news articles. Given such individualized or… ▽ More

    Submitted 30 December, 2018; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: 77 pages, 12 figures; added new experiments and references. arXiv admin note: substantial text overlap with arXiv:1506.07947

  8. arXiv:1703.02723  [pdf, other

    stat.ML cs.IT cs.LG

    Scalable Greedy Feature Selection via Weak Submodularity

    Authors: Rajiv Khanna, Ethan Elenberg, Alexandros G. Dimakis, Sahand Negahban, Joydeep Ghosh

    Abstract: Greedy algorithms are widely used for problems in machine learning such as feature selection and set function optimization. Unfortunately, for large datasets, the running time of even greedy algorithms can be quite high. This is because for each greedy step we need to refit a model or calculate a function using the previously selected choices and the new candidate. Two algorithms that are faster… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: To appear in AISTATS 2017

  9. arXiv:1703.02721  [pdf, other

    stat.ML cs.IT cs.LG

    On Approximation Guarantees for Greedy Low Rank Optimization

    Authors: Rajiv Khanna, Ethan Elenberg, Alexandros G. Dimakis, Sahand Negahban

    Abstract: We provide new approximation guarantees for greedy low rank matrix estimation under standard assumptions of restricted strong convexity and smoothness. Our novel analysis also uncovers previously unknown connections between the low rank estimation and combinatorial optimization, so much so that our bounds are reminiscent of corresponding approximation bounds in submodular maximization. Additionall… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

  10. arXiv:1612.00804  [pdf, other

    stat.ML cs.IT cs.LG

    Restricted Strong Convexity Implies Weak Submodularity

    Authors: Ethan R. Elenberg, Rajiv Khanna, Alexandros G. Dimakis, Sahand Negahban

    Abstract: We connect high-dimensional subset selection and submodular maximization. Our results extend the work of Das and Kempe (2011) from the setting of linear regression to arbitrary objective functions. For greedy feature selection, this connection allows us to obtain strong multiplicative performance bounds on several methods without statistical modeling assumptions. We also derive recovery guarantees… ▽ More

    Submitted 12 October, 2017; v1 submitted 2 December, 2016; originally announced December 2016.

  11. arXiv:1610.09600  [pdf, other

    stat.ML

    Super-resolution estimation of cyclic arrival rates

    Authors: Ningyuan Chen, Donald K. K. Lee, Sahand Negahban

    Abstract: Exploiting the fact that most arrival processes exhibit cyclic behaviour, we propose a simple procedure for estimating the intensity of a nonhomogeneous Poisson process. The estimator is the super-resolution analogue to Shao 2010 and Shao & Lii 2011, which is a sum of $p$ sinusoids where $p$ and the frequency, amplitude, and phase of each wave are not known and need to be estimated. This results i… ▽ More

    Submitted 27 February, 2019; v1 submitted 29 October, 2016; originally announced October 2016.

    Comments: 32 pages, 5 figures

    MSC Class: 62M15; 90B22; 60G55

    Journal ref: Annals of Statistics 47:3:1754-1775 (2019)

  12. Understanding Adversarial Training: Increasing Local Stability of Neural Nets through Robust Optimization

    Authors: Uri Shaham, Yutaro Yamada, Sahand Negahban

    Abstract: We propose a general framework for increasing local stability of Artificial Neural Nets (ANNs) using Robust Optimization (RO). We achieve this through an alternating minimization-maximization procedure, in which the loss of the network is minimized over perturbed examples that are generated at each parameter update. We show that adversarial training of ANNs is in fact robustification of the networ… ▽ More

    Submitted 16 January, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

  13. arXiv:1410.0860  [pdf, ps, other

    stat.ML

    Individualized Rank Aggregation using Nuclear Norm Regularization

    Authors: Yu Lu, Sahand N. Negahban

    Abstract: In recent years rank aggregation has received significant attention from the machine learning community. The goal of such a problem is to combine the (partially revealed) preferences over objects of a large population into a single, relatively consistent ordering of those objects. However, in many cases, we might not want a single ranking and instead opt for individual rankings. We study a version… ▽ More

    Submitted 3 October, 2014; originally announced October 2014.

  14. arXiv:1209.3775  [pdf, other

    astro-ph.IM stat.AP

    Using Machine Learning for Discovery in Synoptic Survey Imaging

    Authors: Henrik Brink, Joseph W. Richards, Dovi Poznanski, Joshua S. Bloom, John Rice, Sahand Negahban, Martin Wainwright

    Abstract: Modern time-domain surveys continuously monitor large swaths of the sky to look for astronomical variability. Astrophysical discovery in such data sets is complicated by the fact that detections of real transient and variable sources are highly outnumbered by bogus detections caused by imperfect subtractions, atmospheric effects and detector artefacts. In this work we present a machine learning (M… ▽ More

    Submitted 17 September, 2012; originally announced September 2012.

    Comments: 16 pages, 14 figures

  15. arXiv:1209.1688  [pdf, other

    cs.LG stat.ML

    Rank Centrality: Ranking from Pair-wise Comparisons

    Authors: Sahand Negahban, Sewoong Oh, Devavrat Shah

    Abstract: The question of aggregating pair-wise comparisons to obtain a global ranking over a collection of objects has been of interest for a very long time: be it ranking of online gamers (e.g. MSR's TrueSkill system) and chess players, aggregating social opinions, or deciding which product to sell based on transactions. In most settings, in addition to obtaining a ranking, finding `scores' for each objec… ▽ More

    Submitted 12 November, 2015; v1 submitted 8 September, 2012; originally announced September 2012.

    Comments: 45 pages, 3 figures

  16. arXiv:1208.1860  [pdf, other

    cs.DB cs.LG

    Scaling Multiple-Source Entity Resolution using Statistically Efficient Transfer Learning

    Authors: Sahand Negahban, Benjamin I. P. Rubinstein, Jim Gemmell

    Abstract: We consider a serious, previously-unexplored challenge facing almost all approaches to scaling up entity resolution (ER) to multiple data sources: the prohibitive cost of labeling training data for supervised learning of similarity scores for each pair of sources. While there exists a rich literature describing almost all aspects of pairwise ER, this new challenge is arising now due to the unprece… ▽ More

    Submitted 9 August, 2012; originally announced August 2012.

    Comments: Short version to appear in CIKM'2012; 10 pages, 7 figures

    ACM Class: H.2; I.2.6; I.5.4

  17. arXiv:1207.4421  [pdf, ps, other

    stat.ML cs.LG math.OC

    Stochastic optimization and sparse statistical recovery: An optimal algorithm for high dimensions

    Authors: Alekh Agarwal, Sahand Negahban, Martin J. Wainwright

    Abstract: We develop and analyze stochastic optimization algorithms for problems in which the expected loss is strongly convex, and the optimum is (approximately) sparse. Previous approaches are able to exploit only one of these two structures, yielding an $\order(\pdim/T)$ convergence rate for strongly convex objectives in $\pdim$ dimensions, and an $\order(\sqrt{(\spindex \log \pdim)/T})$ convergence rate… ▽ More

    Submitted 18 July, 2012; originally announced July 2012.

    Comments: 2 figures

  18. arXiv:1104.4824  [pdf, ps, other

    stat.ML cs.IT

    Fast global convergence of gradient methods for high-dimensional statistical recovery

    Authors: Alekh Agarwal, Sahand N. Negahban, Martin J. Wainwright

    Abstract: Many statistical $M$-estimators are based on convex optimization problems formed by the combination of a data-dependent loss function with a norm-based regularizer. We analyze the convergence rates of projected gradient and composite gradient methods for solving such problems, working within a high-dimensional framework that allows the data dimension $\pdim$ to grow with (and possibly exceed) the… ▽ More

    Submitted 25 July, 2012; v1 submitted 25 April, 2011; originally announced April 2011.

  19. arXiv:1102.4807  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Noisy matrix decomposition via convex relaxation: Optimal rates in high dimensions

    Authors: Alekh Agarwal, Sahand N. Negahban, Martin J. Wainwright

    Abstract: We analyze a class of estimators based on convex relaxation for solving high-dimensional matrix decomposition problems. The observations are noisy realizations of a linear transformation $\mathfrak{X}$ of the sum of an approximately) low rank matrix $Θ^\star$ with a second matrix $Γ^\star$ endowed with a complementary form of low-dimensional structure; this set-up includes many statistical models… ▽ More

    Submitted 6 March, 2012; v1 submitted 23 February, 2011; originally announced February 2011.

    Comments: 41 pages, 2 figures

    Report number: IMS-AOS-AOS1000 MSC Class: 62F30; 62F30 (Primary) 62H12 (Secondary)

    Journal ref: Annals of Statistics 2012, Vol. 40, No. 2, 1171-1197

  20. arXiv:1010.2731  [pdf, ps, other

    math.ST cs.IT stat.ME

    A Unified Framework for High-Dimensional Analysis of M-Estimators with Decomposable Regularizers

    Authors: Sahand N. Negahban, Pradeep Ravikumar, Martin J. Wainwright, Bin Yu

    Abstract: High-dimensional statistical inference deals with models in which the the number of parameters p is comparable to or larger than the sample size n. Since it is usually impossible to obtain consistent procedures unless $p/n\rightarrow0$, a line of recent work has studied models with various types of low-dimensional structure, including sparse vectors, sparse and structured matrices, low-rank matric… ▽ More

    Submitted 12 March, 2013; v1 submitted 13 October, 2010; originally announced October 2010.

    Comments: Published in at http://dx.doi.org/10.1214/12-STS400 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS400

    Journal ref: Statistical Science 2012, Vol. 27, No. 4, 538-557

  21. arXiv:1009.2118  [pdf, ps, other

    cs.IT math.ST

    Restricted strong convexity and weighted matrix completion: Optimal bounds with noise

    Authors: Sahand Negahban, Martin J. Wainwright

    Abstract: We consider the matrix completion problem under a form of row/column weighted entrywise sampling, including the case of uniform entrywise sampling as a special case. We analyze the associated random observation operator, and prove that with high probability, it satisfies a form of restricted strong convexity with respect to weighted Frobenius norm. Using this property, we obtain as corollaries a n… ▽ More

    Submitted 15 May, 2011; v1 submitted 10 September, 2010; originally announced September 2010.

  22. arXiv:0912.5100  [pdf, ps, other

    math.ST

    Estimation of (near) low-rank matrices with noise and high-dimensional scaling

    Authors: Sahand Negahban, Martin J. Wainwright

    Abstract: High-dimensional inference refers to problems of statistical estimation in which the ambient dimension of the data may be comparable to or possibly even larger than the sample size. We study an instance of high-dimensional inference in which the goal is to estimate a matrix $Θ^* \in \real^{k \times p}$ on the basis of $N$ noisy observations, and the unknown matrix $Θ^*$ is assumed to be either e… ▽ More

    Submitted 27 December, 2009; originally announced December 2009.

    Comments: Appeared as Stat. technical report, UC Berkeley

  23. arXiv:0905.0642  [pdf, ps, other

    math.ST cs.IT

    Simultaneous support recovery in high dimensions: Benefits and perils of block $\ell_1/\ell_\infty$-regularization

    Authors: S. Negahban, M. J. Wainwright

    Abstract: Consider the use of $\ell_{1}/\ell_{\infty}$-regularized regression for joint estimation of a $\pdim \times \numreg$ matrix of regression coefficients. We analyze the high-dimensional scaling of $\ell_1/\ell_\infty$-regularized quadratic programming, considering both consistency in $\ell_\infty$-norm, and variable selection. We begin by establishing bounds on the $\ell_\infty$-error as well suff… ▽ More

    Submitted 5 May, 2009; originally announced May 2009.

    Comments: Presented in part at NIPS 2008 conference, Vancouver, Canada, December 2008