Skip to main content

Showing 51–91 of 91 results for author: Pilanci, M

.
  1. arXiv:2107.05680  [pdf, other

    cs.LG cs.CV eess.IV math.OC stat.ML

    Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions

    Authors: Arda Sahiner, Tolga Ergen, Batu Ozturkler, Burak Bartan, John Pauly, Morteza Mardani, Mert Pilanci

    Abstract: Generative Adversarial Networks (GANs) are commonly used for modeling complex distributions of data. Both the generators and discriminators of GANs are often modeled by neural networks, posing a non-transparent optimization problem which is non-convex and non-concave over the generator and discriminator, respectively. Such networks are often heuristically optimized with gradient descent-ascent (GD… ▽ More

    Submitted 21 March, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Published as paper in ICLR 2022. First two authors contributed equally to this work; 34 pages, 11 figures

  2. arXiv:2105.07291  [pdf, other

    math.OC cs.LG

    Adaptive Newton Sketch: Linear-time Optimization with Quadratic Convergence and Effective Hessian Dimensionality

    Authors: Jonathan Lacotte, Yifei Wang, Mert Pilanci

    Abstract: We propose a randomized algorithm with quadratic convergence rate for convex optimization problems with a self-concordant, composite, strongly convex objective function. Our method is based on performing an approximate Newton step using a random projection of the Hessian. Our first contribution is to show that, at each iteration, the embedding dimension (or sketch size) can be as small as the effe… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

  3. arXiv:2105.01420  [pdf, ps, other

    cs.LG stat.ML

    Training Quantized Neural Networks to Global Optimality via Semidefinite Programming

    Authors: Burak Bartan, Mert Pilanci

    Abstract: Neural networks (NNs) have been extremely successful across many tasks in machine learning. Quantization of NN weights has become an important topic due to its impact on their energy efficiency, inference time and deployment on hardware. Although post-training quantization is well-studied, training optimal quantized NNs involves combinatorial non-convex optimization problems which appear intractab… ▽ More

    Submitted 5 May, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: v2: Minor edits in the text. The results are unchanged

  4. arXiv:2104.14101  [pdf, other

    cs.LG

    Fast Convex Quadratic Optimization Solvers with Adaptive Sketching-based Preconditioners

    Authors: Jonathan Lacotte, Mert Pilanci

    Abstract: We consider least-squares problems with quadratic regularization and propose novel sketching-based iterative methods with an adaptive sketch size. The sketch size can be as small as the effective dimension of the data matrix to guarantee linear convergence. However, a major difficulty in choosing the sketch size in terms of the effective dimension lies in the fact that the latter is usually unknow… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  5. arXiv:2103.07578  [pdf, other

    cs.LG cs.IT math.OC

    Efficient Randomized Subspace Embeddings for Distributed Optimization under a Communication Budget

    Authors: Rajarshi Saha, Mert Pilanci, Andrea J. Goldsmith

    Abstract: We study first-order optimization algorithms under the constraint that the descent direction is quantized using a pre-specified budget of $R$-bits per dimension, where $R \in (0 ,\infty)$. We propose computationally efficient optimization algorithms with convergence rates matching the information-theoretic performance lower bounds for: (i) Smooth and Strongly-Convex objectives with access to an Ex… ▽ More

    Submitted 15 August, 2022; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: 41 pages, 26 figures, 1 table. This work has been accepted for publication in the IEEE Journal on Selected Areas in Information Theory (JSAIT), Spl. issue on Distributed Coding and Computation

  6. arXiv:2103.01499  [pdf, other

    cs.LG math.OC stat.ML

    Demystifying Batch Normalization in ReLU Networks: Equivalent Convex Optimization Models and Implicit Regularization

    Authors: Tolga Ergen, Arda Sahiner, Batu Ozturkler, John Pauly, Morteza Mardani, Mert Pilanci

    Abstract: Batch Normalization (BN) is a commonly used technique to accelerate and stabilize training of deep neural networks. Despite its empirical success, a full theoretical understanding of BN is yet to be developed. In this work, we analyze BN through the lens of convex optimization. We introduce an analytic framework based on convex duality to obtain exact convex representations of weight-decay regular… ▽ More

    Submitted 21 March, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted to ICLR 2022. First two authors contributed equally to this work; 36 pages, 13 figures

  7. arXiv:2102.03088  [pdf, other

    cs.LG

    Boost AI Power: Data Augmentation Strategies with unlabelled Data and Conformal Prediction, a Case in Alternative Herbal Medicine Discrimination with Electronic Nose

    Authors: Li Liu, Xianghao Zhan, Rumeng Wu, Xiaoqing Guan, Zhan Wang, Wei Zhang, Mert Pilanci, You Wang, Zhiyuan Luo, Guang Li

    Abstract: Electronic nose has been proven to be effective in alternative herbal medicine classification, but due to the nature of supervised learning, previous research heavily relies on the labelled training data, which are time-costly and labor-intensive to collect. To alleviate the critical dependency on the training data in real-world applications, this study aims to improve classification accuracy via… ▽ More

    Submitted 17 July, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

  8. arXiv:2101.02429  [pdf, other

    cs.LG cs.CC math.OC stat.ML

    Neural Spectrahedra and Semidefinite Lifts: Global Convex Optimization of Polynomial Activation Neural Networks in Fully Polynomial-Time

    Authors: Burak Bartan, Mert Pilanci

    Abstract: The training of two-layer neural networks with nonlinear activation functions is an important non-convex optimization problem with numerous applications and promising performance in layerwise deep learning. In this paper, we develop exact convex optimization formulations for two-layer neural networks with second degree polynomial activations based on semidefinite programming. Remarkably, we show t… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

  9. arXiv:2012.13329  [pdf, other

    cs.LG cs.AI cs.CC stat.ML

    Vector-output ReLU Neural Network Problems are Copositive Programs: Convex Analysis of Two Layer Networks and Polynomial-time Algorithms

    Authors: Arda Sahiner, Tolga Ergen, John Pauly, Mert Pilanci

    Abstract: We describe the convex semi-infinite dual of the two-layer vector-output ReLU neural network training problem. This semi-infinite dual admits a finite dimensional representation, but its support is over a convex set which is difficult to characterize. In particular, we demonstrate that the non-convex neural network training problem is equivalent to a finite-dimensional convex copositive program. O… ▽ More

    Submitted 20 December, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

    Comments: 25 pages, 6 figures

  10. arXiv:2012.07054  [pdf, other

    cs.IT cs.LG

    Adaptive and Oblivious Randomized Subspace Methods for High-Dimensional Optimization: Sharp Analysis and Lower Bounds

    Authors: Jonathan Lacotte, Mert Pilanci

    Abstract: We propose novel randomized optimization methods for high-dimensional convex problems based on restrictions of variables to random subspaces. We consider oblivious and data-adaptive subspaces and study their approximation properties via convex duality and Fenchel conjugates. A suitable adaptive subspace can be generated by sampling a correlated random matrix whose second order statistics mirror th… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

  11. arXiv:2012.05169  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Convex Regularization Behind Neural Reconstruction

    Authors: Arda Sahiner, Morteza Mardani, Batu Ozturkler, Mert Pilanci, John Pauly

    Abstract: Neural networks have shown tremendous potential for reconstructing high-resolution images in inverse problems. The non-convex and opaque nature of neural networks, however, hinders their utility in sensitive applications such as medical imaging. To cope with this challenge, this paper advocates a convex duality framework that makes a two-layer fully-convolutional ReLU denoising network amenable to… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

  12. arXiv:2011.09709  [pdf, other

    cs.IT math.NA

    Approximate Weighted $CR$ Coded Matrix Multiplication

    Authors: Neophytos Charalambides, Mert Pilanci, Alfred Hero

    Abstract: One of the most common, but at the same time expensive operations in linear algebra, is multiplying two matrices $A$ and $B$. With the rapid development of machine learning and increases in data volume, performing fast matrix intensive multiplications has become a major hurdle. Two different approaches to overcoming this issue are, 1) to approximate the product; and 2) to perform the multiplicatio… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: 5 pages, 1 figure, conference

    MSC Class: 65Fxx; 65Dxx ACM Class: E.4; H.1.1

  13. Linear Predictive Coding for Acute Stress Prediction from Computer Mouse Movements

    Authors: Lawrence H. Kim, Rahul Goel, Jia Liang, Mert Pilanci, Pablo E. Paredes

    Abstract: Prior work demonstrated the potential of using the Linear Predictive Coding (LPC) filter to approximate muscle stiffness and dam** from computer mouse movements to predict acute stress levels of users. Theoretically, muscle stiffness and dam** in the arm can be estimated using a mass-spring-damper (MSD) biomechanical model. However, the dam** frequency (i.e., stiffness) and dam** ratio val… ▽ More

    Submitted 15 December, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: The first three authors contributed equally. 5 pages, 6 figures, 2 tables, published at EMBC'21

    Journal ref: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  14. arXiv:2007.01327  [pdf, other

    cs.LG math.OC stat.ML

    Debiasing Distributed Second Order Optimization with Surrogate Sketching and Scaled Regularization

    Authors: Michał Dereziński, Burak Bartan, Mert Pilanci, Michael W. Mahoney

    Abstract: In distributed second order optimization, a standard strategy is to average many local estimates, each of which is based on a small sketch or batch of the data. However, the local estimates on each machine are typically biased, relative to the full solution on all of the data, and this can limit the effectiveness of averaging. Here, we introduce a new technique for debiasing the local estimates, w… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  15. arXiv:2006.14798  [pdf, other

    cs.LG cs.CC stat.ML

    Implicit Convex Regularizers of CNN Architectures: Convex Optimization of Two- and Three-Layer Networks in Polynomial Time

    Authors: Tolga Ergen, Mert Pilanci

    Abstract: We study training of Convolutional Neural Networks (CNNs) with ReLU activations and introduce exact convex optimization formulations with a polynomial complexity with respect to the number of data samples, the number of neurons, and data dimension. More specifically, we develop a convex analytic framework utilizing semi-infinite duality to obtain equivalent convex optimization problems for several… ▽ More

    Submitted 18 March, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: Accepted for Spotlight Presentation at ICLR 2021

    Journal ref: International Conference on Learning Representations (ICLR), 2021

  16. arXiv:2006.08160  [pdf, other

    math.OC cs.IT stat.ML

    Lower Bounds and a Near-Optimal Shrinkage Estimator for Least Squares using Random Projections

    Authors: Srivatsan Sridhar, Mert Pilanci, Ayfer Özgür

    Abstract: In this work, we consider the deterministic optimization using random projections as a statistical estimation problem, where the squared distance between the predictions from the estimator and the true solution is the error metric. In approximately solving a large scale least squares problem using Gaussian sketches, we show that the sketched solution has a conditional Gaussian distribution with th… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: This work has been submitted to the IEEE Journal on Selected Areas in Information Theory (JSAIT) - Special Issue on Estimation and Inference, and is awaiting review. This document contains 37 pages and 14 figures

  17. arXiv:2006.05900  [pdf, other

    cs.LG stat.ML

    The Hidden Convex Optimization Landscape of Two-Layer ReLU Neural Networks: an Exact Characterization of the Optimal Solutions

    Authors: Yifei Wang, Jonathan Lacotte, Mert Pilanci

    Abstract: We prove that finding all globally optimal two-layer ReLU neural networks can be performed by solving a convex optimization program with cone constraints. Our analysis is novel, characterizes all optimal solutions, and does not leverage duality-based analysis which was recently used to lift neural network training into convex spaces. Given the set of solutions of our convex optimization program, w… ▽ More

    Submitted 13 March, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

  18. arXiv:2006.05874  [pdf, other

    cs.LG stat.ML

    Effective Dimension Adaptive Sketching Methods for Faster Regularized Least-Squares Optimization

    Authors: Jonathan Lacotte, Mert Pilanci

    Abstract: We propose a new randomized algorithm for solving L2-regularized least-squares problems based on sketching. We consider two of the most popular random embeddings, namely, Gaussian embeddings and the Subsampled Randomized Hadamard Transform (SRHT). While current randomized solvers for least-squares optimization prescribe an embedding dimension at least greater than the data dimension, we show that… ▽ More

    Submitted 23 October, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

  19. arXiv:2005.10848  [pdf, other

    cs.LG cs.IT stat.ML

    Global Multiclass Classification and Dataset Construction via Heterogeneous Local Experts

    Authors: Surin Ahn, Ayfer Ozgur, Mert Pilanci

    Abstract: In the domains of dataset construction and crowdsourcing, a notable challenge is to aggregate labels from a heterogeneous set of labelers, each of whom is potentially an expert in some subset of tasks (and less reliable in others). To reduce costs of hiring human labelers or training automated labeling systems, it is of interest to minimize the number of labelers while ensuring the reliability of… ▽ More

    Submitted 5 January, 2021; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: 27 pages, 8 figures, to be published in IEEE Journal on Selected Areas in Information Theory (JSAIT) - Special Issue on Estimation and Inference

  20. arXiv:2003.02948  [pdf, other

    math.NA cs.IT

    Straggler Robust Distributed Matrix Inverse Approximation

    Authors: Neophytos Charalambides, Mert Pilanci, Alfred O. Hero III

    Abstract: A cumbersome operation in numerical analysis and linear algebra, optimization, machine learning and engineering algorithms; is inverting large full-rank matrices which appears in various processes and applications. This has both numerical stability and complexity issues, as well as high expected time to compute. We address the latter issue, by proposing an algorithm which uses a black-box least sq… ▽ More

    Submitted 22 June, 2022; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: 4 pages, 1 figure, conference

    MSC Class: 65F05 (Primary); 94B60 (Secondary)

  21. arXiv:2002.11219  [pdf, ps, other

    cs.LG stat.ML

    Convex Geometry and Duality of Over-parameterized Neural Networks

    Authors: Tolga Ergen, Mert Pilanci

    Abstract: We develop a convex analytic approach to analyze finite width two-layer ReLU networks. We first prove that an optimal solution to the regularized training problem can be characterized as extreme points of a convex set, where simple solutions are encouraged via its convex geometrical properties. We then leverage this characterization to show that an optimal set of parameters yield linear spline int… ▽ More

    Submitted 30 August, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Accepted to the Journal of Machine Learning Research (JMLR)

  22. arXiv:2002.10674  [pdf, other

    cs.NE cs.LG eess.SP

    Separating the Effects of Batch Normalization on CNN Training Speed and Stability Using Classical Adaptive Filter Theory

    Authors: Elaina Chai, Mert Pilanci, Boris Murmann

    Abstract: Batch Normalization (BatchNorm) is commonly used in Convolutional Neural Networks (CNNs) to improve training speed and stability. However, there is still limited consensus on why this technique is effective. This paper uses concepts from the traditional adaptive filter domain to provide insight into the dynamics and inner workings of BatchNorm. First, we show that the convolution weight updates ha… ▽ More

    Submitted 1 June, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Presented at Asilomar Conference on Signals, Systems, and Computers, 2020

  23. arXiv:2002.10553  [pdf, other

    cs.LG cs.CC stat.ML

    Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-layer Networks

    Authors: Mert Pilanci, Tolga Ergen

    Abstract: We develop exact representations of training two-layer neural networks with rectified linear units (ReLUs) in terms of a single convex program with number of variables polynomial in the number of training samples and the number of hidden neurons. Our theory utilizes semi-infinite duality and minimum norm regularization. We show that ReLU networks trained with standard weight decay are equivalent t… ▽ More

    Submitted 15 August, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

  24. arXiv:2002.09773  [pdf, other

    cs.LG stat.ML

    Revealing the Structure of Deep Neural Networks via Convex Duality

    Authors: Tolga Ergen, Mert Pilanci

    Abstract: We study regularized deep neural networks (DNNs) and introduce a convex analytic framework to characterize the structure of the hidden layers. We show that a set of optimal hidden layer weights for a norm regularized DNN training problem can be explicitly found as the extreme points of a convex set. For the special case of deep linear networks, we prove that each optimal weight matrix aligns with… ▽ More

    Submitted 11 June, 2021; v1 submitted 22 February, 2020; originally announced February 2020.

    Comments: Accepted to ICML 2021

  25. arXiv:2002.09488  [pdf, other

    math.OC cs.LG

    Optimal Randomized First-Order Methods for Least-Squares Problems

    Authors: Jonathan Lacotte, Mert Pilanci

    Abstract: We provide an exact analysis of a class of randomized algorithms for solving overdetermined least-squares problems. We consider first-order methods, where the gradients are pre-conditioned by an approximation of the Hessian, based on a subspace embedding of the data matrix. This class of algorithms encompasses several randomized methods among the fastest solvers for least-squares problems. We focu… ▽ More

    Submitted 25 February, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: text overlap with arXiv:2002.00864

  26. arXiv:2002.06540  [pdf, other

    stat.ML cs.DC cs.LG

    Distributed Averaging Methods for Randomized Second Order Optimization

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We consider distributed optimization problems where forming the Hessian is computationally challenging and communication is a significant bottleneck. We develop unbiased parameter averaging methods for randomized second order optimization that employ sampling and sketching of the Hessian. Existing works do not take the bias of the estimators into consideration, which limits their application to ma… ▽ More

    Submitted 16 February, 2020; originally announced February 2020.

  27. arXiv:2002.06538  [pdf, other

    cs.DC cs.CR cs.LG

    Distributed Sketching Methods for Privacy Preserving Regression

    Authors: Burak Bartan, Mert Pilanci

    Abstract: In this work, we study distributed sketching methods for large scale regression problems. We leverage multiple randomized sketches for reducing the problem dimensions as well as preserving privacy and improving straggler resilience in asynchronous distributed systems. We derive novel approximation guarantees for classical sketching methods and analyze the accuracy of parameter averaging for distri… ▽ More

    Submitted 19 June, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

  28. arXiv:2002.02291  [pdf, other

    cs.IT

    Weighted Gradient Coding with Leverage Score Sampling

    Authors: Neophytos Charalambides, Mert Pilanci, Alfred O. Hero III

    Abstract: A major hurdle in machine learning is scalability to massive datasets. Approaches to overcome this hurdle include compression of the data matrix and distributing the computations. \textit{Leverage score sampling} provides a compressed approximation of a data matrix using an importance weighted subset. \textit{Gradient coding} has been recently proposed in distributed optimization to compute the gr… ▽ More

    Submitted 15 September, 2020; v1 submitted 30 January, 2020; originally announced February 2020.

    Comments: 4 pages, 2 figures, 2 tables, conference

    MSC Class: 94Bxx; 94B60

  29. arXiv:2002.02208  [pdf, ps, other

    math.OC cs.LG

    Global Convergence of Frank Wolfe on One Hidden Layer Networks

    Authors: Alexandre d'Aspremont, Mert Pilanci

    Abstract: We derive global convergence bounds for the Frank Wolfe algorithm when training one hidden layer neural networks. When using the ReLU activation function, and under tractable preconditioning assumptions on the sample data set, the linear minimization oracle used to incrementally form the solution can be solved explicitly as a second order cone program. The classical Frank Wolfe algorithm then conv… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

  30. arXiv:2002.00864  [pdf, other

    math.OC cs.LG

    Optimal Iterative Sketching with the Subsampled Randomized Hadamard Transform

    Authors: Jonathan Lacotte, Sifan Liu, Edgar Dobriban, Mert Pilanci

    Abstract: Random projections or sketching are widely used in many algorithmic and learning contexts. Here we study the performance of iterative Hessian sketch for least-squares problems. By leveraging and extending recent results from random matrix theory on the limiting spectrum of matrices randomly projected with the subsampled randomized Hadamard transform, and truncated Haar matrices, we can study and c… ▽ More

    Submitted 23 October, 2020; v1 submitted 3 February, 2020; originally announced February 2020.

  31. arXiv:1912.03514  [pdf, ps, other

    math.OC cs.CC

    M-IHS: An Accelerated Randomized Preconditioning Method Avoiding Costly Matrix Decompositions

    Authors: Ibrahim Kurban Ozaslan, Mert Pilanci, Orhan Arikan

    Abstract: Momentum Iterative Hessian Sketch (M-IHS) techniques, a group of solvers for large scale regularized linear Least Squares (LS) problems, are proposed and analyzed in detail. Proposed M-IHS techniques are obtained by incorporating the Heavy Ball Acceleration into the Iterative Hessian Sketch algorithm and they provide significant improvements over the randomized preconditioning techniques. By using… ▽ More

    Submitted 28 November, 2020; v1 submitted 7 December, 2019; originally announced December 2019.

    MSC Class: 15B52; 65F08; 65F10; 65F22; 65F50; 68W20; 90C06 ACM Class: G.1.3; G.1.6

  32. arXiv:1911.02675  [pdf, other

    math.NA cs.CC math.OC

    Faster Least Squares Optimization

    Authors: Jonathan Lacotte, Mert Pilanci

    Abstract: We investigate iterative methods with randomized preconditioners for solving overdetermined least-squares problems, where the preconditioners are based on a random embedding of the data matrix. We consider two distinct approaches: the sketch is either computed once (fixed preconditioner), or, the random projection is refreshed at each iteration, i.e., sampled independently of previous ones (varyin… ▽ More

    Submitted 13 April, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

  33. arXiv:1907.05984  [pdf, other

    cs.DC cs.IT cs.LG

    Distributed Black-Box Optimization via Error Correcting Codes

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We introduce a novel distributed derivative-free optimization framework that is resilient to stragglers. The proposed method employs coded search directions at which the objective function is evaluated, and a decoding step to find the next iterate. Our framework can be seen as an extension of evolution strategies and structured exploration methods where structured search directions were utilized.… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

  34. arXiv:1906.11809  [pdf, other

    math.OC cs.LG

    High-Dimensional Optimization in Adaptive Random Subspaces

    Authors: Jonathan Lacotte, Mert Pilanci, Marco Pavone

    Abstract: We propose a new randomized optimization method for high-dimensional problems which can be seen as a generalization of coordinate descent to random subspaces. We show that an adaptive sampling strategy for the random subspace significantly outperforms the oblivious sampling method, which is the common choice in the recent literature. The adaptive subspace can be efficiently generated by a correlat… ▽ More

    Submitted 18 December, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

  35. arXiv:1901.06811  [pdf, other

    cs.IT cs.DC cs.LG

    Straggler Resilient Serverless Computing Based on Polar Codes

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We propose a serverless computing mechanism for distributed computation based on polar codes. Serverless computing is an emerging cloud based computation model that lets users run their functions on the cloud without provisioning or managing servers. Our proposed approach is a hybrid computing framework that carries out computationally expensive tasks such as linear algebraic operations involving… ▽ More

    Submitted 12 July, 2019; v1 submitted 21 January, 2019; originally announced January 2019.

    Comments: New results added in the new version. More discussion on serverless computing

  36. arXiv:1901.00035  [pdf, other

    cs.LG stat.ML

    Convex Relaxations of Convolutional Neural Nets

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We propose convex relaxations for convolutional neural nets with one hidden layer where the output weights are fixed. For convex activation functions such as rectified linear units, the relaxations are convex second order cone programs which can be solved very efficiently. We prove that the relaxation recovers the global minimum under a planted model assumption, given sufficiently many training sa… ▽ More

    Submitted 31 December, 2018; originally announced January 2019.

  37. arXiv:1803.05288  [pdf, other

    stat.ML cs.LG

    Domain Adaptation on Graphs by Learning Aligned Graph Bases

    Authors: Mehmet Pilanci, Elif Vural

    Abstract: A common assumption in semi-supervised learning with graph models is that the class label function varies smoothly on the data graph, resulting in the rather strict prior that the label function has low-frequency content. Meanwhile, in many classification problems, the label function may vary abruptly in certain graph regions, resulting in high-frequency components. Although the semi-supervised es… ▽ More

    Submitted 4 February, 2020; v1 submitted 14 March, 2018; originally announced March 2018.

  38. arXiv:1505.02250  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    Newton Sketch: A Linear-time Optimization Algorithm with Linear-Quadratic Convergence

    Authors: Mert Pilanci, Martin J. Wainwright

    Abstract: We propose a randomized second-order method for optimization known as the Newton Sketch: it is based on performing an approximate Newton step using a randomly projected or sub-sampled Hessian. For self-concordant functions, we prove that the algorithm has super-linear convergence with exponentially high probability, with convergence and complexity guarantees that are independent of condition numbe… ▽ More

    Submitted 9 May, 2015; originally announced May 2015.

  39. arXiv:1501.06195  [pdf, ps, other

    stat.ML cs.DS cs.LG stat.CO

    Randomized sketches for kernels: Fast and optimal non-parametric regression

    Authors: Yun Yang, Mert Pilanci, Martin J. Wainwright

    Abstract: Kernel ridge regression (KRR) is a standard method for performing non-parametric regression over reproducing kernel Hilbert spaces. Given $n$ samples, the time and space complexity of computing the KRR estimate scale as $\mathcal{O}(n^3)$ and $\mathcal{O}(n^2)$ respectively, and so is prohibitive in many cases. We propose approximations of KRR based on $m$-dimensional randomized sketches of the ke… ▽ More

    Submitted 25 January, 2015; originally announced January 2015.

    Comments: 27 pages, 3 figures

  40. arXiv:1411.0347  [pdf, other

    math.OC cs.IT cs.LG stat.ML

    Iterative Hessian sketch: Fast and accurate solution approximation for constrained least-squares

    Authors: Mert Pilanci, Martin J. Wainwright

    Abstract: We study randomized sketching methods for approximately solving least-squares problem with a general convex constraint. The quality of a least-squares approximation can be assessed in different ways: either in terms of the value of the quadratic objective function (cost approximation), or in terms of some distance measure between the approximate minimizer and the true minimizer (solution approxima… ▽ More

    Submitted 2 November, 2014; originally announced November 2014.

  41. arXiv:1404.7203  [pdf, ps, other

    cs.IT cs.DS math.OC stat.ML

    Randomized Sketches of Convex Programs with Sharp Guarantees

    Authors: Mert Pilanci, Martin J. Wainwright

    Abstract: Random projection (RP) is a classical technique for reducing storage and computational costs. We analyze RP-based approximations of convex programs, in which the original optimization problem is approximated by the solution of a lower-dimensional problem. Such dimensionality reduction is essential in computation-limited settings, since the complexity of general convex programming can be quite high… ▽ More

    Submitted 28 April, 2014; originally announced April 2014.