Skip to main content

Showing 1–13 of 13 results for author: Bartan, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.00682  [pdf, other

    cs.DC cs.IT cs.LG

    Randomized Polar Codes for Anytime Distributed Machine Learning

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We present a novel distributed computing framework that is robust to slow compute nodes, and is capable of both approximate and exact computation of linear operations. The proposed mechanism integrates the concepts of randomized sketching and polar codes in the context of coded computation. We propose a sequential decoding algorithm designed to handle real valued data while maintaining low computa… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  2. arXiv:2304.14463  [pdf, other

    cs.LG cs.AI

    Moccasin: Efficient Tensor Rematerialization for Neural Networks

    Authors: Burak Bartan, Haoming Li, Harris Teague, Christopher Lott, Bistra Dilkina

    Abstract: The deployment and training of neural networks on edge computing devices pose many challenges. The low memory nature of edge devices is often one of the biggest limiting factors encountered in the deployment of large neural network models. Tensor rematerialization or recompute is a way to address high memory requirements for neural network training and inference. In this paper we consider the prob… ▽ More

    Submitted 30 May, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

  3. arXiv:2203.09755  [pdf, other

    math.OC cs.DC cs.IT cs.LG

    Distributed Sketching for Randomized Optimization: Exact Characterization, Concentration and Lower Bounds

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We consider distributed optimization methods for problems where forming the Hessian is computationally challenging and communication is a significant bottleneck. We leverage randomized sketches for reducing the problem dimensions as well as preserving privacy and improving straggler resilience in asynchronous distributed systems. We derive novel approximation guarantees for classical sketching met… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2002.06540

  4. arXiv:2107.05680  [pdf, other

    cs.LG cs.CV eess.IV math.OC stat.ML

    Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions

    Authors: Arda Sahiner, Tolga Ergen, Batu Ozturkler, Burak Bartan, John Pauly, Morteza Mardani, Mert Pilanci

    Abstract: Generative Adversarial Networks (GANs) are commonly used for modeling complex distributions of data. Both the generators and discriminators of GANs are often modeled by neural networks, posing a non-transparent optimization problem which is non-convex and non-concave over the generator and discriminator, respectively. Such networks are often heuristically optimized with gradient descent-ascent (GD… ▽ More

    Submitted 21 March, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Published as paper in ICLR 2022. First two authors contributed equally to this work; 34 pages, 11 figures

  5. arXiv:2105.01420  [pdf, ps, other

    cs.LG stat.ML

    Training Quantized Neural Networks to Global Optimality via Semidefinite Programming

    Authors: Burak Bartan, Mert Pilanci

    Abstract: Neural networks (NNs) have been extremely successful across many tasks in machine learning. Quantization of NN weights has become an important topic due to its impact on their energy efficiency, inference time and deployment on hardware. Although post-training quantization is well-studied, training optimal quantized NNs involves combinatorial non-convex optimization problems which appear intractab… ▽ More

    Submitted 5 May, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: v2: Minor edits in the text. The results are unchanged

  6. arXiv:2101.02429  [pdf, other

    cs.LG cs.CC math.OC stat.ML

    Neural Spectrahedra and Semidefinite Lifts: Global Convex Optimization of Polynomial Activation Neural Networks in Fully Polynomial-Time

    Authors: Burak Bartan, Mert Pilanci

    Abstract: The training of two-layer neural networks with nonlinear activation functions is an important non-convex optimization problem with numerous applications and promising performance in layerwise deep learning. In this paper, we develop exact convex optimization formulations for two-layer neural networks with second degree polynomial activations based on semidefinite programming. Remarkably, we show t… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

  7. arXiv:2007.01327  [pdf, other

    cs.LG math.OC stat.ML

    Debiasing Distributed Second Order Optimization with Surrogate Sketching and Scaled Regularization

    Authors: Michał Dereziński, Burak Bartan, Mert Pilanci, Michael W. Mahoney

    Abstract: In distributed second order optimization, a standard strategy is to average many local estimates, each of which is based on a small sketch or batch of the data. However, the local estimates on each machine are typically biased, relative to the full solution on all of the data, and this can limit the effectiveness of averaging. Here, we introduce a new technique for debiasing the local estimates, w… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  8. arXiv:2002.06540  [pdf, other

    stat.ML cs.DC cs.LG

    Distributed Averaging Methods for Randomized Second Order Optimization

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We consider distributed optimization problems where forming the Hessian is computationally challenging and communication is a significant bottleneck. We develop unbiased parameter averaging methods for randomized second order optimization that employ sampling and sketching of the Hessian. Existing works do not take the bias of the estimators into consideration, which limits their application to ma… ▽ More

    Submitted 16 February, 2020; originally announced February 2020.

  9. arXiv:2002.06538  [pdf, other

    cs.DC cs.CR cs.LG

    Distributed Sketching Methods for Privacy Preserving Regression

    Authors: Burak Bartan, Mert Pilanci

    Abstract: In this work, we study distributed sketching methods for large scale regression problems. We leverage multiple randomized sketches for reducing the problem dimensions as well as preserving privacy and improving straggler resilience in asynchronous distributed systems. We derive novel approximation guarantees for classical sketching methods and analyze the accuracy of parameter averaging for distri… ▽ More

    Submitted 19 June, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

  10. arXiv:1907.05984  [pdf, other

    cs.DC cs.IT cs.LG

    Distributed Black-Box Optimization via Error Correcting Codes

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We introduce a novel distributed derivative-free optimization framework that is resilient to stragglers. The proposed method employs coded search directions at which the objective function is evaluated, and a decoding step to find the next iterate. Our framework can be seen as an extension of evolution strategies and structured exploration methods where structured search directions were utilized.… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

  11. arXiv:1901.06811  [pdf, other

    cs.IT cs.DC cs.LG

    Straggler Resilient Serverless Computing Based on Polar Codes

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We propose a serverless computing mechanism for distributed computation based on polar codes. Serverless computing is an emerging cloud based computation model that lets users run their functions on the cloud without provisioning or managing servers. Our proposed approach is a hybrid computing framework that carries out computationally expensive tasks such as linear algebraic operations involving… ▽ More

    Submitted 12 July, 2019; v1 submitted 21 January, 2019; originally announced January 2019.

    Comments: New results added in the new version. More discussion on serverless computing

  12. arXiv:1901.00035  [pdf, other

    cs.LG stat.ML

    Convex Relaxations of Convolutional Neural Nets

    Authors: Burak Bartan, Mert Pilanci

    Abstract: We propose convex relaxations for convolutional neural nets with one hidden layer where the output weights are fixed. For convex activation functions such as rectified linear units, the relaxations are convex second order cone programs which can be solved very efficiently. We prove that the relaxation recovers the global minimum under a planted model assumption, given sufficiently many training sa… ▽ More

    Submitted 31 December, 2018; originally announced January 2019.

  13. arXiv:1707.02241  [pdf, other

    cs.IT

    Repairing Multiple Failures for Scalar MDS Codes

    Authors: Jay Mardia, Burak Bartan, Mary Wootters

    Abstract: In distributed storage, erasure codes -- like Reed-Solomon Codes -- are often employed to provide reliability. In this setting, it is desirable to be able to repair one or more failed nodes while minimizing the repair bandwidth. In this work, motivated by Reed-Solomon codes, we study the problem of repairing multiple failed nodes in a scalar MDS code. We extend the framework of (Guruswami and Woot… ▽ More

    Submitted 19 April, 2018; v1 submitted 7 July, 2017; originally announced July 2017.