Skip to main content

Showing 1–11 of 11 results for author: Devarakonda, A

.
  1. arXiv:2406.18001  [pdf, other

    cs.DC stat.ML

    Scalable Dual Coordinate Descent for Kernel Methods

    Authors: Zishan Shao, Aditya Devarakonda

    Abstract: Dual Coordinate Descent (DCD) and Block Dual Coordinate Descent (BDCD) are important iterative methods for solving convex optimization problems. In this work, we develop scalable DCD and BDCD methods for the kernel support vector machines (K-SVM) and kernel ridge regression (K-RR) problems. On distributed-memory parallel machines the scalability of these methods is limited by the need to communica… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    MSC Class: 65Y05 ACM Class: D.1.3; G.4; F.2.1

  2. arXiv:2402.08677  [pdf, other

    cond-mat.mtrl-sci cond-mat.supr-con

    Striped electronic phases in an incommensurately modulated van der Waals superlattice

    Authors: Aravind Devarakonda, Alan Chen, Shiang Fang, David Graf, Markus Kriener, Austin J. Akey, David C. Bell, Takehito Suzuki, Joseph G. Checkelsky

    Abstract: Electronic properties of crystals can be manipulated using spatially periodic modulations. Long-wavelength, incommensurate modulations are of particular interest, exemplified recently by moirĂ© patterned van der Waals (vdW) heterostructures. Bulk vdW superlattices hosting interfaces between clean 2D layers represent scalable bulk analogs of vdW heterostructures and present a complementary venue to… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 19 pages, 4 figures

  3. arXiv:2308.02772  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Probing charge order of monolayer NbSe$_2$ within a bulk crystal

    Authors: Doron Azoury, Edoardo Baldini, Aravind Devarakonda, Jiarui Li, Shiang Fang, Pheona Williams, Riccardo Comin, Joseph Checkelsky, Nuh Gedik

    Abstract: Atomically thin transition metal dichalcogenides can exhibit markedly different electronic properties compared to their bulk counterparts. In the case of NbSe$_2$, the question of whether its charge density wave (CDW) phase is enhanced in the monolayer limit has been the subject of intense debate, primarily due to the difficulty of decoupling this order from its environment. Here, we address this… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  4. arXiv:2307.16652  [pdf, other

    cs.DC cs.LG stat.ML

    Sequential and Shared-Memory Parallel Algorithms for Partitioned Local Depths

    Authors: Aditya Devarakonda, Grey Ballard

    Abstract: In this work, we design, analyze, and optimize sequential and shared-memory parallel algorithms for partitioned local depths (PaLD). Given a set of data points and pairwise distances, PaLD is a method for identifying strength of pairwise relationships based on relative distances, enabling the identification of strong ties within dense and sparse communities even if their sizes and within-community… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    MSC Class: 68W10 ACM Class: D.1.3

  5. arXiv:2011.08281  [pdf, other

    cs.LG cs.DC

    Avoiding Communication in Logistic Regression

    Authors: Aditya Devarakonda, James Demmel

    Abstract: Stochastic gradient descent (SGD) is one of the most widely used optimization methods for solving various machine learning problems. SGD solves an optimization problem by iteratively sampling a few data points from the input data, computing gradients for the selected data points, and updating the solution. However, in a parallel setting, SGD requires interprocess communication at every iteration.… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

  6. arXiv:1906.02065  [pdf

    cond-mat.supr-con

    Clean 2D superconductivity in a bulk van der Waals superlattice

    Authors: Aravind Devarakonda, Hisashi Inoue, Shiang Fang, Cigdem Ozsoy-Keskinbora, Takehito Suzuki, Markus Kriener, Liang Fu, Efthimios Kaxiras, David C. Bell, Joseph G. Checkelsky

    Abstract: Advances in low-dimensional superconductivity are often realized through improvements in material quality. Apart from a small group of organic materials, there is a near absence of clean-limit two-dimensional (2D) superconductors, which presents an impediment to the pursuit of numerous long-standing predictions for exotic superconductivity with fragile pairing symmetries. Here, we report the devel… ▽ More

    Submitted 11 October, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: Accepted version with revised title, discussion, structure, and figures. 38 pages, 4 figures

    Journal ref: Science 370, 231-236 (2020)

  7. arXiv:1712.06047  [pdf, other

    cs.DC cs.LG math.OC stat.ML

    Avoiding Synchronization in First-Order Methods for Sparse Convex Optimization

    Authors: Aditya Devarakonda, Kimon Fountoulakis, James Demmel, Michael W. Mahoney

    Abstract: Parallel computing has played an important role in speeding up convex optimization methods for big data analytics and large-scale machine learning (ML). However, the scalability of these optimization methods is inhibited by the cost of communicating and synchronizing processors in a parallel setting. Iterative ML methods are particularly sensitive to communication cost since they often require com… ▽ More

    Submitted 16 December, 2017; originally announced December 2017.

    MSC Class: 68W10; 90C25 ACM Class: G.1.6

  8. arXiv:1712.02029  [pdf, other

    cs.LG cs.CV cs.DC stat.ML

    AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks

    Authors: Aditya Devarakonda, Maxim Naumov, Michael Garland

    Abstract: Training deep neural networks with Stochastic Gradient Descent, or its variants, requires careful choice of both learning rate and batch size. While smaller batch sizes generally converge in fewer training epochs, larger batch sizes offer more parallelism and hence better computational efficiency. We have developed a new training approach that, rather than statically choosing a single batch size f… ▽ More

    Submitted 13 February, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: 14 pages

    MSC Class: 68T05; ACM Class: I.2.6; I.5.0

  9. arXiv:1710.08883  [pdf, other

    cs.DC cs.LG math.NA math.OC

    Avoiding Communication in Proximal Methods for Convex Optimization Problems

    Authors: Saeed Soori, Aditya Devarakonda, James Demmel, Mert Gurbuzbalaban, Maryam Mehri Dehnavi

    Abstract: The fast iterative soft thresholding algorithm (FISTA) is used to solve convex regularized optimization problems in machine learning. Distributed implementations of the algorithm have become popular since they enable the analysis of large datasets. However, existing formulations of FISTA communicate data at every iteration which reduces its performance on modern distributed architectures. The comm… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

  10. arXiv:1612.04003  [pdf, other

    cs.DC

    Avoiding communication in primal and dual block coordinate descent methods

    Authors: Aditya Devarakonda, Kimon Fountoulakis, James Demmel, Michael W. Mahoney

    Abstract: Primal and dual block coordinate descent methods are iterative methods for solving regularized and unregularized optimization problems. Distributed-memory parallel implementations of these methods have become popular in analyzing large machine learning datasets. However, existing implementations communicate at every iteration which, on modern data center and supercomputing architectures, often dom… ▽ More

    Submitted 1 May, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    MSC Class: 68W10; 65F10 ACM Class: G.1.0; G.1.3; G.1.6

  11. arXiv:1607.01335  [pdf, other

    cs.DC

    Matrix Factorization at Scale: a Comparison of Scientific Data Analytics in Spark and C+MPI Using Three Case Studies

    Authors: Alex Gittens, Aditya Devarakonda, Evan Racah, Michael Ringenburg, Lisa Gerhardt, Jey Kottalam, Jialin Liu, Kristyn Maschhoff, Shane Canon, Jatin Chhugani, Pramod Sharma, Jiyan Yang, James Demmel, Jim Harrell, Venkat Krishnamurthy, Michael W. Mahoney, Prabhat

    Abstract: We explore the trade-offs of performing linear algebra using Apache Spark, compared to traditional C and MPI implementations on HPC platforms. Spark is designed for data analytics on cluster computing platforms with access to local disks and is optimized for data-parallel tasks. We examine three widely-used and important matrix factorizations: NMF (for physical plausability), PCA (for its ubiquity… ▽ More

    Submitted 20 September, 2016; v1 submitted 5 July, 2016; originally announced July 2016.

    ACM Class: G.1.3; C.2.4