Skip to main content

Showing 1–2 of 2 results for author: Sitdikov, T

.
  1. arXiv:2401.11202  [pdf, other

    cs.LG cs.DC cs.PL

    PartIR: Composing SPMD Partitioning Strategies for Machine Learning

    Authors: Sami Alabed, Daniel Belov, Bart Chrzaszcz, Juliana Franco, Dominik Grewe, Dougal Maclaurin, James Molloy, Tom Natan, Tamara Norman, Xiaoyue Pan, Adam Paszke, Norman A. Rink, Michael Schaarschmidt, Timur Sitdikov, Agnieszka Swietlik, Dimitrios Vytiniotis, Joel Wee

    Abstract: Training of modern large neural networks (NN) requires a combination of parallelization strategies encompassing data, model, or optimizer sharding. When strategies increase in complexity, it becomes necessary for partitioning tools to be 1) expressive, allowing the composition of simpler strategies, and 2) predictable to estimate performance analytically. We present PartIR, our design for a NN par… ▽ More

    Submitted 3 March, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  2. arXiv:2103.07692  [pdf, ps, other

    cs.CC cs.DM

    The complexity of multilayer $d$-dimensional circuits

    Authors: T. R. Sitdikov, G. V. Kalachev

    Abstract: In this paper we research a model of multilayer circuits with a single logical layer. We consider $λ$-separable graphs as a support for circuits. We establish the Shannon function lower bound $\max \bigl(\frac{2^n}{n}, \frac{2^n (1 - λ)}{\log k} \bigr)$ for this type of circuits where $k$ is the number of layers. For $d$-dimensional graphs, which are $λ$-separable for $λ= \frac{d - 1}{d}$, this gi… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.