Skip to main content

Showing 1–14 of 14 results for author: Gavranović, B

.
  1. arXiv:2404.00408  [pdf, other

    cs.LG cs.LO

    Deep Learning with Parametric Lenses

    Authors: Geoffrey S. H. Cruttwell, Bruno Gavranovic, Neil Ghani, Paul Wilson, Fabio Zanasi

    Abstract: We propose a categorical semantics for machine learning algorithms in terms of lenses, parametric maps, and reverse derivative categories. This foundation provides a powerful explanatory and unifying framework: it encompasses a variety of gradient descent algorithms such as ADAM, AdaGrad, and Nesterov momentum, as well as a variety of loss functions such as MSE and Softmax cross-entropy, and diffe… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.13001

  2. arXiv:2403.16388  [pdf, ps, other

    math.CT cs.IT cs.LO

    On a fibrational construction for optics, lenses, and Dialectica categories

    Authors: Matteo Capucci, Bruno Gavranović, Abdullah Malik, Francisco Rios, Jonathan Weinberger

    Abstract: Categories of lenses/optics and Dialectica categories are both comprised of bidirectional morphisms of basically the same form. In this work we show how they can be considered a special case of an overarching fibrational construction, generalizing Hofstra's construction of Dialectica fibrations and Spivak's construction of generalized lenses. This construction turns a tower of Grothendieck fibrati… ▽ More

    Submitted 12 June, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: v2: 18 pp. Project results from the American Mathematical Society's Math Research Community on Applied Category Theory 2022. Some corrections and clarifications after peer review. Accepted pre-proceedings version for MFPS 2024

    MSC Class: 18M35; 18D30; 18M05; 03G30; 03B38 ACM Class: F.4.1

  3. arXiv:2403.13001  [pdf, other

    cs.LG cs.AI math.CT

    Fundamental Components of Deep Learning: A category-theoretic approach

    Authors: Bruno Gavranović

    Abstract: Deep learning, despite its remarkable achievements, is still a young field. Like the early stages of many scientific disciplines, it is marked by the discovery of new phenomena, ad-hoc design decisions, and the lack of a uniform and compositional mathematical foundation. From the intricacies of the implementation of backpropagation, through a growing zoo of neural network architectures, to the new… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: PhD Thesis defended at University of Strathclyde

  4. arXiv:2402.15332  [pdf, ps, other

    cs.LG cs.AI math.CT math.RA stat.ML

    Position: Categorical Deep Learning is an Algebraic Theory of All Architectures

    Authors: Bruno Gavranović, Paul Lessard, Andrew Dudzik, Tamara von Glehn, João G. M. Araújo, Petar Veličković

    Abstract: We present our position on the elusive quest for a general-purpose framework for specifying and studying deep learning architectures. Our opinion is that the key attempts made so far lack a coherent bridge between specifying constraints which models must satisfy and specifying their implementations. Focusing on building a such a bridge, we propose to apply category theory -- precisely, the univers… ▽ More

    Submitted 5 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: To appear in ICML 2024. Comments welcome. More info at categoricaldeeplearning.com

  5. arXiv:2212.00542  [pdf, other

    math.CT cs.LG

    Graph Convolutional Neural Networks as Parametric CoKleisli morphisms

    Authors: Bruno Gavranović, Mattia Villani

    Abstract: We define the bicategory of Graph Convolutional Neural Networks $\mathbf{GCNN}_n$ for an arbitrary graph with $n$ nodes. We show it can be factored through the already existing categorical constructions for deep learning called $\mathbf{Para}$ and $\mathbf{Lens}$ with the base category set to the CoKleisli category of the product comonad. We prove that there exists an injective-on-objects, faithfu… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 21 pages

  6. arXiv:2209.09351  [pdf, other

    math.CT cs.LG

    Space-time tradeoffs of lenses and optics via higher category theory

    Authors: Bruno Gavranović

    Abstract: Optics and lenses are abstract categorical gadgets that model systems with bidirectional data flow. In this paper we observe that the denotational definition of optics - identifying two optics as equivalent by observing their behaviour from the outside - is not suitable for operational, software oriented approaches where optics are not merely observed, but built with their internal setups in mind.… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 28 pages

  7. arXiv:2203.16351  [pdf, other

    math.CT

    Actegories for the Working Amthematician

    Authors: Matteo Capucci, Bruno Gavranović

    Abstract: Actions of monoidal categories on categories, also known as actegories, have been familiar to category theorists for a long time, and yet a comprehensive overview of this topic seems to be missing from the literature. Recently, actegories have been increasingly employed in applied category theory, thereby encouraging an effort to fill this gap according to the new needs of these applications. This… ▽ More

    Submitted 11 December, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: 90 pages

    MSC Class: 18-02

  8. arXiv:2112.11145  [pdf, ps, other

    math.CT

    Fibre optics

    Authors: Dylan Braithwaite, Matteo Capucci, Bruno Gavranović, Jules Hedges, Eigil Fjeldgren Rischel

    Abstract: Lenses, optics and dependent lenses (or equivalently morphisms of containers, or equivalently natural transformations of polynomial functors) are all widely used in applied category theory as models of bidirectional processes. From the definition of lenses over a finite product category, optics weaken the required structure to actions of monoidal categories, and dependent lenses make use of the ad… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: This is an extended abstract of a forthcoming paper

  9. arXiv:2106.07032  [pdf, ps, other

    cs.LG

    Category Theory in Machine Learning

    Authors: Dan Shiebler, Bruno Gavranović, Paul Wilson

    Abstract: Over the past two decades machine learning has permeated almost every realm of technology. At the same time, many researchers have begun using category theory as a unifying language, facilitating communication between different scientific disciplines. It is therefore unsurprising that there is a burgeoning interest in applying category theory to machine learning. We aim to document the motivations… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  10. Towards Foundations of Categorical Cybernetics

    Authors: Matteo Capucci, Bruno Gavranović, Jules Hedges, Eigil Fjeldgren Rischel

    Abstract: We propose a categorical framework for processes which interact bidirectionally with both an environment and a 'controller'. Examples include open learners, in which the controller is an optimiser such as gradient descent, and an approach to compositional game theory closely related to open games, in which the controller is a composite of game-theoretic agents. We believe that 'cybernetic' is an a… ▽ More

    Submitted 3 November, 2022; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: In Proceedings ACT 2021, arXiv:2211.01102

    Journal ref: EPTCS 372, 2022, pp. 235-248

  11. arXiv:2103.01931  [pdf, other

    cs.LG math.CT

    Categorical Foundations of Gradient-Based Learning

    Authors: G. S. H. Cruttwell, Bruno Gavranović, Neil Ghani, Paul Wilson, Fabio Zanasi

    Abstract: We propose a categorical semantics of gradient-based machine learning algorithms in terms of lenses, parametrised maps, and reverse derivative categories. This foundation provides a powerful explanatory and unifying framework: it encompasses a variety of gradient descent algorithms such as ADAM, AdaGrad, and Nesterov momentum, as well as a variety of loss functions such as as MSE and Softmax cross… ▽ More

    Submitted 13 July, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: 14 pages

  12. Compositional Game Theory, Compositionally

    Authors: Robert Atkey, Bruno Gavranović, Neil Ghani, Clemens Kupke, Jérémy Ledent, Fredrik Nordvall Forsberg

    Abstract: We present a new compositional approach to compositional game theory (CGT) based upon Arrows, a concept originally from functional programming, closely related to Tambara modules, and operators to build new Arrows from old. We model equilibria as a bimodule over an Arrow and define an operator to build a new Arrow from such a bimodule over an existing Arrow. We also model strategies as graded Ar… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: In Proceedings ACT 2020, arXiv:2101.07888

    ACM Class: F.3.m

    Journal ref: EPTCS 333, 2021, pp. 198-214

  13. arXiv:2009.06837  [pdf, ps, other

    cs.LG cs.AI math.CT

    Learning Functors using Gradient Descent

    Authors: Bruno Gavranović

    Abstract: Neural networks are a general framework for differentiable optimization which includes many other machine learning approaches as special cases. In this paper we build a category-theoretic formalism around a neural network system called CycleGAN. CycleGAN is a general approach to unpaired image-to-image translation that has been getting attention in the recent years. Inspired by categorical databas… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

    Comments: In Proceedings ACT 2019, arXiv:2009.06334. This paper is a condensed version of the master thesis of the author (arXiv:1907.08292)

    Journal ref: EPTCS 323, 2020, pp. 230-245

  14. arXiv:1907.08292  [pdf, other

    cs.LG cs.AI math.CT

    Compositional Deep Learning

    Authors: Bruno Gavranović

    Abstract: Neural networks have become an increasingly popular tool for solving many real-world problems. They are a general framework for differentiable optimization which includes many other machine learning approaches as special cases. In this thesis we build a category-theoretic formalism around a class of neural networks exemplified by CycleGAN. CycleGAN is a collection of neural networks, closed under… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Comments: 56 pages