Skip to main content

Showing 1–7 of 7 results for author: Brüel-Gabrielsson, R

.
  1. arXiv:2407.00066  [pdf, other

    cs.DC cs.AI cs.CL cs.LG

    Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead

    Authors: Rickard Brüel-Gabrielsson, Jiacheng Zhu, Onkar Bhardwaj, Leshem Choshen, Kristjan Greenewald, Mikhail Yurochkin, Justin Solomon

    Abstract: Fine-tuning large language models (LLMs) with low-rank adapters (LoRAs) has become common practice, often yielding numerous copies of the same LLM differing only in their LoRA updates. This paradigm presents challenges for systems that serve real-time responses to queries that each involve a different LoRA. Prior works optimize the design of such systems but still require continuous loading and of… ▽ More

    Submitted 17 June, 2024; originally announced July 2024.

  2. arXiv:2303.14537  [pdf, other

    cs.LG cs.CL cs.CV

    Deep Augmentation: Self-Supervised Learning with Transformations in Activation Space

    Authors: Rickard Brüel-Gabrielsson, Tongzhou Wang, Manel Baradad, Justin Solomon

    Abstract: We introduce Deep Augmentation, an approach to implicit data augmentation using dropout or PCA to transform a targeted layer within a neural network to improve performance and generalization. We demonstrate Deep Augmentation through extensive experiments on contrastive learning tasks in NLP, computer vision, and graph learning. We observe substantial performance gains with Transformers, ResNets, a… ▽ More

    Submitted 26 February, 2024; v1 submitted 25 March, 2023; originally announced March 2023.

  3. arXiv:2202.01145  [pdf, ps, other

    cs.CL

    Relative Position Prediction as Pre-training for Text Encoders

    Authors: Rickard Brüel-Gabrielsson, Chris Scarvelis

    Abstract: Meaning is defined by the company it keeps. However, company is two-fold: It's based on the identity of tokens and also on their position (topology). We argue that a position-centric perspective is more general and useful. The classic MLM and CLM objectives in NLP are easily phrased as position predictions over the whole vocabulary. Adapting the relative position encoding paradigm in NLP to create… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  4. arXiv:2201.12674  [pdf, other

    cs.LG

    Rewiring with Positional Encodings for Graph Neural Networks

    Authors: Rickard Brüel-Gabrielsson, Mikhail Yurochkin, Justin Solomon

    Abstract: Several recent works use positional encodings to extend the receptive fields of graph neural network (GNN) layers equipped with attention mechanisms. These techniques, however, extend receptive fields to the complete graph, at substantial computational cost and risking a change in the inductive biases of conventional GNNs, or require complex architecture adjustments. As a conservative alternative,… ▽ More

    Submitted 13 December, 2023; v1 submitted 29 January, 2022; originally announced January 2022.

  5. arXiv:2003.06706  [pdf, other

    cs.DS cs.LG stat.ML

    Universal Function Approximation on Graphs

    Authors: Rickard Brüel-Gabrielsson

    Abstract: In this work we produce a framework for constructing universal function approximators on graph isomorphism classes. We prove how this framework comes with a collection of theoretically desirable properties and enables novel analysis. We show how this allows us to achieve state-of-the-art performance on four different well-known datasets in graph classification and separate classes of graphs that o… ▽ More

    Submitted 26 October, 2020; v1 submitted 14 March, 2020; originally announced March 2020.

  6. arXiv:1905.12200  [pdf, other

    cs.LG math.AT stat.ML

    A Topology Layer for Machine Learning

    Authors: Rickard Brüel-Gabrielsson, Bradley J. Nelson, Anjan Dwaraknath, Primoz Skraba, Leonidas J. Guibas, Gunnar Carlsson

    Abstract: Topology applied to real world data using persistent homology has started to find applications within machine learning, including deep learning. We present a differentiable topology layer that computes persistent homology based on level set filtrations and edge-based filtrations. We present three novel applications: the topological layer can (i) regularize data reconstruction or the weights of mac… ▽ More

    Submitted 24 April, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  7. arXiv:1811.12543  [pdf, other

    cs.CG cs.GR

    Topology-Aware Surface Reconstruction for Point Clouds

    Authors: Rickard Brüel-Gabrielsson, Vignesh Ganapathi-Subramanian, Primoz Skraba, Leonidas J. Guibas

    Abstract: We present an approach to inform the reconstruction of a surface from a point scan through topological priors. The reconstruction is based on basis functions which are optimized to provide a good fit to the point scan while satisfying predefined topological constraints. We optimize the parameters of a model to obtain likelihood function over the reconstruction domain. The topological constraints a… ▽ More

    Submitted 15 September, 2021; v1 submitted 29 November, 2018; originally announced November 2018.

    Journal ref: Computer Graphics Forum 39 (5), 197-207, 2020