Skip to main content

Showing 1–11 of 11 results for author: Saratchandran, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.19243  [pdf, other

    cs.LG cs.CV cs.NE

    Sine Activated Low-Rank Matrices for Parameter Efficient Learning

    Authors: Yi** Ji, Hemanth Saratchandran, Cameron Gordon, Zeyu Zhang, Simon Lucey

    Abstract: Low-rank decomposition has emerged as a vital tool for enhancing parameter efficiency in neural network architectures, gaining traction across diverse applications in machine learning. These techniques significantly lower the number of parameters, striking a balance between compactness and performance. However, a common challenge has been the compromise between parameter efficiency and the accurac… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: The first two authors contributed equally

  2. arXiv:2403.19205  [pdf, other

    cs.CV cs.LG

    From Activation to Initialization: Scaling Insights for Optimizing Neural Fields

    Authors: Hemanth Saratchandran, Sameera Ramasinghe, Simon Lucey

    Abstract: In the realm of computer vision, Neural Fields have gained prominence as a contemporary tool harnessing neural networks for signal representation. Despite the remarkable progress in adapting these networks to solve a variety of problems, the field still lacks a comprehensive theoretical framework. This article aims to address this gap by delving into the intricate interplay between initialization… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  3. arXiv:2403.19163  [pdf, other

    cs.LG cs.CV

    D'OH: Decoder-Only random Hypernetworks for Implicit Neural Representations

    Authors: Cameron Gordon, Lachlan Ewen MacDonald, Hemanth Saratchandran, Simon Lucey

    Abstract: Deep implicit functions have been found to be an effective tool for efficiently encoding all manner of natural signals. Their attractiveness stems from their ability to compactly represent signals with little to no off-line training data. Instead, they leverage the implicit bias of deep networks to decouple hidden redundancies within the signal. In this paper, we explore the hypothesis that additi… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 29 pages, 17 figures

  4. arXiv:2402.08784  [pdf, other

    cs.CV cs.LG

    Preconditioners for the Stochastic Training of Implicit Neural Representations

    Authors: Shin-Fang Chng, Hemanth Saratchandran, Simon Lucey

    Abstract: Implicit neural representations have emerged as a powerful technique for encoding complex continuous multidimensional signals as neural networks, enabling a wide range of applications in computer vision, robotics, and geometry. While Adam is commonly used for training due to its stochastic proficiency, it entails lengthy training durations. To address this, we explore alternative optimization tech… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: The first two authors contributed equally

  5. arXiv:2402.05427  [pdf, other

    cs.LG

    A Sampling Theory Perspective on Activations for Implicit Neural Representations

    Authors: Hemanth Saratchandran, Sameera Ramasinghe, Violetta Shevchenko, Alexander Long, Simon Lucey

    Abstract: Implicit Neural Representations (INRs) have gained popularity for encoding signals as compact, differentiable entities. While commonly using techniques like Fourier positional encodings or non-traditional activation functions (e.g., Gaussian, sinusoid, or wavelets) to capture high-frequency content, their properties lack exploration within a unified theoretical framework. Addressing this gap, we c… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  6. arXiv:2402.04783  [pdf, other

    cs.LG

    Analyzing the Neural Tangent Kernel of Periodically Activated Coordinate Networks

    Authors: Hemanth Saratchandran, Shin-Fang Chng, Simon Lucey

    Abstract: Recently, neural networks utilizing periodic activation functions have been proven to demonstrate superior performance in vision tasks compared to traditional ReLU-activated networks. However, there is still a limited understanding of the underlying reasons for this improved performance. In this paper, we aim to address this gap by providing a theoretical understanding of periodically activated ne… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.02711

  7. arXiv:2402.02711  [pdf, other

    cs.LG

    Architectural Strategies for the optimization of Physics-Informed Neural Networks

    Authors: Hemanth Saratchandran, Shin-Fang Chng, Simon Lucey

    Abstract: Physics-informed neural networks (PINNs) offer a promising avenue for tackling both forward and inverse problems in partial differential equations (PDEs) by incorporating deep learning with fundamental physics principles. Despite their remarkable empirical success, PINNs have garnered a reputation for their notorious training challenges across a spectrum of PDEs. In this work, we delve into the in… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  8. arXiv:2305.08552  [pdf, other

    cs.CV

    Curvature-Aware Training for Coordinate Networks

    Authors: Hemanth Saratchandran, Shin-Fang Chng, Sameera Ramasinghe, Lachlan MacDonald, Simon Lucey

    Abstract: Coordinate networks are widely used in computer vision due to their ability to represent signals as compressed, continuous entities. However, training these networks with first-order optimizers can be slow, hindering their use in real-time applications. Recent works have opted for shallow voxel-based representations to achieve faster training, but this sacrifices memory efficiency. This work propo… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  9. arXiv:2303.05728  [pdf, other

    cs.LG eess.SP

    On the effectiveness of neural priors in modeling dynamical systems

    Authors: Sameera Ramasinghe, Hemanth Saratchandran, Violetta Shevchenko, Simon Lucey

    Abstract: Modelling dynamical systems is an integral component for understanding the natural world. To this end, neural networks are becoming an increasingly popular candidate owing to their ability to learn complex functions from large amounts of data. Despite this recent progress, there has not been an adequate discussion on the architectural regularization that neural networks offer when learning such sy… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  10. arXiv:2210.05371  [pdf, other

    cs.LG

    On skip connections and normalisation layers in deep optimisation

    Authors: Lachlan Ewen MacDonald, Jack Valmadre, Hemanth Saratchandran, Simon Lucey

    Abstract: We introduce a general theoretical framework, designed for the study of gradient optimisation of deep neural networks, that encompasses ubiquitous architecture choices including batch normalisation, weight normalisation and skip connections. Our framework determines the curvature and regularity properties of multilayer loss landscapes in terms of their constituent layers, thereby elucidating the r… ▽ More

    Submitted 4 December, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2023

  11. arXiv:2206.08558  [pdf, other

    cs.LG

    How You Start Matters for Generalization

    Authors: Sameera Ramasinghe, Lachlan MacDonald, Moshiur Farazi, Hemanth Saratchandran, Simon Lucey

    Abstract: Characterizing the remarkable generalization properties of over-parameterized neural networks remains an open problem. In this paper, we promote a shift of focus towards initialization rather than neural architecture or (stochastic) gradient descent to explain this implicit regularization. Through a Fourier lens, we derive a general result for the spectral bias of neural networks and show that the… ▽ More

    Submitted 10 July, 2022; v1 submitted 17 June, 2022; originally announced June 2022.