Skip to main content

Showing 1–27 of 27 results for author: Ramasinghe, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.19205  [pdf, other

    cs.CV cs.LG

    From Activation to Initialization: Scaling Insights for Optimizing Neural Fields

    Authors: Hemanth Saratchandran, Sameera Ramasinghe, Simon Lucey

    Abstract: In the realm of computer vision, Neural Fields have gained prominence as a contemporary tool harnessing neural networks for signal representation. Despite the remarkable progress in adapting these networks to solve a variety of problems, the field still lacks a comprehensive theoretical framework. This article aims to address this gap by delving into the intricate interplay between initialization… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  2. arXiv:2403.18442  [pdf, other

    cs.CV

    Backpropagation-free Network for 3D Test-time Adaptation

    Authors: Yanshuo Wang, Ali Cheraghian, Zeeshan Hayder, Jie Hong, Sameera Ramasinghe, Shafin Rahman, David Ahmedt-Aristizabal, Xuesong Li, Lars Petersson, Mehrtash Harandi

    Abstract: Real-world systems often encounter new data over time, which leads to experiencing target domain shifts. Existing Test-Time Adaptation (TTA) methods tend to apply computationally heavy and memory-intensive backpropagation-based approaches to handle this. Here, we propose a novel method that uses a backpropagation-free approach for TTA for the specific case of 3D data. Our model uses a two-stream a… ▽ More

    Submitted 24 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  3. arXiv:2402.18842  [pdf, other

    cs.CV

    ViewFusion: Towards Multi-View Consistency via Interpolated Denoising

    Authors: Xianghui Yang, Yan Zuo, Sameera Ramasinghe, Loris Bazzani, Gil Avraham, Anton van den Hengel

    Abstract: Novel-view synthesis through diffusion models has demonstrated remarkable potential for generating diverse and high-quality images. Yet, the independent process of image generation in these prevailing methods leads to challenges in maintaining multiple-view consistency. To address this, we introduce ViewFusion, a novel, training-free algorithm that can be seamlessly integrated into existing pre-tr… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: CVPR2024,homepage:https://wi-sc.github.io/ViewFusion.github.io/

  4. arXiv:2402.05427  [pdf, other

    cs.LG

    A Sampling Theory Perspective on Activations for Implicit Neural Representations

    Authors: Hemanth Saratchandran, Sameera Ramasinghe, Violetta Shevchenko, Alexander Long, Simon Lucey

    Abstract: Implicit Neural Representations (INRs) have gained popularity for encoding signals as compact, differentiable entities. While commonly using techniques like Fourier positional encodings or non-traditional activation functions (e.g., Gaussian, sinusoid, or wavelets) to capture high-frequency content, their properties lack exploration within a unified theoretical framework. Addressing this gap, we c… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  5. arXiv:2310.03669  [pdf, other

    cs.CV

    LumiNet: The Bright Side of Perceptual Knowledge Distillation

    Authors: Md. Ismail Hossain, M M Lutfe Elahi, Sameera Ramasinghe, Ali Cheraghian, Fuad Rahman, Nabeel Mohammed, Shafin Rahman

    Abstract: In knowledge distillation literature, feature-based methods have dominated due to their ability to effectively tap into extensive teacher models. In contrast, logit-based approaches, which aim to distill `dark knowledge' from teachers, typically exhibit inferior performance compared to feature-based methods. To bridge this gap, we present LumiNet, a novel knowledge distillation algorithm designed… ▽ More

    Submitted 9 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  6. arXiv:2309.00339  [pdf, other

    cs.CV

    Robust Point Cloud Processing through Positional Embedding

    Authors: Jianqiao Zheng, Xueqian Li, Sameera Ramasinghe, Simon Lucey

    Abstract: End-to-end trained per-point embeddings are an essential ingredient of any state-of-the-art 3D point cloud processing such as detection or alignment. Methods like PointNet, or the more recent point cloud transformer -- and its variants -- all employ learned per-point embeddings. Despite impressive performance, such approaches are sensitive to out-of-distribution (OOD) noise and outliers. In this p… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 18 pages, 13 figures, 5 tables

  7. arXiv:2305.08552  [pdf, other

    cs.CV

    Curvature-Aware Training for Coordinate Networks

    Authors: Hemanth Saratchandran, Shin-Fang Chng, Sameera Ramasinghe, Lachlan MacDonald, Simon Lucey

    Abstract: Coordinate networks are widely used in computer vision due to their ability to represent signals as compressed, continuous entities. However, training these networks with first-order optimizers can be slow, hindering their use in real-time applications. Recent works have opted for shallow voxel-based representations to achieve faster training, but this sacrifices memory efficiency. This work propo… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  8. arXiv:2303.05728  [pdf, other

    cs.LG eess.SP

    On the effectiveness of neural priors in modeling dynamical systems

    Authors: Sameera Ramasinghe, Hemanth Saratchandran, Violetta Shevchenko, Simon Lucey

    Abstract: Modelling dynamical systems is an integral component for understanding the natural world. To this end, neural networks are becoming an increasingly popular candidate owing to their ability to learn complex functions from large amounts of data. Despite this recent progress, there has not been an adequate discussion on the architectural regularization that neural networks offer when learning such sy… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  9. arXiv:2302.13543  [pdf, other

    cs.CV

    BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling

    Authors: Sameera Ramasinghe, Violetta Shevchenko, Gil Avraham, Anton Van Den Hengel

    Abstract: Reasoning the 3D structure of a non-rigid dynamic scene from a single moving camera is an under-constrained problem. Inspired by the remarkable progress of neural radiance fields (NeRFs) in photo-realistic novel view synthesis of static scenes, extensions have been proposed for dynamic settings. These methods heavily rely on neural priors in order to regularize the problem. In this work, we take a… ▽ More

    Submitted 24 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  10. arXiv:2206.08558  [pdf, other

    cs.LG

    How You Start Matters for Generalization

    Authors: Sameera Ramasinghe, Lachlan MacDonald, Moshiur Farazi, Hemanth Saratchandran, Simon Lucey

    Abstract: Characterizing the remarkable generalization properties of over-parameterized neural networks remains an open problem. In this paper, we promote a shift of focus towards initialization rather than neural architecture or (stochastic) gradient descent to explain this implicit regularization. Through a Fourier lens, we derive a general result for the spectral bias of neural networks and show that the… ▽ More

    Submitted 10 July, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  11. arXiv:2205.15225  [pdf, other

    cs.CV

    Few-shot Class-incremental Learning for 3D Point Cloud Objects

    Authors: Townim Chowdhury, Ali Cheraghian, Sameera Ramasinghe, Sahar Ahmadi, Morteza Saberi, Shafin Rahman

    Abstract: Few-shot class-incremental learning (FSCIL) aims to incrementally fine-tune a model (trained on base classes) for a novel set of classes using a few examples without forgetting the previous training. Recent efforts address this problem primarily on 2D images. However, due to the advancement of camera technology, 3D point cloud data has become more available than ever, which warrants considering FS… ▽ More

    Submitted 22 July, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

  12. arXiv:2205.08987  [pdf, other

    cs.CV

    Trading Positional Complexity vs. Deepness in Coordinate Networks

    Authors: Jianqiao Zheng, Sameera Ramasinghe, Xueqian Li, Simon Lucey

    Abstract: It is well noted that coordinate-based MLPs benefit -- in terms of preserving high-frequency information -- through the encoding of coordinate positions as an array of Fourier features. Hitherto, the rationale for the effectiveness of these positional encodings has been mainly studied through a Fourier lens. In this paper, we strive to broaden this understanding by showing that alternative non-Fou… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2107.02561

  13. arXiv:2204.05735  [pdf, other

    cs.CV

    GARF: Gaussian Activated Radiance Fields for High Fidelity Reconstruction and Pose Estimation

    Authors: Shin-Fang Chng, Sameera Ramasinghe, Jamie Sherrah, Simon Lucey

    Abstract: Despite Neural Radiance Fields (NeRF) showing compelling results in photorealistic novel views synthesis of real-world scenes, most existing approaches require accurate prior camera poses. Although approaches for jointly recovering the radiance field and camera pose exist (BARF), they rely on a cumbersome coarse-to-fine auxiliary positional embedding to ensure good performance. We present Gaussian… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Project page: https://sfchng.github.io/garf/

  14. arXiv:2202.00790  [pdf, other

    cs.LG cs.CV

    On Regularizing Coordinate-MLPs

    Authors: Sameera Ramasinghe, Lachlan MacDonald, Simon Lucey

    Abstract: We show that typical implicit regularization assumptions for deep neural networks (for regression) do not hold for coordinate-MLPs, a family of MLPs that are now ubiquitous in computer vision for representing high-frequency signals. Lack of such implicit bias disrupts smooth interpolations between training samples, and hampers generalizing across signal regions with different spectra. We investiga… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  15. arXiv:2112.11577  [pdf, other

    cs.LG

    Learning Positional Embeddings for Coordinate-MLPs

    Authors: Sameera Ramasinghe, Simon Lucey

    Abstract: We propose a novel method to enhance the performance of coordinate-MLPs by learning instance-specific positional embeddings. End-to-end optimization of positional embedding parameters along with network weights leads to poor generalization performance. Instead, we develop a generic framework to learn the positional embedding based on the classic graph-Laplacian regularization, which can implicitly… ▽ More

    Submitted 20 March, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

  16. arXiv:2111.15135  [pdf, other

    cs.LG

    Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs

    Authors: Sameera Ramasinghe, Simon Lucey

    Abstract: Coordinate-MLPs are emerging as an effective tool for modeling multidimensional continuous signals, overcoming many drawbacks associated with discrete grid-based approximations. However, coordinate-MLPs with ReLU activations, in their rudimentary form, demonstrate poor performance in representing signals with high fidelity, promoting the need for positional embedding layers. Recently, Sitzmann et… ▽ More

    Submitted 17 March, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

  17. arXiv:2111.08251  [pdf, other

    cs.CV cs.LG cs.NE

    Enabling equivariance for arbitrary Lie groups

    Authors: Lachlan Ewen MacDonald, Sameera Ramasinghe, Simon Lucey

    Abstract: Although provably robust to translational perturbations, convolutional neural networks (CNNs) are known to suffer from extreme performance degradation when presented at test time with more general geometric transformations of inputs. Recently, this limitation has motivated a shift in focus from CNNs to Capsule Networks (CapsNets). However, CapsNets suffer from admitting relatively few theoretical… ▽ More

    Submitted 30 March, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Oral presentation at the Conference on Computer Vision and Pattern Recognition (CVPR), 2022

  18. arXiv:2107.02561  [pdf, other

    cs.LG cs.CV

    Rethinking Positional Encoding

    Authors: Jianqiao Zheng, Sameera Ramasinghe, Simon Lucey

    Abstract: It is well noted that coordinate based MLPs benefit -- in terms of preserving high-frequency information -- through the encoding of coordinate positions as an array of Fourier features. Hitherto, the rationale for the effectiveness of these positional encodings has been solely studied through a Fourier lens. In this paper, we strive to broaden this understanding by showing that alternative non-Fou… ▽ More

    Submitted 11 October, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

  19. arXiv:2102.03509  [pdf, other

    cs.LG stat.ML

    Robust normalizing flows using Bernstein-type polynomials

    Authors: Sameera Ramasinghe, Kasun Fernando, Salman Khan, Nick Barnes

    Abstract: Modeling real-world distributions can often be challenging due to sample data that are subjected to perturbations, e.g., instrumentation errors, or added random noise. Since flow models are typically nonlinear algorithms, they amplify these initial errors, leading to poor generalizations. This paper proposes a framework to construct Normalizing Flows (NF), which demonstrates higher robustness agai… ▽ More

    Submitted 9 October, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

  20. arXiv:2011.13055  [pdf, other

    cs.CV

    Rethinking conditional GAN training: An approach using geometrically structured latent manifolds

    Authors: Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, Stephen Gould

    Abstract: Conditional GANs (cGAN), in their rudimentary form, suffer from critical drawbacks such as the lack of diversity in generated outputs and distortion between the latent and output manifolds. Although efforts have been made to improve results, they can suffer from unpleasant side-effects such as the topology mismatch between latent and output spaces. In contrast, we tackle this problem from a geomet… ▽ More

    Submitted 2 June, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

  21. arXiv:2010.03132  [pdf, other

    cs.LG cs.CV

    Conditional Generative Modeling via Learning the Latent Space

    Authors: Sameera Ramasinghe, Kanchana Ranasinghe, Salman Khan, Nick Barnes, Stephen Gould

    Abstract: Although deep learning has achieved appealing results on several machine learning tasks, most of the models are deterministic at inference, limiting their application to single-modal settings. We propose a novel general-purpose framework for conditional generation in multimodal spaces, that uses latent variables to model generalizable learning patterns while minimizing a family of regression cost… ▽ More

    Submitted 8 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

  22. arXiv:1912.01800  [pdf, other

    cs.CV

    Spectral-GANs for High-Resolution 3D Point-cloud Generation

    Authors: Sameera Ramasinghe, Salman Khan, Nick Barnes, Stephen Gould

    Abstract: Point-clouds are a popular choice for vision and graphics tasks due to their accurate shape description and direct acquisition from range-scanners. This demands the ability to synthesize and reconstruct high-quality point-clouds. Current deep generative models for 3D data generally work on simplified representations (e.g., voxelized objects) and cannot deal with the inherent redundancy and irregul… ▽ More

    Submitted 19 July, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: 1 page: Added affiliations

  23. arXiv:1912.01454  [pdf, other

    cs.CV

    Representation Learning on Unit Ball with 3D Roto-Translational Equivariance

    Authors: Sameera Ramasinghe, Salman Khan, Nick Barnes, Stephen Gould

    Abstract: Convolution is an integral operation that defines how the shape of one function is modified by another function. This powerful concept forms the basis of hierarchical feature learning in deep neural networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces---such as a sphere ($\mathbb{S}^2$) or a unit ball (… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: arXiv admin note: text overlap with arXiv:1901.00616

  24. arXiv:1908.10209  [pdf, other

    cs.LG stat.ML

    Blended Convolution and Synthesis for Efficient Discrimination of 3D Shapes

    Authors: Sameera Ramasinghe, Salman Khan, Nick Barnes, Stephen Gould

    Abstract: Existing networks directly learn feature representations on 3D point clouds for shape analysis. We argue that 3D point clouds are highly redundant and hold irregular (permutation-invariant) structure, which makes it difficult to achieve inter-class discrimination efficiently. In this paper, we propose a two-faceted solution to this problem that is seamlessly integrated in a single `Blended Convolu… ▽ More

    Submitted 19 July, 2020; v1 submitted 24 August, 2019; originally announced August 2019.

    Comments: 10 pages: corrected typos and added affiliations. The IEEE Winter Conference on Applications of Computer Vision. 2020

  25. arXiv:1901.00616  [pdf, other

    cs.LG cs.CV stat.ML

    Volumetric Convolution: Automatic Representation Learning in Unit Ball

    Authors: Sameera Ramasinghe, Salman Khan, Nick Barnes

    Abstract: Convolution is an efficient technique to obtain abstract feature representations using hierarchical layers in deep networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces---such as a sphere ($\mathbb{S}^2$) or a unit ball ($\mathbb{B}^3$)---entails unique challenges. In this work, we propose a novel `\emph{volumetric co… ▽ More

    Submitted 3 January, 2019; originally announced January 2019.

  26. Combined Static and Motion Features for Deep-Networks Based Activity Recognition in Videos

    Authors: Sameera Ramasinghe, Jathushan Rajasegaran, Vinoj Jayasundara, Kanchana Ranasinghe, Ranga Rodrigo, Ajith A. Pasqual

    Abstract: Activity recognition in videos in a deep-learning setting---or otherwise---uses both static and pre-computed motion components. The method of combining the two components, whilst kee** the burden on the deep network less, still remains uninvestigated. Moreover, it is not clear what the level of contribution of individual components is, and how to control the contribution. In this work, we use a… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology (2017)

  27. arXiv:1810.06231  [pdf, other

    cs.CV

    A Context-aware Capsule Network for Multi-label Classification

    Authors: Sameera Ramasinghe, C. D. Athuralya, Salman Khan

    Abstract: Recently proposed Capsule Network is a brain inspired architecture that brings a new paradigm to deep learning by modelling input domain variations through vector based representations. Despite being a seminal contribution, CapsNet does not explicitly model structured relationships between the detected entities and among the capsule features for related inputs. Motivated by the working of cortical… ▽ More

    Submitted 16 October, 2018; v1 submitted 15 October, 2018; originally announced October 2018.