Skip to main content

Showing 1–6 of 6 results for author: Deora, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.05738  [pdf, other

    cs.LG math.OC stat.ML

    Implicit Bias and Fast Convergence Rates for Self-attention

    Authors: Bhavya Vasudeva, Puneesh Deora, Christos Thrampoulidis

    Abstract: Self-attention, the core mechanism of transformers, distinguishes them from traditional neural networks and drives their outstanding performance. Towards develo** the fundamental optimization principles of self-attention, we investigate the implicit bias of gradient descent (GD) in training a self-attention layer with fixed linear decoder in binary classification. Drawing inspiration from the st… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 41 pages, 7 figures

  2. arXiv:2310.12680  [pdf, other

    cs.LG math.OC stat.ML

    On the Optimization and Generalization of Multi-head Attention

    Authors: Puneesh Deora, Rouzbeh Ghaderi, Hossein Taheri, Christos Thrampoulidis

    Abstract: The training and generalization dynamics of the Transformer's core mechanism, namely the Attention mechanism, remain under-explored. Besides, existing analyses primarily focus on single-head attention. Inspired by the demonstrated benefits of overparameterization when training fully-connected networks, we investigate the potential optimization and generalization advantages of using multiple attent… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 48 page; presented in the Workshop on High-dimensional Learning Dynamics, ICML 2023

  3. arXiv:2108.09335  [pdf, other

    cs.CV cs.LG

    LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning

    Authors: Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Umapada Pal, Sukalpa Chanda

    Abstract: Deep metric learning has been effectively used to learn distance metrics for different visual tasks like image retrieval, clustering, etc. In order to aid the training process, existing methods either use a hard mining strategy to extract the most informative samples or seek to generate hard synthetics using an additional network. Such approaches face different challenges and can lead to biased em… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: 17 pages, 9 figures, 5 tables. Accepted at The IEEE/CVF International Conference on Computer Vision (ICCV) 2021

  4. arXiv:2011.04994  [pdf, other

    cs.CV eess.IV

    AIM 2020 Challenge on Learned Image Signal Processing Pipeline

    Authors: Andrey Ignatov, Radu Timofte, Zhilu Zhang, Ming Liu, Haolin Wang, Wangmeng Zuo, Jiawei Zhang, Ruimao Zhang, Zhanglin Peng, Sijie Ren, Linhui Dai, Xiaohong Liu, Chengqi Li, Jun Chen, Yuichi Ito, Bhavya Vasudeva, Puneesh Deora, Umapada Pal, Zhenyu Guo, Yu Zhu, Tian Liang, Chenghua Li, Cong Leng, Zhihong Pan, Baopu Li , et al. (14 additional authors not shown)

    Abstract: This paper reviews the second AIM learned ISP challenge and provides the description of the proposed solutions and results. The participating teams were solving a real-world RAW-to-RGB map** problem, where to goal was to map the original low-quality RAW images captured by the Huawei P20 device to the same photos obtained with the Canon 5D DSLR camera. The considered task embraced a number of com… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Published in ECCV 2020 Workshops (Advances in Image Manipulation), https://data.vision.ee.ethz.ch/cvl/aim20/

  5. arXiv:2002.10523  [pdf, other

    eess.IV cs.CV cs.LG

    Co-VeGAN: Complex-Valued Generative Adversarial Network for Compressive Sensing MR Image Reconstruction

    Authors: Bhavya Vasudeva, Puneesh Deora, Saumik Bhattacharya, Pyari Mohan Pradhan

    Abstract: Compressive sensing (CS) is widely used to reduce the acquisition time of magnetic resonance imaging (MRI). Although state-of-the-art deep learning based methods have been able to obtain fast, high-quality reconstruction of CS-MR images, their main drawback is that they treat complex-valued MRI data as real-valued entities. Most methods either extract the magnitude from the complex-valued entities… ▽ More

    Submitted 24 September, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

  6. arXiv:1910.06067  [pdf, other

    eess.IV cs.CV cs.LG

    Structure Preserving Compressive Sensing MRI Reconstruction using Generative Adversarial Networks

    Authors: Puneesh Deora, Bhavya Vasudeva, Saumik Bhattacharya, Pyari Mohan Pradhan

    Abstract: Compressive sensing magnetic resonance imaging (CS-MRI) accelerates the acquisition of MR images by breaking the Nyquist sampling limit. In this work, a novel generative adversarial network (GAN) based framework for CS-MRI reconstruction is proposed. Leveraging a combination of patch-based discriminator and structural similarity index based loss, our model focuses on preserving high frequency cont… ▽ More

    Submitted 26 April, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: Accepted in IEEE CVPR Workshop on NTIRE 2020