Skip to main content

Showing 1–10 of 10 results for author: Mahapatra, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20531  [pdf, ps, other

    cs.LG

    Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation

    Authors: Louis L. Chen, Bobbie Chern, Eric Eckstrand, Amogh Mahapatra, Johannes O. Royset

    Abstract: Labeling errors in datasets are common, if not systematic, in practice. They naturally arise in a variety of contexts-human labeling, noisy labeling, and weak labeling (i.e., image classification), for example. This presents a persistent and pervasive stress on machine learning practice. In particular, neural network (NN) architectures can withstand minor amounts of dataset imperfection with tradi… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2404.12391  [pdf, other

    cs.CV cs.GR cs.LG

    On the Content Bias in Fréchet Video Distance

    Authors: Songwei Ge, Aniruddha Mahapatra, Gaurav Parmar, Jun-Yan Zhu, Jia-Bin Huang

    Abstract: Fréchet Video Distance (FVD), a prominent metric for evaluating video generation models, is known to conflict with human perception occasionally. In this paper, we aim to explore the extent of FVD's bias toward per-frame quality over temporal realism and identify its sources. We first quantify the FVD's sensitivity to the temporal axis by decoupling the frame and motion quality and find that the F… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project webpage: https://content-debiased-fvd.github.io/

  3. arXiv:2311.17138  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry...for now

    Authors: Ayush Sarkar, Hanlin Mai, Amitabh Mahapatra, Svetlana Lazebnik, D. A. Forsyth, Anand Bhattad

    Abstract: Generative models can produce impressively realistic images. This paper demonstrates that generated images have geometric features different from those of real images. We build a set of collections of generated images, prequalified to fool simple, signal-based classifiers into believing they are real. We then show that prequalified generated images can be identified reliably by classifiers that on… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Project Page: https://projective-geometry.github.io | First three authors contributed equally

  4. arXiv:2307.03190  [pdf, other

    cs.CV cs.GR cs.LG

    Text-Guided Synthesis of Eulerian Cinemagraphs

    Authors: Aniruddha Mahapatra, Aliaksandr Siarohin, Hsin-Ying Lee, Sergey Tulyakov, Jun-Yan Zhu

    Abstract: We introduce Text2Cinemagraph, a fully automated method for creating cinemagraphs from text descriptions - an especially challenging task when prompts feature imaginary elements and artistic styles, given the complexity of interpreting the semantics and motions of these images. We focus on cinemagraphs of fluid elements, such as flowing rivers, and drifting clouds, which exhibit continuous motion… ▽ More

    Submitted 25 September, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Project website: https://text2cinemagraph.github.io/website/

  5. arXiv:2306.02680  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion

    Authors: Ahana Deb, Sayan Nag, Ayan Mahapatra, Soumitri Chattopadhyay, Aritra Marik, Pijush Kanti Gayen, Shankha Sanyal, Archi Banerjee, Samir Karmakar

    Abstract: Spoken languages often utilise intonation, rhythm, intensity, and structure, to communicate intention, which can be interpreted differently depending on the rhythm of speech of their utterance. These speech acts provide the foundation of communication and are unique in expression to the language. Recent advancements in attention-based models, demonstrating their ability to learn powerful represent… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023

  6. arXiv:2207.03729  [pdf, other

    cs.CV

    GEMS: Scene Expansion using Generative Models of Graphs

    Authors: Rishi Agarwal, Tirupati Saketh Chandra, Vaidehi Patil, Aniruddha Mahapatra, Kuldeep Kulkarni, Vishwa Vinay

    Abstract: Applications based on image retrieval require editing and associating in intermediate spaces that are representative of the high-level concepts like objects and their relationships rather than dense, pixel-level representations like RGB images or semantic-label maps. We focus on one such representation, scene graphs, and propose a novel scene expansion task where we enrich an input seed graph by a… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  7. Fuzzing+Hardware Performance Counters-Based Detection of Algorithm Subversion Attacks on Post-Quantum Signature Schemes

    Authors: Animesh Basak Chowdhury, Anushree Mahapatra, Deepraj Soni, Ramesh Karri

    Abstract: NIST is standardizing Post Quantum Cryptography (PQC) algorithms that are resilient to the computational capability of quantum computers. Past works show malicious subversion with cryptographic software (algorithm subversion attacks) that weaken the implementations. We show that PQC digital signature codes can be subverted in line with previously reported flawed implementations that generate verif… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: Accepted in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

  8. arXiv:2112.03051  [pdf, other

    cs.CV

    Controllable Animation of Fluid Elements in Still Images

    Authors: Aniruddha Mahapatra, Kuldeep Kulkarni

    Abstract: We propose a method to interactively control the animation of fluid elements in still images to generate cinemagraphs. Specifically, we focus on the animation of fluid elements like water, smoke, fire, which have the properties of repeating textures and continuous fluid motion. Taking inspiration from prior works, we represent the motion of such fluid elements in the image in the form of a constan… ▽ More

    Submitted 25 September, 2023; v1 submitted 6 December, 2021; originally announced December 2021.

  9. arXiv:2108.13702  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    SemIE: Semantically-aware Image Extrapolation

    Authors: Bholeshwar Khurana, Soumya Ranjan Dash, Abhishek Bhatia, Aniruddha Mahapatra, Hrituraj Singh, Kuldeep Kulkarni

    Abstract: We propose a semantically-aware novel paradigm to perform image extrapolation that enables the addition of new object instances. All previous methods are limited in their capability of extrapolation to merely extending the already existing objects in the image. However, our proposed approach focuses not only on (i) extending the already present objects but also on (ii) adding new objects in the ex… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: To appear in International Conference on Computer Vision (ICCV) 2021. Project URL: https://semie-iccv.github.io

  10. arXiv:1407.7626  [pdf

    cs.CV

    A Survey on Two Dimensional Cellular Automata and Its Application in Image Processing

    Authors: Deepak Ranjan Nayak, Prashanta Kumar Patra, Amitav Mahapatra

    Abstract: Parallel algorithms for solving any image processing task is a highly demanded approach in the modern world. Cellular Automata (CA) are the most common and simple models of parallel computation. So, CA has been successfully used in the domain of image processing for the last couple of years. This paper provides a survey of available literatures of some methodologies employed by different researche… ▽ More

    Submitted 29 July, 2014; originally announced July 2014.

    Comments: 10 pages, 10 figures, 4 tables