Skip to main content

Showing 1–12 of 12 results for author: Venkataramanan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.13484  [pdf, other

    eess.IV cs.CV

    Joint Quality Assessment and Example-Guided Image Processing by Disentangling Picture Appearance from Content

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik

    Abstract: The deep learning revolution has strongly impacted low-level image processing tasks such as style/domain transfer, enhancement/restoration, and visual quality assessments. Despite often being treated separately, the aforementioned tasks share a common theme of understanding, editing, or enhancing the appearance of input images without modifying the underlying content. We leverage this observation… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  2. arXiv:2404.13452  [pdf, other

    eess.IV cs.CV

    Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik

    Abstract: High Dynamic Range (HDR) videos have enjoyed a surge in popularity in recent years due to their ability to represent a wider range of contrast and color than Standard Dynamic Range (SDR) videos. Although HDR video capture has seen increasing popularity because of recent flagship mobile phones such as Apple iPhones, Google Pixels, and Samsung Galaxy phones, a broad swath of consumers still utilize… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  3. arXiv:2403.15061  [pdf, other

    eess.IV cs.CV

    Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos

    Authors: Abhinau K. Venkataramanan, Alan C. Bovik

    Abstract: High Dynamic Range (HDR) videos are able to represent wider ranges of contrasts and colors than Standard Dynamic Range (SDR) videos, giving more vivid experiences. Due to this, HDR videos are expected to grow into the dominant video modality of the future. However, HDR videos are incompatible with existing SDR displays, which form the majority of affordable consumer displays on the market. Because… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  4. arXiv:2312.08524  [pdf, other

    eess.IV cs.CV

    A FUNQUE Approach to the Quality Assessment of Compressed HDR Videos

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: Recent years have seen steady growth in the popularity and availability of High Dynamic Range (HDR) content, particularly videos, streamed over the internet. As a result, assessing the subjective quality of HDR videos, which are generally subjected to compression, is of increasing importance. In particular, we target the task of full-reference quality assessment of compressed HDR videos. The state… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  5. arXiv:2312.03993  [pdf, other

    cs.CV cs.AI

    Style Transfer to Calvin and Hobbes comics using Stable Diffusion

    Authors: Sloke Shrestha, Sundar Sripada V. S., Asvin Venkataramanan

    Abstract: This project report summarizes our journey to perform stable diffusion fine-tuning on a dataset containing Calvin and Hobbes comics. The purpose is to convert any given input image into the comic style of Calvin and Hobbes, essentially performing style transfer. We train stable-diffusion-v1.5 using Low Rank Adaptation (LoRA) to efficiently speed up the fine-tuning process. The diffusion itself is… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Project report for ECE 371Q Digital Image Processing at UT Austin

  6. arXiv:2311.15437  [pdf, ps, other

    eess.IV cs.CV math.ST

    Quality Modeling Under A Relaxed Natural Scene Statistics Model

    Authors: Abhinau K. Venkataramanan, Alan C. Bovik

    Abstract: Information-theoretic image quality assessment (IQA) models such as Visual Information Fidelity (VIF) and Spatio-temporal Reduced Reference Entropic Differences (ST-RRED) have enjoyed great success by seamlessly integrating natural scene statistics (NSS) with information theory. The Gaussian Scale Mixture (GSM) model that governs the wavelet subband coefficients of natural images forms the foundat… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  7. arXiv:2308.08431  [pdf, other

    cs.CV

    Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval

    Authors: Aishwarya Venkataramanan, Martin Laviale, Cédric Pradalier

    Abstract: Most of the research in content-based image retrieval (CBIR) focus on develo** robust feature representations that can effectively retrieve instances from a database of images that are visually similar to a query. However, the retrieved images sometimes contain results that are not semantically related to the query. To address this, we propose a method for CBIR that captures both visual and sema… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted in ICVS 2023

  8. arXiv:2305.13849  [pdf, other

    cs.CV cs.LG

    Gaussian Latent Representations for Uncertainty Estimation using Mahalanobis Distance in Deep Classifiers

    Authors: Aishwarya Venkataramanan, Assia Benbihi, Martin Laviale, Cedric Pradalier

    Abstract: Recent works show that the data distribution in a network's latent space is useful for estimating classification uncertainty and detecting Out-of-distribution (OOD) samples. To obtain a well-regularized latent space that is conducive for uncertainty estimation, existing methods bring in significant changes to model architectures and training procedures. In this paper, we present a lightweight, fas… ▽ More

    Submitted 29 September, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: ICCV Workshop 2023

  9. arXiv:2304.10669  [pdf, other

    eess.IV cs.CV

    Edge-Aware Image Color Appearance and Difference Modeling

    Authors: Abhinau K. Venkataramanan

    Abstract: The perception of color is one of the most important aspects of human vision. From an evolutionary perspective, the accurate perception of color is crucial to distinguishing friend from foe, and food from fatal poison. As a result, humans have developed a keen sense of color and are able to detect subtle differences in appearance, while also robustly identifying colors across illumination and view… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  10. arXiv:2202.11241  [pdf, other

    cs.CV eess.IV

    FUNQUE: Fusion of Unified Quality Evaluators

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Alan C. Bovik

    Abstract: Fusion-based quality assessment has emerged as a powerful method for develo** high-performance quality models from quality models that individually achieve lower performances. A prominent example of such an algorithm is VMAF, which has been widely adopted as an industry standard for video quality prediction along with SSIM. In addition to advancing the state-of-the-art, it is imperative to allev… ▽ More

    Submitted 6 July, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted at ICIP 2022

  11. arXiv:2109.11891  [pdf, other

    cs.CV

    Tackling Inter-Class Similarity and Intra-Class Variance for Microscopic Image-based Classification

    Authors: Aishwarya Venkataramanan, Martin Laviale, Cécile Figus, Philippe Usseglio-Polatera, Cédric Pradalier

    Abstract: Automatic classification of aquatic microorganisms is based on the morphological features extracted from individual images. The current works on their classification do not consider the inter-class similarity and intra-class variance that causes misclassification. We are particularly interested in the case where variance within a class occurs due to discrete visual changes in microscopic images. I… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: 13th International Conference on Computer Vision Systems (2021)

  12. arXiv:2101.06354  [pdf, other

    eess.IV cs.CV cs.MM

    A Hitchhiker's Guide to Structural Similarity

    Authors: Abhinau K. Venkataramanan, Chengyang Wu, Alan C. Bovik, Ioannis Katsavounidis, Zafar Shahid

    Abstract: The Structural Similarity (SSIM) Index is a very widely used image/video quality model that continues to play an important role in the perceptual evaluation of compression algorithms, encoding recipes and numerous other image/video processing algorithms. Several public implementations of the SSIM and Multiscale-SSIM (MS-SSIM) algorithms have been developed, which differ in efficiency and performan… ▽ More

    Submitted 30 January, 2021; v1 submitted 15 January, 2021; originally announced January 2021.

    Comments: Submitted final version to IEEE Access on January 30, 2021