Skip to main content

Showing 1–16 of 16 results for author: Ranjan, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.13370  [pdf, other

    eess.IV cs.CV cs.LG

    Low-Resolution Chest X-ray Classification via Knowledge Distillation and Multi-task Learning

    Authors: Yasmeena Akhter, Rishabh Ranjan, Richa Singh, Mayank Vatsa

    Abstract: This research addresses the challenges of diagnosing chest X-rays (CXRs) at low resolutions, a common limitation in resource-constrained healthcare settings. High-resolution CXR imaging is crucial for identifying small but critical anomalies, such as nodules or opacities. However, when images are downsized for processing in Computer-Aided Diagnosis (CAD) systems, vital spatial details and receptiv… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: IEEE ISBI 2024

  2. arXiv:2402.09233  [pdf, other

    cs.RO cs.AI cs.MA eess.SY math.OC

    Design and Realization of a Benchmarking Testbed for Evaluating Autonomous Platooning Algorithms

    Authors: Michael Shaham, Risha Ranjan, Engin Kirda, Taskin Padir

    Abstract: Autonomous vehicle platoons present near- and long-term opportunities to enhance operational efficiencies and save lives. The past 30 years have seen rapid development in the autonomous driving space, enabling new technologies that will alleviate the strain placed on human drivers and reduce vehicle emissions. This paper introduces a testbed for evaluating and benchmarking platooning algorithms on… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: To be published in International Symposium on Experimental Robotics, 2023

  3. arXiv:2402.02634  [pdf, other

    cs.CV cs.LG eess.IV

    Key-Graph Transformer for Image Restoration

    Authors: Bin Ren, Yawei Li, **gyun Liang, Rakesh Ranjan, Mengyuan Liu, Rita Cucchiara, Luc Van Gool, Nicu Sebe

    Abstract: While it is crucial to capture global information for effective image restoration (IR), integrating such cues into transformer-based methods becomes computationally expensive, especially with high input resolution. Furthermore, the self-attention mechanism in transformers is prone to considering unnecessary global cues from unrelated objects or regions, introducing computational inefficiencies. In… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 9 pages, 6 figures

  4. arXiv:2312.14239  [pdf, other

    cs.CV eess.IV

    PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar

    Authors: Tzofi Klinghoffer, Xiaoyu Xiang, Siddharth Somasundaram, Yuchen Fan, Christian Richardt, Ramesh Raskar, Rakesh Ranjan

    Abstract: 3D reconstruction from a single-view is challenging because of the ambiguity from monocular cues and lack of information about occluded regions. Neural radiance fields (NeRF), while popular for view synthesis and 3D reconstruction, are typically reliant on multi-view images. Existing methods for single-view 3D reconstruction with NeRF rely on either data priors to hallucinate views of occluded reg… ▽ More

    Submitted 5 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Project Page: https://platonerf.github.io/

  5. arXiv:2312.03640  [pdf, other

    eess.IV cs.CV

    Training Neural Networks on RAW and HDR Images for Restoration Tasks

    Authors: Lei Luo, Alexandre Chapiro, Xiaoyu Xiang, Yuchen Fan, Rakesh Ranjan, Rafal Mantiuk

    Abstract: The vast majority of standard image and video content available online is represented in display-encoded color spaces, in which pixel values are conveniently scaled to a limited range (0-1) and the color distribution is approximately perceptually uniform. In contrast, both camera RAW and high dynamic range (HDR) images are often represented in linear color spaces, in which color values are linearl… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  6. arXiv:2311.11325  [pdf, other

    cs.CV eess.IV

    MoVideo: Motion-Aware Video Generation with Diffusion Models

    Authors: **gyun Liang, Yuchen Fan, Kai Zhang, Radu Timofte, Luc Van Gool, Rakesh Ranjan

    Abstract: While recent years have witnessed great progress on using diffusion models for video generation, most of them are simple extensions of image generation frameworks, which fail to explicitly consider one of the key differences between videos and images, i.e., motion. In this paper, we propose a novel motion-aware video generation (MoVideo) framework that takes motion into consideration from two aspe… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: project homepage: https://**gyunliang.github.io/MoVideo

  7. arXiv:2310.09653  [pdf, other

    cs.SD cs.AI eess.AS

    SelfVC: Voice Conversion With Iterative Refinement using Self Transformations

    Authors: Paarth Neekhara, Shehzeen Hussain, Rafael Valle, Boris Ginsburg, Rishabh Ranjan, Shlomo Dubnov, Farinaz Koushanfar, Julian McAuley

    Abstract: We propose SelfVC, a training strategy to iteratively improve a voice conversion model with self-synthesized examples. Previous efforts on voice conversion focus on factorizing speech into explicitly disentangled representations that separately encode speaker characteristics and linguistic content. However, disentangling speech representations to capture such attributes using task-specific loss te… ▽ More

    Submitted 3 May, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted at ICML 2024

  8. arXiv:2310.08805  [pdf, other

    eess.IV cs.CV

    Two-Stage Deep Learning Framework for Quality Assessment of Left Atrial Late Gadolinium Enhanced MRI Images

    Authors: K M Arefeen Sultan, Benjamin Orkild, Alan Morris, Eugene Kholmovski, Erik Bieging, Eugene Kwan, Ravi Ranjan, Ed DiBella, Shireen Elhabian

    Abstract: Accurate assessment of left atrial fibrosis in patients with atrial fibrillation relies on high-quality 3D late gadolinium enhancement (LGE) MRI images. However, obtaining such images is challenging due to patient motion, changing breathing patterns, or sub-optimal choice of pulse sequence parameters. Automated assessment of LGE-MRI image diagnostic quality is clinically significant as it would en… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted to STACOM 2023. 11 pages, 3 figures

  9. arXiv:2307.06669  [pdf, other

    cs.SD cs.CR eess.AS

    Uncovering the Deceptions: An Analysis on Audio Spoofing Detection and Future Prospects

    Authors: Rishabh Ranjan, Mayank Vatsa, Richa Singh

    Abstract: Audio has become an increasingly crucial biometric modality due to its ability to provide an intuitive way for humans to interact with machines. It is currently being used for a range of applications, including person authentication to banking to virtual assistants. Research has shown that these systems are also susceptible to spoofing and attacks. Therefore, protecting audio processing systems ag… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Accepted in IJCAI 2023

  10. arXiv:2211.08658  [pdf, other

    eess.IV cs.CV

    Consistent Direct Time-of-Flight Video Depth Super-Resolution

    Authors: Zhanghao Sun, Wei Ye, **hui Xiong, Gyeongmin Choe, Jialiang Wang, Shuochen Su, Rakesh Ranjan

    Abstract: Direct time-of-flight (dToF) sensors are promising for next-generation on-device 3D sensing. However, limited by manufacturing capabilities in a compact module, the dToF data has a low spatial resolution (e.g., $\sim 20\times30$ for iPhone dToF), and it requires a super-resolution step before being passed to downstream tasks. In this paper, we solve this super-resolution problem by fusing the low-… ▽ More

    Submitted 3 May, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  11. arXiv:2206.02146  [pdf, other

    cs.CV eess.IV

    Recurrent Video Restoration Transformer with Guided Deformable Attention

    Authors: **gyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, Jiezhang Cao, Kai Zhang, Radu Timofte, Luc Van Gool

    Abstract: Video restoration aims at restoring multiple high-quality frames from multiple low-quality frames. Existing video restoration methods generally fall into two extreme cases, i.e., they either restore all frames in parallel or restore the video frame by frame in a recurrent way, which would result in different merits and drawbacks. Typically, the former has the advantage of temporal information fusi… ▽ More

    Submitted 12 November, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: Accepted by NeurIPS 2022. Code: https://github.com/**gyunLiang/RVRT

  12. arXiv:2201.12288  [pdf, other

    cs.CV eess.IV

    VRT: A Video Restoration Transformer

    Authors: **gyun Liang, Jiezhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc Van Gool

    Abstract: Video restoration (e.g., video super-resolution) aims to restore high-quality frames from low-quality frames. Different from single image restoration, video restoration generally requires to utilize temporal information from multiple adjacent but usually misaligned video frames. Existing deep methods generally tackle with this by exploiting a sliding window strategy or a recurrent architecture, wh… ▽ More

    Submitted 15 June, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: add results on VFI and STVSR; SOTA results (+up to 2.16dB) on video SR, video deblurring, video denoising, video frame interpolation and space-time video super-resolution. Code: https://github.com/**gyunLiang/VRT

  13. Evaluating Sensor Data Quality in Internet ofThings Smart Agriculture Applications

    Authors: Kaneez Fizza, Prem Prakash Jayaraman, Abhik Banerjee, Dimitrios Georgakopoulos, Rajiv Ranjan

    Abstract: The unprecedented growth of Internet of Things (IoT) and its applications in areas such as Smart Agriculture compels the need to devise newer ways for evaluating the quality of such applications. While existing models for application quality focus on the quality experienced by the end-user (captured using likert scale), IoT applications have minimal human involvement and rely on machine to machine… ▽ More

    Submitted 28 April, 2021; originally announced May 2021.

    Comments: Technical Report under review with IEEE micro

    Report number: 1937-4143

    Journal ref: IEEE Micro 21 December 2021

  14. arXiv:2103.01524  [pdf, other

    eess.IV cs.CV cs.LG

    Feature-Align Network with Knowledge Distillation for Efficient Denoising

    Authors: Lucas D. Young, Fitsum A. Reda, Rakesh Ranjan, Jon Morton, Jun Hu, Yazhu Ling, Xiaoyu Xiang, David Liu, Vikas Chandra

    Abstract: We propose an efficient neural network for RAW image denoising. Although neural network-based denoising has been extensively studied for image restoration, little attention has been given to efficient denoising for compute limited and power sensitive devices, such as smartphones and smartwatches. In this paper, we present a novel architecture and a suite of training techniques for high quality den… ▽ More

    Submitted 17 March, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    MSC Class: 94A08 (Primary) 68T07; 65D19 (Secondary) ACM Class: I.4.5; I.2.6

  15. arXiv:2012.02228  [pdf, other

    cs.CV cs.LG eess.IV

    EVRNet: Efficient Video Restoration on Edge Devices

    Authors: Sachin Mehta, Amit Kumar, Fitsum Reda, Varun Nasery, Vikram Mulukutla, Rakesh Ranjan, Vikas Chandra

    Abstract: Video transmission applications (e.g., conferencing) are gaining momentum, especially in times of global health pandemic. Video signals are transmitted over lossy channels, resulting in low-quality received signals. To restore videos on recipient edge devices in real-time, we introduce an efficient video restoration network, EVRNet. EVRNet efficiently allocates parameters inside the network using… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Technical report

  16. arXiv:1911.11373  [pdf, other

    eess.AS cs.SD

    A two-step system for sound event localization and detection

    Authors: T. N. T. Nguyen, D. L. Jones, R. Ranjan, S. Jayabalan, W. S. Gan

    Abstract: Sound event detection and sound event localization requires different features from audio input signals. While sound event detection mainly relies on time-frequency patterns to distinguish different event classes, sound event localization uses magnitude or phase differences between microphones to estimate source directions. Therefore, we propose a two-step system to do sound event localization and… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: 5 pages