Skip to main content

Showing 1–4 of 4 results for author: Dagli, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.10724  [pdf, other

    eess.IV cs.CV cs.LG

    Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft

    Authors: Ian Vyse, Rishit Dagli, Dav Vrat Chadha, John P. Ma, Hector Chen, Isha Ruparelia, Prithvi Seran, Matthew Xie, Eesa Aamer, Aidan Armstrong, Naveen Black, Ben Borstein, Kevin Caldwell, Orrin Dahanaggamaarachchi, Joe Dai, Abeer Fatima, Stephanie Lu, Maxime Michet, Anoushka Paul, Carrie Ann Po, Shivesh Prakash, Noa Prosser, Riddhiman Roy, Mirai Shinjo, Iliya Shofman , et al. (4 additional authors not shown)

    Abstract: Satellite remote sensing missions have gained popularity over the past fifteen years due to their ability to cover large swaths of land at regular intervals, making them ideal for monitoring environmental trends. The FINCH mission, a 3U+ CubeSat equipped with a hyperspectral camera, aims to monitor crop residue cover in agricultural fields. Although hyperspectral imaging captures both spectral and… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: To appear in 38th Annual Small Satellite Conference

  2. arXiv:2406.06612  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

    Authors: Rishit Dagli, Shivesh Prakash, Robert Wu, Houman Khosravani

    Abstract: Generating combined visual and auditory sensory experiences is critical for the consumption of immersive content. Recent advances in neural generative models have enabled the creation of high-resolution content across multiple modalities such as images, text, speech, and videos. Despite these successes, there remains a significant gap in the generation of high-quality spatial audio that complement… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project Page: https://see2sound.github.io/

  3. arXiv:2402.18575  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    DiffuseRAW: End-to-End Generative RAW Image Processing for Low-Light Images

    Authors: Rishit Dagli

    Abstract: Imaging under extremely low-light conditions presents a significant challenge and is an ill-posed problem due to the low signal-to-noise ratio (SNR) caused by minimal photon capture. Previously, diffusion models have been used for multiple kinds of generative tasks and image-to-image tasks, however, these models work as a post-processing step. These diffusion models are trained on processed images… ▽ More

    Submitted 12 December, 2023; originally announced February 2024.

  4. arXiv:2402.10100  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Tuning In: Analysis of Audio Classifier Performance in Clinical Settings with Limited Data

    Authors: Hamza Mahdi, Eptehal Nashnoush, Rami Saab, Arjun Balachandar, Rishit Dagli, Lucas X. Perri, Houman Khosravani

    Abstract: This study assesses deep learning models for audio classification in a clinical setting with the constraint of small datasets reflecting real-world prospective data collection. We analyze CNNs, including DenseNet and ConvNeXt, alongside transformer models like ViT, SWIN, and AST, and compare them against pre-trained audio models such as YAMNet and VGGish. Our method highlights the benefits of pre-… ▽ More

    Submitted 5 April, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: CHIL 2024