Skip to main content

Showing 1–17 of 17 results for author: Sharma, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2311.00082  [pdf, other

    eess.IV

    UAV Immersive Video Streaming: A Comprehensive Survey, Benchmarking, and Open Challenges

    Authors: Mohit K. Sharma, Chen-Feng Liu, Ibrahim Farhat, Nassim Sehad, Wassim Hamidouche, Merouane Debbah

    Abstract: Over the past decade, the utilization of UAVs has witnessed significant growth, owing to their agility, rapid deployment, and maneuverability. In particular, the use of UAV-mounted 360-degree cameras to capture omnidirectional videos has enabled truly immersive viewing experiences with up to 6DoF. However, achieving this immersive experience necessitates encoding omnidirectional videos in high res… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  2. arXiv:2210.01447  [pdf, other

    cs.CV eess.IV

    A Novel Light Field Coding Scheme Based on Deep Belief Network & Weighted Binary Images for Additive Layered Displays

    Authors: Sally Khaidem, Mansi Sharma

    Abstract: Light-field displays create an immersive experience by providing binocular depth sensation and motion parallax. Stacking light attenuating layers is one approach to implement a light field display with a broader depth of field, wide viewing angles and high resolution. Due to the transparent holographic optical element (HOE) layers, additive layered displays can be integrated into augmented reality… ▽ More

    Submitted 21 April, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: The paper is under consideration at Pattern Recognition Letters

  3. arXiv:2206.11095  [pdf, other

    cs.CV eess.IV

    A High Resolution Multi-exposure Stereoscopic Image & Video Database of Natural Scenes

    Authors: Rohit Choudhary, Mansi Sharma, Aditya Wadaskar

    Abstract: Immersive displays such as VR headsets, AR glasses, Multiview displays, Free point televisions have emerged as a new class of display technologies in recent years, offering a better visual experience and viewer engagement as compared to conventional displays. With the evolution of 3D video and display technologies, the consumer market for High Dynamic Range (HDR) cameras and displays is quickly gr… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  4. arXiv:2206.11048  [pdf, other

    eess.IV cs.CV cs.LG

    Automated GI tract segmentation using deep learning

    Authors: Manhar Sharma

    Abstract: The job of Radiation oncologists is to deliver x-ray beams pointed toward the tumor and at the same time avoid the stomach and intestines. With MR-Linacs (magnetic resonance imaging and linear accelerator systems), oncologists can visualize the position of the tumor and allow for precise dose according to tumor cell presence which can vary from day to day. The current job of outlining the position… ▽ More

    Submitted 5 September, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 8 pages, 9 figures

  5. arXiv:2206.10131  [pdf, other

    cs.CV eess.IV

    An Integrated Representation & Compression Scheme Based on Convolutional Autoencoders with 4D DCT Perceptual Encoding for High Dynamic Range Light Fields

    Authors: Sally Khaidem, Mansi Sharma

    Abstract: The emerging and existing light field displays are highly capable of realistic presentation of 3D scenes on auto-stereoscopic glasses-free platforms. The light field size is a major drawback while utilising 3D displays and streaming purposes. When a light field is of high dynamic range, the size increases drastically. In this paper, we propose a novel compression algorithm for a high dynamic range… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  6. arXiv:2202.06166  [pdf, other

    eess.SP physics.ins-det physics.space-ph

    Do cities have a unique magnetic pulse?

    Authors: Vincent Dumont, Trevor A. Bowen, Roger Roglans, Gregory Dobler, Mohit S. Sharma, Andy Karpf, Stuart D. Bale, Arne Wickenbrock, Elena Zhivun, Tom Kornack, Jonathan S. Wurtele, Dmitry Budker

    Abstract: We present a comparative analysis of urban magnetic fields between two American cities: Berkeley (California) and Brooklyn Borough of New York City (New York). Our analysis uses data taken over a four-week period during which magnetic field data were continuously recorded using a fluxgate magnetometer of 70 pT/$\sqrt{\mathrm{Hz}}$ sensitivity. We identified significant differences in the magnetic… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: 8 pages, 7 figures

    Journal ref: Journal of Applied Physics 131, 204902 (2022)

  7. arXiv:2108.12399  [pdf, other

    eess.IV cs.CV

    A Novel Hierarchical Light Field Coding Scheme Based on Hybrid Stacked Multiplicative Layers and Fourier Disparity Layers for Glasses-Free 3D Displays

    Authors: Joshitha Ravishankar, Mansi Sharma

    Abstract: This paper presents a novel hierarchical coding scheme for light fields based on transmittance patterns of low-rank multiplicative layers and Fourier disparity layers. The proposed scheme identifies multiplicative layers of light field view subsets optimized using a convolutional neural network for different scanning orders. Our approach exploits the hidden low-rank structure in the multiplicative… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  8. arXiv:2106.03851  [pdf, other

    cs.SD cs.LG eess.AS

    Impact of data-splits on generalization: Identifying COVID-19 from cough and context

    Authors: Makkunda Sharma, Nikhil Shenoy, Jigar Doshi, Piyush Bagad, Aman Dalmia, Parag Bhamare, Amrita Mahale, Saurabh Rane, Neeraj Agrawal, Rahul Panicker

    Abstract: Rapidly scaling screening, testing and quarantine has shown to be an effective strategy to combat the COVID-19 pandemic. We consider the application of deep learning techniques to distinguish individuals with COVID from non-COVID by using data acquirable from a phone. Using cough and context (symptoms and meta-data) represent such a promising approach. Several independent works in this direction h… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: Published as a workshop paper at ICLR 2021 AI for Public Health Workshop and ICLR 20201 Machine Learning for Preventing and Combating Pandemics Workshop

  9. arXiv:2104.04678  [pdf, other

    cs.MM eess.IV

    A Flexible Lossy Depth Video Coding Scheme Based on Low-rank Tensor Modelling and HEVC Intra Prediction for Free Viewpoint Video

    Authors: Mansi Sharma, Santosh Kumar

    Abstract: The compression quality losses of depth sequences determine quality of view synthesis in free-viewpoint video. The depth map intra prediction in 3D extensions of the HEVC applies intra modes with auxiliary depth modeling modes (DMMs) to better preserve depth edges and handle motion discontinuities. Although such modes enable high efficiency compression, but at the cost of very high encoding comple… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  10. arXiv:2101.10039  [pdf, other

    cs.MM cs.CV eess.IV

    Latent Factor Modeling of Users Subjective Perception for Stereoscopic 3D Video Recommendation

    Authors: Balasubramanyam Appina, Mansi Sharma, Santosh Kumar

    Abstract: Numerous stereoscopic 3D movies are released every year to theaters and created large revenues. Despite the improvement in stereo capturing and 3D video post-production technology, stereoscopic artifacts which cause viewer discomfort continue to appear even in high-budget films. Existing automatic 3D video quality measurement tools can detect distortions in stereoscopic images or videos, but they… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  11. arXiv:2008.08243  [pdf, other

    cs.RO eess.SP

    Enabling Remote Whole-Body Control with 5G Edge Computing

    Authors: Huaijiang Zhu, Manali Sharma, Kai Pfeiffer, Marco Mezzavilla, Jia Shen, Sundeep Rangan, Ludovic Righetti

    Abstract: Real-world applications require light-weight, energy-efficient, fully autonomous robots. Yet, increasing autonomy is oftentimes synonymous with escalating computational requirements. It might thus be desirable to offload intensive computation--not only sensing and planning, but also low-level whole-body control--to remote servers in order to reduce on-board computational needs. Fifth Generation (5… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  12. arXiv:2005.00698  [pdf

    cs.HC cs.LG eess.SP

    Deep ConvLSTM with self-attention for human activity decoding using wearables

    Authors: Satya P. Singh, Aimé Lay-Ekuakille, Deepak Gangwar, Madan Kumar Sharma, Sukrit Gupta

    Abstract: Decoding human activity accurately from wearable sensors can aid in applications related to healthcare and context awareness. The present approaches in this domain use recurrent and/or convolutional models to capture the spatio-temporal features from time-series data from multiple sensors. We propose a deep neural network architecture that not only captures the spatio-temporal features of multiple… ▽ More

    Submitted 17 December, 2020; v1 submitted 2 May, 2020; originally announced May 2020.

    Comments: 8 pages, 2 figures, 3 tables. IEEE Sensors Journal, 2020

  13. arXiv:2004.09347  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    End-to-End Whisper to Natural Speech Conversion using Modified Transformer Network

    Authors: Abhishek Niranjan, Mukesh Sharma, Sai Bharath Chandra Gutha, M Ali Basha Shaik

    Abstract: Machine recognition of an atypical speech like whispered speech, is a challenging task. We introduce whisper-to-natural-speech conversion using sequence-to-sequence approach by proposing enhanced transformer architecture, which uses both parallel and non-parallel data. We investigate different features like Mel frequency cepstral coefficients and smoothed spectral features. The proposed networks a… ▽ More

    Submitted 5 April, 2021; v1 submitted 20 April, 2020; originally announced April 2020.

  14. arXiv:2001.01469  [pdf, other

    cs.CV cs.LG eess.IV

    TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images

    Authors: Shubham Paliwal, Vishwanath D, Rohit Rahul, Monika Sharma, Lovekesh Vig

    Abstract: With the widespread use of mobile phones and scanners to photograph and upload documents, the need for extracting the information trapped in unstructured document images such as retail receipts, insurance claim forms and financial invoices is becoming more acute. A major hurdle to this objective is that these images often contain information in the form of tables and extracting data from tabular s… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

  15. arXiv:1907.04294  [pdf, other

    cs.IR cs.SD eess.AS

    An Attention Mechanism for Musical Instrument Recognition

    Authors: Siddharth Gururani, Mohit Sharma, Alexander Lerch

    Abstract: While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings. Datasets for polyphonic instrument recognition can be categorized into roughly two categories. Some, such as MedleyDB, have strong per-frame instrument activity annotations but are usually small… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: To appear in: Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Delft, 2019

  16. arXiv:1903.03652  [pdf, ps, other

    eess.SP cs.LG math.OC

    Deep Learning Based Online Power Control for Large Energy Harvesting Networks

    Authors: Mohit K Sharma, Alessio Zappone, Merouane Debbah, Mohamad Assaad

    Abstract: In this paper, we propose a deep learning based approach to design online power control policies for large EH networks, which are often intractable stochastic control problems. In the proposed approach, for a given EH network, the optimal online power control rule is learned by training a deep neural network (DNN), using the solution of offline policy design problem. Under the proposed scheme, in… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

    Comments: 5 pages, to appear at ICASSP 2019

  17. arXiv:1903.03195  [pdf, other

    cs.SD eess.AS

    The life of a New York City noise sensor network

    Authors: Charlie Mydlarz, Mohit Sharma, Yitzchak Lockerman, Ben Steers, Claudio Silva, Juan Pablo Bello

    Abstract: Noise pollution is one of the topmost quality of life issues for urban residents in the United States. Continued exposure to high levels of noise has proven effects on health, including acute effects such as sleep disruption, and long-term effects such as hypertension, heart disease, and hearing loss. To investigate and ultimately aid in the mitigation of urban noise, a network of 55 sensor nodes… ▽ More

    Submitted 26 March, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: This article belongs to the Section Intelligent Sensors, 24 pages, 15 figures, 3 tables, 45 references

    Journal ref: Sensors 2019, 19, 1415