Search | arXiv e-print repository

Lossless Image Compression Using Multi-level Dictionaries: Binary Images

Authors: Samar Agnihotri, Renu Rameshan, Ritwik Ghosal

Abstract: Lossless image compression is required in various applications to reduce storage or transmission costs of images, while requiring the reconstructed images to have zero information loss compared to the original. Existing lossless image compression methods either have simple design but poor compression performance, or complex design, better performance, but with no performance guarantees. In our end… ▽ More Lossless image compression is required in various applications to reduce storage or transmission costs of images, while requiring the reconstructed images to have zero information loss compared to the original. Existing lossless image compression methods either have simple design but poor compression performance, or complex design, better performance, but with no performance guarantees. In our endeavor to develop a lossless image compression method with low complexity and guaranteed performance, we argue that compressibility of a color image is essentially derived from the patterns in its spatial structure, intensity variations, and color variations. Thus, we divide the overall design of a lossless image compression scheme into three parts that exploit corresponding redundancies. We further argue that the binarized version of an image captures its fundamental spatial structure and in this work, we propose a scheme for lossless compression of binary images. The proposed scheme first learns dictionaries of $16\times16$, $8\times8$, $4\times4$, and $2\times 2$ square pixel patterns from various datasets of binary images. It then uses these dictionaries to encode binary images. These dictionaries have various interesting properties that are further exploited to construct an efficient scheme. Our preliminary results show that the proposed scheme consistently outperforms existing conventional and learning based lossless compression approaches, and provides, on average, as much as $1.5\times$ better performance than a common general purpose lossless compression scheme (WebP), more than $3\times$ better performance than a state of the art learning based scheme, and better performance than a specialized scheme for binary image compression (JBIG2). △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 11 pages, 7 figures, and 5 tables

arXiv:2404.08238 [pdf, other]

Simulation of a Vision Correction Display System

Authors: Vidya Sunil, Renu M Rameshan

Abstract: Eyes serve as our primary sensory organs, responsible for processing up to 80\% of our sensory input. However, common visual aberrations like myopia and hyperopia affect a significant portion of the global population. This paper focuses on simulating a Vision Correction Display (VCD) to enhance the visual experience of individuals with various visual impairments. Utilising Blender, we digitally mo… ▽ More Eyes serve as our primary sensory organs, responsible for processing up to 80\% of our sensory input. However, common visual aberrations like myopia and hyperopia affect a significant portion of the global population. This paper focuses on simulating a Vision Correction Display (VCD) to enhance the visual experience of individuals with various visual impairments. Utilising Blender, we digitally model the functionality of a VCD in correcting refractive errors such as myopia and hyperopia. With these simulations we can see potential improvements in visual acuity and comfort. These simulations provide valuable insights for the design and development of future VCD technologies, ultimately advancing accessibility and usability for individuals with visual challenges. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2310.05943 [pdf, other]

Analysis of Learned Features and Framework for Potato Disease Detection

Authors: Shikha Gupta, Soma Chakraborty, Renu Rameshan

Abstract: For applications like plant disease detection, usually, a model is trained on publicly available data and tested on field data. This means that the test data distribution is not the same as the training data distribution, which affects the classifier performance adversely. We handle this dataset shift by ensuring that the features are learned from disease spots in the leaf or healthy regions, as a… ▽ More For applications like plant disease detection, usually, a model is trained on publicly available data and tested on field data. This means that the test data distribution is not the same as the training data distribution, which affects the classifier performance adversely. We handle this dataset shift by ensuring that the features are learned from disease spots in the leaf or healthy regions, as applicable. This is achieved using a faster Region-based convolutional neural network (RCNN) as one of the solutions and an attention-based network as the other. The average classification accuracies of these classifiers are approximately 95% while evaluated on the test set corresponding to their training dataset. These classifiers also performed equivalently, with an average score of 84% on a dataset not seen during the training phase. △ Less

Submitted 29 August, 2023; originally announced October 2023.

Comments: 15 pages, 8 figures

arXiv:2210.09846 [pdf, other]

G-PECNet: Towards a Generalizable Pedestrian Trajectory Prediction System

Authors: Aryan Garg, Renu M. Rameshan

Abstract: Navigating dynamic physical environments without obstructing or damaging human assets is of quintessential importance for social robots. In this work, we solve autonomous drone navigation's sub-problem of predicting out-of-domain human and agent trajectories using a deep generative model. Our method: General-PECNet or G-PECNet observes an improvement of 9.5\% on the Final Displacement Error (FDE)… ▽ More Navigating dynamic physical environments without obstructing or damaging human assets is of quintessential importance for social robots. In this work, we solve autonomous drone navigation's sub-problem of predicting out-of-domain human and agent trajectories using a deep generative model. Our method: General-PECNet or G-PECNet observes an improvement of 9.5\% on the Final Displacement Error (FDE) on 2020's benchmark: PECNet through a combination of architectural improvements inspired by periodic activation functions and synthetic trajectory (data) augmentations using Hidden Markov Models (HMMs) and Reinforcement Learning (RL). Additionally, we propose a simple geometry-inspired metric for trajectory non-linearity and outlier detection, helpful for the task. Code available at https://github.com/Aryan-Garg/PECNet-Pedestrian-Trajectory-Prediction.git △ Less

Submitted 31 March, 2024; v1 submitted 15 October, 2022; originally announced October 2022.

Comments: Notable ICLR Tiny Paper 2024

arXiv:2106.11558 [pdf, ps, other]

Learning-Based Practical Light Field Image Compression Using A Disparity-Aware Model

Authors: Mohana Singh, Renu M. Rameshan

Abstract: Light field technology has increasingly attracted the attention of the research community with its many possible applications. The lenslet array in commercial plenoptic cameras helps capture both the spatial and angular information of light rays in a single exposure. While the resulting high dimensionality of light field data enables its superior capabilities, it also impedes its extensive adoptio… ▽ More Light field technology has increasingly attracted the attention of the research community with its many possible applications. The lenslet array in commercial plenoptic cameras helps capture both the spatial and angular information of light rays in a single exposure. While the resulting high dimensionality of light field data enables its superior capabilities, it also impedes its extensive adoption. Hence, there is a compelling need for efficient compression of light field images. Existing solutions are commonly composed of several separate modules, some of which may not have been designed for the specific structure and quality of light field data. This increases the complexity of the codec and results in impractical decoding runtimes. We propose a new learning-based, disparity-aided model for compression of 4D light field images capable of parallel decoding. The model is end-to-end trainable, eliminating the need for hand-tuning separate modules and allowing joint learning of rate and distortion. The disparity-aided approach ensures the structural integrity of the reconstructed light fields. Comparisons with the state of the art show encouraging performance in terms of PSNR and MS-SSIM metrics. Also, there is a notable gain in the encoding and decoding runtimes. Source code is available at https://moha23.github.io/LF-DAAE. △ Less

Submitted 23 June, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

Comments: accepted to Picture Coding Symposium 2021, corrected typo in link to source code

arXiv:2105.08865 [pdf, other]

Learning optimally separated class-specific subspace representations using convolutional autoencoder

Authors: Krishan Sharma, Shikha Gupta, Renu Rameshan

Abstract: In this work, we propose a novel convolutional autoencoder based architecture to generate subspace specific feature representations that are best suited for classification task. The class-specific data is assumed to lie in low dimensional linear subspaces, which could be noisy and not well separated, i.e., subspace distance (principal angle) between two classes is very low. The proposed network us… ▽ More In this work, we propose a novel convolutional autoencoder based architecture to generate subspace specific feature representations that are best suited for classification task. The class-specific data is assumed to lie in low dimensional linear subspaces, which could be noisy and not well separated, i.e., subspace distance (principal angle) between two classes is very low. The proposed network uses a novel class-specific self expressiveness (CSSE) layer sandwiched between encoder and decoder networks to generate class-wise subspace representations which are well separated. The CSSE layer along with encoder/ decoder are trained in such a way that data still lies in subspaces in the feature space with minimum principal angle much higher than that of the input space. To demonstrate the effectiveness of the proposed approach, several experiments have been carried out on state-of-the-art machine learning datasets and a significant improvement in classification performance is observed over existing subspace based transformation learning methods. △ Less

Submitted 18 May, 2021; originally announced May 2021.

arXiv:2102.03512 [pdf, other]

MOTS R-CNN: Cosine-margin-triplet loss for multi-object tracking

Authors: Amit Satish Unde, Renu M. Rameshan

Abstract: One of the central tasks of multi-object tracking involves learning a distance metric that is consistent with the semantic similarities of objects. The design of an appropriate loss function that encourages discriminative feature learning is among the most crucial challenges in deep neural network-based metric learning. Despite significant progress, slow convergence and a poor local optimum of the… ▽ More One of the central tasks of multi-object tracking involves learning a distance metric that is consistent with the semantic similarities of objects. The design of an appropriate loss function that encourages discriminative feature learning is among the most crucial challenges in deep neural network-based metric learning. Despite significant progress, slow convergence and a poor local optimum of the existing contrastive and triplet loss based deep metric learning methods necessitates a better solution. In this paper, we propose cosine-margin-contrastive (CMC) and cosine-margin-triplet (CMT) loss by reformulating both contrastive and triplet loss functions from the perspective of cosine distance. The proposed reformulation as a cosine loss is achieved by feature normalization which distributes the learned features on a hypersphere. We then propose the MOTS R-CNN framework for joint multi-object tracking and segmentation, particularly targeted at improving the tracking performance. Specifically, the tracking problem is addressed through deep metric learning based on the proposed loss functions. We propose a scale-invariant tracking by using a multi-layer feature aggregation scheme to make the model robust against object scale variations and occlusions. The MOTS R-CNN achieves the state-of-the-art tracking performance on the KITTI MOTS dataset. We show that the MOTS R-CNN reduces the identity switching by $62\%$ and $61\%$ on cars and pedestrians, respectively in comparison to Track R-CNN. △ Less

Submitted 6 February, 2021; originally announced February 2021.

Comments: 10 pages, 2 figures

arXiv:1201.1438 [pdf, ps, other]

doi 10.1103/PhysRevA.85.032506

Towards the production of ultracold ground-state RbCs molecules: Feshbach resonances, weakly bound states, and coupled-channel model

Authors: Tetsu Takekoshi, Markus Debatin, Raffael Rameshan, Francesca Ferlaino, Rudolf Grimm, Hanns-Christoph Nägerl, C. Ruth Le Sueur, Jeremy M. Hutson, Paul S. Julienne, Svetlana Kotochigova, Eberhard Tiemann

Abstract: We have studied interspecies scattering in an ultracold mixture of $^{87}$Rb and $^{133}$Cs atoms, both in their lowest-energy spin states. The three-body loss signatures of 30 incoming s- and p-wave magnetic Feshbach resonances over the range 0 to 667 G have been catalogued. Magnetic field modulation spectroscopy was used to observe molecular states bound by up to 2.5 MHz$\times h$. Magnetic mome… ▽ More We have studied interspecies scattering in an ultracold mixture of $^{87}$Rb and $^{133}$Cs atoms, both in their lowest-energy spin states. The three-body loss signatures of 30 incoming s- and p-wave magnetic Feshbach resonances over the range 0 to 667 G have been catalogued. Magnetic field modulation spectroscopy was used to observe molecular states bound by up to 2.5 MHz$\times h$. Magnetic moment spectroscopy along the magneto-association pathway from 197 to 182 G gives results consistent with the observed and calculated dependence of the binding energy on magnetic field strength. We have created RbCs Feshbach molecules using two of the resonances. We have set up a coupled-channel model of the interaction and have used direct least-squares fitting to refine its parameters to fit the experimental results from the Feshbach molecules, in addition to the Feshbach resonance positions and the spectroscopic results for deeply bound levels. The final model gives a good description of all the experimental results and predicts a large resonance near 790 G, which may be useful for tuning the interspecies scattering properties. Quantum numbers and vibrational wavefunctions from the model can also be used to choose optimal initial states of Feshbach molecules for their transfer to the rovibronic ground state using stimulated Raman adiabatic passage (STIRAP). △ Less

Submitted 7 March, 2012; v1 submitted 6 January, 2012; originally announced January 2012.

Comments: 16 pages, 11 figures, submitted to PRA

Journal ref: Phys. Rev. A 85, 032506 (2012)

arXiv:1106.0129 [pdf, ps, other]

doi 10.1039/C1CP21769K

Molecular spectroscopy for ground-state transfer of ultracold RbCs molecules

Authors: Markus Debatin, Tetsu Takekoshi, Raffael Rameshan, Lukas Reichsöllner, Francesca Ferlaino, Rudolf Grimm, Romain Vexiau, Nadia Bouloufa, Olivier Dulieu, Hanns-Christoph Naegerl

Abstract: We perform one- and two-photon high resolution spectroscopy on ultracold samples of RbCs Feshbach molecules with the aim to identify a suitable route for efficient ground-state transfer in the quantum-gas regime to produce quantum gases of dipolar RbCs ground-state molecules. One-photon loss spectroscopy allows us to probe deeply bound rovibrational levels of the mixed excited (A1Σ+ - b3Π0) 0+ mol… ▽ More We perform one- and two-photon high resolution spectroscopy on ultracold samples of RbCs Feshbach molecules with the aim to identify a suitable route for efficient ground-state transfer in the quantum-gas regime to produce quantum gases of dipolar RbCs ground-state molecules. One-photon loss spectroscopy allows us to probe deeply bound rovibrational levels of the mixed excited (A1Σ+ - b3Π0) 0+ molecular states. Two-photon dark state spectroscopy connects the initial Feshbach state to the rovibronic ground state. We determine the binding energy of the lowest rovibrational level |v"=0,J"=0> of the X1Σ+ ground state to be DX 0 = 3811.5755(16) 1/cm, a 300-fold improvement in accuracy with respect to previous data. We are now in the position to perform stimulated two-photon Raman transfer to the rovibronic ground state. △ Less

Submitted 1 June, 2011; originally announced June 2011.

Comments: Submitted to PCCP themed issue: Physics and Chemistry of Cold Molecules

arXiv:1101.1409 [pdf, other]

doi 10.1140/epjd/e2011-20015-6

Production of a dual-species Bose-Einstein condensate of Rb and Cs atoms

Authors: A. D. Lercher, T. Takekoshi, M. Debatin, B. Schuster, R. Rameshan, F. Ferlaino, R. Grimm, H. -C. Nägerl

Abstract: We report the simultaneous production of Bose-Einstein condensates (BECs) of $^{87}$Rb and $^{133}$Cs atoms in separate optical traps. The two samples are mixed during laser cooling and loading but are separated by $400 μ$m for the final stage of evaporative cooling. This is done to avoid considerable interspecies three-body recombination, which causes heating and evaporative loss. We characterize… ▽ More We report the simultaneous production of Bose-Einstein condensates (BECs) of $^{87}$Rb and $^{133}$Cs atoms in separate optical traps. The two samples are mixed during laser cooling and loading but are separated by $400 μ$m for the final stage of evaporative cooling. This is done to avoid considerable interspecies three-body recombination, which causes heating and evaporative loss. We characterize the BEC production process, discuss limitations, and outline the use of the dual-species BEC in future experiments to produce rovibronic ground state molecules, including a scheme facilitated by the superfluid-to-Mott-insulator (SF-MI) phase transition. △ Less

Submitted 7 January, 2011; originally announced January 2011.

Journal ref: Eur. Phys. J. D Topical issue: Cold Quantum Matter - Achievements and Prospects (2011)

Showing 1–10 of 10 results for author: Rameshan, R