-
Lossless Image Compression Using Multi-level Dictionaries: Binary Images
Authors:
Samar Agnihotri,
Renu Rameshan,
Ritwik Ghosal
Abstract:
Lossless image compression is required in various applications to reduce storage or transmission costs of images, while requiring the reconstructed images to have zero information loss compared to the original. Existing lossless image compression methods either have simple design but poor compression performance, or complex design, better performance, but with no performance guarantees. In our end…
▽ More
Lossless image compression is required in various applications to reduce storage or transmission costs of images, while requiring the reconstructed images to have zero information loss compared to the original. Existing lossless image compression methods either have simple design but poor compression performance, or complex design, better performance, but with no performance guarantees. In our endeavor to develop a lossless image compression method with low complexity and guaranteed performance, we argue that compressibility of a color image is essentially derived from the patterns in its spatial structure, intensity variations, and color variations. Thus, we divide the overall design of a lossless image compression scheme into three parts that exploit corresponding redundancies. We further argue that the binarized version of an image captures its fundamental spatial structure and in this work, we propose a scheme for lossless compression of binary images.
The proposed scheme first learns dictionaries of $16\times16$, $8\times8$, $4\times4$, and $2\times 2$ square pixel patterns from various datasets of binary images. It then uses these dictionaries to encode binary images. These dictionaries have various interesting properties that are further exploited to construct an efficient scheme. Our preliminary results show that the proposed scheme consistently outperforms existing conventional and learning based lossless compression approaches, and provides, on average, as much as $1.5\times$ better performance than a common general purpose lossless compression scheme (WebP), more than $3\times$ better performance than a state of the art learning based scheme, and better performance than a specialized scheme for binary image compression (JBIG2).
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Simulation of a Vision Correction Display System
Authors:
Vidya Sunil,
Renu M Rameshan
Abstract:
Eyes serve as our primary sensory organs, responsible for processing up to 80\% of our sensory input. However, common visual aberrations like myopia and hyperopia affect a significant portion of the global population. This paper focuses on simulating a Vision Correction Display (VCD) to enhance the visual experience of individuals with various visual impairments. Utilising Blender, we digitally mo…
▽ More
Eyes serve as our primary sensory organs, responsible for processing up to 80\% of our sensory input. However, common visual aberrations like myopia and hyperopia affect a significant portion of the global population. This paper focuses on simulating a Vision Correction Display (VCD) to enhance the visual experience of individuals with various visual impairments. Utilising Blender, we digitally model the functionality of a VCD in correcting refractive errors such as myopia and hyperopia. With these simulations we can see potential improvements in visual acuity and comfort. These simulations provide valuable insights for the design and development of future VCD technologies, ultimately advancing accessibility and usability for individuals with visual challenges.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Analysis of Learned Features and Framework for Potato Disease Detection
Authors:
Shikha Gupta,
Soma Chakraborty,
Renu Rameshan
Abstract:
For applications like plant disease detection, usually, a model is trained on publicly available data and tested on field data. This means that the test data distribution is not the same as the training data distribution, which affects the classifier performance adversely. We handle this dataset shift by ensuring that the features are learned from disease spots in the leaf or healthy regions, as a…
▽ More
For applications like plant disease detection, usually, a model is trained on publicly available data and tested on field data. This means that the test data distribution is not the same as the training data distribution, which affects the classifier performance adversely. We handle this dataset shift by ensuring that the features are learned from disease spots in the leaf or healthy regions, as applicable. This is achieved using a faster Region-based convolutional neural network (RCNN) as one of the solutions and an attention-based network as the other. The average classification accuracies of these classifiers are approximately 95% while evaluated on the test set corresponding to their training dataset. These classifiers also performed equivalently, with an average score of 84% on a dataset not seen during the training phase.
△ Less
Submitted 29 August, 2023;
originally announced October 2023.
-
G-PECNet: Towards a Generalizable Pedestrian Trajectory Prediction System
Authors:
Aryan Garg,
Renu M. Rameshan
Abstract:
Navigating dynamic physical environments without obstructing or damaging human assets is of quintessential importance for social robots. In this work, we solve autonomous drone navigation's sub-problem of predicting out-of-domain human and agent trajectories using a deep generative model. Our method: General-PECNet or G-PECNet observes an improvement of 9.5\% on the Final Displacement Error (FDE)…
▽ More
Navigating dynamic physical environments without obstructing or damaging human assets is of quintessential importance for social robots. In this work, we solve autonomous drone navigation's sub-problem of predicting out-of-domain human and agent trajectories using a deep generative model. Our method: General-PECNet or G-PECNet observes an improvement of 9.5\% on the Final Displacement Error (FDE) on 2020's benchmark: PECNet through a combination of architectural improvements inspired by periodic activation functions and synthetic trajectory (data) augmentations using Hidden Markov Models (HMMs) and Reinforcement Learning (RL). Additionally, we propose a simple geometry-inspired metric for trajectory non-linearity and outlier detection, helpful for the task. Code available at https://github.com/Aryan-Garg/PECNet-Pedestrian-Trajectory-Prediction.git
△ Less
Submitted 31 March, 2024; v1 submitted 15 October, 2022;
originally announced October 2022.
-
Learning-Based Practical Light Field Image Compression Using A Disparity-Aware Model
Authors:
Mohana Singh,
Renu M. Rameshan
Abstract:
Light field technology has increasingly attracted the attention of the research community with its many possible applications. The lenslet array in commercial plenoptic cameras helps capture both the spatial and angular information of light rays in a single exposure. While the resulting high dimensionality of light field data enables its superior capabilities, it also impedes its extensive adoptio…
▽ More
Light field technology has increasingly attracted the attention of the research community with its many possible applications. The lenslet array in commercial plenoptic cameras helps capture both the spatial and angular information of light rays in a single exposure. While the resulting high dimensionality of light field data enables its superior capabilities, it also impedes its extensive adoption. Hence, there is a compelling need for efficient compression of light field images. Existing solutions are commonly composed of several separate modules, some of which may not have been designed for the specific structure and quality of light field data. This increases the complexity of the codec and results in impractical decoding runtimes. We propose a new learning-based, disparity-aided model for compression of 4D light field images capable of parallel decoding. The model is end-to-end trainable, eliminating the need for hand-tuning separate modules and allowing joint learning of rate and distortion. The disparity-aided approach ensures the structural integrity of the reconstructed light fields. Comparisons with the state of the art show encouraging performance in terms of PSNR and MS-SSIM metrics. Also, there is a notable gain in the encoding and decoding runtimes. Source code is available at https://moha23.github.io/LF-DAAE.
△ Less
Submitted 23 June, 2021; v1 submitted 22 June, 2021;
originally announced June 2021.
-
Learning optimally separated class-specific subspace representations using convolutional autoencoder
Authors:
Krishan Sharma,
Shikha Gupta,
Renu Rameshan
Abstract:
In this work, we propose a novel convolutional autoencoder based architecture to generate subspace specific feature representations that are best suited for classification task. The class-specific data is assumed to lie in low dimensional linear subspaces, which could be noisy and not well separated, i.e., subspace distance (principal angle) between two classes is very low. The proposed network us…
▽ More
In this work, we propose a novel convolutional autoencoder based architecture to generate subspace specific feature representations that are best suited for classification task. The class-specific data is assumed to lie in low dimensional linear subspaces, which could be noisy and not well separated, i.e., subspace distance (principal angle) between two classes is very low. The proposed network uses a novel class-specific self expressiveness (CSSE) layer sandwiched between encoder and decoder networks to generate class-wise subspace representations which are well separated. The CSSE layer along with encoder/ decoder are trained in such a way that data still lies in subspaces in the feature space with minimum principal angle much higher than that of the input space. To demonstrate the effectiveness of the proposed approach, several experiments have been carried out on state-of-the-art machine learning datasets and a significant improvement in classification performance is observed over existing subspace based transformation learning methods.
△ Less
Submitted 18 May, 2021;
originally announced May 2021.
-
MOTS R-CNN: Cosine-margin-triplet loss for multi-object tracking
Authors:
Amit Satish Unde,
Renu M. Rameshan
Abstract:
One of the central tasks of multi-object tracking involves learning a distance metric that is consistent with the semantic similarities of objects. The design of an appropriate loss function that encourages discriminative feature learning is among the most crucial challenges in deep neural network-based metric learning. Despite significant progress, slow convergence and a poor local optimum of the…
▽ More
One of the central tasks of multi-object tracking involves learning a distance metric that is consistent with the semantic similarities of objects. The design of an appropriate loss function that encourages discriminative feature learning is among the most crucial challenges in deep neural network-based metric learning. Despite significant progress, slow convergence and a poor local optimum of the existing contrastive and triplet loss based deep metric learning methods necessitates a better solution. In this paper, we propose cosine-margin-contrastive (CMC) and cosine-margin-triplet (CMT) loss by reformulating both contrastive and triplet loss functions from the perspective of cosine distance. The proposed reformulation as a cosine loss is achieved by feature normalization which distributes the learned features on a hypersphere. We then propose the MOTS R-CNN framework for joint multi-object tracking and segmentation, particularly targeted at improving the tracking performance. Specifically, the tracking problem is addressed through deep metric learning based on the proposed loss functions. We propose a scale-invariant tracking by using a multi-layer feature aggregation scheme to make the model robust against object scale variations and occlusions. The MOTS R-CNN achieves the state-of-the-art tracking performance on the KITTI MOTS dataset. We show that the MOTS R-CNN reduces the identity switching by $62\%$ and $61\%$ on cars and pedestrians, respectively in comparison to Track R-CNN.
△ Less
Submitted 6 February, 2021;
originally announced February 2021.
-
Towards the production of ultracold ground-state RbCs molecules: Feshbach resonances, weakly bound states, and coupled-channel model
Authors:
Tetsu Takekoshi,
Markus Debatin,
Raffael Rameshan,
Francesca Ferlaino,
Rudolf Grimm,
Hanns-Christoph Nägerl,
C. Ruth Le Sueur,
Jeremy M. Hutson,
Paul S. Julienne,
Svetlana Kotochigova,
Eberhard Tiemann
Abstract:
We have studied interspecies scattering in an ultracold mixture of $^{87}$Rb and $^{133}$Cs atoms, both in their lowest-energy spin states. The three-body loss signatures of 30 incoming s- and p-wave magnetic Feshbach resonances over the range 0 to 667 G have been catalogued. Magnetic field modulation spectroscopy was used to observe molecular states bound by up to 2.5 MHz$\times h$. Magnetic mome…
▽ More
We have studied interspecies scattering in an ultracold mixture of $^{87}$Rb and $^{133}$Cs atoms, both in their lowest-energy spin states. The three-body loss signatures of 30 incoming s- and p-wave magnetic Feshbach resonances over the range 0 to 667 G have been catalogued. Magnetic field modulation spectroscopy was used to observe molecular states bound by up to 2.5 MHz$\times h$. Magnetic moment spectroscopy along the magneto-association pathway from 197 to 182 G gives results consistent with the observed and calculated dependence of the binding energy on magnetic field strength. We have created RbCs Feshbach molecules using two of the resonances. We have set up a coupled-channel model of the interaction and have used direct least-squares fitting to refine its parameters to fit the experimental results from the Feshbach molecules, in addition to the Feshbach resonance positions and the spectroscopic results for deeply bound levels. The final model gives a good description of all the experimental results and predicts a large resonance near 790 G, which may be useful for tuning the interspecies scattering properties. Quantum numbers and vibrational wavefunctions from the model can also be used to choose optimal initial states of Feshbach molecules for their transfer to the rovibronic ground state using stimulated Raman adiabatic passage (STIRAP).
△ Less
Submitted 7 March, 2012; v1 submitted 6 January, 2012;
originally announced January 2012.
-
Molecular spectroscopy for ground-state transfer of ultracold RbCs molecules
Authors:
Markus Debatin,
Tetsu Takekoshi,
Raffael Rameshan,
Lukas Reichsöllner,
Francesca Ferlaino,
Rudolf Grimm,
Romain Vexiau,
Nadia Bouloufa,
Olivier Dulieu,
Hanns-Christoph Naegerl
Abstract:
We perform one- and two-photon high resolution spectroscopy on ultracold samples of RbCs Feshbach molecules with the aim to identify a suitable route for efficient ground-state transfer in the quantum-gas regime to produce quantum gases of dipolar RbCs ground-state molecules. One-photon loss spectroscopy allows us to probe deeply bound rovibrational levels of the mixed excited (A1Σ+ - b3Π0) 0+ mol…
▽ More
We perform one- and two-photon high resolution spectroscopy on ultracold samples of RbCs Feshbach molecules with the aim to identify a suitable route for efficient ground-state transfer in the quantum-gas regime to produce quantum gases of dipolar RbCs ground-state molecules. One-photon loss spectroscopy allows us to probe deeply bound rovibrational levels of the mixed excited (A1Σ+ - b3Π0) 0+ molecular states. Two-photon dark state spectroscopy connects the initial Feshbach state to the rovibronic ground state. We determine the binding energy of the lowest rovibrational level |v"=0,J"=0> of the X1Σ+ ground state to be DX 0 = 3811.5755(16) 1/cm, a 300-fold improvement in accuracy with respect to previous data. We are now in the position to perform stimulated two-photon Raman transfer to the rovibronic ground state.
△ Less
Submitted 1 June, 2011;
originally announced June 2011.
-
Production of a dual-species Bose-Einstein condensate of Rb and Cs atoms
Authors:
A. D. Lercher,
T. Takekoshi,
M. Debatin,
B. Schuster,
R. Rameshan,
F. Ferlaino,
R. Grimm,
H. -C. Nägerl
Abstract:
We report the simultaneous production of Bose-Einstein condensates (BECs) of $^{87}$Rb and $^{133}$Cs atoms in separate optical traps. The two samples are mixed during laser cooling and loading but are separated by $400 μ$m for the final stage of evaporative cooling. This is done to avoid considerable interspecies three-body recombination, which causes heating and evaporative loss. We characterize…
▽ More
We report the simultaneous production of Bose-Einstein condensates (BECs) of $^{87}$Rb and $^{133}$Cs atoms in separate optical traps. The two samples are mixed during laser cooling and loading but are separated by $400 μ$m for the final stage of evaporative cooling. This is done to avoid considerable interspecies three-body recombination, which causes heating and evaporative loss. We characterize the BEC production process, discuss limitations, and outline the use of the dual-species BEC in future experiments to produce rovibronic ground state molecules, including a scheme facilitated by the superfluid-to-Mott-insulator (SF-MI) phase transition.
△ Less
Submitted 7 January, 2011;
originally announced January 2011.