Search | arXiv e-print repository

Orthogonal Transform based Generative Adversarial Network for Image Dehazing

Authors: Ahlad Kumar, Mantra Sanathra, Manish Khare, Vijeta Khare

Abstract: Image dehazing has become one of the crucial preprocessing steps for any computer vision task. Most of the dehazing methods try to estimate the transmission map along with the atmospheric light to get the dehazed image in the image domain. In this paper, we propose a novel end-to-end architecture that directly estimates dehazed image in Krawtchouk transform domain. For this a customized Krawtchouk… ▽ More Image dehazing has become one of the crucial preprocessing steps for any computer vision task. Most of the dehazing methods try to estimate the transmission map along with the atmospheric light to get the dehazed image in the image domain. In this paper, we propose a novel end-to-end architecture that directly estimates dehazed image in Krawtchouk transform domain. For this a customized Krawtchouk Convolution Layer (KCL) in the architecture is added. KCL is constructed using Krawtchouk basis functions which converts the image from the spatial domain to the Krawtchouk transform domain. Another convolution layer is added at the end of the architecture named as Inverse Krawtchouk Convolution Layer (IKCL) which converts the image back to the spatial domain from the transform domain. It has been observed that the haze is mainly present in lower frequencies of hazy images, wherein the Krawtchouk transform helps to analyze the high and low frequencies of the images separately. We have divided our architecture into two branches, the upper branch deals with the higher frequencies while the lower branch deals with the lower frequencies of the image. The lower branch is made deeper in terms of the layers as compared to the upper branch to address the haze present in the lower frequencies. Using the proposed Orthogonal Transform based Generative Adversarial Network (OTGAN) architecture for image dehazing, we were able to achieve competitive results when compared to the present state-of-the-art methods. △ Less

Submitted 3 June, 2022; originally announced June 2022.

Comments: 12 pages, 14 figures

arXiv:2104.12385 [pdf, other]

Syft 0.5: A Platform for Universally Deployable Structured Transparency

Authors: Adam James Hall, Madhava Jay, Tudor Cebere, Bogdan Cebere, Koen Lennart van der Veen, George Muraru, Tongye Xu, Patrick Cason, William Abramson, Ayoub Benaissa, Chinmay Shah, Alan Aboudib, Théo Ryffel, Kritika Prakash, Tom Titcombe, Varun Kumar Khare, Maddie Shang, Ionesio Junior, Animesh Gupta, Jason Paumier, Nahua Kang, Vova Manannikov, Andrew Trask

Abstract: We present Syft 0.5, a general-purpose framework that combines a core group of privacy-enhancing technologies that facilitate a universal set of structured transparency systems. This framework is demonstrated through the design and implementation of a novel privacy-preserving inference information flow where we pass homomorphically encrypted activation signals through a split neural network for in… ▽ More We present Syft 0.5, a general-purpose framework that combines a core group of privacy-enhancing technologies that facilitate a universal set of structured transparency systems. This framework is demonstrated through the design and implementation of a novel privacy-preserving inference information flow where we pass homomorphically encrypted activation signals through a split neural network for inference. We show that splitting the model further up the computation chain significantly reduces the computation time of inference and the payload size of activation signals at the cost of model secrecy. We evaluate our proposed flow with respect to its provision of the core structural transparency principles. △ Less

Submitted 27 April, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

Comments: ICLR 2021 Workshop on Distributed and Private Machine Learning (DPML 2021)

arXiv:1911.01562 [pdf, other]

DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning

Authors: Bharathan Balaji, Sunil Mallya, Sahika Genc, Saurabh Gupta, Leo Dirac, Vineet Khare, Gourav Roy, Tao Sun, Yunzhe Tao, Brian Townsend, Eddie Calleja, Sunil Muralidhara, Dhanasekar Karuppasamy

Abstract: DeepRacer is a platform for end-to-end experimentation with RL and can be used to systematically investigate the key challenges in develo** intelligent control systems. Using the platform, we demonstrate how a 1/18th scale car can learn to drive autonomously using RL with a monocular camera. It is trained in simulation with no additional tuning in physical world and demonstrates: 1) formulation… ▽ More DeepRacer is a platform for end-to-end experimentation with RL and can be used to systematically investigate the key challenges in develo** intelligent control systems. Using the platform, we demonstrate how a 1/18th scale car can learn to drive autonomously using RL with a monocular camera. It is trained in simulation with no additional tuning in physical world and demonstrates: 1) formulation and solution of a robust reinforcement learning algorithm, 2) narrowing the reality gap through joint perception and dynamics, 3) distributed on-demand compute architecture for training optimal policies, and 4) a robust evaluation method to identify when to stop training. It is the first successful large-scale deployment of deep reinforcement learning on a robotic control agent that uses only raw camera images as observations and a model-free learning method to perform robust path planning. We open source our code and video demo on GitHub: https://git.io/fjxoJ. △ Less

Submitted 4 November, 2019; originally announced November 2019.

arXiv:1910.11959 [pdf, other]

FineText: Text Classification via Attention-based Language Model Fine-tuning

Authors: Yunzhe Tao, Saurabh Gupta, Satyapriya Krishna, Xiong Zhou, Orchid Majumder, Vineet Khare

Abstract: Training deep neural networks from scratch on natural language processing (NLP) tasks requires significant amount of manually labeled text corpus and substantial time to converge, which usually cannot be satisfied by the customers. In this paper, we aim to develop an effective transfer learning algorithm by fine-tuning a pre-trained language model. The goal is to provide expressive and convenient-… ▽ More Training deep neural networks from scratch on natural language processing (NLP) tasks requires significant amount of manually labeled text corpus and substantial time to converge, which usually cannot be satisfied by the customers. In this paper, we aim to develop an effective transfer learning algorithm by fine-tuning a pre-trained language model. The goal is to provide expressive and convenient-to-use feature extractors for downstream NLP tasks, and achieve improvement in terms of accuracy, data efficiency, and generalization to new domains. Therefore, we propose an attention-based fine-tuning algorithm that automatically selects relevant contextualized features from the pre-trained language model and uses those features on downstream text classification tasks. We test our methods on six widely-used benchmarking datasets, and achieve new state-of-the-art performance on all of them. Moreover, we then introduce an alternative multi-task learning approach, which is an end-to-end algorithm given the pre-trained model. By doing multi-task learning, one can largely reduce the total training time by trading off some classification accuracy. △ Less

Submitted 25 October, 2019; originally announced October 2019.

arXiv:1906.03038 [pdf, other]

A Generative Framework for Zero-Shot Learning with Adversarial Domain Adaptation

Authors: Varun Khare, Divyat Mahajan, Homanga Bharadhwaj, Vinay Verma, Piyush Rai

Abstract: We present a domain adaptation based generative framework for zero-shot learning. Our framework addresses the problem of domain shift between the seen and unseen class distributions in zero-shot learning and minimizes the shift by develo** a generative model trained via adversarial domain adaptation. Our approach is based on end-to-end learning of the class distributions of seen classes and unse… ▽ More We present a domain adaptation based generative framework for zero-shot learning. Our framework addresses the problem of domain shift between the seen and unseen class distributions in zero-shot learning and minimizes the shift by develo** a generative model trained via adversarial domain adaptation. Our approach is based on end-to-end learning of the class distributions of seen classes and unseen classes. To enable the model to learn the class distributions of unseen classes, we parameterize these class distributions in terms of the class attribute information (which is available for both seen and unseen classes). This provides a very simple way to learn the class distribution of any unseen class, given only its class attribute information, and no labeled training data. Training this model with adversarial domain adaptation further provides robustness against the distribution mismatch between the data from seen and unseen classes. Our approach also provides a novel way for training neural net based classifiers to overcome the hubness problem in zero-shot learning. Through a comprehensive set of experiments, we show that our model yields superior accuracies as compared to various state-of-the-art zero shot learning models, on a variety of benchmark datasets. Code for the experiments is available at github.com/vkkhare/ZSL-ADA △ Less

Submitted 22 February, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

Comments: Proceedings of Winter Conference on Applications of Computer Vision (WACV) 2020

arXiv:cond-mat/0304320 [pdf, ps, other]

doi 10.1103/PhysRevE.67.056110

Breaking of general rotational symmetries by multi-dimensional classical ratchets

Authors: A. W. Ghosh, S. V. Khare

Abstract: We demonstrate that a particle driven by a set of spatially uncorrelated, independent colored noise forces in a bounded, multidimensional potential exhibits rotations that are independent of the initial conditions. We calculate the particle currents in terms of the noise statistics and the potential asymmetries by deriving an n-dimensional Fokker-Planck equation in the small correlation time lim… ▽ More We demonstrate that a particle driven by a set of spatially uncorrelated, independent colored noise forces in a bounded, multidimensional potential exhibits rotations that are independent of the initial conditions. We calculate the particle currents in terms of the noise statistics and the potential asymmetries by deriving an n-dimensional Fokker-Planck equation in the small correlation time limit. We analyze a variety of flow patterns for various potential structures, generating various combinations of laminar and rotational flows. △ Less

Submitted 14 April, 2003; originally announced April 2003.

Comments: Accepted, Physical Review E

arXiv:cond-mat/9911113 [pdf, ps, other]

doi 10.1103/PhysRevLett.84.5243

Rotation in an asymmetric multidimensional periodic potential due to colored noise

Authors: A. W. Ghosh, S. V. Khare

Abstract: We analyze the motion of an overdamped classical particle in a multidimensional periodic potential, driven by a weak external noise. We demonstrate that in steady-state, the presence of temporal correlations in the noise and spatial asymmetry within a period of the potential could lead to particle rotation. The rotation is a direct consequence of a change in sign of the noise-induced drift motio… ▽ More We analyze the motion of an overdamped classical particle in a multidimensional periodic potential, driven by a weak external noise. We demonstrate that in steady-state, the presence of temporal correlations in the noise and spatial asymmetry within a period of the potential could lead to particle rotation. The rotation is a direct consequence of a change in sign of the noise-induced drift motion in each dimension. By choosing different potentials, we can generate a variety of flow patterns from laminar drifts to rotations. △ Less

Submitted 8 November, 1999; originally announced November 1999.

Comments: 11 pages and 3 figures

Showing 1–7 of 7 results for author: Khare, V