Search | arXiv e-print repository

arXiv:2407.00234 [pdf, other]

Medium-scale thermospheric gravity waves in the high-resolution Whole Atmosphere Model: Seasonal, local time, and longitudinal variations

Authors: Garima Malhotra, Timothy Fuller-Rowell, Tzu-Wei Fang, Valery Yudin, Svetlana Karol, Erich Becker, Adam Marshall Kubaryk

Abstract: This paper presents a study of the global medium-scale (scales<620 km) gravity wave (GW) activity (in terms of zonal wind variance) and its seasonal, local time and longitudinal variations by employing the enhanced-resolution (~50 km) Whole Atmosphere Model (WAMT254) and space-based observations for geomagnetically quiet conditions. It is found that the GW hotspots produced by WAMT254 in the tropo… ▽ More This paper presents a study of the global medium-scale (scales<620 km) gravity wave (GW) activity (in terms of zonal wind variance) and its seasonal, local time and longitudinal variations by employing the enhanced-resolution (~50 km) Whole Atmosphere Model (WAMT254) and space-based observations for geomagnetically quiet conditions. It is found that the GW hotspots produced by WAMT254 in the troposphere and stratosphere agree well with previously well-studied orographic and non-orographic sources. In the ionosphere-thermosphere (IT) region, GWs spread out forming latitudinal band-like hotspots. During solstices, a primary maximum in GW activity is observed in WAMT254 and GOCE over winter mid-high latitudes, likely associated with higher-order waves with primary sources in polar night jet, fronts and polar vortex. During all the seasons, the enhancement of GWs around the geomagnetic poles as observed by GOCE (at ~250 km) is well captured by simulations. WAMT254 GWs in the IT region also show dependence on local time due to their interaction with migrating tides leading to diurnal and semidiurnal variations. The GWs are more likely to propagate up from the MLT region during westward/weakly-eastward phase of thermospheric tides, signifying the dominance of eastward GW momentum flux in the MLT. Additionally, as a novel finding, a wavenumber-4 signature in GW activity is predicted by WAMT254 between 6-12 LT in the tropics at ~250 km, which propagates eastward with local time. This behavior is likely associated with the modulation of GWs by wave-4 signal of non-migrating tides in the lower thermospheric zonal winds. △ Less

Submitted 28 June, 2024; originally announced July 2024.

arXiv:2404.05290 [pdf, other]

MindSet: Vision. A toolbox for testing DNNs on key psychological experiments

Authors: Valerio Biscione, Dong Yin, Gaurav Malhotra, Marin Dujmovic, Milton L. Montero, Guillermo Puebla, Federico Adolfi, Rachel F. Heaton, John E. Hummel, Benjamin D. Evans, Karim Habashy, Jeffrey S. Bowers

Abstract: Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbo… ▽ More Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbox MindSet: Vision, consisting of a collection of image datasets and related scripts designed to test DNNs on 30 psychological findings. In all experimental conditions, the stimuli are systematically manipulated to test specific hypotheses regarding human visual perception and object recognition. In addition to providing pre-generated datasets of images, we provide code to regenerate these datasets, offering many configurable parameters which greatly extend the dataset versatility for different research contexts, and code to facilitate the testing of DNNs on these image datasets using three different methods (similarity judgments, out-of-distribution classification, and decoder method), accessible at https://github.com/MindSetVision/mindset-vision. We test ResNet-152 on each of these methods as an example of how the toolbox can be used. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2204.02283 [pdf, other]

Lost in Latent Space: Disentangled Models and the Challenge of Combinatorial Generalisation

Authors: Milton L. Montero, Jeffrey S. Bowers, Rui Ponte Costa, Casimir J. H. Ludwig, Gaurav Malhotra

Abstract: Recent research has shown that generative models with highly disentangled representations fail to generalise to unseen combination of generative factor values. These findings contradict earlier research which showed improved performance in out-of-training distribution settings when compared to entangled representations. Additionally, it is not clear if the reported failures are due to (a) encoders… ▽ More Recent research has shown that generative models with highly disentangled representations fail to generalise to unseen combination of generative factor values. These findings contradict earlier research which showed improved performance in out-of-training distribution settings when compared to entangled representations. Additionally, it is not clear if the reported failures are due to (a) encoders failing to map novel combinations to the proper regions of the latent space or (b) novel combinations being mapped correctly but the decoder/downstream process is unable to render the correct output for the unseen combinations. We investigate these alternatives by testing several models on a range of datasets and training settings. We find that (i) when models fail, their encoders also fail to map unseen combinations to correct regions of the latent space and (ii) when models succeed, it is either because the test conditions do not exclude enough examples, or because excluded generative factors determine independent parts of the output image. Based on these results, we argue that to generalise properly, models not only need to capture factors of variation, but also understand how to invert the generative process that was used to generate the data. △ Less

Submitted 14 June, 2024; v1 submitted 5 April, 2022; originally announced April 2022.

Comments: 10 pages and 7 figures in main text (not including references). 27 pages and 31 figures in appendix. Updated to match the camera-ready version

ACM Class: I.2.6; I.2.10; I.4.5; I.4.10; I.5.1; I.5.3

Journal ref: Adv.Neur.Info.Proc.Sys. 35 (2022) 10136-1049

arXiv:2112.06049 [pdf, other]

Auto-Tag: Tagging-Data-By-Example in Data Lakes

Authors: Yeye He, Jie Song, Yue Wang, Surajit Chaudhuri, Vishal Anil, Blake Lassiter, Yaron Goland, Gaurav Malhotra

Abstract: As data lakes become increasingly popular in large enterprises today, there is a growing need to tag or classify data assets (e.g., files and databases) in data lakes with additional metadata (e.g., semantic column-types), as the inferred metadata can enable a range of downstream applications like data governance (e.g., GDPR compliance), and dataset search. Given the sheer size of today's enterpri… ▽ More As data lakes become increasingly popular in large enterprises today, there is a growing need to tag or classify data assets (e.g., files and databases) in data lakes with additional metadata (e.g., semantic column-types), as the inferred metadata can enable a range of downstream applications like data governance (e.g., GDPR compliance), and dataset search. Given the sheer size of today's enterprise data lakes with petabytes of data and millions of data assets, it is imperative that data assets can be ``auto-tagged'', using lightweight inference algorithms and minimal user input. In this work, we develop Auto-Tag, a corpus-driven approach that automates data-tagging of \textit{custom} data types in enterprise data lakes. Using Auto-Tag, users only need to provide \textit{one} example column to demonstrate the desired data-type to tag. Leveraging an index structure built offline using a lightweight scan of the data lake, which is analogous to pre-training in machine learning, Auto-Tag can infer suitable data patterns to best ``describe'' the underlying ``domain'' of the given column at an interactive speed, which can then be used to tag additional data of the same ``type'' in data lakes. The Auto-Tag approach can adapt to custom data-types, and is shown to be both accurate and efficient. Part of Auto-Tag ships as a ``custom-classification'' feature in a cloud-based data governance and catalog solution \textit{Azure Purview}. △ Less

Submitted 11 December, 2021; originally announced December 2021.

arXiv:2111.06647 [pdf, other]

Speaker and Time-aware Joint Contextual Learning for Dialogue-act Classification in Counselling Conversations

Authors: Ganeshan Malhotra, Abdul Waheed, Aseem Srivastava, Md Shad Akhtar, Tanmoy Chakraborty

Abstract: The onset of the COVID-19 pandemic has brought the mental health of people under risk. Social counselling has gained remarkable significance in this environment. Unlike general goal-oriented dialogues, a conversation between a patient and a therapist is considerably implicit, though the objective of the conversation is quite apparent. In such a case, understanding the intent of the patient is impe… ▽ More The onset of the COVID-19 pandemic has brought the mental health of people under risk. Social counselling has gained remarkable significance in this environment. Unlike general goal-oriented dialogues, a conversation between a patient and a therapist is considerably implicit, though the objective of the conversation is quite apparent. In such a case, understanding the intent of the patient is imperative in providing effective counselling in therapy sessions, and the same applies to a dialogue system as well. In this work, we take forward a small but an important step in the development of an automated dialogue system for mental-health counselling. We develop a novel dataset, named HOPE, to provide a platform for the dialogue-act classification in counselling conversations. We identify the requirement of such conversation and propose twelve domain-specific dialogue-act (DAC) labels. We collect 12.9K utterances from publicly-available counselling session videos on YouTube, extract their transcripts, clean, and annotate them with DAC labels. Further, we propose SPARTA, a transformer-based architecture with a novel speaker- and time-aware contextual learning for the dialogue-act classification. Our evaluation shows convincing performance over several baselines, achieving state-of-the-art on HOPE. We also supplement our experiments with extensive empirical and qualitative analyses of SPARTA. △ Less

Submitted 12 November, 2021; originally announced November 2021.

Comments: 9 pages; Accepted to WSDM 2022

arXiv:1912.10640 [pdf, ps, other]

Pricing of the Geometric Asian Options Under a Multifactor Stochastic Volatility Model

Authors: Gifty Malhotra, R. Srivastava, H. C. Taneja

Abstract: This paper focuses on the pricing of continuous geometric Asian options (GAOs) under a multifactor stochastic volatility model. The model considers fast and slow mean reverting factors of volatility, where slow volatility factor is approximated by a quadratic arc. The asymptotic expansion of the price function is assumed, and the first order price approximation is derived using the perturbation te… ▽ More This paper focuses on the pricing of continuous geometric Asian options (GAOs) under a multifactor stochastic volatility model. The model considers fast and slow mean reverting factors of volatility, where slow volatility factor is approximated by a quadratic arc. The asymptotic expansion of the price function is assumed, and the first order price approximation is derived using the perturbation techniques for both floating and fixed strike GAOs. Much simplified pricing formulae for the GAOs are obtained in this multifactor stochastic volatility framework. The zeroth order term in the price approximation is the modified Black-Scholes price for the GAOs. This modified price is expressed in terms of the Black-Scholes price for the GAOs. The accuracy of the approximate option pricing formulae is established, and the model parameter is also estimated by capturing the volatility smiles. △ Less

Submitted 23 December, 2019; originally announced December 2019.

Comments: 29 pages, 2 figures

arXiv:1912.10237 [pdf, ps, other]

Comparative Study of Two Extensions of Heston Stochastic Volatility Model

Authors: Gifty Malhotra, R. Srivastava, H. C. Taneja

Abstract: In the option valuation literature, the shortcomings of one factor stochastic volatility models have traditionally been addressed by adding jumps to the stock price process. An alternate approach in the context of option pricing and calibration of implied volatility is the addition of a few other factors to the volatility process. This paper contemplates two extensions of the Heston stochastic vol… ▽ More In the option valuation literature, the shortcomings of one factor stochastic volatility models have traditionally been addressed by adding jumps to the stock price process. An alternate approach in the context of option pricing and calibration of implied volatility is the addition of a few other factors to the volatility process. This paper contemplates two extensions of the Heston stochastic volatility model. Out of which, one considers the addition of jumps to the stock price process (a stochastic volatility jump diffusion model) and another considers an additional stochastic volatility factor varying at a different time scale (a multiscale stochastic volatility model). An empirical analysis is carried out on the market data of options with different strike prices and maturities, to compare the pricing performance of these models and to capture their implied volatility fit. The unknown parameters of these models are calibrated using the non-linear least square optimization. It has been found that the multiscale stochastic volatility model performs better than the Heston stochastic volatility model and the stochastic volatility jump diffusion model for the data set under consideration. △ Less

Submitted 21 December, 2019; originally announced December 2019.

Comments: 15 pages, 3 pages

arXiv:1910.03085 [pdf, other]

Correlation of Auroral Dynamics and GNSS Scintillation with an Autoencoder

Authors: Kara Lamb, Garima Malhotra, Athanasios Vlontzos, Edward Wagstaff, Atılım Günes Baydin, Anahita Bhiwandiwalla, Yarin Gal, Alfredo Kalaitzis, Anthony Reina, Asti Bhatt

Abstract: High energy particles originating from solar activity travel along the the Earth's magnetic field and interact with the atmosphere around the higher latitudes. These interactions often manifest as aurora in the form of visible light in the Earth's ionosphere. These interactions also result in irregularities in the electron density, which cause disruptions in the amplitude and phase of the radio si… ▽ More High energy particles originating from solar activity travel along the the Earth's magnetic field and interact with the atmosphere around the higher latitudes. These interactions often manifest as aurora in the form of visible light in the Earth's ionosphere. These interactions also result in irregularities in the electron density, which cause disruptions in the amplitude and phase of the radio signals from the Global Navigation Satellite Systems (GNSS), known as 'scintillation'. In this paper we use a multi-scale residual autoencoder (Res-AE) to show the correlation between specific dynamic structures of the aurora and the magnitude of the GNSS phase scintillations ($σ_φ$). Auroral images are encoded in a lower dimensional feature space using the Res-AE, which in turn are clustered with t-SNE and UMAP. Both methods produce similar clusters, and specific clusters demonstrate greater correlations with observed phase scintillations. Our results suggest that specific dynamic structures of auroras are highly correlated with GNSS phase scintillations. △ Less

Submitted 4 October, 2019; originally announced October 2019.

Comments: Four first authors contributed equally; Paper accepted in Machine Learning for the Physical Sciences workshop of NeurIPS 2019; Camera Ready Version to Follow

arXiv:1910.01570 [pdf, other]

Prediction of GNSS Phase Scintillations: A Machine Learning Approach

Authors: Kara Lamb, Garima Malhotra, Athanasios Vlontzos, Edward Wagstaff, Atılım Günes Baydin, Anahita Bhiwandiwalla, Yarin Gal, Alfredo Kalaitzis, Anthony Reina, Asti Bhatt

Abstract: A Global Navigation Satellite System (GNSS) uses a constellation of satellites around the earth for accurate navigation, timing, and positioning. Natural phenomena like space weather introduce irregularities in the Earth's ionosphere, disrupting the propagation of the radio signals that GNSS relies upon. Such disruptions affect both the amplitude and the phase of the propagated waves. No physics-b… ▽ More A Global Navigation Satellite System (GNSS) uses a constellation of satellites around the earth for accurate navigation, timing, and positioning. Natural phenomena like space weather introduce irregularities in the Earth's ionosphere, disrupting the propagation of the radio signals that GNSS relies upon. Such disruptions affect both the amplitude and the phase of the propagated waves. No physics-based model currently exists to predict the time and location of these disruptions with sufficient accuracy and at relevant scales. In this paper, we focus on predicting the phase fluctuations of GNSS radio waves, known as phase scintillations. We propose a novel architecture and loss function to predict 1 hour in advance the magnitude of phase scintillations within a time window of plus-minus 5 minutes with state-of-the-art performance. △ Less

Submitted 3 October, 2019; originally announced October 2019.

Comments: First 4 authors contributed equally Paper accepted in Machine Learning for the Physical Sciences workshop of NeurIPS 2019 Camera Ready Version to Follow

arXiv:1703.10825 [pdf, ps, other]

Quadratic approximation of slow factor of volatility in a Multi-factor Stochastic volatility Model

Authors: Gifty Malhotra, R. Srivastava, H. C. Taneja

Abstract: In the present work, we propose a new multifactor stochastic volatility model in which slow factor of volatility is approximated by a parabolic arc. We retain ourselves to the perturbation technique to obtain approximate expression for European option prices. We introduce the notion of modified Black-Scholes price. We obtain a simplified expression for European option price which is perturbed arou… ▽ More In the present work, we propose a new multifactor stochastic volatility model in which slow factor of volatility is approximated by a parabolic arc. We retain ourselves to the perturbation technique to obtain approximate expression for European option prices. We introduce the notion of modified Black-Scholes price. We obtain a simplified expression for European option price which is perturbed around the modified Black-Scholes price and have also obtained the expression of modified price in terms of Black-Scholes price. △ Less

Submitted 31 March, 2017; originally announced March 2017.

Comments: 11 pages, 1 figure

arXiv:1503.00900 [pdf]

Normalization based K means Clustering Algorithm

Authors: Deepali Virmani, Shweta Taneja, Geetika Malhotra

Abstract: K-means is an effective clustering technique used to separate similar data into groups based on initial centroids of clusters. In this paper, Normalization based K-means clustering algorithm(N-K means) is proposed. Proposed N-K means clustering algorithm applies normalization prior to clustering on the available data as well as the proposed approach calculates initial centroids based on weights. E… ▽ More K-means is an effective clustering technique used to separate similar data into groups based on initial centroids of clusters. In this paper, Normalization based K-means clustering algorithm(N-K means) is proposed. Proposed N-K means clustering algorithm applies normalization prior to clustering on the available data as well as the proposed approach calculates initial centroids based on weights. Experimental results prove the betterment of proposed N-K means clustering algorithm over existing K-means clustering algorithm in terms of complexity and overall performance. △ Less

Submitted 3 March, 2015; originally announced March 2015.

Comments: 5 pages, 4 figures in International Journal of Advanced Engineering Research and Science (IJAERS)-Feb 2015

Showing 1–11 of 11 results for author: Malhotra, G