-
Medium-scale thermospheric gravity waves in the high-resolution Whole Atmosphere Model: Seasonal, local time, and longitudinal variations
Authors:
Garima Malhotra,
Timothy Fuller-Rowell,
Tzu-Wei Fang,
Valery Yudin,
Svetlana Karol,
Erich Becker,
Adam Marshall Kubaryk
Abstract:
This paper presents a study of the global medium-scale (scales<620 km) gravity wave (GW) activity (in terms of zonal wind variance) and its seasonal, local time and longitudinal variations by employing the enhanced-resolution (~50 km) Whole Atmosphere Model (WAMT254) and space-based observations for geomagnetically quiet conditions. It is found that the GW hotspots produced by WAMT254 in the tropo…
▽ More
This paper presents a study of the global medium-scale (scales<620 km) gravity wave (GW) activity (in terms of zonal wind variance) and its seasonal, local time and longitudinal variations by employing the enhanced-resolution (~50 km) Whole Atmosphere Model (WAMT254) and space-based observations for geomagnetically quiet conditions. It is found that the GW hotspots produced by WAMT254 in the troposphere and stratosphere agree well with previously well-studied orographic and non-orographic sources. In the ionosphere-thermosphere (IT) region, GWs spread out forming latitudinal band-like hotspots. During solstices, a primary maximum in GW activity is observed in WAMT254 and GOCE over winter mid-high latitudes, likely associated with higher-order waves with primary sources in polar night jet, fronts and polar vortex. During all the seasons, the enhancement of GWs around the geomagnetic poles as observed by GOCE (at ~250 km) is well captured by simulations. WAMT254 GWs in the IT region also show dependence on local time due to their interaction with migrating tides leading to diurnal and semidiurnal variations. The GWs are more likely to propagate up from the MLT region during westward/weakly-eastward phase of thermospheric tides, signifying the dominance of eastward GW momentum flux in the MLT. Additionally, as a novel finding, a wavenumber-4 signature in GW activity is predicted by WAMT254 between 6-12 LT in the tropics at ~250 km, which propagates eastward with local time. This behavior is likely associated with the modulation of GWs by wave-4 signal of non-migrating tides in the lower thermospheric zonal winds.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
MindSet: Vision. A toolbox for testing DNNs on key psychological experiments
Authors:
Valerio Biscione,
Dong Yin,
Gaurav Malhotra,
Marin Dujmovic,
Milton L. Montero,
Guillermo Puebla,
Federico Adolfi,
Rachel F. Heaton,
John E. Hummel,
Benjamin D. Evans,
Karim Habashy,
Jeffrey S. Bowers
Abstract:
Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbo…
▽ More
Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbox MindSet: Vision, consisting of a collection of image datasets and related scripts designed to test DNNs on 30 psychological findings. In all experimental conditions, the stimuli are systematically manipulated to test specific hypotheses regarding human visual perception and object recognition. In addition to providing pre-generated datasets of images, we provide code to regenerate these datasets, offering many configurable parameters which greatly extend the dataset versatility for different research contexts, and code to facilitate the testing of DNNs on these image datasets using three different methods (similarity judgments, out-of-distribution classification, and decoder method), accessible at https://github.com/MindSetVision/mindset-vision. We test ResNet-152 on each of these methods as an example of how the toolbox can be used.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Lost in Latent Space: Disentangled Models and the Challenge of Combinatorial Generalisation
Authors:
Milton L. Montero,
Jeffrey S. Bowers,
Rui Ponte Costa,
Casimir J. H. Ludwig,
Gaurav Malhotra
Abstract:
Recent research has shown that generative models with highly disentangled representations fail to generalise to unseen combination of generative factor values. These findings contradict earlier research which showed improved performance in out-of-training distribution settings when compared to entangled representations. Additionally, it is not clear if the reported failures are due to (a) encoders…
▽ More
Recent research has shown that generative models with highly disentangled representations fail to generalise to unseen combination of generative factor values. These findings contradict earlier research which showed improved performance in out-of-training distribution settings when compared to entangled representations. Additionally, it is not clear if the reported failures are due to (a) encoders failing to map novel combinations to the proper regions of the latent space or (b) novel combinations being mapped correctly but the decoder/downstream process is unable to render the correct output for the unseen combinations. We investigate these alternatives by testing several models on a range of datasets and training settings. We find that (i) when models fail, their encoders also fail to map unseen combinations to correct regions of the latent space and (ii) when models succeed, it is either because the test conditions do not exclude enough examples, or because excluded generative factors determine independent parts of the output image. Based on these results, we argue that to generalise properly, models not only need to capture factors of variation, but also understand how to invert the generative process that was used to generate the data.
△ Less
Submitted 14 June, 2024; v1 submitted 5 April, 2022;
originally announced April 2022.
-
Auto-Tag: Tagging-Data-By-Example in Data Lakes
Authors:
Yeye He,
Jie Song,
Yue Wang,
Surajit Chaudhuri,
Vishal Anil,
Blake Lassiter,
Yaron Goland,
Gaurav Malhotra
Abstract:
As data lakes become increasingly popular in large enterprises today, there is a growing need to tag or classify data assets (e.g., files and databases) in data lakes with additional metadata (e.g., semantic column-types), as the inferred metadata can enable a range of downstream applications like data governance (e.g., GDPR compliance), and dataset search. Given the sheer size of today's enterpri…
▽ More
As data lakes become increasingly popular in large enterprises today, there is a growing need to tag or classify data assets (e.g., files and databases) in data lakes with additional metadata (e.g., semantic column-types), as the inferred metadata can enable a range of downstream applications like data governance (e.g., GDPR compliance), and dataset search. Given the sheer size of today's enterprise data lakes with petabytes of data and millions of data assets, it is imperative that data assets can be ``auto-tagged'', using lightweight inference algorithms and minimal user input. In this work, we develop Auto-Tag, a corpus-driven approach that automates data-tagging of \textit{custom} data types in enterprise data lakes. Using Auto-Tag, users only need to provide \textit{one} example column to demonstrate the desired data-type to tag. Leveraging an index structure built offline using a lightweight scan of the data lake, which is analogous to pre-training in machine learning, Auto-Tag can infer suitable data patterns to best ``describe'' the underlying ``domain'' of the given column at an interactive speed, which can then be used to tag additional data of the same ``type'' in data lakes. The Auto-Tag approach can adapt to custom data-types, and is shown to be both accurate and efficient. Part of Auto-Tag ships as a ``custom-classification'' feature in a cloud-based data governance and catalog solution \textit{Azure Purview}.
△ Less
Submitted 11 December, 2021;
originally announced December 2021.
-
Speaker and Time-aware Joint Contextual Learning for Dialogue-act Classification in Counselling Conversations
Authors:
Ganeshan Malhotra,
Abdul Waheed,
Aseem Srivastava,
Md Shad Akhtar,
Tanmoy Chakraborty
Abstract:
The onset of the COVID-19 pandemic has brought the mental health of people under risk. Social counselling has gained remarkable significance in this environment. Unlike general goal-oriented dialogues, a conversation between a patient and a therapist is considerably implicit, though the objective of the conversation is quite apparent. In such a case, understanding the intent of the patient is impe…
▽ More
The onset of the COVID-19 pandemic has brought the mental health of people under risk. Social counselling has gained remarkable significance in this environment. Unlike general goal-oriented dialogues, a conversation between a patient and a therapist is considerably implicit, though the objective of the conversation is quite apparent. In such a case, understanding the intent of the patient is imperative in providing effective counselling in therapy sessions, and the same applies to a dialogue system as well. In this work, we take forward a small but an important step in the development of an automated dialogue system for mental-health counselling. We develop a novel dataset, named HOPE, to provide a platform for the dialogue-act classification in counselling conversations. We identify the requirement of such conversation and propose twelve domain-specific dialogue-act (DAC) labels. We collect 12.9K utterances from publicly-available counselling session videos on YouTube, extract their transcripts, clean, and annotate them with DAC labels. Further, we propose SPARTA, a transformer-based architecture with a novel speaker- and time-aware contextual learning for the dialogue-act classification. Our evaluation shows convincing performance over several baselines, achieving state-of-the-art on HOPE. We also supplement our experiments with extensive empirical and qualitative analyses of SPARTA.
△ Less
Submitted 12 November, 2021;
originally announced November 2021.
-
Pricing of the Geometric Asian Options Under a Multifactor Stochastic Volatility Model
Authors:
Gifty Malhotra,
R. Srivastava,
H. C. Taneja
Abstract:
This paper focuses on the pricing of continuous geometric Asian options (GAOs) under a multifactor stochastic volatility model. The model considers fast and slow mean reverting factors of volatility, where slow volatility factor is approximated by a quadratic arc. The asymptotic expansion of the price function is assumed, and the first order price approximation is derived using the perturbation te…
▽ More
This paper focuses on the pricing of continuous geometric Asian options (GAOs) under a multifactor stochastic volatility model. The model considers fast and slow mean reverting factors of volatility, where slow volatility factor is approximated by a quadratic arc. The asymptotic expansion of the price function is assumed, and the first order price approximation is derived using the perturbation techniques for both floating and fixed strike GAOs. Much simplified pricing formulae for the GAOs are obtained in this multifactor stochastic volatility framework. The zeroth order term in the price approximation is the modified Black-Scholes price for the GAOs. This modified price is expressed in terms of the Black-Scholes price for the GAOs. The accuracy of the approximate option pricing formulae is established, and the model parameter is also estimated by capturing the volatility smiles.
△ Less
Submitted 23 December, 2019;
originally announced December 2019.
-
Comparative Study of Two Extensions of Heston Stochastic Volatility Model
Authors:
Gifty Malhotra,
R. Srivastava,
H. C. Taneja
Abstract:
In the option valuation literature, the shortcomings of one factor stochastic volatility models have traditionally been addressed by adding jumps to the stock price process. An alternate approach in the context of option pricing and calibration of implied volatility is the addition of a few other factors to the volatility process. This paper contemplates two extensions of the Heston stochastic vol…
▽ More
In the option valuation literature, the shortcomings of one factor stochastic volatility models have traditionally been addressed by adding jumps to the stock price process. An alternate approach in the context of option pricing and calibration of implied volatility is the addition of a few other factors to the volatility process. This paper contemplates two extensions of the Heston stochastic volatility model. Out of which, one considers the addition of jumps to the stock price process (a stochastic volatility jump diffusion model) and another considers an additional stochastic volatility factor varying at a different time scale (a multiscale stochastic volatility model). An empirical analysis is carried out on the market data of options with different strike prices and maturities, to compare the pricing performance of these models and to capture their implied volatility fit. The unknown parameters of these models are calibrated using the non-linear least square optimization. It has been found that the multiscale stochastic volatility model performs better than the Heston stochastic volatility model and the stochastic volatility jump diffusion model for the data set under consideration.
△ Less
Submitted 21 December, 2019;
originally announced December 2019.
-
Correlation of Auroral Dynamics and GNSS Scintillation with an Autoencoder
Authors:
Kara Lamb,
Garima Malhotra,
Athanasios Vlontzos,
Edward Wagstaff,
Atılım Günes Baydin,
Anahita Bhiwandiwalla,
Yarin Gal,
Alfredo Kalaitzis,
Anthony Reina,
Asti Bhatt
Abstract:
High energy particles originating from solar activity travel along the the Earth's magnetic field and interact with the atmosphere around the higher latitudes. These interactions often manifest as aurora in the form of visible light in the Earth's ionosphere. These interactions also result in irregularities in the electron density, which cause disruptions in the amplitude and phase of the radio si…
▽ More
High energy particles originating from solar activity travel along the the Earth's magnetic field and interact with the atmosphere around the higher latitudes. These interactions often manifest as aurora in the form of visible light in the Earth's ionosphere. These interactions also result in irregularities in the electron density, which cause disruptions in the amplitude and phase of the radio signals from the Global Navigation Satellite Systems (GNSS), known as 'scintillation'. In this paper we use a multi-scale residual autoencoder (Res-AE) to show the correlation between specific dynamic structures of the aurora and the magnitude of the GNSS phase scintillations ($σ_φ$). Auroral images are encoded in a lower dimensional feature space using the Res-AE, which in turn are clustered with t-SNE and UMAP. Both methods produce similar clusters, and specific clusters demonstrate greater correlations with observed phase scintillations. Our results suggest that specific dynamic structures of auroras are highly correlated with GNSS phase scintillations.
△ Less
Submitted 4 October, 2019;
originally announced October 2019.
-
Prediction of GNSS Phase Scintillations: A Machine Learning Approach
Authors:
Kara Lamb,
Garima Malhotra,
Athanasios Vlontzos,
Edward Wagstaff,
Atılım Günes Baydin,
Anahita Bhiwandiwalla,
Yarin Gal,
Alfredo Kalaitzis,
Anthony Reina,
Asti Bhatt
Abstract:
A Global Navigation Satellite System (GNSS) uses a constellation of satellites around the earth for accurate navigation, timing, and positioning. Natural phenomena like space weather introduce irregularities in the Earth's ionosphere, disrupting the propagation of the radio signals that GNSS relies upon. Such disruptions affect both the amplitude and the phase of the propagated waves. No physics-b…
▽ More
A Global Navigation Satellite System (GNSS) uses a constellation of satellites around the earth for accurate navigation, timing, and positioning. Natural phenomena like space weather introduce irregularities in the Earth's ionosphere, disrupting the propagation of the radio signals that GNSS relies upon. Such disruptions affect both the amplitude and the phase of the propagated waves. No physics-based model currently exists to predict the time and location of these disruptions with sufficient accuracy and at relevant scales. In this paper, we focus on predicting the phase fluctuations of GNSS radio waves, known as phase scintillations. We propose a novel architecture and loss function to predict 1 hour in advance the magnitude of phase scintillations within a time window of plus-minus 5 minutes with state-of-the-art performance.
△ Less
Submitted 3 October, 2019;
originally announced October 2019.
-
Quadratic approximation of slow factor of volatility in a Multi-factor Stochastic volatility Model
Authors:
Gifty Malhotra,
R. Srivastava,
H. C. Taneja
Abstract:
In the present work, we propose a new multifactor stochastic volatility model in which slow factor of volatility is approximated by a parabolic arc. We retain ourselves to the perturbation technique to obtain approximate expression for European option prices. We introduce the notion of modified Black-Scholes price. We obtain a simplified expression for European option price which is perturbed arou…
▽ More
In the present work, we propose a new multifactor stochastic volatility model in which slow factor of volatility is approximated by a parabolic arc. We retain ourselves to the perturbation technique to obtain approximate expression for European option prices. We introduce the notion of modified Black-Scholes price. We obtain a simplified expression for European option price which is perturbed around the modified Black-Scholes price and have also obtained the expression of modified price in terms of Black-Scholes price.
△ Less
Submitted 31 March, 2017;
originally announced March 2017.
-
Normalization based K means Clustering Algorithm
Authors:
Deepali Virmani,
Shweta Taneja,
Geetika Malhotra
Abstract:
K-means is an effective clustering technique used to separate similar data into groups based on initial centroids of clusters. In this paper, Normalization based K-means clustering algorithm(N-K means) is proposed. Proposed N-K means clustering algorithm applies normalization prior to clustering on the available data as well as the proposed approach calculates initial centroids based on weights. E…
▽ More
K-means is an effective clustering technique used to separate similar data into groups based on initial centroids of clusters. In this paper, Normalization based K-means clustering algorithm(N-K means) is proposed. Proposed N-K means clustering algorithm applies normalization prior to clustering on the available data as well as the proposed approach calculates initial centroids based on weights. Experimental results prove the betterment of proposed N-K means clustering algorithm over existing K-means clustering algorithm in terms of complexity and overall performance.
△ Less
Submitted 3 March, 2015;
originally announced March 2015.