Search | arXiv e-print repository

arXiv:2406.16383 [pdf, other]

Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model

Authors: Sai Ganesh, Anupam Purwar, Gautam B

Abstract: Generating high-quality answers consistently by providing contextual information embedded in the prompt passed to the Large Language Model (LLM) is dependent on the quality of information retrieval. As the corpus of contextual information grows, the answer/inference quality of Retrieval Augmented Generation (RAG) based Question Answering (QA) systems declines. This work solves this problem by comb… ▽ More Generating high-quality answers consistently by providing contextual information embedded in the prompt passed to the Large Language Model (LLM) is dependent on the quality of information retrieval. As the corpus of contextual information grows, the answer/inference quality of Retrieval Augmented Generation (RAG) based Question Answering (QA) systems declines. This work solves this problem by combining classical text classification with the Large Language Model (LLM) to enable quick information retrieval from the vector store and ensure the relevancy of retrieved information. For the same, this work proposes a new approach Context Augmented retrieval (CAR), where partitioning of vector database by real-time classification of information flowing into the corpus is done. CAR demonstrates good quality answer generation along with significant reduction in information retrieval and answer generation time. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2405.12817 [pdf, other]

Curve of Growth Analysis of SZ Lyn

Authors: Janaka Adassuriya, Shashikiran Ganesh, Peter de Cat, Santosh Joshi, Chandana Jayaratne

Abstract: We present one high-resolution and a time series of 561 low-resolution follow-up spectroscopic observations of SZ Lyn. It is a high-amplitude Delta Scuti-type pulsating star in a binary system. The photometric observations reveal the existence of radial and non-radial oscillation modes in SZ Lyn. In spectroscopy, the variation of equivalent width of the line profiles reflects the temperature varia… ▽ More We present one high-resolution and a time series of 561 low-resolution follow-up spectroscopic observations of SZ Lyn. It is a high-amplitude Delta Scuti-type pulsating star in a binary system. The photometric observations reveal the existence of radial and non-radial oscillation modes in SZ Lyn. In spectroscopy, the variation of equivalent width of the line profiles reflects the temperature variations. The equivalent widths of the Balmer lines, H-alpha, H-hbeta, and H-gamma were measured over the pulsation cycle of SZ Lyn using time sequence spectra. Hence, the temperature profile of SZ Lyn was derived using the curve of growth analysis. Furthermore, the stellar parameters were determined through the best fit analysis of observed and synthetic high-resolution spectral lines. The best fit determines a model of Teff=6750 K, log(g)=3.5 dex, and vrot=10 km/s for solar abundance. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 8 pages, 4 figures, accepted in the Bulletin of Liège Royal Society of Sciences (Proceedings paper for the 3rd BINA Workshop held at ARIES, India)

arXiv:2405.03903 [pdf, other]

Unified Locational Differential Privacy Framework

Authors: Aman Priyanshu, Yash Maurya, Suriya Ganesh, Vy Tran

Abstract: Aggregating statistics over geographical regions is important for many applications, such as analyzing income, election results, and disease spread. However, the sensitive nature of this data necessitates strong privacy protections to safeguard individuals. In this work, we present a unified locational differential privacy (DP) framework to enable private aggregation of various data types, includi… ▽ More Aggregating statistics over geographical regions is important for many applications, such as analyzing income, election results, and disease spread. However, the sensitive nature of this data necessitates strong privacy protections to safeguard individuals. In this work, we present a unified locational differential privacy (DP) framework to enable private aggregation of various data types, including one-hot encoded, boolean, float, and integer arrays, over geographical regions. Our framework employs local DP mechanisms such as randomized response, the exponential mechanism, and the Gaussian mechanism. We evaluate our approach on four datasets representing significant location data aggregation scenarios. Results demonstrate the utility of our framework in providing formal DP guarantees while enabling geographical data analysis. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 10 pages, 7 figures

arXiv:2404.02108 [pdf, ps, other]

Variance-Reduced Policy Gradient Approaches for Infinite Horizon Average Reward Markov Decision Processes

Authors: Swetha Ganesh, Washim Uddin Mondal, Vaneet Aggarwal

Abstract: We present two Policy Gradient-based methods with general parameterization in the context of infinite horizon average reward Markov Decision Processes. The first approach employs Implicit Gradient Transport for variance reduction, ensuring an expected regret of the order $\tilde{\mathcal{O}}(T^{3/5})$. The second approach, rooted in Hessian-based techniques, ensures an expected regret of the order… ▽ More We present two Policy Gradient-based methods with general parameterization in the context of infinite horizon average reward Markov Decision Processes. The first approach employs Implicit Gradient Transport for variance reduction, ensuring an expected regret of the order $\tilde{\mathcal{O}}(T^{3/5})$. The second approach, rooted in Hessian-based techniques, ensures an expected regret of the order $\tilde{\mathcal{O}}(\sqrt{T})$. These results significantly improve the state of the art of the problem, which achieves a regret of $\tilde{\mathcal{O}}(T^{3/4})$. △ Less

Submitted 2 April, 2024; originally announced April 2024.

Comments: 34 pages

arXiv:2403.20043 [pdf, other]

Foreground Dust Properties towards the Cluster NGC 7380

Authors: Sadhana Singh, Jeewan C. Pandey, Thiem Hoang, Neelam Panwar, Biman J. Medhi, Vishal Joshi, Shashikiran Ganesh

Abstract: Using starlight polarization, we present the properties of foreground dust towards cluster NGC 7380 embedded in H{\sc ii} region Sh 2-142. Observations of starlight polarization are carried out in four filters using an imaging polarimeter equipped with a 104-cm ARIES telescope. Polarization vectors of stars are aligned along the Galactic magnetic field. Towards the east and southeast regions, the… ▽ More Using starlight polarization, we present the properties of foreground dust towards cluster NGC 7380 embedded in H{\sc ii} region Sh 2-142. Observations of starlight polarization are carried out in four filters using an imaging polarimeter equipped with a 104-cm ARIES telescope. Polarization vectors of stars are aligned along the Galactic magnetic field. Towards the east and southeast regions, the dust structure appears much denser than in other regions (inferred from extinction contours and colour composite image) and is also reflected in polarization distribution. We find that the polarization degree and extinction tend to increase with distance and indication for the presence of a dust layer at a distance of around 1.2 $kpc$. We have identified eight potential candidates exhibiting intrinsic polarization by employing three distinct criteria to distinguish between stars of intrinsic polarization and interstellar polarized stars. For interstellar polarized stars, we find that the maximum polarization degree increases with the color excess and has a strong scatter, with the mean value of 1.71$\pm$0.57$\%$. The peak wavelength spans $0.40-0.88μ$m with the mean value of 0.56$\pm$0.07 $μm$, suggesting similar grain sizes in the region as the average diffuse interstellar medium. The polarization efficiency is also found to decrease with visual extinction as $P_{max}/A_{V}\propto A_{V}^{-0.61}$. Our observational results are found to be consistent with the predictions by the radiative torque alignment theory. △ Less

Submitted 29 March, 2024; originally announced March 2024.

Comments: 20 pages, 7 figures, 3 tables. Accepted for publication in AJ

arXiv:2403.13019 [pdf, ps, other]

Many body gravity and the galaxy rotation curves

Authors: S Ganesh

Abstract: A novel theory was proposed earlier to model systems with thermal gradients, based on the postulate that the spatial and temporal variation in temperature can be recast as a variation in the metric. Combining the variation in the metric due to the thermal variations and gravity, leads to the concept of thermal gravity in a 5-D space-time-temperature setting. When the 5-D Einstein field equations a… ▽ More A novel theory was proposed earlier to model systems with thermal gradients, based on the postulate that the spatial and temporal variation in temperature can be recast as a variation in the metric. Combining the variation in the metric due to the thermal variations and gravity, leads to the concept of thermal gravity in a 5-D space-time-temperature setting. When the 5-D Einstein field equations are projected to a 4-D space, they result in additional terms in the field equations. This may lead to unique phenomena such as the spontaneous symmetry breaking of scalar particles in the presence of a strong gravitational field. This theory, originally conceived in a quantum mechanical framework, is now adapted to explain the galaxy rotation curves. A galaxy is not in a state of thermal equilibrium. A parameter called the "degree of thermalization" is introduced to model partially thermalized systems. The generalization of thermal gravity to partially thermalized systems, leads to the theory of many-body gravity. The theory of many-body gravity is now shown to be able to explain the rotation curves of the Milky Way and the M31 (Andromeda) galaxies, to a fair extent. The radial acceleration relation (RAR) for 21 galaxies, with variations spanning three orders of magnitude in galactic mass, is also reproduced. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: 13 pages, 6 figures

arXiv:2403.10704 [pdf, other]

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Authors: Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis **, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Simral Chaudhary, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

Abstract: Reinforcement Learning from Human Feedback (RLHF) has proven to be a strong method to align Pretrained Large Language Models (LLMs) with human preferences. But training models with RLHF is computationally expensive, and an overall complex process. In this work, we study RLHF where the underlying models are trained using the parameter efficient method of Low-Rank Adaptation (LoRA) introduced by Hu… ▽ More Reinforcement Learning from Human Feedback (RLHF) has proven to be a strong method to align Pretrained Large Language Models (LLMs) with human preferences. But training models with RLHF is computationally expensive, and an overall complex process. In this work, we study RLHF where the underlying models are trained using the parameter efficient method of Low-Rank Adaptation (LoRA) introduced by Hu et al. [2021]. We investigate the setup of "Parameter Efficient Reinforcement Learning" (PERL), in which we perform reward model training and reinforcement learning using LoRA. We compare PERL to conventional fine-tuning (full-tuning) across various configurations for 7 benchmarks, including 2 novel datasets, of reward modeling and reinforcement learning. We find that PERL performs on par with the conventional RLHF setting, while training faster, and with less memory. This enables the high performance of RLHF, while reducing the computational burden that limits its adoption as an alignment technique for Large Language Models. We also release 2 novel thumbs up/down preference datasets: "Taskmaster Coffee", and "Taskmaster Ticketing" to promote research around RLHF. △ Less

Submitted 15 March, 2024; originally announced March 2024.

arXiv:2403.09940 [pdf, ps, other]

Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries

Authors: Swetha Ganesh, Jiayu Chen, Gugan Thoppe, Vaneet Aggarwal

Abstract: Federated Reinforcement Learning (FRL) allows multiple agents to collaboratively build a decision making policy without sharing raw trajectories. However, if a small fraction of these agents are adversarial, it can lead to catastrophic results. We propose a policy gradient based approach that is robust to adversarial agents which can send arbitrary values to the server. Under this setting, our res… ▽ More Federated Reinforcement Learning (FRL) allows multiple agents to collaboratively build a decision making policy without sharing raw trajectories. However, if a small fraction of these agents are adversarial, it can lead to catastrophic results. We propose a policy gradient based approach that is robust to adversarial agents which can send arbitrary values to the server. Under this setting, our results form the first global convergence guarantees with general parametrization. These results demonstrate resilience with adversaries, while achieving sample complexity of order $\tilde{\mathcal{O}}\left( \frac{1}{ε^2} \left( \frac{1}{N-f} + \frac{f^2}{(N-f)^2}\right)\right)$, where $N$ is the total number of agents and $f$ is the number of adversarial agents. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 27 pages, 6 figures

arXiv:2403.01595 [pdf, other]

doi 10.1093/mnras/stae666

Optical spectroscopy of comets using Hanle Echelle Spectrograph (HESP)

Authors: K Aravind, Kumar Venkataramani, Shashikiran Ganesh, Arun Surya, Thirupathi Sivarani, Devendra Sahu, Athira Unni, Anil Bhardwaj

Abstract: Observing the vibrational/rotational lines in a comet's optical spectrum requires high-resolution spectroscopy, as they are otherwise seen as a blended feature. To achieve this, we have obtained medium and high-resolution (R ($λ/Δλ$) = 30000 and 60000) spectra of several comets, including C/2015 V2 (Johnson), 46P/Wirtanen, 41P/Tuttle-Giacobini-Kresák and 38P/Stephan-Oterma, using the Hanle Echelle… ▽ More Observing the vibrational/rotational lines in a comet's optical spectrum requires high-resolution spectroscopy, as they are otherwise seen as a blended feature. To achieve this, we have obtained medium and high-resolution (R ($λ/Δλ$) = 30000 and 60000) spectra of several comets, including C/2015 V2 (Johnson), 46P/Wirtanen, 41P/Tuttle-Giacobini-Kresák and 38P/Stephan-Oterma, using the Hanle Echelle Spectrograph (HESP) mounted on the 2-m Himalayan Chandra Telescope (HCT) in India. The spectra effectively cover the wavelength range 3700 - 10,000 Å, allowing us to probe the various vibrational bands and band sequences to identify the rotational lines in the cometary molecular emission. We were also able to separate the cometary Oxygen lines from the telluric lines and analyse the green-to-red (G/R) forbidden oxygen [OI] ratios in a few comets. For comets C/2015 V2, 46P, and 41P, the computed G/R ratios, 0.04$\pm$0.01, 0.04$\pm$0.01, and 0.08$\pm$0.02 respectively, point to H$_2$O being a major source of Oxygen emissions. Notably, in the second fibre pointing at a location 1000 km away from the photocenter of comet 46P, the G/R ratio reduced by more than half the value observed in the first fibre, indicating the effects of quenching within the inner coma. We also measured the NH$_2$ ortho-to-para ratio of comet 46P to be about 3.41$\pm$0.05 and derived an ammonia ratio of 1.21$\pm$0.03 corresponding to a spin temperature of $\sim$26 K. With these, we present the results of the study of four comets from different cometary reservoirs using medium and high-resolution optical spectroscopy, emphasising the capabilities of the instrument for future cometary studies. △ Less

Submitted 3 March, 2024; originally announced March 2024.

Comments: 12 pages, 21 figures, 6 tables, Accepted for publication in MNRAS

arXiv:2402.17932 [pdf, other]

A Heterogeneous Agent Model of Mortgage Servicing: An Income-based Relief Analysis

Authors: Deepeka Garg, Benjamin Patrick Evans, Leo Ardon, Annapoorani Lakshmi Narayanan, Jared Vann, Udari Madhushani, Makada Henry-Nickie, Sumitra Ganesh

Abstract: Mortgages account for the largest portion of household debt in the United States, totaling around \$12 trillion nationwide. In times of financial hardship, alleviating mortgage burdens is essential for supporting affected households. The mortgage servicing industry plays a vital role in offering this assistance, yet there has been limited research modelling the complex relationship between househo… ▽ More Mortgages account for the largest portion of household debt in the United States, totaling around \$12 trillion nationwide. In times of financial hardship, alleviating mortgage burdens is essential for supporting affected households. The mortgage servicing industry plays a vital role in offering this assistance, yet there has been limited research modelling the complex relationship between households and servicers. To bridge this gap, we developed an agent-based model that explores household behavior and the effectiveness of relief measures during financial distress. Our model represents households as adaptive learning agents with realistic financial attributes. These households experience exogenous income shocks, which may influence their ability to make mortgage payments. Mortgage servicers provide relief options to these households, who then choose the most suitable relief based on their unique financial circumstances and individual preferences. We analyze the impact of various external shocks and the success of different mortgage relief strategies on specific borrower subgroups. Through this analysis, we show that our model can not only replicate real-world mortgage studies but also act as a tool for conducting a broad range of what-if scenario analyses. Our approach offers fine-grained insights that can inform the development of more effective and inclusive mortgage relief solutions. △ Less

Submitted 29 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

Comments: AAAI 2024 - AI in Finance for Social Impact

arXiv:2402.02476 [pdf, other]

Constraints on Triton atmospheric evolution from occultations: 1989-2022

Authors: B. Sicardy, A. Tej, A. R. Gomes-Junior, F. D. Romanov, T. Bertrand, N. M. Ashok, E. Lellouch, B. E. Morgado, M. Assafin, J. Desmars, J. I. B. Camargo, Y. Kilic, J. L. Ortiz, R. Vieira-Martins, F. Braga-Ribas, J. P. Ninan, B. C. Bhatt, S. Pramod Kumar, V. Swain, S. Sharma, A. Saha, D. K. Ojha, G. Pawar, S. Deshmukh, A. Deshpande , et al. (27 additional authors not shown)

Abstract: Context - Around the year 2000, Triton's south pole experienced an extreme summer solstice that occurs every about 650 years, when the subsolar latitude reached about 50°. Bracketing this epoch, a few occultations probed Triton's atmosphere in 1989, 1995, 1997, 2008 and 2017. A recent ground-based stellar occultation observed on 6 October 2022 provides a new measurement of Triton's atmospheric pre… ▽ More Context - Around the year 2000, Triton's south pole experienced an extreme summer solstice that occurs every about 650 years, when the subsolar latitude reached about 50°. Bracketing this epoch, a few occultations probed Triton's atmosphere in 1989, 1995, 1997, 2008 and 2017. A recent ground-based stellar occultation observed on 6 October 2022 provides a new measurement of Triton's atmospheric pressure which is presented here. Aims- The goal is to constrain the Volatile Transport Models (VTMs) of Triton's atmosphere that is basically in vapor pressure equilibrium with the nitrogen ice at its surface. Methods - Fits to the occultation light curves yield Triton's atmospheric pressure at the reference radius 1400 km, from which the surface pressure is induced. Results - The fits provide a pressure p_1400= 1.211 +/- 0.039 microbar at radius 1400 km (47 km altitude), from which a surface pressure of p_surf= 14.54 +/- 0.47 microbar is induced (1-sigma error bars). To within error bars, this is identical to the pressure derived from the previous occultation of 5 October 2017, p_1400 = 1.18 +/- 0.03 microbar and p_surf= 14.1 +/- 0.4 microbar, respectively. Based on recent models of Triton's volatile cycles, the overall evolution over the last 30 years of the surface pressure is consistent with N2 condensation taking place in the northern hemisphere. However, models typically predict a steady decrease in surface pressure for the period 2005-2060, which is not confirmed by this observation. Complex surface-atmosphere interactions, such as ice albedo runaway and formation of local N2 frosts in the equatorial regions of Triton could explain the relatively constant pressure between 2017 and 2022. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: 8 pages, 4 figures, accepted for publication in Astronomy and Astrophysics

arXiv:2402.00787 [pdf, other]

Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning

Authors: Benjamin Patrick Evans, Sumitra Ganesh

Abstract: Agent-based models (ABMs) have shown promise for modelling various real world phenomena incompatible with traditional equilibrium analysis. However, a critical concern is the manual definition of behavioural rules in ABMs. Recent developments in multi-agent reinforcement learning (MARL) offer a way to address this issue from an optimisation perspective, where agents strive to maximise their utilit… ▽ More Agent-based models (ABMs) have shown promise for modelling various real world phenomena incompatible with traditional equilibrium analysis. However, a critical concern is the manual definition of behavioural rules in ABMs. Recent developments in multi-agent reinforcement learning (MARL) offer a way to address this issue from an optimisation perspective, where agents strive to maximise their utility, eliminating the need for manual rule specification. This learning-focused approach aligns with established economic and financial models through the use of rational utility-maximising agents. However, this representation departs from the fundamental motivation for ABMs: that realistic dynamics emerging from bounded rationality and agent heterogeneity can be modelled. To resolve this apparent disparity between the two approaches, we propose a novel technique for representing heterogeneous processing-constrained agents within a MARL framework. The proposed approach treats agents as constrained optimisers with varying degrees of strategic skills, permitting departure from strict utility maximisation. Behaviour is learnt through repeated simulations with policy gradients to adjust action likelihoods. To allow efficient computation, we use parameterised shared policy learning with distributions of agent skill levels. Shared policy learning avoids the need for agents to learn individual policies yet still enables a spectrum of bounded rational behaviours. We validate our model's effectiveness using real-world data on a range of canonical $n$-agent settings, demonstrating significantly improved predictive capability. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: Accepted as a full paper at AAMAS 2024

arXiv:2401.03869 [pdf, other]

doi 10.1007/s12036-024-09996-6

Long-term spectroscopic monitoring of comet 46P/Wirtanen

Authors: K. Aravind, Kumar Venkataramani, Shashikiran Ganesh, Emmanuel Jehin, Youssef Moulane

Abstract: Jupiter Family Comets (JFCs), having orbital period less than 20 years, provide us with an opportunity to observe their activity and analyse the homogeneity in their coma composition over multiple apparitions. Comet 46P/Wirtanen with its exceptionally close approach to Earth during its 2018 apparition offered the possibility for a long-term spectroscopic observations. We used a 1.2 m telescope equ… ▽ More Jupiter Family Comets (JFCs), having orbital period less than 20 years, provide us with an opportunity to observe their activity and analyse the homogeneity in their coma composition over multiple apparitions. Comet 46P/Wirtanen with its exceptionally close approach to Earth during its 2018 apparition offered the possibility for a long-term spectroscopic observations. We used a 1.2 m telescope equipped with a low-resolution spectrograph to monitor the comet's activity and compute the relative abundances in the coma, as a function of heliocentric distance. We report the production rates of four molecules CN, C$_2$, C$_3$ and NH$_2$, and Af$ρ$ parameter, a proxy to the dust production, before and after perihelion. We found that 46P has a typical coma composition with almost constant abundance ratios with respect to CN across the epochs of observation. Comparing the coma composition of comet 46P during the current and previous apparitions, we conclude the comet has a highly homogeneous chemical composition in the nucleus with an enhancement in ammonia abundance compared to the average abundance in comets. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: 11 pages, 6 figures, 2 tables, accepted for publication in JOAA

arXiv:2311.10927 [pdf, other]

Learning Payment-Free Resource Allocation Mechanisms

Authors: Sihan Zeng, Sujay Bhatt, Eleonora Kreacic, Parisa Hassanzadeh, Alec Koppel, Sumitra Ganesh

Abstract: We consider the design of mechanisms that allocate limited resources among self-interested agents using neural networks. Unlike the recent works that leverage machine learning for revenue maximization in auctions, we consider welfare maximization as the key objective in the payment-free setting. Without payment exchange, it is unclear how we can align agents' incentives to achieve the desired obje… ▽ More We consider the design of mechanisms that allocate limited resources among self-interested agents using neural networks. Unlike the recent works that leverage machine learning for revenue maximization in auctions, we consider welfare maximization as the key objective in the payment-free setting. Without payment exchange, it is unclear how we can align agents' incentives to achieve the desired objectives of truthfulness and social welfare simultaneously, without resorting to approximations. Our work makes novel contributions by designing an approximate mechanism that desirably trade-off social welfare with truthfulness. Specifically, (i) we contribute a new end-to-end neural network architecture, ExS-Net, that accommodates the idea of "money-burning" for mechanism design without payments; (ii)~we provide a generalization bound that guarantees the mechanism performance when trained under finite samples; and (iii) we provide an experimental demonstration of the merits of the proposed mechanism. △ Less

Submitted 12 April, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

arXiv:2311.10493 [pdf, other]

Optical spectroscopy of comets

Authors: K. Aravind, Shashikiran Ganesh

Abstract: Comets are pristine remnants of the Solar system, composed of dust and ice. They remain inactive and undetectable for most of their orbit due to low temperatures. However, as they approach the Sun, volatile materials sublimate, expelling dust and creating a visible coma. Spectroscopic observations of comets help the simultaneous study of both the gas emissions and reflected sunlight from dust part… ▽ More Comets are pristine remnants of the Solar system, composed of dust and ice. They remain inactive and undetectable for most of their orbit due to low temperatures. However, as they approach the Sun, volatile materials sublimate, expelling dust and creating a visible coma. Spectroscopic observations of comets help the simultaneous study of both the gas emissions and reflected sunlight from dust particles. By implementing a long slit, the spatial variations in molecular emissions can be analysed to be further used for other computations. Additionally, spatial information aids in extracting the characteristic profile of the Af(rho) parameter, revealing insights into the behaviour of dust emissions. A sufficiently long slit would prove advantageous in extracting information about the emissions occurring at different parts of the coma or even the tail. We can gain an overall comprehensive understanding of a comet's chemical composition and dust emission by constructively utilising low-resolution spectroscopy with the help of a long slit. △ Less

Submitted 17 November, 2023; originally announced November 2023.

Comments: 10 pages, 8 figures, accepted for publication in Bulletin de la Société Royale des Sciences de Liège (2023)

arXiv:2311.09617 [pdf, other]

Optical polarisation study of Galactic Open clusters

Authors: Namita Uppal, Shashikiran Ganesh, Santosh Joshi, Mrinmoy Sarkar, Prachi Prajapati, Athul Dileep

Abstract: Dust is a ubiquitous component in our Galaxy. It accounts for only $1\%$ mass of the ISM but still is an essential part of the Galaxy. It affects our view of the Galaxy by obscuring the starlight at shorter wavelengths and re-emitting in longer wavelengths. Studying the dust distribution in the Galaxy at longer wavelengths may cause discrepancies due to distance ambiguity caused by unknown Galacti… ▽ More Dust is a ubiquitous component in our Galaxy. It accounts for only $1\%$ mass of the ISM but still is an essential part of the Galaxy. It affects our view of the Galaxy by obscuring the starlight at shorter wavelengths and re-emitting in longer wavelengths. Studying the dust distribution in the Galaxy at longer wavelengths may cause discrepancies due to distance ambiguity caused by unknown Galactic potential. However, another aspect of dust, i.e., the polarisation of the background starlight, when combined with distance information, will help to give direct observational evidence of the number of dust clouds encountered in the line of sight. We observed 15 open clusters distributed at increasing distances in three lines of sight using two Indian national facilities. The measured polarisation results used to scrutinize the dust distribution and orientation of the local plane of sky magnetic fields towards selected directions. The analysis of the stars observed towards the distant cluster King 8 cluster shows two foreground layers at a distance of $\sim 500$ pc and $\sim$ 3500 pc. Similar analysis towards different clusters also results in multiple dust layers. △ Less

Submitted 16 November, 2023; originally announced November 2023.

Comments: 11 pages, 5 figures, 1 table Accepted for publication in the Bulletin of Liège Royal Society of Sciences as a part of 3rd Belgo-Indian Network for Astronomy and Astrophysics (BINA) workshop, 22-24 March 2023

arXiv:2311.09616 [pdf, other]

doi 10.1093/mnras/stad3525

Warp and flare of the old Galactic disc as traced by the red clump stars

Authors: Namita Uppal, Shashikiran Ganesh, Mathias Schultheis

Abstract: Our study aims to investigate the outer disc structure of the Milky Way galaxy using the red clump (RC) stars. We analysed the distribution of the largest sample of RC stars to date, homogeneously covering the entire Galactic plane in the range of $40^\circ \le \ell \le 340^\circ$ and $-10^\circ \le b \le +10^\circ$. This sample allows us to model the RC star distribution in the Galactic disc to b… ▽ More Our study aims to investigate the outer disc structure of the Milky Way galaxy using the red clump (RC) stars. We analysed the distribution of the largest sample of RC stars to date, homogeneously covering the entire Galactic plane in the range of $40^\circ \le \ell \le 340^\circ$ and $-10^\circ \le b \le +10^\circ$. This sample allows us to model the RC star distribution in the Galactic disc to better constrain the properties of the flare and warp of the Galaxy. Our results show that the scale length of the old stellar disc weakly depends on azimuth, with an average value of $1.95 \pm0.26$ kpc. On the other hand, a significant disc flaring is detected, where the scale height of the disc increases from 0.38 kpc in the solar neighbourhood to $\sim 2.2$ kpc at R $\approx 15$ kpc. The flare exhibits a slight asymmetry, with $\sim 1$ kpc more scale height below the Galactic plane as compared to the Northern flare. We also confirm the war** of the outer disc, which can be modelled with $Z_w = (0.0057 \pm 0.0050)~ [R-(7358 \pm 368) (pc)]^{1.40 \pm 0.09} \sin(φ- (-2^\circ.03 \pm 0^\circ .18))$. Our analysis reveals a noticeable north-south asymmetry in the warp, with a greater amplitude observed in the southern direction compared to the northern. Comparing our findings with younger tracers from the literature, we observe an age dependency of both the flare and warp. An increase in flare strength with age suggests the secular evolution of the disc as the preferred mechanism for forming the flare. The increase of the maximum warp amplitude with age indicates that the warp dynamics could be the possible cause of the variation in the warp properties with age. △ Less

Submitted 16 November, 2023; originally announced November 2023.

Comments: 11 pages, 12 figures, 4 tables Accepted for publication in MNRAS

arXiv:2310.14403 [pdf, other]

O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models

Authors: Yuchen Xiao, Yanchao Sun, Mengda Xu, Udari Madhushani, Jared Vann, Deepeka Garg, Sumitra Ganesh

Abstract: Recent advancements in large language models (LLMs) have exhibited promising performance in solving sequential decision-making problems. By imitating few-shot examples provided in the prompts (i.e., in-context learning), an LLM agent can interact with an external environment and complete given tasks without additional training. However, such few-shot examples are often insufficient to generate hig… ▽ More Recent advancements in large language models (LLMs) have exhibited promising performance in solving sequential decision-making problems. By imitating few-shot examples provided in the prompts (i.e., in-context learning), an LLM agent can interact with an external environment and complete given tasks without additional training. However, such few-shot examples are often insufficient to generate high-quality solutions for complex and long-horizon tasks, while the limited context length cannot consume larger-scale demonstrations with long interaction horizons. To this end, we propose an offline learning framework that utilizes offline data at scale (e.g, logs of human interactions) to improve LLM-powered policies without finetuning. The proposed method O3D (Offline Data-driven Discovery and Distillation) automatically discovers reusable skills and distills generalizable knowledge across multiple tasks based on offline interaction data, advancing the capability of solving downstream tasks. Empirical results under two interactive decision-making benchmarks (ALFWorld and WebShop) verify that O3D can notably enhance the decision-making capabilities of LLMs through the offline discovery and distillation process, and consistently outperform baselines across various LLMs. △ Less

Submitted 26 February, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

arXiv:2309.02666 [pdf, other]

Fast and Resource-Efficient Object Tracking on Edge Devices: A Measurement Study

Authors: Sanjana Vijay Ganesh, Yanzhao Wu, Gaowen Liu, Ramana Kompella, Ling Liu

Abstract: Object tracking is an important functionality of edge video analytic systems and services. Multi-object tracking (MOT) detects the moving objects and tracks their locations frame by frame as real scenes are being captured into a video. However, it is well known that real time object tracking on the edge poses critical technical challenges, especially with edge devices of heterogeneous computing re… ▽ More Object tracking is an important functionality of edge video analytic systems and services. Multi-object tracking (MOT) detects the moving objects and tracks their locations frame by frame as real scenes are being captured into a video. However, it is well known that real time object tracking on the edge poses critical technical challenges, especially with edge devices of heterogeneous computing resources. This paper examines the performance issues and edge-specific optimization opportunities for object tracking. We will show that even the well trained and optimized MOT model may still suffer from random frame drop** problems when edge devices have insufficient computation resources. We present several edge specific performance optimization strategies, collectively coined as EMO, to speed up the real time object tracking, ranging from window-based optimization to similarity based optimization. Extensive experiments on popular MOT benchmarks demonstrate that our EMO approach is competitive with respect to the representative methods for on-device object tracking techniques in terms of run-time performance and tracking accuracy. EMO is released on Github at https://github.com/git-disl/EMO. △ Less

Submitted 5 September, 2023; originally announced September 2023.

arXiv:2306.11082

Frequency measurements of $5s5p^{3}$P$_{0}\to5s6d^{3}$D$_{1}$ and observation of nonlinearities in King plot with Sr

Authors: S. Zhang, B. T. Tiwari, S. Ganesh, Y. Singh

Abstract: We report the first precision measurement of the absolute frequency of $5s5p^{3}$P$_{0}\to5s6d^{3}$D$_{1}$ for all four stable Sr isotopes with an accuracy of $\sim$25 kHz employing repum** induced spectroscopy. By combining the isotope shifts of this transition with the existing measurement data on the intercombination line, the King plot is established which reveals a deviation from the linear… ▽ More We report the first precision measurement of the absolute frequency of $5s5p^{3}$P$_{0}\to5s6d^{3}$D$_{1}$ for all four stable Sr isotopes with an accuracy of $\sim$25 kHz employing repum** induced spectroscopy. By combining the isotope shifts of this transition with the existing measurement data on the intercombination line, the King plot is established which reveals a deviation from the linearity at the 5.2$σ$ level. △ Less

Submitted 22 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

Comments: We plan to include the theoretical calculation together with the experimental measurements presented here, which will take some extra time to finalize the paper. Once the new version is completed, we will upload it here

arXiv:2306.09937 [pdf, other]

Collective scattering in lattice-trapped Sr atoms via dipole-dipole interactions

Authors: Shengnan Zhang, Sandhya Ganesh, Balsant Shivanand Tiwari, Kai Bongs, Yeshpal Singh

Abstract: We investigate, based on the coupled dipole model, collective properties of dense Sr ensembles trapped in a three-dimensional (3D) optical lattice in the presence of dipole-dipole interactions induced on the 5$s5p^{3}$P$_{0}\to5s4d^{3}$D$_{1}$ transition. Our results reveal that the collective scattering properties, such as the scattered light intensity, frequency shift and linewidth, strongly dep… ▽ More We investigate, based on the coupled dipole model, collective properties of dense Sr ensembles trapped in a three-dimensional (3D) optical lattice in the presence of dipole-dipole interactions induced on the 5$s5p^{3}$P$_{0}\to5s4d^{3}$D$_{1}$ transition. Our results reveal that the collective scattering properties, such as the scattered light intensity, frequency shift and linewidth, strongly depend on the interatomic distance and the atom number in the lattice. Moreover, the emission intensity is strongly dependent on the atomic distribution in lattices, the laser polarization and the detection position. The results not only offer the understanding of collective behaviors of lattice-trapped ensembles with an atom number equivalent to the experimental scale, but also provide an excellent platform for exploring many-body physics, thereby, opening a new window for applications like quantum information processing and quantum simulation. △ Less

Submitted 7 August, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

arXiv:2304.01525 [pdf, other]

Online Learning with Adversaries: A Differential-Inclusion Analysis

Authors: Swetha Ganesh, Alexandre Reiffers-Masson, Gugan Thoppe

Abstract: We introduce an observation-matrix-based framework for fully asynchronous online Federated Learning (FL) with adversaries. In this work, we demonstrate its effectiveness in estimating the mean of a random vector. Our main result is that the proposed algorithm almost surely converges to the desired mean $μ.$ This makes ours the first asynchronous FL method to have an a.s. convergence guarantee in t… ▽ More We introduce an observation-matrix-based framework for fully asynchronous online Federated Learning (FL) with adversaries. In this work, we demonstrate its effectiveness in estimating the mean of a random vector. Our main result is that the proposed algorithm almost surely converges to the desired mean $μ.$ This makes ours the first asynchronous FL method to have an a.s. convergence guarantee in the presence of adversaries. We derive this convergence using a novel differential-inclusion-based two-timescale analysis. Two other highlights of our proof include (a) the use of a novel Lyapunov function to show that $μ$ is the unique global attractor for our algorithm's limiting dynamics, and (b) the use of martingale and stop**-time theory to show that our algorithm's iterates are almost surely bounded. △ Less

Submitted 26 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

Comments: 6 pages, 2 figures

arXiv:2302.13626 [pdf, other]

doi 10.1007/s12036-022-09905-9

Infrared polarisation study of Lynds 1340: A case of RNO 8

Authors: Archita Rai, Shashikiran Ganesh

Abstract: This paper describes the polarisation study of a Lynds cloud, LDN 1340, $α$ = 2h32m & $δ$ = $73^{\circ} 00^\prime$ corresponding to galactic coordinates of $\ell=$ 130$^{\circ}$.07 $b=$ 11$^{\circ}$.6, with emphasis on the RNO 8 area. The cloud has been observed using the 1.2 m telescope at Mt.Abu Infrared Observatory, in the infrared wavelength band using the Near-Infrared Camera, Spectrograph &… ▽ More This paper describes the polarisation study of a Lynds cloud, LDN 1340, $α$ = 2h32m & $δ$ = $73^{\circ} 00^\prime$ corresponding to galactic coordinates of $\ell=$ 130$^{\circ}$.07 $b=$ 11$^{\circ}$.6, with emphasis on the RNO 8 area. The cloud has been observed using the 1.2 m telescope at Mt.Abu Infrared Observatory, in the infrared wavelength band using the Near-Infrared Camera, Spectrograph & Polarimeter (NICSPol) instrument. The polarimetric observations were used to map the magnetic field geometry around the region. We combined our measurements with archival data from the 2MASS and WISE surveys. The Gaia EDR3 & DR3 data for the same region were used for distance, proper motion, and other astrophysical information. The analysis of the data reveals areas with ordered polarisation vectors in the region of RNO 8. The position angle measurements reveal polarisation due to dichroic extinction which is consistent with the Galactic magnetic field. The magnetic field strength was calculated for the RNO 8 region using the Chandrashekhar-Fermi method and the value estimated is $\sim$ 42$μ$G. △ Less

Submitted 27 February, 2023; originally announced February 2023.

Comments: 10 pages, 10 figures

Journal ref: J. Astrophys. Astr. (2023) 44:16

arXiv:2301.04827 [pdf, ps, other]

doi 10.1088/1361-6382/acb24c

5-D thermal field theory, Einstein field equations and spontaneous symmetry breaking

Authors: S. Ganesh

Abstract: It has been shown previously, that the spatial thermal variation of a thermal medium can be recast as a variation in the Euclidean metric. It is now extended to temporal variations in temperature, for a non-relativistic thermal bath, which remains in local thermal equilibrium. This is achieved by examining the thermal field theory in a five-dimensional space-time-temperature. The bulk thermodynami… ▽ More It has been shown previously, that the spatial thermal variation of a thermal medium can be recast as a variation in the Euclidean metric. It is now extended to temporal variations in temperature, for a non-relativistic thermal bath, which remains in local thermal equilibrium. This is achieved by examining the thermal field theory in a five-dimensional space-time-temperature. The bulk thermodynamic quantity, namely the energy density, is calculated for a neutral scalar field with a time-dependent Hamiltonian. Furthermore, the concept of recasting thermal variations as a variation in the metric is extended to thermal systems in a gravitational field. The Einstein field equations, in the 5-D space-time-temperature, is determined. It is shown that the resulting Ricci scalar can then lead to spontaneous symmetry breaking, leading to the Higgs mechanism. In essence, the asymmetry in the distribution of temperature in space-time can translate to spontaneous symmetry breaking of particle fields, in a very strong gravitational field. △ Less

Submitted 12 January, 2023; originally announced January 2023.

Comments: 10 pages, 1 figure. Accepted for publication in Classical and Quantum Gravity. Refer to the journal for the accepted/published version

Journal ref: Classical and Quantum Gravity, Vol 40, No. 4, 045008 (2023)

arXiv:2301.03758 [pdf, other]

Sequential Fair Resource Allocation under a Markov Decision Process Framework

Authors: Parisa Hassanzadeh, Eleonora Kreacic, Sihan Zeng, Yuchen Xiao, Sumitra Ganesh

Abstract: We study the sequential decision-making problem of allocating a limited resource to agents that reveal their stochastic demands on arrival over a finite horizon. Our goal is to design fair allocation algorithms that exhaust the available resource budget. This is challenging in sequential settings where information on future demands is not available at the time of decision-making. We formulate the… ▽ More We study the sequential decision-making problem of allocating a limited resource to agents that reveal their stochastic demands on arrival over a finite horizon. Our goal is to design fair allocation algorithms that exhaust the available resource budget. This is challenging in sequential settings where information on future demands is not available at the time of decision-making. We formulate the problem as a discrete time Markov decision process (MDP). We propose a new algorithm, SAFFE, that makes fair allocations with respect to the entire demands revealed over the horizon by accounting for expected future demands at each arrival time. The algorithm introduces regularization which enables the prioritization of current revealed demands over future potential demands depending on the uncertainty in agents' future demands. Using the MDP formulation, we show that SAFFE optimizes allocations based on an upper bound on the Nash Social Welfare fairness objective, and we bound its gap to optimality with the use of concentration bounds on total future demands. Using synthetic and real data, we compare the performance of SAFFE against existing approaches and a reinforcement learning policy trained on the MDP. We show that SAFFE leads to more fair and efficient allocations and achieves close-to-optimal performance in settings with dense arrivals. △ Less

Submitted 16 June, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

arXiv:2211.15589 [pdf, other]

Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Authors: Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh

Abstract: Reinforcement Learning (RL) algorithms are known to scale poorly to environments with many available actions, requiring numerous samples to learn an optimal policy. The traditional approach of considering the same fixed action space in every possible state implies that the agent must understand, while also learning to maximize its reward, to ignore irrelevant actions such as… ▽ More Reinforcement Learning (RL) algorithms are known to scale poorly to environments with many available actions, requiring numerous samples to learn an optimal policy. The traditional approach of considering the same fixed action space in every possible state implies that the agent must understand, while also learning to maximize its reward, to ignore irrelevant actions such as $\textit{inapplicable actions}$ (i.e. actions that have no effect on the environment when performed in a given state). Knowing this information can help reduce the sample complexity of RL algorithms by masking the inapplicable actions from the policy distribution to only explore actions relevant to finding an optimal policy. While this technique has been formalized for quite some time within the Automated Planning community with the concept of precondition in the STRIPS language, RL algorithms have never formally taken advantage of this information to prune the search space to explore. This is typically done in an ad-hoc manner with hand-crafted domain logic added to the RL algorithm. In this paper, we propose a more systematic approach to introduce this knowledge into the algorithm. We (i) standardize the way knowledge can be manually specified to the agent; and (ii) present a new framework to autonomously learn the partial action model encapsulating the precondition of an action jointly with the policy. We show experimentally that learning inapplicable actions greatly improves the sample efficiency of the algorithm by providing a reliable signal to mask out irrelevant actions. Moreover, we demonstrate that thanks to the transferability of the knowledge acquired, it can be reused in other tasks and domains to make the learning process more efficient. △ Less

Submitted 11 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

arXiv:2210.08206 [pdf, other]

doi 10.1051/0004-6361/202244548

The Outer spiral arm of the Milky Way using Red Clump stars

Authors: Namita Uppal, Shashikiran Ganesh, Mathias Schultheis

Abstract: Aims: Our aim is to provide an observational view of the old Disc structure of the Milky Way galaxy using the distribution of red clump stars. The spiral arms, warp structure, and other asymmetries present in the Disc are re-visited using a systematic study of red clump star counts over the disc of the Galaxy. Methods: We developed a method to systematically extract the red clump stars from 2MAS… ▽ More Aims: Our aim is to provide an observational view of the old Disc structure of the Milky Way galaxy using the distribution of red clump stars. The spiral arms, warp structure, and other asymmetries present in the Disc are re-visited using a systematic study of red clump star counts over the disc of the Galaxy. Methods: We developed a method to systematically extract the red clump stars from 2MASS ($J-K_s, ~J$) colour-magnitude diagram of $1^\circ \times 1^\circ$ bins in $\ell \times b$ covering the range $40^\circ \le \ell \le 320^\circ$ and $-10^\circ \le b \le 10^\circ$. 2MASS data continues to be important since it is able to identify and trace the red clump stars to much farther distances than any optical survey of the Disc. The foreground star contamination in the selected sample is removed by utilising the accurate astrometric data from Gaia EDR3. Results: We have generated a face-on-view (XY-plane) of the Galaxy depicting the density distribution and count ratio above and below the Galactic plane. The resulting over-density of red clump stars traces the continuous morphology of the Outer arm from the second to the third Galactic quadrant. This is the first study to map the Outer arms across the disc using red clump stars. Through this study, we are able to trace the Outer arm well into the 3rd Galactic quadrant for the first time. Apart from the spiral structures, we also see a wave-like asymmetry above and below the Galactic plane with respect to longitudes indicating the warp structure. The warp structure is studied systematically by tracing the ratio of red clump stars above and below the Galactic plane. We provide the first direct observational evidence of the asymmetry in the Outer spiral arms confirming that the spiral arms traced by the older population are also warped, similar to the Disc. △ Less

Submitted 23 March, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

Comments: 11 pages, 8 figures, accepted for publication in 'Astronomy and Astrophysics' Journal

Journal ref: A&A 673, A99 (2023)

arXiv:2210.07184 [pdf, other]

Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

Authors: Nelson Vadori, Leo Ardon, Sumitra Ganesh, Thomas Spooner, Selim Amrouni, Jared Vann, Mengda Xu, Zeyu Zheng, Tucker Balch, Manuela Veloso

Abstract: We study a game between liquidity provider and liquidity taker agents interacting in an over-the-counter market, for which the typical example is foreign exchange. We show how a suitable design of parameterized families of reward functions coupled with shared policy learning constitutes an efficient solution to this problem. By playing against each other, our deep-reinforcement-learning-driven age… ▽ More We study a game between liquidity provider and liquidity taker agents interacting in an over-the-counter market, for which the typical example is foreign exchange. We show how a suitable design of parameterized families of reward functions coupled with shared policy learning constitutes an efficient solution to this problem. By playing against each other, our deep-reinforcement-learning-driven agents learn emergent behaviors relative to a wide spectrum of objectives encompassing profit-and-loss, optimal execution and market share. In particular, we find that liquidity providers naturally learn to balance hedging and skewing, where skewing refers to setting their buy and sell prices asymmetrically as a function of their inventory. We further introduce a novel RL-based calibration algorithm which we found performed well at imposing constraints on the game equilibrium. On the theoretical side, we are able to show convergence rates for our multi-agent policy gradient algorithm under a transitivity assumption, closely related to generalized ordinal potential games. △ Less

Submitted 1 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

arXiv:2210.06012 [pdf, other]

Phantom -- A RL-driven multi-agent framework to model complex systems

Authors: Leo Ardon, Jared Vann, Deepeka Garg, Tom Spooner, Sumitra Ganesh

Abstract: Agent based modelling (ABM) is a computational approach to modelling complex systems by specifying the behaviour of autonomous decision-making components or agents in the system and allowing the system dynamics to emerge from their interactions. Recent advances in the field of Multi-agent reinforcement learning (MARL) have made it feasible to study the equilibrium of complex environments where mul… ▽ More Agent based modelling (ABM) is a computational approach to modelling complex systems by specifying the behaviour of autonomous decision-making components or agents in the system and allowing the system dynamics to emerge from their interactions. Recent advances in the field of Multi-agent reinforcement learning (MARL) have made it feasible to study the equilibrium of complex environments where multiple agents learn simultaneously. However, most ABM frameworks are not RL-native, in that they do not offer concepts and interfaces that are compatible with the use of MARL to learn agent behaviours. In this paper, we introduce a new open-source framework, Phantom, to bridge the gap between ABM and MARL. Phantom is an RL-driven framework for agent-based modelling of complex multi-agent systems including, but not limited to economic systems and markets. The framework aims to provide the tools to simplify the ABM specification in a MARL-compatible way - including features to encode dynamic partial observability, agent utility functions, heterogeneity in agent preferences or types, and constraints on the order in which agents can act (e.g. Stackelberg games, or more complex turn-taking environments). In this paper, we present these features, their design rationale and present two new environments leveraging the framework. △ Less

Submitted 19 May, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

Comments: 2022 ACM International Conference on Artificial Intelligence in Finance - Benchmarks for AI in Finance Workshop 2023 Autonomous Agents and Multiagent Systems - Extended Abstract

arXiv:2210.03485 [pdf, other]

doi 10.1016/j.jcp.2023.112523

Gradient-based optimisation of the conditional-value-at-risk using the multi-level Monte Carlo method

Authors: Sundar Ganesh, Fabio Nobile

Abstract: In this work, we tackle the problem of minimising the Conditional-Value-at-Risk (CVaR) of output quantities of complex differential models with random input data, using gradient-based approaches in combination with the Multi-Level Monte Carlo (MLMC) method. In particular, we consider the framework of multi-level Monte Carlo for parametric expectations and propose modifications of the MLMC estimato… ▽ More In this work, we tackle the problem of minimising the Conditional-Value-at-Risk (CVaR) of output quantities of complex differential models with random input data, using gradient-based approaches in combination with the Multi-Level Monte Carlo (MLMC) method. In particular, we consider the framework of multi-level Monte Carlo for parametric expectations and propose modifications of the MLMC estimator, error estimation procedure, and adaptive MLMC parameter selection to ensure the estimation of the CVaR and sensitivities for a given design with a prescribed accuracy. We then propose combining the MLMC framework with an alternating inexact minimisation-gradient descent algorithm, for which we prove exponential convergence in the optimisation iterations under the assumptions of strong convexity and Lipschitz continuity of the gradient of the objective function. We demonstrate the performance of our approach on two numerical examples of practical relevance, which evidence the same optimal asymptotic cost-tolerance behaviour as standard MLMC methods for fixed design computations of output expectations. △ Less

Submitted 13 October, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

Comments: 26 pages, 18 figures, 1 table, Related to arXiv:2208.07252, Data available at https://zenodo.org/record/7193448

MSC Class: 65C05; 65K10 ACM Class: G.3; G.1.6

Journal ref: Journal of Computational Physics, Volume 495, 112523 (2023)

arXiv:2209.13109 [pdf, other]

R-fiducial: Reliable and Scalable Radar Fiducials for Smart mmwave Sensing

Authors: Kshitiz Bansal, Manideep Dunna, Sanjeev Anthia Ganesh, Eamon Patamsing, Dinesh Bharadia

Abstract: Millimeter wave sensing has recently attracted a lot of attention given its environmental robust nature. In situations where visual sensors like cameras fail to perform, mmwave radars can be used to achieve reliable performance. However, because of the poor scattering performance and lack of texture in millimeter waves, radars can not be used in several situations that require precise identificati… ▽ More Millimeter wave sensing has recently attracted a lot of attention given its environmental robust nature. In situations where visual sensors like cameras fail to perform, mmwave radars can be used to achieve reliable performance. However, because of the poor scattering performance and lack of texture in millimeter waves, radars can not be used in several situations that require precise identification of objects. In this paper, we take insight from camera fiducials which are very easily identifiable by the camera, and present R-fiducial tags, which smartly augment the current infrastructure to enable a myriad of applications with mmwave radars. R-fiducial acts as a fiducial for mmwave sensing, similar to camera fiducials, and can be reliably identified by a mmwave radar. We identify a precise list of requirements that a millimeter wave fiducial has to follow and show how R-fiducial achieves all of them. R-fiducial uses a novel spread-spectrum modulation technique that provides low latency with high reliability. Our evaluations show that R-fiducial can be reliably detected upto 25m and upto 120 degrees field of view with a latency of the order of milliseconds. We also conduct experiments and case studies in adverse and low visibility conditions to showcase the applicability of R-fiducial in a wide range of applications. △ Less

Submitted 26 September, 2022; originally announced September 2022.

arXiv:2208.07252 [pdf, other]

doi 10.1615/Int.J.UncertaintyQuantification.2023045259

Quantifying uncertain system outputs via the multi-level Monte Carlo method -- distribution and robustness measures

Authors: Quentin Ayoul-Guilmard, Sundar Ganesh, Sebastian Krumscheid, Fabio Nobile

Abstract: In this work, we consider the problem of estimating the probability distribution, the quantile or the conditional expectation above the quantile, the so called conditional-value-at-risk, of output quantities of complex random differential models by the MLMC method. We follow the approach of (reference), which recasts the estimation of the above quantities to the computation of suitable parametric… ▽ More In this work, we consider the problem of estimating the probability distribution, the quantile or the conditional expectation above the quantile, the so called conditional-value-at-risk, of output quantities of complex random differential models by the MLMC method. We follow the approach of (reference), which recasts the estimation of the above quantities to the computation of suitable parametric expectations. In this work, we present novel computable error estimators for the estimation of such quantities, which are then used to optimally tune the MLMC hierarchy in a continuation type adaptive algorithm. We demonstrate the efficiency and robustness of our adaptive continuation-MLMC in an array of numerical test cases. △ Less

Submitted 22 May, 2023; v1 submitted 29 July, 2022; originally announced August 2022.

Comments: 35 pages, 48 figures. Data available at https://doi.org/10.5281/zenodo.7025018

MSC Class: 65C05; 91G60 ACM Class: G.1.8; G.3; J.2; G.3

Journal ref: International Journal for Uncertainty Quantification, Volume 13, Issue 5, 2023, pp. 61-98

arXiv:2206.13324 [pdf, ps, other]

doi 10.1142/S0217751X22501251

Quantum theory, thermal gradients and the curved Euclidean space

Authors: S. Ganesh

Abstract: The Euclidean space, obtained by the analytical continuation of time, to an imaginary time, is used to model thermal systems. In this work, it is taken a step further to systems with spatial thermal variation, by develo** an equivalence between the spatial variation of temperature in a thermal bath and the curvature of the Euclidean space. The variation in temperature is recast as a variation in… ▽ More The Euclidean space, obtained by the analytical continuation of time, to an imaginary time, is used to model thermal systems. In this work, it is taken a step further to systems with spatial thermal variation, by develo** an equivalence between the spatial variation of temperature in a thermal bath and the curvature of the Euclidean space. The variation in temperature is recast as a variation in the metric, leading to a curved Euclidean space. The equivalence is substantiated by analyzing the Polyakov loop, the partition function and the periodicity of the correlation function. The bulk thermodynamic properties like the energy, entropy and the Helmholtz free energy are calculated from the partition function, for small metric perturbations, for a neutral scalar field. The Dirac equation for an external Dirac spinor, traversing in a thermal bath with spatial thermal gradients, is solved in the curved Euclidean space. The fundamental behavior exhibited by the Dirac spinor eigenstate, may provide a possible mechanism to validate the theory, at a more basal level, than examining only bulk thermodynamic properties. Furthermore, in order to verify the equivalence at the level of classical mechanics, the geodesic equation is analyzed in a classical backdrop. The mathematical apparatus is borrowed from the physics of quantum theory in a gravity-induced space-time curvature. As spatial thermal variations are obtainable at QCD or QED energies, it may be feasible for the proposed formulation to be validated experimentally. △ Less

Submitted 27 June, 2022; originally announced June 2022.

Comments: 20 pages, 8 figures, Accepted for publication in Int. J. Mod. Phys. A

Journal ref: Int. J. Mod. Phys. A, Vol 37, Issue No. 17, Article No. 2250125 (2022)

arXiv:2206.10158 [pdf, other]

Certifiably Robust Policy Learning against Adversarial Communication in Multi-agent Systems

Authors: Yanchao Sun, Ruijie Zheng, Parisa Hassanzadeh, Yongyuan Liang, Soheil Feizi, Sumitra Ganesh, Furong Huang

Abstract: Communication is important in many multi-agent reinforcement learning (MARL) problems for agents to share information and make good decisions. However, when deploying trained communicative agents in a real-world application where noise and potential attackers exist, the safety of communication-based policies becomes a severe issue that is underexplored. Specifically, if communication messages are… ▽ More Communication is important in many multi-agent reinforcement learning (MARL) problems for agents to share information and make good decisions. However, when deploying trained communicative agents in a real-world application where noise and potential attackers exist, the safety of communication-based policies becomes a severe issue that is underexplored. Specifically, if communication messages are manipulated by malicious attackers, agents relying on untrustworthy communication may take unsafe actions that lead to catastrophic consequences. Therefore, it is crucial to ensure that agents will not be misled by corrupted communication, while still benefiting from benign communication. In this work, we consider an environment with $N$ agents, where the attacker may arbitrarily change the communication from any $C<\frac{N-1}{2}$ agents to a victim agent. For this strong threat model, we propose a certifiable defense by constructing a message-ensemble policy that aggregates multiple randomly ablated message sets. Theoretical analysis shows that this message-ensemble policy can utilize benign communication while being certifiably robust to adversarial communication, regardless of the attacking algorithm. Experiments in multiple environments verify that our defense significantly improves the robustness of trained policies against various types of attacks. △ Less

Submitted 2 July, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

arXiv:2205.13153 [pdf, other]

doi 10.3847/1538-3881/ac7445

Optical linear polarization study towards Czernik 3 open cluster at different spatial scales

Authors: Namita Uppal, Shashikiran Ganesh, D. Bisht

Abstract: We present the optical linear polarization observation of stars towards the core of the Czernik 3 cluster in the Sloan i-band. The data were obtained using the EMPOL instrument on the 1.2 m telescope at Mount Abu Observatory. We study the dust distribution towards this cluster by combining the results from our polarization observations with the data from Gaia EDR3, WISE, and the HI, $^{12}$CO surv… ▽ More We present the optical linear polarization observation of stars towards the core of the Czernik 3 cluster in the Sloan i-band. The data were obtained using the EMPOL instrument on the 1.2 m telescope at Mount Abu Observatory. We study the dust distribution towards this cluster by combining the results from our polarization observations with the data from Gaia EDR3, WISE, and the HI, $^{12}$CO surveys. In addition, we use the polarimetric data of previously studied clusters within 15$^\circ$ of Czernik 3 to understand the large scale dust distribution. The observational results of Czernik 3 show a large range in the degree of polarization, indicating that the dust is not uniformly distributed over the plane of the sky, even on a small scale. The distance to the Czernik 3 is constrained to $3.6\pm0.8$ kpc using the member stars in the core region identified from Gaia EDR3 astrometry. This makes it one of the most distant clusters observed for optical polarization so far. The variation of observed degree of polarization and extinction towards this cluster direction suggests the presence of at least two dust layers along this line of sight at distances of $\sim 1$ kpc and $\sim 3.4$ kpc. There is an indication of the presence of dust in the centre of the cluster, as seen from an increase in the degree of polarization and WISE W4 flux. The large scale distribution of dust reveals the presence of a region of low dust content between the local arm and the Perseus arm. △ Less

Submitted 26 May, 2022; originally announced May 2022.

Comments: 22 pages, 15 figures, 6 tables, Accepted for publication in The Astronomical Journal

arXiv:2204.09727 [pdf, other]

doi 10.1016/j.icarus.2022.115042

Optical observations and dust modelling of comet 156P/Russell-LINEAR

Authors: K. Aravind, Prithish Halder, Shashikiran Ganesh, Devendra Sahu, Miquel Serra-Ricart, José J. Chambó, Dorje Angchuk, Thirupathi Sivarani

Abstract: Comet 156P/Russell-LINEAR is a short period Jupiter family comet with an orbital period of 6.44 years. The results from spectroscopic, photometric, polarimetric observations and dust modelling studies are presented here. From the spectroscopic study, strong emissions from $CN (Δν= 0)$, $C_3 (λ4050$ Å), $C_2 (Δν= +1)$ and $C_2 (Δν= 0)$ can be observed during both the epochs of our observations. The… ▽ More Comet 156P/Russell-LINEAR is a short period Jupiter family comet with an orbital period of 6.44 years. The results from spectroscopic, photometric, polarimetric observations and dust modelling studies are presented here. From the spectroscopic study, strong emissions from $CN (Δν= 0)$, $C_3 (λ4050$ Å), $C_2 (Δν= +1)$ and $C_2 (Δν= 0)$ can be observed during both the epochs of our observations. The Q($C_2$)/Q(CN) ratio classifies the comet as a typical comet. The imaging data reveals the presence of jets. The dust emission from the comet is observed to have a non-steady state outflow due to the presence of these strong jets which subside in later epochs, resulting in a steady state outflow. Polarimetric study at two different phase angles reveals the degree of polarization to be comparable to Jupiter family comets at similar phase angles. Localized variations in polarization values are observed in the coma. The dust modelling studies suggest the presence of high amount of silicate/low absorbing material and indicate the coma to be dominated by higher amount of large size grains with low porosity having power law size distribution index = 2.4. The observed activity and dust properties points to a similarity to another Jupiter family comet, 67P/Churyumov-Gerasimenko. △ Less

Submitted 20 April, 2022; originally announced April 2022.

Comments: 24 pages, 15 figures, Accepted for publication in Icarus

arXiv:2201.01853 [pdf, other]

Mixture of basis for interpretable continual learning with distribution shifts

Authors: Mengda Xu, Sumitra Ganesh, Pranay Pasula

Abstract: Continual learning in environments with shifting data distributions is a challenging problem with several real-world applications. In this paper we consider settings in which the data distribution(task) shifts abruptly and the timing of these shifts are not known. Furthermore, we consider a semi-supervised task-agnostic setting in which the learning algorithm has access to both task-segmented and… ▽ More Continual learning in environments with shifting data distributions is a challenging problem with several real-world applications. In this paper we consider settings in which the data distribution(task) shifts abruptly and the timing of these shifts are not known. Furthermore, we consider a semi-supervised task-agnostic setting in which the learning algorithm has access to both task-segmented and unsegmented data for offline training. We propose a novel approach called mixture of Basismodels (MoB) for addressing this problem setting. The core idea is to learn a small set of basis models and to construct a dynamic, task-dependent mixture of the models to predict for the current task. We also propose a new methodology to detect observations that are out-of-distribution with respect to the existing basis models and to instantiate new models as needed. We test our approach in multiple domains and show that it attains better prediction error than existing methods in most cases while using fewer models than other multiple model approaches. Moreover, we analyze the latent task representations learned by MoB and show that similar tasks tend to cluster in the latent space and that the latent representation shifts at the task boundaries when tasks are dissimilar. △ Less

Submitted 5 January, 2022; originally announced January 2022.

arXiv:2110.15547 [pdf, ps, other]

Does Momentum Help? A Sample Complexity Analysis

Authors: Swetha Ganesh, Rohan Deb, Gugan Thoppe, Amarjit Budhiraja

Abstract: Stochastic Heavy Ball (SHB) and Nesterov's Accelerated Stochastic Gradient (ASG) are popular momentum methods in stochastic optimization. While benefits of such acceleration ideas in deterministic settings are well understood, their advantages in stochastic optimization is still unclear. In fact, in some specific instances, it is known that momentum does not help in the sample complexity sense. Ou… ▽ More Stochastic Heavy Ball (SHB) and Nesterov's Accelerated Stochastic Gradient (ASG) are popular momentum methods in stochastic optimization. While benefits of such acceleration ideas in deterministic settings are well understood, their advantages in stochastic optimization is still unclear. In fact, in some specific instances, it is known that momentum does not help in the sample complexity sense. Our work shows that a similar outcome actually holds for the whole of quadratic optimization. Specifically, we obtain a lower bound on the sample complexity of SHB and ASG for this family and show that the same bound can be achieved by the vanilla SGD. We note that there exist results claiming the superiority of momentum based methods in quadratic optimization, but these are based on one-sided or flawed analyses. △ Less

Submitted 11 July, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

arXiv:2110.06829 [pdf, other]

doi 10.1145/3490354.3494372

Towards a fully RL-based Market Simulator

Authors: Leo Ardon, Nelson Vadori, Thomas Spooner, Mengda Xu, Jared Vann, Sumitra Ganesh

Abstract: We present a new financial framework where two families of RL-based agents representing the Liquidity Providers and Liquidity Takers learn simultaneously to satisfy their objective. Thanks to a parametrized reward formulation and the use of Deep RL, each group learns a shared policy able to generalize and interpolate over a wide range of behaviors. This is a step towards a fully RL-based market si… ▽ More We present a new financial framework where two families of RL-based agents representing the Liquidity Providers and Liquidity Takers learn simultaneously to satisfy their objective. Thanks to a parametrized reward formulation and the use of Deep RL, each group learns a shared policy able to generalize and interpolate over a wide range of behaviors. This is a step towards a fully RL-based market simulator replicating complex market conditions particularly suited to study the dynamics of the financial market under various scenarios. △ Less

Submitted 8 November, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

Journal ref: ACM International Conference on AI in Finance, 2021

arXiv:2106.02615 [pdf, other]

Consensus Multiplicative Weights Update: Learning to Learn using Projector-based Game Signatures

Authors: Nelson Vadori, Rahul Savani, Thomas Spooner, Sumitra Ganesh

Abstract: Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update method - OMWU and MWU - display opposite convergence properties depending on whether the game is zero-sum or cooperative. Inspired by this work and the recent literature on learning to optimize for single functions, we introduce a new framework for learning last-iterate convergence to Nash Equilibria… ▽ More Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update method - OMWU and MWU - display opposite convergence properties depending on whether the game is zero-sum or cooperative. Inspired by this work and the recent literature on learning to optimize for single functions, we introduce a new framework for learning last-iterate convergence to Nash Equilibria in games, where the update rule's coefficients (learning rates) along a trajectory are learnt by a reinforcement learning policy that is conditioned on the nature of the game: \textit{the game signature}. We construct the latter using a new decomposition of two-player games into eight components corresponding to commutative projection operators, generalizing and unifying recent game concepts studied in the literature. We compare the performance of various update rules when their coefficients are learnt, and show that the RL policy is able to exploit the game signature across a wide range of game types. In doing so, we introduce CMWU, a new algorithm that extends consensus optimization to the constrained case, has local convergence guarantees for zero-sum bimatrix games, and show that it enjoys competitive performance on both zero-sum games with constant coefficients and across a spectrum of games when its coefficients are learnt. △ Less

Submitted 11 June, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

Comments: ICML 2022, the 39th International Conference on Machine Learning

arXiv:2105.08119 [pdf, other]

doi 10.3847/1538-4357/ac01d1

X-ray Observations of 1ES 1959+650 in its high activity state in 2016-2017 with AstroSat and Swift

Authors: Sunil Chandra, Markus Boettcher, Pranjupriya Goswami, Kulinder Pal Singh, Michael Zacharias, Navpreet Kaur, Sudip Bhattacharyya, Shashikiran Ganesh, Daniela Dorner

Abstract: We present a comprehensive multi-frequency study of the HBL 1ES 1959+650 using data from various facilities during the period 2016-2017, including X-ray data from {\it AstroSat} and {\it Swift} during the historically high X-ray flux state of the source observed until February 2021. The unprecedented quality of X-ray data from high cadence monitoring with the {\it AstroSat} during 2016-2017 enable… ▽ More We present a comprehensive multi-frequency study of the HBL 1ES 1959+650 using data from various facilities during the period 2016-2017, including X-ray data from {\it AstroSat} and {\it Swift} during the historically high X-ray flux state of the source observed until February 2021. The unprecedented quality of X-ray data from high cadence monitoring with the {\it AstroSat} during 2016-2017 enables us to establish a detailed description of X-ray flares in 1ES 1959+650. The synchrotron peak shifts significantly between different flux states, in a manner consistent with a geometric (changing Doppler factor) interpretation. A time-dependent leptonic diffusive-shock-acceleration and radiation transfer model is used to reproduce the spectral energy distributions (SEDs) and X-ray light curves, to provide insight into the particle acceleration during the major activity periods observed in 2016 and 2017. The extensive data of {\it Swift}-XRT from December 2015 to February 2021 (Exp. = 411.3 ks) reveals a positive correlation between flux and peak position. △ Less

Submitted 17 May, 2021; originally announced May 2021.

Comments: 25 pages, 10 figures, 10 tables [Accepted by the Astrophysical Journal]

arXiv:2103.04596 [pdf, ps, other]

doi 10.1093/mnras/stab691

Multi-colour photometry and Gaia EDR3 astrometry of two couples of binary clusters (NGC 5617 and Trumpler 22) and (NGC 3293 and NGC 3324)

Authors: D. Bisht, Qingfeng Zhu, R. K. S. Yadav, Shashikiran Ganesh, Geeta Rangwal, Alok Durgapal, Devesh P. Sariya, Ing-Guey Jiang

Abstract: This paper presents a comprehensive analysis of two pairs of binary clusters (NGC 5617 and Trumpler 22) and (NGC 3293 and NGC 3324) located in the fourth quadrant of our Galaxy. For this purpose we use different data taken from VVV survey, WISE, VPHAS, APASS, GLIMPSE along with Gaia~EDR3 astrometric data. We identified 584, 429, 692 and 273 most probable cluster members with membership probability… ▽ More This paper presents a comprehensive analysis of two pairs of binary clusters (NGC 5617 and Trumpler 22) and (NGC 3293 and NGC 3324) located in the fourth quadrant of our Galaxy. For this purpose we use different data taken from VVV survey, WISE, VPHAS, APASS, GLIMPSE along with Gaia~EDR3 astrometric data. We identified 584, 429, 692 and 273 most probable cluster members with membership probability higher than 80 % towards the region of clusters NGC 5617, Trumpler 22, NGC 3293 and NGC 3324. We estimated the value of R as ~ 3.1 for clusters NGC 5617 and Trumpler 22, which indicates normal extinction law. The value of R ~ 3.8 and 1.9 represent the abnormal extinction law towards the clusters NGC 3293 and NGC 3324. Our Kinematical analysis show that all these clusters have circular orbits. Ages are found to be 90\pm10 and 12\pm3 Myr for the cluster pairs (NGC 5617 and Trumpler 22) and (NGC 3293 and NGC 3324), respectively. The distances of 2.43\pm0.08, 2.64\pm0.07, 2.59\pm0.1 and 2.80\pm0.2 kpc estimated using parallax are alike to the values calculated by using the distance modulus. We have also identified 18 and 44 young stellar object candidates present in NGC 5617 and Trumpler 22, respectively. Mass function slopes are found to be in fair agreement with the Salpeter's value. The dynamical study of these objects shows a lack of faint stars in their inner regions, which leads to the mass-segregation effect. Our study indicates that NGC 5617 and Trumpler 22 are dynamically relaxed but the other pair of clusters are not. Orbital alongwith the physical parameters show that the clusters in both pairs are physically connected. △ Less

Submitted 8 March, 2021; originally announced March 2021.

Comments: 21 pages, 20 figures, 7 tables, the article has been accepted for the publication MNRAS

arXiv:2103.02169 [pdf]

doi 10.5121/ijcsit.2021.13104

Real Time Vigilance Detection using Frontal EEG

Authors: Siddarth Ganesh, Ram Gurumoorthy

Abstract: Vigilance of an operator is compromised in performing many monotonous activities like workshop and manufacturing floor tasks, driving, night shift workers, flying, and in general any activity which requires keen attention of an individual over prolonged periods of time. Driver or operator fatigue in these situations leads to drowsiness and lowered vigilance which is one of the largest contributors… ▽ More Vigilance of an operator is compromised in performing many monotonous activities like workshop and manufacturing floor tasks, driving, night shift workers, flying, and in general any activity which requires keen attention of an individual over prolonged periods of time. Driver or operator fatigue in these situations leads to drowsiness and lowered vigilance which is one of the largest contributors to injuries and fatalities amongst road accidents or workshop floor accidents. Having a vigilance monitoring system to detect drop in vigilance in these situations becomes very important. This paper presents a system which uses non-invasively recorded Frontal EEG from an easy-to-use commercially available Brain Computer Interface wearable device to determine the vigilance state of an individual. The change in the power spectrum in the Frontal Theta Band (4-8Hz) of an individual's brain wave predicts the changes in the attention level of an individual - providing an early detection and warning system. This method provides an accurate, yet cheap and practical system for vigilance monitoring across different environments. △ Less

Submitted 2 March, 2021; originally announced March 2021.

Journal ref: International Journal of Computer Science & Information Technology (IJCSIT) Vol 13, No 1, February 2021

arXiv:2102.10362 [pdf, other]

Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs

Authors: Thomas Spooner, Nelson Vadori, Sumitra Ganesh

Abstract: Policy gradient methods can solve complex tasks but often fail when the dimensionality of the action-space or objective multiplicity grow very large. This occurs, in part, because the variance on score-based gradient estimators scales quadratically. In this paper, we address this problem through a factor baseline which exploits independence structure encoded in a novel action-target influence netw… ▽ More Policy gradient methods can solve complex tasks but often fail when the dimensionality of the action-space or objective multiplicity grow very large. This occurs, in part, because the variance on score-based gradient estimators scales quadratically. In this paper, we address this problem through a factor baseline which exploits independence structure encoded in a novel action-target influence network. Factored policy gradients (FPGs), which follow, provide a common framework for analysing key state-of-the-art algorithms, are shown to generalise traditional policy gradients, and yield a principled way of incorporating prior knowledge of a problem domain's generative processes. We provide an analysis of the proposed estimator and identify the conditions under which variance is reduced. The algorithmic aspects of FPGs are discussed, including optimal policy factorisation, as characterised by minimum biclique coverings, and the implications for the bias-variance trade-off of incorrectly specifying the network. Finally, we demonstrate the performance advantages of our algorithm on large-scale bandit and traffic intersection problems, providing a novel contribution to the latter in the form of a spatial approximation. △ Less

Submitted 23 November, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

Comments: NeurIPS 2021; 19 pages, 19 figures, 1 table

arXiv:2101.02752 [pdf, other]

doi 10.1093/mnras/stab084

Activity of the first interstellar comet 2I/Borisov around perihelion: Results from Indian observatories

Authors: Aravind Krishnakumar, Shashikiran Ganesh, Kumar Venkataramani, Devendra Sahu, Dorje Angchuk, Thirupathi Sivarani, Athira Unni

Abstract: Comet 2I/Borisov is the first true interstellar comet discovered. Here we present results from observational programs at two Indian observatories, 2 m Himalayan Chandra Telescope at the Indian Astronomical Observatory, Hanle (HCT) and 1.2 m telescope at the Mount Abu Infrared Observatory (MIRO). Two epochs of imaging and spectroscopy were carried out at the HCT and three epochs of imaging at MIRO.… ▽ More Comet 2I/Borisov is the first true interstellar comet discovered. Here we present results from observational programs at two Indian observatories, 2 m Himalayan Chandra Telescope at the Indian Astronomical Observatory, Hanle (HCT) and 1.2 m telescope at the Mount Abu Infrared Observatory (MIRO). Two epochs of imaging and spectroscopy were carried out at the HCT and three epochs of imaging at MIRO. We found CN to be the dominant molecular emission on both epochs, 31/11/2019 and 22/12/2019, at distances of r$_H$ = 2.013 and 2.031 AU respectively. The comet was inferred to be relatively depleted in Carbon bearing molecules on the basis of low $C_2$ and $C_3$ abundances. We find the production rate ratio, Q($C_2$)/Q(CN) = 0.54 $\pm$ 0.18, pre-perihelion and Q($C_2$)/Q(CN) = 0.34 $\pm$ 0.12 post-perihelion. This classifies the comet as being moderately depleted in carbon chain molecules. Using the results from spectroscopic observations, we believe the comet to have a chemically heterogeneous surface having variation in abundance of carbon chain molecules. From imaging observations we infer a dust-to-gas ratio similar to carbon chain depleted comets of the Solar system. We also compute the nucleus size to be in the range $0.18\leq r \leq 3.1$ Km. Our observations show that 2I/Borisov's behaviour is analogous to that of the Solar system comets. △ Less

Submitted 7 January, 2021; originally announced January 2021.

Comments: Accepted for publication in MNRAS, 10 pages, 8 figures

arXiv:2012.12616 [pdf, other]

doi 10.1007/s12036-021-09696-5

Multi-wavelength view of the galactic black-hole binary GRS 1716-249

Authors: Sandeep K. Rout, Santosh V. Vadawale, Aarthy E., Shashikiran Ganesh, Vishal Joshi, Jayashree Roy, Ranjeev Misra, J. S. Yadav

Abstract: The origins of X-ray and radio emissions during an X-ray binary outburst are comparatively better understood than those of ultraviolet, optical and infrared radiation. This is because multiple competing mechanisms peak in these mid-energy ranges. Ascertaining the true emission mechanism and segregating the contribution of different mechanisms, if present, is important for correct understanding of… ▽ More The origins of X-ray and radio emissions during an X-ray binary outburst are comparatively better understood than those of ultraviolet, optical and infrared radiation. This is because multiple competing mechanisms peak in these mid-energy ranges. Ascertaining the true emission mechanism and segregating the contribution of different mechanisms, if present, is important for correct understanding of the energetics of the system and hence its geometry. We have studied the multi-wavelength spectral energy distribution of the galactic X-ray binary GRS 1716-249 ranging from near infrared (0.0005 keV) to hard X-rays (120 keV) using observations from AstroSat, Swift, and Mount Abu Infrared Observatory. Broadband spectral fitting suggests that the irradiated accretion disk dominates emission in ultraviolet and optical regimes. The near infrared emission exhibits some excess than the prediction of the irradiated disk model, which is most likely due to Synchrotron emission from jets as suggested by radio emission. Irradiation of the inner disk by the hard X-ray emission from the Corona also plays a significant role in accounting for the soft X-ray emission. △ Less

Submitted 11 June, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

Comments: 11 pages, 8 figures and 2 tables. Published in the Journal of Astrophysics & Astronomy

Journal ref: Journal of Astrophysics and Astronomy volume 42, Article number: 39 (2021)

arXiv:2012.12458 [pdf, other]

TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems

Authors: Bill Byrne, Karthik Krishnamoorthi, Saravanan Ganesh, Mihir Sanjay Kale

Abstract: We present a data-driven, end-to-end approach to transaction-based dialog systems that performs at near-human levels in terms of verbal response quality and factual grounding accuracy. We show that two essential components of the system produce these results: a sufficiently large and diverse, in-domain labeled dataset, and a neural network-based, pre-trained model that generates both verbal respon… ▽ More We present a data-driven, end-to-end approach to transaction-based dialog systems that performs at near-human levels in terms of verbal response quality and factual grounding accuracy. We show that two essential components of the system produce these results: a sufficiently large and diverse, in-domain labeled dataset, and a neural network-based, pre-trained model that generates both verbal responses and API call predictions. In terms of data, we introduce TicketTalk, a movie ticketing dialog dataset with 23,789 annotated conversations. The movie ticketing conversations range from completely open-ended and unrestricted to more structured, both in terms of their knowledge base, discourse features, and number of turns. In qualitative human evaluations, model-generated responses trained on just 10,000 TicketTalk dialogs were rated to "make sense" 86.5 percent of the time, almost the same as human responses in the same contexts. Our simple, API-focused annotation schema results in a much easier labeling task making it faster and more cost effective. It is also the key component for being able to predict API calls accurately. We handle factual grounding by incorporating API calls in the training data, allowing our model to learn which actions to take and when. Trained on the same 10,000-dialog set, the model's API call predictions were rated to be correct 93.9 percent of the time in our evaluations, surpassing the ratings for the corresponding human labels. We show how API prediction and response generation scores improve as the dataset size incrementally increases from 5000 to 21,000 dialogs. Our analysis also clearly illustrates the benefits of pre-training. We are publicly releasing the TicketTalk dataset with this paper to facilitate future work on transaction-based dialogs. △ Less

Submitted 27 December, 2020; v1 submitted 22 December, 2020; originally announced December 2020.

Comments: Eight pages, 4 figures, 7 tables

arXiv:2012.11940 [pdf, other]

doi 10.1093/mnras/staa3923

Asteroseismology of sz lyn using multi-band high time resolution photometry from ground and space

Authors: J. Adassuriya, S. Ganesh, J. L. Gutierrez, G. Handler, Santosh Joshi, K. P. S. C. Jayaratne, K. S. Baliyan

Abstract: We report the analysis of high temporal resolution ground and space based photometric observations of SZ Lyncis, a binary star one of whose components is a high amplitude $δ$ Scuti. UBVR photometric observations were obtained from Mt. Abu Infrared Observatory and Fairborn Observatory; archival observations from the WASP project were also included. Furthermore, the continuous, high quality light cu… ▽ More We report the analysis of high temporal resolution ground and space based photometric observations of SZ Lyncis, a binary star one of whose components is a high amplitude $δ$ Scuti. UBVR photometric observations were obtained from Mt. Abu Infrared Observatory and Fairborn Observatory; archival observations from the WASP project were also included. Furthermore, the continuous, high quality light curve from the TESS project was extensively used for the analysis. The well resolved light curve from TESS reveals the presence of 23 frequencies with four independent modes, 13 harmonics of the main pulsation frequency of 8.296943$\pm$0.000002 d$^{-1}$ and their combinations. The frequency 8.296 d$^{-1}$ is identified as the fundamental radial mode by amplitude ratio method and using the estimated pulsation constant. The frequencies 14.535 d$^{-1}$, 32.620 d$^{-1}$ and 4.584 d$^{-1}$ are newly discovered for SZ Lyn. Out of these three, 14.535 d$^{-1}$ and 32.620 d$^{-1}$ are identified as non-radial lower order p-modes and 4.584 d$^{-1}$ could be an indication of a g-mode in a $δ$ Scuti star. As a result of frequency determination and mode identification, the physical parameters of SZ Lyn were revised by optimizations of stellar pulsation models with the observed frequencies. The theoretical models correspond to 7500 K $\le $T$_{\rm eff}$ $\le$ 7800 K, log(g)=3.81$\pm$0.06. The mass of SZ Lyn was estimated to be close to 1.7--2.0 M$_\odot$ using evolutionary sequences. The period-density relation estimates a mean density $ρ$ of 0.1054$\pm$0.0016 g cm$^{-3}$ △ Less

Submitted 22 December, 2020; originally announced December 2020.

Comments: 16 pages, 11 figures, accepted for publication in MNRAS

arXiv:2012.11380 [pdf, other]

doi 10.1051/0004-6361/202039687

VHE gamma-ray detection of FSRQ QSO B1420+326 and modeling of its enhanced broadband state in 2020

Authors: V. A. Acciari, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, M. Artero, K. Asano, D. Baack, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, J. Becerra González, W. Bednarek, L. Bellizzi, E. Bernardini, M. Bernardos, A. Berti, J. Besenrieder, W. Bhattacharyya, C. Bigongiari, A. Biland, O. Blanch, G. Bonnoli, Ž. Bošnjak, G. Busetto , et al. (209 additional authors not shown)

Abstract: Context. QSO B1420+326 is a blazar classified as a Flat Spectrum Radio Quasar (FSRQ). In the beginning of 2020 it underwent an enhanced flux state. An extensive multiwavelength campaign allowed us to trace the evolution of the flare. Aims. We search for VHE gamma-ray emission from QSO B1420+326 during this flaring state. We aim to characterize and model the broadband emission of the source over di… ▽ More Context. QSO B1420+326 is a blazar classified as a Flat Spectrum Radio Quasar (FSRQ). In the beginning of 2020 it underwent an enhanced flux state. An extensive multiwavelength campaign allowed us to trace the evolution of the flare. Aims. We search for VHE gamma-ray emission from QSO B1420+326 during this flaring state. We aim to characterize and model the broadband emission of the source over different phases of the flare. Methods. The source was observed with a number of instruments in radio, near infrared, optical (including polarimetry and spectroscopy), ultra-violet, X-ray and gamma-ray bands. We use dedicated optical spectroscopy results to estimate the accretion disk and the dust torus luminosity. We perform spectral energy distribution modeling in the framework of combined Synchrotron-Self-Compton and External Compton scenario in which the electron energy distribution is partially determined from acceleration and cooling processes. Results. During the enhanced state the flux of both SED components drastically increased and the peaks were shifted to higher energies. Follow up observations with the MAGIC telescopes led to the detection of very-high-energy gamma-ray emission from this source, making it one of only a handful of FSRQs known in this energy range. Modeling allows us to constrain the evolution of the magnetic field and electron energy distribution in the emission region. The gamma-ray flare was accompanied by a rotation of the optical polarization vector during a low polarization state. Also, a new, superluminal radio knot contemporaneously appeared in the radio image of the jet. The optical spectroscopy shows a prominent FeII bump with flux evolving together with the continuum emission and a MgII line with varying equivalent width. △ Less

Submitted 21 December, 2020; originally announced December 2020.

Comments: 20 pages, 15 figures, 7 tables, accepted for publication in A&A

Journal ref: A&A 647, A163 (2021)

arXiv:2012.08813 [pdf]

doi 10.1117/12.2561339

Mechanical aspects of Near-Infrared Imager Spectrometer and Polarimeter

Authors: Prashanth Kumar Kasarla, Pitamber Singh Patwal, Hitesh Kumar L. Adalja, Satya Narain Mathur, Deekshya Roy Sarkar, Alka Singh, Archita Rai, Prachi Vinod Prajapati, Sachindra Naik, Amish B. Shah, Shashikiran Ganesh, Kiran S. Baliyan

Abstract: Near-infrared Imager Spectrometer and Polarimeter (NISP) is a camera, an intermediate resolution spectrograph and an imaging polarimeter being developed for upcoming 2.5m telescope of Physical Research Laboratory at Mount Abu, India. NISP is designed to work in the Near-IR (0.8-2.5 micron) using a H2RG detector. Collimator and camera lenses would transfer the image from the focal plane of the tele… ▽ More Near-infrared Imager Spectrometer and Polarimeter (NISP) is a camera, an intermediate resolution spectrograph and an imaging polarimeter being developed for upcoming 2.5m telescope of Physical Research Laboratory at Mount Abu, India. NISP is designed to work in the Near-IR (0.8-2.5 micron) using a H2RG detector. Collimator and camera lenses would transfer the image from the focal plane of the telescope to the detector plane. The entire optics, mechanical support structures, detector-SIDECAR assembly will be cooled to cryo-temperatures using an open cycle Liquid Nitrogen tank inside a vacuum Dewar. GFRP support structures would be used to isolate cryogenic system from the Dewar. Two layer thermal shielding would be used to reduce the radiative heat transfer. Molecular sieve (getter) would be used to enhance the vacuum level inside Dewar. Magnet-reedswitch combination are used for absolute positioning of filterwheels. Here we describe the mechanical aspects in detail. △ Less

Submitted 16 December, 2020; originally announced December 2020.

Comments: 11 pages, 12 figures, Submitted to SPIE Conference Astronomical Telescopes + Instrumentation 2020

Journal ref: Proc. SPIE 11447, Ground-based and Airborne Instrumentation for Astronomy VIII, 114476U, 2020

Showing 1–50 of 115 results for author: Ganesh, S