Skip to main content

Showing 1–50 of 53 results for author: Ao, L

.
  1. arXiv:2403.17552  [pdf, other

    cs.CL

    Naive Bayes-based Context Extension for Large Language Models

    Authors: Jianlin Su, Murtadha Ahmed, Wenbo, Luo Ao, Mingren Zhu, Yunfeng Liu

    Abstract: Large Language Models (LLMs) have shown promising in-context learning abilities. However, conventional In-Context Learning (ICL) approaches are often impeded by length limitations of transformer architecture, which pose challenges when attempting to effectively integrate supervision from a substantial number of demonstration examples. In this paper, we introduce a novel framework, called Naive Bay… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to main NAACL 2024

  2. arXiv:2401.07462  [pdf, other

    hep-ex physics.ins-det

    Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments

    Authors: S. M. Lee, G. Adhikari, N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Fran. a, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, S. W. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim , et al. (37 additional authors not shown)

    Abstract: We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced… ▽ More

    Submitted 10 May, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures

    Journal ref: Eur. Phys. J. C 84 (2024) 484

  3. arXiv:2311.18699  [pdf, other

    stat.ME

    Gaussian processes Correlated Bayesian Additive Regression Trees

    Authors: Xuetao Lu a, Robert E. McCulloch

    Abstract: In recent years, Bayesian Additive Regression Trees (BART) has garnered increased attention, leading to the development of various extensions for diverse applications. However, there has been limited exploration of its utility in analyzing correlated data. This paper introduces a novel extension of BART, named Correlated BART (CBART). Unlike the original BART with independent errors, CBART is spec… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  4. Photochemistry and Haze Formation

    Authors: Mandt K. E., Luspay-Kuti A., Cheng A., Jessup K. -L., Gao P

    Abstract: One of the many exciting revelations of the New Horizons flyby of Pluto was the observation of global haze layers at altitudes as high as 200 km in the visible wavelengths. This haze is produced in the upper atmosphere through photochemical processes, similar to the processes in Titan's atmosphere. As the haze particles grow in size and descend to the lower atmosphere, they coagulate and interact… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    MSC Class: 85-01

    Journal ref: In Pluto System After New Horizons (S. A. Stern, R. P. Binzel, W. M. Grundy, J. M. Moore, and L. A. Young, eds.), Univ. of Arizona, Tucson (2021)

  5. arXiv:2310.18743  [pdf, other

    cs.LG

    Optimization of utility-based shortfall risk: A non-asymptotic viewpoint

    Authors: Sumedh Gupte, Prashanth L. A., Sanjay P. Bhat

    Abstract: We consider the problems of estimation and optimization of utility-based shortfall risk (UBSR), which is a popular risk measure in finance. In the context of UBSR estimation, we derive a non-asymptotic bound on the mean-squared error of the classical sample average approximation (SAA) of UBSR. Next, in the context of UBSR optimization, we derive an expression for the UBSR gradient under a smooth p… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

  6. arXiv:2308.16386  [pdf

    cs.CV

    RGB-T Tracking via Multi-Modal Mutual Prompt Learning

    Authors: Yang Luo, Xiqing Guo, Hui Feng, Lei Ao

    Abstract: Object tracking based on the fusion of visible and thermal im-ages, known as RGB-T tracking, has gained increasing atten-tion from researchers in recent years. How to achieve a more comprehensive fusion of information from the two modalities with fewer computational costs has been a problem that re-searchers have been exploring. Recently, with the rise of prompt learning in computer vision, we can… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 9 pages, 5 figures, 5 tables

  7. arXiv:2307.16495  [pdf

    physics.flu-dyn physics.app-ph

    Through-chip microchannels for three-dimensional integrated circuits cooling

    Authors: Lihong Ao, Aymeric Ramiere

    Abstract: Cooling high-power electronics in multilayer integrated circuits (ICs) is challenging for existing cooling methods. In this work, we designed through-chip microchannels (TCMCs) that cross the entire chip perpendicularly to the layers, with water circulating inside to provide direct cooling to each layer. TCMCs are organized in a square array where the pitch and radius of the microchannels are expl… ▽ More

    Submitted 19 December, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

    Journal ref: Thermal Science and Engineering Progress, 102333 (2024)

  8. Searching for Milky Way twins: Radial abundance distribution as a strict criterion

    Authors: Pilyugin L. S., Tautvaisiene G., Lara-Lopez M. A

    Abstract: We search for Milky Way-like galaxies among a sample of approximately 500 galaxies. The characteristics we considered of the candidate galaxies are the following: stellar mass M_star, optical radius R_25, rotation velocity V_rot, central oxygen abundance (O/H)_0, and abundance at the optical radius (O/H)_R25. If the values of R_25 and M_star of the galaxy were close to that of the Milky Way, then… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted to Astronomy and Astrophysics, 28 pages, 13 figures

    Journal ref: A&A 676, A57 (2023)

  9. arXiv:2304.10951  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning

    Authors: Mizhaan Prajit Maniyar, Akash Mondal, Prashanth L. A., Shalabh Bhatnagar

    Abstract: We consider the problem of control in the setting of reinforcement learning (RL), where model information is not available. Policy gradient algorithms are a popular solution approach for this problem and are usually shown to converge to a stationary point of the value function. In this paper, we propose two policy Newton algorithms that incorporate cubic regularization. Both algorithms employ the… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  10. arXiv:2211.05910  [pdf, other

    eess.IV cs.CV

    Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, **gang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, **woo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

    Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256

  11. arXiv:2210.05918  [pdf, ps, other

    cs.LG cs.AI eess.SY stat.ML

    Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation

    Authors: Gandharv Patil, Prashanth L. A., Dheeraj Nagaraj, Doina Precup

    Abstract: We study the finite-time behaviour of the popular temporal difference (TD) learning algorithm when combined with tail-averaging. We derive finite time bounds on the parameter error of the tail-averaged TD iterate under a step-size choice that does not require information about the eigenvalues of the matrix underlying the projected TD fixed point. Our analysis shows that tail-averaged TD converges… ▽ More

    Submitted 11 September, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Journal ref: Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, Proceedings of Machine Learning Research, 2023

  12. Calibration-based abundances in the interstellar gas of galaxies from slit and IFU spectra

    Authors: Pilyugin L. S., Lara-Lopez M. A., Vilchez J. M., Duarte Puertas S., Zinchenko I. A., Dors O. L

    Abstract: In this work we make use of available Integral Field Unit (IFU) spectroscopy and slit spectra of several nearby galaxies. The pre-existing empirical R and S calibrations for abundance determinations are constructed using a sample of HII regions with high quality slit spectra. In this paper, we test the applicability of those calibrations to the IFU spectra. We estimate the calibration-based abunda… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 15 pages, 14 figures, accepted to the Astronomy and Astrophysics

    Journal ref: A&A 668, A5 (2022)

  13. The ASTRI Mini-Array of Cherenkov Telescopes at the Observatorio del Teide

    Authors: Scuderi S., Giuliani A., Pareschi G., Tosti G., Catalano O., Amato E., Antonelli L. A., Becerra Gonzáles J., Bellassai G., Bigongiari, C., Biondo B., Böttcher M., Bonanno G., Bonnoli G., Bruno P., Bulgarelli A., Canestrari R., Capalbi M., Caraveo P., Cardillo M., Conforti V., Contino G., Corpora M., Costa A. , et al. (73 additional authors not shown)

    Abstract: The ASTRI Mini-Array (MA) is an INAF project to build and operate a facility to study astronomical sources emitting at very high-energy in the TeV spectral band. The ASTRI MA consists of a group of nine innovative Imaging Atmospheric Cherenkov telescopes. The telescopes will be installed at the Teide Astronomical Observatory of the Instituto de Astrofisica de Canarias (IAC) in Tenerife (Canary Isl… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: 19 pages, 22 figures

    Journal ref: Journal of High Energy Astrophysics, Volume 35, p. 52-68 (2022)

  14. arXiv:2208.00290  [pdf, ps, other

    math.OC cs.LG

    A Gradient Smoothed Functional Algorithm with Truncated Cauchy Random Perturbations for Stochastic Optimization

    Authors: Akash Mondal, Prashanth L. A., Shalabh Bhatnagar

    Abstract: In this paper, we present a stochastic gradient algorithm for minimizing a smooth objective function that is an expectation over noisy cost samples, and only the latter are observed for any given parameter. Our algorithm employs a gradient estimation scheme with random perturbations, which are formed using the truncated Cauchy distribution from the delta sphere. We analyze the bias and variance of… ▽ More

    Submitted 30 June, 2023; v1 submitted 30 July, 2022; originally announced August 2022.

  15. arXiv:2205.05843  [pdf, ps, other

    stat.ML cs.IT cs.LG

    A Survey of Risk-Aware Multi-Armed Bandits

    Authors: Vincent Y. F. Tan, Prashanth L. A., Krishna Jagannathan

    Abstract: In several applications such as clinical trials and financial portfolio optimization, the expected value (or the average reward) does not satisfactorily capture the merits of a drug or a portfolio. In such applications, risk plays a crucial role, and a risk-aware performance measure is preferable, so as to capture losses in the case of adverse events. This survey aims to consolidate and summarise… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: 11 pages; Unabridged version of a a survey paper of the same title accepted to IJCAI-ECAI, 2022

  16. arXiv:2204.11026  [pdf

    q-bio.BM

    Bioinformatic analysis for structure and function of Glutamine synthetase(GS)

    Authors: Jiahao Ma, Guotong Xu, Le Ao, Siqi Chen, **gze Liu

    Abstract: Objective: To predict structure and function of Glutamine synthetase (GS) from Pseudoalteromonas sp. by bioinformatics technology, and to provide a theoretical basis for further study. Methods: Open reading frame (ORF) of GS sequence from Pseudoalteromonas sp. was obtained by ORF finder and was translated into amino acid residue. The structure domain was analyzed by Blast. By the method of analysi… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: 8 pages, 8 figures

  17. arXiv:2202.11046  [pdf, other

    cs.LG

    A policy gradient approach for optimization of smooth risk measures

    Authors: Nithia Vijayan, Prashanth L. A

    Abstract: We propose policy gradient algorithms for solving a risk-sensitive reinforcement learning (RL) problem in on-policy as well as off-policy settings. We consider episodic Markov decision processes, and model the risk using the broad class of smooth risk measures of the cumulative discounted reward. We propose two template policy gradient algorithms that optimize a smooth risk measure in on-policy an… ▽ More

    Submitted 23 June, 2024; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2107.04422

  18. Local heating due to convective overshooting and the solar modelling problem

    Authors: Baraffe I, Constantino T, Clarke J, Le Saux A, Goffrey T, Guillet T, Pratt J, Vlaykov D. G

    Abstract: Recent hydrodynamical simulations of convection in a solar-like model suggest that penetrative convective flows at the boundary of the convective envelope modify the thermal background in the overshooting layer. Based on these results, we implement in one-dimensional stellar evolution codes a simple prescription to modify the temperature gradient below the convective boundary of a solar model. Thi… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: 7 pages, 4 figures, accepted for publication in A&A

    Journal ref: A&A 659, A53 (2022)

  19. arXiv:2111.00481  [pdf, other

    physics.ins-det gr-qc

    Measurements of thermal relaxation of the OGRAN underground setup

    Authors: Gavrilyuk Y. M., Gusev A. V., Kvashnin N. L., Lugovoy A. A., Oreshkin S. I., Popov S. M., Rudenko V. N., Semenov V. V., Syrovatsky I. A

    Abstract: An upgraded version of the OGRAN -- combined optical-acoustic gravitational wave detector -- has been investigated in a long-term operation mode. This installation, located at the Baksan Neutrino Observatory (BNO) INR RAS, is designed to work under the program for detecting collapsing stars in parallel with the neutrino detector: Baksan Underground Scintillation Telescope (BUST). Such joint search… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  20. Current status of PAPYRUS : the pyramid based adaptive optics system at LAM/OHP

    Authors: Muslimov E., Levraud N., Chambouleyron V., Boudjema I., Lau A., Caillat A., Pedreros F., Otten G., El Hadi K., Joaquina K., Lopez M., El Morsy M., Beltramo Martin O., Fetick R., Ke Z., Sauvage J-F., Neichel B., Fusco T., Schmitt J., Le Van Suu A., Charton J., Schimpf A., Martin B., Dintrono F., Esposito S. , et al. (1 additional authors not shown)

    Abstract: The Provence Adaptive optics Pyramid Run System (PAPYRUS) is a pyramid-based Adaptive Optics (AO) system that will be installed at the Coude focus of the 1.52m telescope (T152) at the Observatoire de Haute Provence (OHP). The project is being developed by PhD students and Postdocs across France with support from staff members consolidating the existing expertise and hardware into an R&D testbed. T… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 19 pages, 11 figures

    Journal ref: Proc. SPIE 11876, Optical Instrument Science, Technology, and Applications II, 118760H (24 September 2021);

  21. arXiv:2107.04422  [pdf, other

    cs.LG

    Policy Gradient Methods for Distortion Risk Measures

    Authors: Nithia Vijayan, Prashanth L. A

    Abstract: We propose policy gradient algorithms which learn risk-sensitive policies in a reinforcement learning (RL) framework. Our proposed algorithms maximize the distortion risk measure (DRM) of the cumulative reward in an episodic Markov decision process in on-policy and off-policy RL settings, respectively. We derive a variant of the policy gradient theorem that caters to the DRM objective, and integra… ▽ More

    Submitted 4 February, 2024; v1 submitted 9 July, 2021; originally announced July 2021.

  22. arXiv:2106.11331  [pdf, other

    astro-ph.EP

    Exploiting timing capabilities of the CHEOPS mission with warm-Jupiter planets

    Authors: Borsato L, Piotto G, Gandolfi D, Nascimbeni V, Lacedelli G, Marzari F, Billot N, Maxted P, Sousa S G, Cameron A C, Bonfanti A, Wilson T, Serrano L, Garai Z, Alibert Y, Alonso R, Asquier J, Bárczy T, Bandy T, Barrado D, Barros S C, Baumjohann W, Beck M, Beck T, Benz W , et al. (53 additional authors not shown)

    Abstract: We present 17 transit light curves of seven known warm-Jupiters observed with the CHaracterising ExOPlanet Satellite (CHEOPS). The light curves have been collected as part of the CHEOPS Guaranteed Time Observation (GTO) program that searches for transit-timing variation (TTV) of warm-Jupiters induced by a possible external perturber to shed light on the evolution path of such planetary systems. We… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: 23 pages, 19 figures, 8 tables. Accepted for publication in MNRAS

  23. arXiv:2101.02137  [pdf, other

    cs.LG

    Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint

    Authors: Nithia Vijayan, Prashanth L. A

    Abstract: We propose two policy gradient algorithms for solving the problem of control in an off-policy reinforcement learning (RL) context. Both algorithms incorporate a smoothed functional (SF) based gradient estimation scheme. The first algorithm is a straightforward combination of importance sampling-based off-policy evaluation with SF-based gradient estimation. The second algorithm, inspired by the sto… ▽ More

    Submitted 23 June, 2024; v1 submitted 6 January, 2021; originally announced January 2021.

  24. arXiv:2011.14280  [pdf, other

    cs.CL cs.IR cs.LG

    A Novel Sentiment Analysis Engine for Preliminary Depression Status Estimation on Social Media

    Authors: Sudhir Kumar Suman, Hrithwik Shalu, Lakshya A Agrawal, Archit Agrawal, Juned Kadiwala

    Abstract: Text sentiment analysis for preliminary depression status estimation of users on social media is a widely exercised and feasible method, However, the immense variety of users accessing the social media websites and their ample mix of vocabularies makes it difficult for commonly applied deep learning-based classifiers to perform. To add to the situation, the lack of adaptability of traditional supe… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

  25. arXiv:2011.06273  [pdf, ps, other

    math.NT

    Algebraic properties of summation of exponential Taylor polynomials

    Authors: Lingfeng Ao, Shaofang Hong

    Abstract: Let $n\ge 1$ be an integer and $e_n(x)$ denote the truncated exponential Taylor polynomial, i.e. $e_{n}(x)=\sum_{i=0}^n\frac{x^i}{i!}$. A well-known theorem of Schur states that the Galois group of $e_n(x)$ over $\Q$ is the alternating group $A_n$ if $n$ is divisible by 4 or the symmetric group $S_n$ otherwise. In this paper, we study algebraic properties of the summation of two truncated exponent… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: 14 pages

  26. arXiv:2011.05163  [pdf, other

    cs.CR cs.NI

    Amadeus: Scalable, Privacy-Preserving Live Video Analytics

    Authors: Sandeep Dsouza, Victor Bahl, Lixiang Ao, Landon P. Cox

    Abstract: Smart-city applications ranging from traffic management to public-safety alerts rely on live analytics of video from surveillance cameras in public spaces. However, a growing number of government regulations stipulate how data collected from these cameras must be handled in order to protect citizens' privacy. This paper describes Amadeus, which balances privacy and utility by redacting video in ne… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: 17 pages, 19 figures

    ACM Class: D.4.7

  27. arXiv:2002.11440  [pdf, ps, other

    cs.LG math.OC stat.ML

    Non-asymptotic bounds for stochastic optimization with biased noisy gradient oracles

    Authors: Nirav Bhavsar, Prashanth L. A

    Abstract: We introduce biased gradient oracles to capture a setting where the function measurements have an estimation error that can be controlled through a batch size parameter. Our proposed oracles are appealing in several practical contexts, for instance, risk measure estimation from a batch of independent and identically distributed (i.i.d.) samples, or simulation optimization, where the function measu… ▽ More

    Submitted 16 May, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

  28. arXiv:1912.10398  [pdf, other

    cs.LG stat.ML

    Estimation of Spectral Risk Measures

    Authors: Ajay Kumar Pandey, Prashanth L. A., Sanjay P. Bhat

    Abstract: We consider the problem of estimating a spectral risk measure (SRM) from i.i.d. samples, and propose a novel method that is based on numerical integration. We show that our SRM estimate concentrates exponentially, when the underlying distribution has bounded support. Further, we also consider the case when the underlying distribution is either Gaussian or exponential, and derive a concentration bo… ▽ More

    Submitted 22 December, 2019; originally announced December 2019.

  29. arXiv:1904.00092  [pdf, other

    cond-mat.other physics.optics

    $\mathcal{PT}$-symmetric tight-binding model with asymmetric couplings

    Authors: Moreno-Rodríguez L. A., Izrailev F. M., Méndez-Bermúdez J. A

    Abstract: We study spectral and transport properties of one-dimensional tight-binding $\mathcal{PT}$-symmetric chains with alternating couplings. Based on the transfer matrix method, we have analytically developed the expressions for the transmission and reflection coefficients for any values of control parameters. These expressions are obtained in a very compact form which separately imbed the generic ener… ▽ More

    Submitted 29 March, 2019; originally announced April 2019.

  30. arXiv:1902.10709  [pdf, ps, other

    math.ST cs.LG stat.ML

    A Wasserstein distance approach for concentration of empirical risk estimates

    Authors: Prashanth L. A., Sanjay P. Bhat

    Abstract: This paper presents a unified approach based on Wasserstein distance to derive concentration bounds for empirical estimates for two broad classes of risk measures defined in the paper. The classes of risk measures introduced include as special cases well known risk measures from the finance literature such as conditional value at risk (CVaR), optimized certainty equivalent risk, spectral risk meas… ▽ More

    Submitted 10 May, 2022; v1 submitted 27 February, 2019; originally announced February 2019.

  31. arXiv:1902.02953  [pdf, ps, other

    cs.LG stat.ML

    Correlated bandits or: How to minimize mean-squared error online

    Authors: Vinay Praneeth Boda, Prashanth L. A

    Abstract: While the objective in traditional multi-armed bandit problems is to find the arm with the highest mean, in many settings, finding an arm that best captures information about other arms is of interest. This objective, however, requires learning the underlying correlation structure and not just the means of the arms. Sensors placement for industrial surveillance and cellular network monitoring are… ▽ More

    Submitted 26 June, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

  32. A Wide Orbit Exoplanet OGLE-2012-BLG-0838Lb

    Authors: Poleski R., Suzuki D., Udalski A., Xie X., Yee J. C., Koshimoto N., Gaudi B. S., Gould A., Skowron J., Szymanski M. K., Soszynski I., Pietrukowicz P., Kozlowski S., Wyrzykowski L., Ulaczyk K., Abe F., Barry R. K., Bennett D. P., Bhattacharya A., Bond I. A., Donachie M., Fujii H., Fukui A., Itow Y., Hirao Y. , et al. (26 additional authors not shown)

    Abstract: We present the discovery of a planet on a very wide orbit in the microlensing event OGLE-2012-BLG-0838. The signal of the planet is well separated from the main peak of the event and the planet-star projected separation is found to be twice larger than the Einstein ring radius, which roughly corresponds to a projected separation of ~4 AU. Similar planets around low-mass stars are very hard to find… ▽ More

    Submitted 17 November, 2021; v1 submitted 16 January, 2019; originally announced January 2019.

    Comments: 26 pages, 11 figures

    Journal ref: Astronomical Journal, Volume 159, Issue 6, id.261, 16 pp. (2020)

  33. arXiv:1901.00997  [pdf, ps, other

    cs.LG stat.ML

    Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions

    Authors: Prashanth L. A., Krishna Jagannathan, Ravi Kumar Kolla

    Abstract: Conditional Value-at-Risk (CVaR) is a widely used risk metric in applications such as finance. We derive concentration bounds for CVaR estimates, considering separately the cases of light-tailed and heavy-tailed distributions. In the light-tailed case, we use a classical CVaR estimator based on the empirical distribution constructed from the samples. For heavy-tailed random variables, we assume a… ▽ More

    Submitted 25 August, 2019; v1 submitted 4 January, 2019; originally announced January 2019.

  34. arXiv:1810.09126  [pdf, ps, other

    cs.LG math.OC stat.ML

    Risk-Sensitive Reinforcement Learning via Policy Gradient Search

    Authors: Prashanth L. A., Michael Fu

    Abstract: The objective in a traditional reinforcement learning (RL) problem is to find a policy that optimizes the expected value of a performance metric such as the infinite-horizon cumulative discounted or long-run average cost/reward. In practice, optimizing the expected value alone may not be satisfactory, in that it may be desirable to incorporate the notion of risk into the optimization problem formu… ▽ More

    Submitted 23 May, 2022; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: To appear in "Foundations and Trends in Machine Learning"

  35. arXiv:1808.02871  [pdf, ps, other

    math.OC cs.LG

    Random directions stochastic approximation with deterministic perturbations

    Authors: Prashanth L A, Shalabh Bhatnagar, Nirav Bhavsar, Michael Fu, Steven I. Marcus

    Abstract: We introduce deterministic perturbation schemes for the recently proposed random directions stochastic approximation (RDSA) [17], and propose new first-order and second-order algorithms. In the latter case, these are the first second-order algorithms to incorporate deterministic perturbations. We show that the gradient and/or Hessian estimates in the resulting algorithms with deterministic perturb… ▽ More

    Submitted 28 March, 2019; v1 submitted 8 August, 2018; originally announced August 2018.

  36. arXiv:1808.01739  [pdf, ps, other

    cs.LG stat.ML

    Concentration bounds for empirical conditional value-at-risk: The unbounded case

    Authors: Ravi Kumar Kolla, Prashanth L. A., Sanjay P. Bhat, Krishna Jagannathan

    Abstract: In several real-world applications involving decision making under uncertainty, the traditional expected value objective may not be suitable, as it may be necessary to control losses in the case of a rare but extreme event. Conditional Value-at-Risk (CVaR) is a popular risk measure for modeling the aforementioned objective. We consider the problem of estimating CVaR from i.i.d. samples of an unbou… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

  37. arXiv:1708.04847  [pdf, ps, other

    quant-ph

    Unnormalized quasi-distributions and tomograms of quantum states

    Authors: Man'ko V. I., Markovich L. A

    Abstract: Tomograms and quasi-distribution functions like Wigner, Glauber - Sudarshan $P$- and Husimi $Q$- functions that violate the standard normalization condition are considered. Conditions under which a reconstruction of the density matrix using these tomograms and quasi-distribution functions is possible are obtained. Three different examples of states like the de Broglie plane wave, the Moschinsky sh… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: 19 pages, no figures

  38. arXiv:1611.10283  [pdf, ps, other

    cs.LG stat.ML

    Bandit algorithms to emulate human decision making using probabilistic distortions

    Authors: Ravi Kumar Kolla, Prashanth L. A., Aditya Gopalan, Krishna Jagannathan, Michael Fu, Steve Marcus

    Abstract: Motivated by models of human decision making proposed to explain commonly observed deviations from conventional expected value preferences, we formulate two stochastic multi-armed bandit problems with distorted probabilities on the reward distributions: the classic $K$-armed bandit and the linearly parameterized bandit settings. We consider the aforementioned problems in the regret minimization as… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 November, 2016; originally announced November 2016.

    Comments: The material in this paper was presented in part at the 2017 AAAI Conference on Artificial Intelligence

  39. arXiv:1609.07087  [pdf, other

    cs.LG stat.ML

    (Bandit) Convex Optimization with Biased Noisy Gradient Oracles

    Authors: Xiaowei Hu, Prashanth L. A., András György, Csaba Szepesvári

    Abstract: Algorithms for bandit convex optimization and online learning often rely on constructing noisy gradient estimates, which are then used in appropriately adjusted first-order algorithms, replacing actual gradients. Depending on the properties of the function to be optimized and the nature of ``noise'' in the bandit feedback, the bias and variance of gradient estimates exhibit various tradeoffs. In t… ▽ More

    Submitted 4 July, 2020; v1 submitted 22 September, 2016; originally announced September 2016.

  40. f(Lovelock) theories of gravity

    Authors: Pablo Bueno, Pablo A. Cano, Oscar Lasso A., Pedro F. Ramirez

    Abstract: f(Lovelock) gravities are simple generalizations of the usual f(R) and Lovelock theories in which the gravitational action depends on some arbitrary function of the corresponding dimensionally-extended Euler densities. In this paper we study several aspects of these theories in general dimensions. We start by identifying the generalized boundary term which makes the gravitational variational probl… ▽ More

    Submitted 8 April, 2016; v1 submitted 23 February, 2016; originally announced February 2016.

    Comments: 46 pages, no figures; v3: minor modifications to match published version, references added

    Report number: IFT-UAM/CSIC-16-015

    Journal ref: JHEP 1604 (2016) 028

  41. arXiv:1507.07984  [pdf, ps, other

    cs.LG math.OC

    A constrained optimization perspective on actor critic algorithms and application to network routing

    Authors: Prashanth L. A., H. L. Prasad, Shalabh Bhatnagar, Prakash Chandra

    Abstract: We propose a novel actor-critic algorithm with guaranteed convergence to an optimal policy for a discounted reward Markov decision process. The actor incorporates a descent direction that is motivated by the solution of a certain non-linear optimization problem. We also discuss an extension to incorporate function approximation and demonstrate the practicality of our algorithms on a network routin… ▽ More

    Submitted 28 July, 2015; originally announced July 2015.

  42. arXiv:1506.02632  [pdf, other

    cs.LG math.OC

    Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control

    Authors: Prashanth L. A., Cheng Jie, Michael Fu, Steve Marcus, Csaba Szepesvári

    Abstract: Cumulative prospect theory (CPT) is known to model human decisions well, with substantial empirical evidence supporting this claim. CPT works by distorting probabilities and is more general than the classic expected utility and coherent risk measures. We bring this idea to a risk-sensitive reinforcement learning (RL) setting and design algorithms for both estimation and control. The RL setting pre… ▽ More

    Submitted 26 February, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

  43. arXiv:1502.05577  [pdf, ps, other

    math.OC cs.LG

    Adaptive system optimization using random directions stochastic approximation

    Authors: Prashanth L. A., Shalabh Bhatnagar, Michael Fu, Steve Marcus

    Abstract: We present novel algorithms for simulation optimization using random directions stochastic approximation (RDSA). These include first-order (gradient) as well as second-order (Newton) schemes. We incorporate both continuous-valued as well as discrete-valued perturbations into both our algorithms. The former are chosen to be independent and identically distributed (i.i.d.) symmetric, uniformly distr… ▽ More

    Submitted 8 August, 2015; v1 submitted 19 February, 2015; originally announced February 2015.

  44. arXiv:1405.2690  [pdf, ps, other

    stat.ML cs.LG math.OC

    Policy Gradients for CVaR-Constrained MDPs

    Authors: Prashanth L. A.

    Abstract: We study a risk-constrained version of the stochastic shortest path (SSP) problem, where the risk measure considered is Conditional Value-at-Risk (CVaR). We propose two algorithms that obtain a locally risk-optimal policy by employing four tools: stochastic approximation, mini batches, policy gradients and importance sampling. Both the algorithms incorporate a CVaR estimation procedure, along the… ▽ More

    Submitted 12 May, 2014; originally announced May 2014.

  45. arXiv:1403.6530  [pdf, other

    cs.LG math.OC stat.ML

    Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs

    Authors: Prashanth L. A., Mohammad Ghavamzadeh

    Abstract: In many sequential decision-making problems we may want to manage risk by minimizing some measure of variability in rewards in addition to maximizing a standard criterion. Variance related risk measures are among the most common risk-sensitive criteria in finance and operations research. However, optimizing many such criteria is known to be a hard problem. In this paper, we consider both discounte… ▽ More

    Submitted 18 March, 2015; v1 submitted 25 March, 2014; originally announced March 2014.

  46. arXiv:1312.7292  [pdf, ps, other

    eess.SY cs.LG

    Two Timescale Convergent Q-learning for Sleep--Scheduling in Wireless Sensor Networks

    Authors: Prashanth L. A., Abhranil Chatterjee, Shalabh Bhatnagar

    Abstract: In this paper, we consider an intrusion detection application for Wireless Sensor Networks (WSNs). We study the problem of scheduling the sleep times of the individual sensors to maximize the network lifetime while kee** the tracking error to a minimum. We formulate this problem as a partially-observable Markov decision process (POMDP) with continuous state-action spaces, in a manner similar to… ▽ More

    Submitted 23 March, 2014; v1 submitted 27 December, 2013; originally announced December 2013.

  47. arXiv:1307.3176  [pdf, other

    cs.LG stat.ML

    Fast gradient descent for drifting least squares regression, with application to bandits

    Authors: Nathaniel Korda, Prashanth L. A., Rémi Munos

    Abstract: Online learning algorithms require to often recompute least squares regression estimates of parameters. We study improving the computational complexity of such algorithms by using stochastic gradient descent (SGD) type schemes in place of classic regression solvers. We show that SGD schemes efficiently track the true solutions of the regression problems, even in the presence of a drift. This findi… ▽ More

    Submitted 20 November, 2014; v1 submitted 11 July, 2013; originally announced July 2013.

  48. arXiv:1112.0795  [pdf

    cs.NI cs.CR

    An Approach to Log Management: Prototy** a Design of Agent for Log Harvesting

    Authors: Mayol Arnao Reinaldo, Nuñez Luis A., Lobo Antonio

    Abstract: This paper describes a work in progress implementing a solution for harvesting and transporting information logs from network devices in a e-science environment. The system is composed for servers, agents, active devices and a transporting protocol. This document describes the state of development of agents. Agents capture logs from devices, normalize, reduce and cataloged them by using metadata.… ▽ More

    Submitted 4 December, 2011; originally announced December 2011.

  49. Symmetries of parabolic contact structures

    Authors: Lenka Zalabov\' a

    Abstract: We generalize the concept of locally symmetric spaces to parabolic contact structures. We show that symmetric normal parabolic contact structures are torsion--free and some types of them have to be locally flat. We prove that each symmetry given at a point with non--zero harmonic curvature is involutive. Finally we give restrictions on number of different symmetries which can exist at such a point… ▽ More

    Submitted 29 March, 2010; originally announced March 2010.

    Comments: 19 pages

    MSC Class: 53C15; 53A40; 53C05; 53C35

    Journal ref: Journal of Geometry and Physics, Volume 60, Issue 11, November 2010,1698-1709

  50. arXiv:astro-ph/0003314  [pdf, ps, other

    astro-ph

    Physics of Grain Alignment

    Authors: Lazarian A

    Abstract: Aligned grains provide one of the easiest ways to study magnetic fields in diffuse gas and molecular clouds. How reliable our conclusions about the inferred magnetic field depends critically on our understanding of the physics of grain alignment. Although grain alignment is a problem of half a century standing recent progress achieved in the field makes us believe that we are approaching the sol… ▽ More

    Submitted 21 March, 2000; originally announced March 2000.

    Comments: 10 pages, review for conference "Cosmic Evolution and Galaxy Formation"

    Journal ref: ASPConf.Ser.215:69,2000