Skip to main content

Showing 1–5 of 5 results for author: Walid, A

Searching in archive stat. Search in all archives.
.
  1. Regenerative Particle Thompson Sampling

    Authors: Zeyu Zhou, Bruce Hajek, Nakjung Choi, Anwar Walid

    Abstract: This paper proposes regenerative particle Thompson sampling (RPTS), a flexible variation of Thompson sampling. Thompson sampling itself is a Bayesian heuristic for solving stochastic bandit problems, but it is hard to implement in practice due to the intractability of maintaining a continuous posterior distribution. Particle Thompson sampling (PTS) is an approximation of Thompson sampling obtained… ▽ More

    Submitted 22 January, 2024; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Mainbody 14 pages, appendix 32 pages, 16 figures

    Journal ref: "Particle Thompson Sampling with Static Particles" and "Improving Particle Thompson Sampling through Regenerative Particles," 2023 57th Annual Conference on Information Sciences and Systems (CISS), Baltimore, MD, USA, 2023

  2. arXiv:1906.05015  [pdf, other

    cs.LG cs.AI cs.RO eess.SY stat.ML

    Deep Reinforcement Learning for Unmanned Aerial Vehicle-Assisted Vehicular Networks

    Authors: Ming Zhu, Xiao-Yang Liu, Anwar Walid

    Abstract: Unmanned aerial vehicles (UAVs) are envisioned to complement the 5G communication infrastructure in future smart cities. Hot spots easily appear in road intersections, where effective communication among vehicles is challenging. UAVs may serve as relays with the advantages of low price, easy deployment, line-of-sight links, and flexible mobility. In this paper, we study a UAV-assisted vehicular ne… ▽ More

    Submitted 14 February, 2023; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: 28 pages

  3. arXiv:1812.00979  [pdf, other

    cs.LG stat.ML

    Deep Reinforcement Learning for Intelligent Transportation Systems

    Authors: Xiao-Yang Liu, Zihan Ding, Sem Borst, Anwar Walid

    Abstract: Intelligent Transportation Systems (ITSs) are envisioned to play a critical role in improving traffic flow and reducing congestion, which is a pervasive issue impacting urban areas around the globe. Rapidly advancing vehicular communication and edge cloud computation technologies provide key enablers for smart traffic management. However, operating viable real-time actuation mechanisms on a practi… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  4. arXiv:1811.07522  [pdf, other

    cs.LG q-fin.TR stat.ML

    Practical Deep Reinforcement Learning Approach for Stock Trading

    Authors: Xiao-Yang Liu, Zhuoran Xiong, Shan Zhong, Hongyang Yang, Anwar Walid

    Abstract: Stock trading strategy plays a crucial role in investment companies. However, it is challenging to obtain optimal strategy in the complex and dynamic stock market. We explore the potential of deep reinforcement learning to optimize stock trading strategy and thus maximize investment return. 30 stocks are selected as our trading stocks and their daily prices are used as the training and trading mar… ▽ More

    Submitted 30 July, 2022; v1 submitted 19 November, 2018; originally announced November 2018.

  5. arXiv:1811.07342  [pdf, other

    cs.LG cs.AI stat.ML

    Transform-Based Multilinear Dynamical System for Tensor Time Series Analysis

    Authors: Weijun Lu, Xiao-Yang Liu, Qingwei Wu, Yue Sun, Anwar Walid

    Abstract: We propose a novel multilinear dynamical system (MLDS) in a transform domain, named $\mathcal{L}$-MLDS, to model tensor time series. With transformations applied to a tensor data, the latent multidimensional correlations among the frontal slices are built, and thus resulting in the computational independence in the transform domain. This allows the exact separability of the multi-dimensional probl… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.