Skip to main content

Showing 1–5 of 5 results for author: Sasaki, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2209.15452  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Safe Exploration Method for Reinforcement Learning under Existence of Disturbance

    Authors: Yoshihiro Okawa, Tomotake Sasaki, Hitoshi Yanami, Toru Namerikawa

    Abstract: Recent rapid developments in reinforcement learning algorithms have been giving us novel possibilities in many fields. However, due to their exploring property, we have to take the risk into consideration when we apply those algorithms to safety-critical problems especially in real environments. In this study, we deal with a safe exploration problem in reinforcement learning under the existence of… ▽ More

    Submitted 20 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Accepted by the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD) 2022. The Version of Record is available at https://doi.org/10.1007/978-3-031-26412-2_9

  2. arXiv:2107.12632  [pdf, other

    physics.ins-det eess.IV hep-ex

    INTPIX4NA -- new integration-type silicon-on-insulator pixel detector for imaging application

    Authors: R. Nishimura, S. Kishimoto, T. Sasaki, S. Mitsui, M. Shinya, Y. Arai, T. Miyoshi

    Abstract: INTPIX4NA is an integration-type silicon-on-insulator pixel detector. This detector has a 14.1 x 8.7 mm^2 sensitive area, 425,984 (832 column x 512 row matrix) pixels and the pixel size is 17 x 17 um^2. This detector was developed for residual stress measurement using X-rays (the cos alpha method). The performance of INTPIX4NA was tested with the synchrotron beamlines of the Photon Factory (KEK),… ▽ More

    Submitted 14 January, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at JINST (2022/01/14 Typo correction ver.)

    Journal ref: 2021 JINST 16 P08054

  3. Two-step reinforcement learning for model-free redesign of nonlinear optimal regulator

    Authors: Mei Minami, Yuka Masumoto, Yoshihiro Okawa, Tomotake Sasaki, Yutaka Hori

    Abstract: In many practical control applications, the performance level of a closed-loop system degrades over time due to the change of plant characteristics. Thus, there is a strong need for redesigning a controller without going through the system modeling process, which is often difficult for closed-loop systems. Reinforcement learning (RL) is one of the promising approaches that enable model-free redesi… ▽ More

    Submitted 30 November, 2023; v1 submitted 5 March, 2021; originally announced March 2021.

    Journal ref: SICE Journal of Control, Measurement, and System Integration, vol. 16, no. 1, pp. 349--362, 2023

  4. arXiv:2103.03656  [pdf, ps, other

    cs.LG cs.AI eess.SY stat.ML

    Automatic Exploration Process Adjustment for Safe Reinforcement Learning with Joint Chance Constraint Satisfaction

    Authors: Yoshihiro Okawa, Tomotake Sasaki, Hidenao Iwane

    Abstract: In reinforcement learning (RL) algorithms, exploratory control inputs are used during learning to acquire knowledge for decision making and control, while the true dynamics of a controlled object is unknown. However, this exploring property sometimes causes undesired situations by violating constraints regarding the state of the controlled object. In this paper, we propose an automatic exploration… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: Accepted to the 21st IFAC World Congress (IFAC-V 2020)

  5. arXiv:1812.05501  [pdf, other

    eess.SP cs.LG physics.data-an stat.ML

    Bayesian Spectral Deconvolution Based on Poisson Distribution: Bayesian Measurement and Virtual Measurement Analytics (VMA)

    Authors: Kenji Nagata, Yoh-ichi Mototake, Rei Muraoka, Takehiko Sasaki, Masato Okada

    Abstract: In this paper, we propose a new method of Bayesian measurement for spectral deconvolution, which regresses spectral data into the sum of unimodal basis function such as Gaussian or Lorentzian functions. Bayesian measurement is a framework for considering not only the target physical model but also the measurement model as a probabilistic model, and enables us to estimate the parameter of a physica… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

    Comments: 8 pages, 8 figures