Skip to main content

Showing 1–16 of 16 results for author: Ishii, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10306  [pdf, other

    cs.AI cs.GT cs.LG

    A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI

    Authors: Haruka Kita, Sotetsu Koyamada, Yotaro Yamaguchi, Shin Ishii

    Abstract: Contract bridge, a cooperative game characterized by imperfect information and multi-agent dynamics, poses significant challenges and serves as a critical benchmark in artificial intelligence (AI) research. Success in this domain requires agents to effectively cooperate with their partners. This study demonstrates that an appropriate combination of existing methods can perform surprisingly well in… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted version of IEEE CoG 2024

  2. arXiv:2406.00424  [pdf, other

    stat.ML cs.LG

    A Batch Sequential Halving Algorithm without Performance Degradation

    Authors: Sotetsu Koyamada, Soichiro Nishimori, Shin Ishii

    Abstract: In this paper, we investigate the problem of pure exploration in the context of multi-armed bandits, with a specific focus on scenarios where arms are pulled in fixed-size batches. Batching has been shown to enhance computational efficiency, but it can potentially lead to a degradation compared to the original sequential algorithm's performance due to delayed feedback and reduced adaptability. We… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted to RLC 2024

  3. arXiv:2304.09769  [pdf, other

    cs.AI

    End-to-End Policy Gradient Method for POMDPs and Explainable Agents

    Authors: Soichiro Nishimori, Sotetsu Koyamada, Shin Ishii

    Abstract: Real-world decision-making problems are often partially observable, and many can be formulated as a Partially Observable Markov Decision Process (POMDP). When we apply reinforcement learning (RL) algorithms to the POMDP, reasonable estimation of the hidden states can help solve the problems. Furthermore, explainable decision-making is preferable, considering their application to real-world tasks s… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: 10 pagee, 6 figures

  4. arXiv:2303.17503  [pdf, other

    cs.AI cs.LG

    Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning

    Authors: Sotetsu Koyamada, Shinri Okano, Soichiro Nishimori, Yu Murata, Keigo Habara, Haruka Kita, Shin Ishii

    Abstract: We propose Pgx, a suite of board game reinforcement learning (RL) environments written in JAX and optimized for GPU/TPU accelerators. By leveraging JAX's auto-vectorization and parallelization over accelerators, Pgx can efficiently scale to thousands of simultaneous simulations over accelerators. In our experiments on a DGX-A100 workstation, we discovered that Pgx can simulate RL environments 10-1… ▽ More

    Submitted 15 January, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

  5. arXiv:2102.03777  [pdf, other

    cs.HC cs.LG eess.SP

    EEGFuseNet: Hybrid Unsupervised Deep Feature Characterization and Fusion for High-Dimensional EEG with An Application to Emotion Recognition

    Authors: Zhen Liang, Rushuang Zhou, Li Zhang, Linling Li, Gan Huang, Zhiguo Zhang, Shin Ishii

    Abstract: How to effectively and efficiently extract valid and reliable features from high-dimensional electroencephalography (EEG), particularly how to fuse the spatial and temporal dynamic brain information into a better feature representation, is a critical issue in brain data analysis. Most current EEG studies work in a task driven manner and explore the valid EEG features with a supervised model, which… ▽ More

    Submitted 27 August, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Journal ref: IEEE Transactions on Neural Systems and Rehabilitation Engineering, 29(2021) 1913-1925

  6. arXiv:2004.10026  [pdf, other

    cs.HC

    ExerSense: Real-Tme Physical Exercise Segmentation, Classification, and Counting Algorithm Using an IMU Sensor

    Authors: Shun Ishii, Kizito Nkurikiyeyezu, Anna Yokokubo, Guillaume Lopez

    Abstract: Even though it is well known that physical exercises have numerous emotional and physical health benefits, maintaining a regular exercise routine is quite challenging. Fortunately, there exist technologies that promote physical activity. Nonetheless, almost all of these technologies only target a narrow set of physical activities (e.g., either running or walking but not both) and are only applicab… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  7. arXiv:1908.00876  [pdf, other

    eess.IV cs.LG q-bio.NC stat.ML

    MarmoNet: a pipeline for automated projection map** of the common marmoset brain from whole-brain serial two-photon tomography

    Authors: Henrik Skibbe, Akiya Watakabe, Ken Nakae, Carlos Enrique Gutierrez, Hiromichi Tsukada, Junichi Hata, Takashi Kawase, Rui Gong, Alexander Woodward, Kenji Doya, Hideyuki Okano, Tetsuo Yamamori, Shin Ishii

    Abstract: Understanding the connectivity in the brain is an important prerequisite for understanding how the brain processes information. In the Brain/MINDS project, a connectivity study on marmoset brains uses two-photon microscopy fluorescence images of axonal projections to collect the neuron connectivity from defined brain regions at the mesoscopic scale. The processing of the images requires the detect… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

  8. arXiv:1711.06564  [pdf, other

    cs.CV

    Efficient Diverse Ensemble for Discriminative Co-Tracking

    Authors: Kourosh Meshgi, Shigeyuki Oba, Shin Ishii

    Abstract: Ensemble discriminative tracking utilizes a committee of classifiers, to label data samples, which are in turn, used for retraining the tracker to localize the target using the collective knowledge of the committee. Committee members could vary in their features, memory update schemes, or training data, however, it is inevitable to have committee members that excessively agree because of large ove… ▽ More

    Submitted 7 June, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: CVPR 2018 Submission

  9. arXiv:1706.10031  [pdf, other

    stat.ML cs.LG

    Neural Sequence Model Training via $α$-divergence Minimization

    Authors: Sotetsu Koyamada, Yuta Kikuchi, Atsunori Kanemura, Shin-ichi Maeda, Shin Ishii

    Abstract: We propose a new neural sequence model training method in which the objective function is defined by $α$-divergence. We demonstrate that the objective function generalizes the maximum-likelihood (ML)-based and reinforcement learning (RL)-based objective functions as special cases (i.e., ML corresponds to $α\to 0$ and RL to $α\to1$). We also show that the gradient of the objective function can be c… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

    Comments: 2017 ICML Workshop on Learning to Generate Natural Language (LGNL 2017)

  10. arXiv:1704.08821  [pdf, other

    cs.CV

    Active Collaborative Ensemble Tracking

    Authors: Kourosh Meshgi, Maryam Sadat Mirzaei, Shigeyuki Oba, Shin Ishii

    Abstract: A discriminative ensemble tracker employs multiple classifiers, each of which casts a vote on all of the obtained samples. The votes are then aggregated in an attempt to localize the target object. Such method relies on collective competence and the diversity of the ensemble to approach the target/non-target classification task from different views. However, by updating all of the ensemble using a… ▽ More

    Submitted 28 April, 2017; originally announced April 2017.

    Comments: AVSS 2017 Submission

  11. arXiv:1704.03976  [pdf, other

    stat.ML cs.LG

    Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

    Authors: Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Shin Ishii

    Abstract: We propose a new regularization method based on virtual adversarial loss: a new measure of local smoothness of the conditional label distribution given input. Virtual adversarial loss is defined as the robustness of the conditional label distribution around each input data point against local perturbation. Unlike adversarial training, our method defines the adversarial direction without label info… ▽ More

    Submitted 27 June, 2018; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: To be appeared in IEEE Transactions on Pattern Analysis and Machine Intelligence

  12. arXiv:1704.00299  [pdf, other

    cs.CV

    Efficient Version-Space Reduction for Visual Tracking

    Authors: Kourosh Meshgi, Shigeyuki Oba, Shin Ishii

    Abstract: Discrminative trackers, employ a classification approach to separate the target from its background. To cope with variations of the target shape and appearance, the classifier is updated online with different samples of the target and the background. Sample selection, labeling and updating the classifier is prone to various sources of errors that drift the tracker. We introduce the use of an effic… ▽ More

    Submitted 2 April, 2017; originally announced April 2017.

    Comments: CRV'17 Conference

  13. arXiv:1704.00083  [pdf, other

    cs.CV

    Efficient Asymmetric Co-Tracking using Uncertainty Sampling

    Authors: Kourosh Meshgi, Maryam Sadat Mirzaei, Shigeyuki Oba, Shin Ishii

    Abstract: Adaptive tracking-by-detection approaches are popular for tracking arbitrary objects. They treat the tracking problem as a classification task and use online learning techniques to update the object model. However, these approaches are heavily invested in the efficiency and effectiveness of their detectors. Evaluating a massive number of samples for each frame (e.g., obtained by a sliding window)… ▽ More

    Submitted 31 March, 2017; originally announced April 2017.

    Comments: Submitted to IEEE ICSIPA'2017

  14. arXiv:1507.00677  [pdf, other

    stat.ML cs.LG

    Distributional Smoothing with Virtual Adversarial Training

    Authors: Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Ken Nakae, Shin Ishii

    Abstract: We propose local distributional smoothness (LDS), a new notion of smoothness for statistical model that can be used as a regularization term to promote the smoothness of the model distribution. We named the LDS based regularization as virtual adversarial training (VAT). The LDS of a model at an input datapoint is defined as the KL-divergence based robustness of the model distribution against local… ▽ More

    Submitted 11 June, 2016; v1 submitted 2 July, 2015; originally announced July 2015.

    Comments: Under review as a conference paper at ICLR 2016

  15. arXiv:1502.00093  [pdf, other

    stat.ML cs.LG q-bio.NC

    Deep learning of fMRI big data: a novel approach to subject-transfer decoding

    Authors: Sotetsu Koyamada, Yumi Shikauchi, Ken Nakae, Masanori Koyama, Shin Ishii

    Abstract: As a technology to read brain states from measurable brain activities, brain decoding are widely applied in industries and medical sciences. In spite of high demands in these applications for a universal decoder that can be applied to all individuals simultaneously, large variation in brain activities across individuals has limited the scope of many studies to the development of individual-specifi… ▽ More

    Submitted 31 January, 2015; originally announced February 2015.

  16. Principal Sensitivity Analysis

    Authors: Sotetsu Koyamada, Masanori Koyama, Ken Nakae, Shin Ishii

    Abstract: We present a novel algorithm (Principal Sensitivity Analysis; PSA) to analyze the knowledge of the classifier obtained from supervised machine learning techniques. In particular, we define principal sensitivity map (PSM) as the direction on the input space to which the trained classifier is most sensitive, and use analogously defined k-th PSM to define a basis for the input space. We train neural… ▽ More

    Submitted 11 March, 2015; v1 submitted 21 December, 2014; originally announced December 2014.