Skip to main content

Showing 1–50 of 84 results for author: Kobayashi, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01098  [pdf, other

    cs.LG stat.ML

    Learning Decision Trees and Forests with Algorithmic Recourse

    Authors: Kentaro Kanamori, Takuya Takagi, Ken Kobayashi, Yuichi Ike

    Abstract: This paper proposes a new algorithm for learning accurate tree-based models while ensuring the existence of recourse actions. Algorithmic Recourse (AR) aims to provide a recourse action for altering the undesired prediction result given by a model. Typical AR methods provide a reasonable action by solving an optimization task of minimizing the required effort among executable actions. In practice,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 27 pages, 10 figures, to appear in the 41st International Conference on Machine Learning (ICML 2024)

  2. arXiv:2405.17492  [pdf, other

    cs.SE cs.AI

    StatWhy: Formal Verification Tool for Statistical Hypothesis Testing Programs

    Authors: Yusuke Kawamoto, Kentaro Kobayashi, Kohei Suenaga

    Abstract: Statistical methods have been widely misused and misinterpreted in various scientific fields, raising significant concerns about the integrity of scientific research. To develop techniques to mitigate this problem, we propose a new method for formally specifying and automatically verifying the correctness of statistical programs. In this method, programmers are reminded to check the requirements f… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  3. arXiv:2403.09920  [pdf

    eess.IV cs.AI cs.CV cs.CY

    Predicting Generalization of AI Colonoscopy Models to Unseen Data

    Authors: Joel Shor, Carson McNeil, Yotam Intrator, Joseph R Ledsam, Hiro-o Yamano, Daisuke Tsurumaru, Hiroki Kayama, Atsushi Hamabe, Koji Ando, Mitsuhiko Ota, Haruei Ogino, Hiroshi Nakase, Kaho Kobayashi, Masaaki Miyo, Eiji Oki, Ichiro Takemasa, Ehud Rivlin, Roman Goldenberg

    Abstract: $\textbf{Background}$: Generalizability of AI colonoscopy algorithms is important for wider adoption in clinical practice. However, current techniques for evaluating performance on unseen data require expensive and time-intensive labels. $\textbf{Methods}… ▽ More

    Submitted 22 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  4. arXiv:2312.09529  [pdf, other

    eess.IV cs.CV

    Can Physician Judgment Enhance Model Trustworthiness? A Case Study on Predicting Pathological Lymph Nodes in Rectal Cancer

    Authors: Kazuma Kobayashi, Yasuyuki Takamizawa, Mototaka Miyake, Sono Ito, Lin Gu, Tatsuya Nakatsuka, Yu Akagi, Tatsuya Harada, Yukihide Kanemitsu, Ryuji Hamamoto

    Abstract: Explainability is key to enhancing artificial intelligence's trustworthiness in medicine. However, several issues remain concerning the actual benefit of explainable models for clinical decision-making. Firstly, there is a lack of consensus on an evaluation framework for quantitatively assessing the practical benefits that effective explainability should provide to practitioners. Secondly, physici… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  5. arXiv:2312.06833  [pdf

    cs.LG cs.AI cs.CV cs.CY

    The unreasonable effectiveness of AI CADe polyp detectors to generalize to new countries

    Authors: Joel Shor, Hiro-o Yamano, Daisuke Tsurumaru, Yotami Intrator, Hiroki Kayama, Joe Ledsam, Atsushi Hamabe, Koji Ando, Mitsuhiko Ota, Haruei Ogino, Hiroshi Nakase, Kaho Kobayashi, Eiji Oki, Roman Goldenberg, Ehud Rivlin, Ichiro Takemasa

    Abstract: $\textbf{Background and aims}… ▽ More

    Submitted 17 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

  6. arXiv:2311.18207  [pdf, other

    cs.LG cs.AI

    Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

    Authors: Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami, Ken Kobayashi, Kazuhide Nakata, Yuta Saito

    Abstract: Off-Policy Evaluation (OPE) aims to assess the effectiveness of counterfactual policies using only offline logged data and is often used to identify the top-k promising policies for deployment in online A/B tests. Existing evaluation metrics for OPE estimators primarily focus on the "accuracy" of OPE or that of downstream policy selection, neglecting risk-return tradeoff in the subsequent online p… ▽ More

    Submitted 10 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: ICLR2024

  7. arXiv:2311.18206  [pdf, other

    cs.LG cs.AI

    SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

    Authors: Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami, Ken Kobayashi, Kazuhide Nakata, Yuta Saito

    Abstract: This paper introduces SCOPE-RL, a comprehensive open-source Python software designed for offline reinforcement learning (offline RL), off-policy evaluation (OPE), and selection (OPS). Unlike most existing libraries that focus solely on either policy learning or evaluation, SCOPE-RL seamlessly integrates these two key aspects, facilitating flexible and complete implementations of both offline RL an… ▽ More

    Submitted 10 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: preprint, open-source software: https://github.com/hakuhodo-technologies/scope-rl

  8. arXiv:2309.09627  [pdf, other

    cs.SD eess.AS

    Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders

    Authors: Lester Phillip Violeta, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda

    Abstract: We propose a novel framework for electrolaryngeal speech intelligibility enhancement through the use of robust linguistic encoders. Pretraining and fine-tuning approaches have proven to work well in this task, but in most cases, various mismatches, such as the speech type mismatch (electrolaryngeal vs. typical) or a speaker mismatch between the datasets used in each stage, can deteriorate the conv… ▽ More

    Submitted 20 January, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024. Demo page: lesterphillip.github.io/icassp2024_el_sie

  9. arXiv:2309.07598  [pdf, other

    cs.SD eess.AS

    AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion

    Authors: Wen-Chin Huang, Kazuhiro Kobayashi, Tomoki Toda

    Abstract: Non-autoregressive (non-AR) sequence-to-seqeunce (seq2seq) models for voice conversion (VC) is attractive in its ability to effectively model the temporal structure while enjoying boosted intelligibility and fast inference thanks to non-AR modeling. However, the dependency of current non-AR seq2seq VC models on ground truth durations extracted from an external AR model greatly limits its generaliz… ▽ More

    Submitted 15 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Submitted to ICASSP 2024. Demo: https://unilight.github.io/Publication-Demos/publications/aas-vc/index.html. Code: https://github.com/unilight/seq2seq-vc

  10. arXiv:2309.06006  [pdf, ps, other

    cs.CV cs.AI

    SoccerNet 2023 Challenges Results

    Authors: Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim , et al. (77 additional authors not shown)

    Abstract: The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, fo… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  11. arXiv:2309.03331  [pdf, other

    cs.CV eess.IV

    Expert Uncertainty and Severity Aware Chest X-Ray Classification by Multi-Relationship Graph Learning

    Authors: Mengliang Zhang, Xinyue Hu, Lin Gu, Liangchen Liu, Kazuma Kobayashi, Tatsuya Harada, Ronald M. Summers, Yingying Zhu

    Abstract: Patients undergoing chest X-rays (CXR) often endure multiple lung diseases. When evaluating a patient's condition, due to the complex pathologies, subtle texture changes of different lung lesions in images, and patient condition differences, radiologists may make uncertain even when they have experienced long-term clinical training and professional guidance, which makes much noise in extracting di… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  12. arXiv:2308.07523  [pdf, other

    stat.ML cs.LG stat.CO

    Deep Neural Operator Driven Real Time Inference for Nuclear Systems to Enable Digital Twin Solutions

    Authors: Kazuma Kobayashi, Syed Bahauddin Alam

    Abstract: This paper focuses on the feasibility of Deep Neural Operator (DeepONet) as a robust surrogate modeling method within the context of digital twin (DT) for nuclear energy systems. Through benchmarking and evaluation, this study showcases the generalizability and computational efficiency of DeepONet in solving a challenging particle transport problem. DeepONet also exhibits remarkable prediction acc… ▽ More

    Submitted 28 April, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Journal ref: Sci Rep 14, 2101 (2024)

  13. Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering

    Authors: Xinyue Hu, Lin Gu, Qiyuan An, Mengliang Zhang, Liangchen Liu, Kazuma Kobayashi, Tatsuya Harada, Ronald M. Summers, Yingying Zhu

    Abstract: To contribute to automating the medical vision-language model, we propose a novel Chest-Xray Difference Visual Question Answering (VQA) task. Given a pair of main and reference images, this task attempts to answer several questions on both diseases and, more importantly, the differences between them. This is consistent with the radiologist's diagnosis practice that compares the current image with… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  14. arXiv:2307.10204  [pdf, ps, other

    cs.IR cs.LG stat.ML

    An IPW-based Unbiased Ranking Metric in Two-sided Markets

    Authors: Keisho Oh, Naoki Nishimura, Minje Sung, Ken Kobayashi, Kazuhide Nakata

    Abstract: In modern recommendation systems, unbiased learning-to-rank (LTR) is crucial for prioritizing items from biased implicit user feedback, such as click data. Several techniques, such as Inverse Propensity Weighting (IPW), have been proposed for single-sided markets. However, less attention has been paid to two-sided markets, such as job platforms or dating services, where successful conversions requ… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  15. arXiv:2307.08966  [pdf, ps, other

    cs.RO cs.MA

    Multi-Robot Patrol Algorithm with Distributed Coordination and Consciousness of the Base Station's Situation Awareness

    Authors: Kazuho Kobayashi, Seiya Ueno, Takehiro Higuchi

    Abstract: Multi-robot patrolling is the potential application for robotic systems to survey wide areas efficiently without human burdens and mistakes. However, such systems have few examples of real-world applications due to their lack of human predictability. This paper proposes an algorithm: Local Reactive (LR) for multi-robot patrolling to satisfy both needs: (i)patrol efficiently and (ii)provide humans… ▽ More

    Submitted 10 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  16. arXiv:2304.14606  [pdf, other

    cs.LG stat.ML

    Algorithmic Recourse with Missing Values

    Authors: Kentaro Kanamori, Takuya Takagi, Ken Kobayashi, Yuichi Ike

    Abstract: This paper proposes a new framework of algorithmic recourse (AR) that works even in the presence of missing values. AR aims to provide a recourse action for altering the undesired prediction result given by a classifier. Existing AR methods assume that we can access complete information on the features of an input instance. However, we often encounter missing values in a given instance (e.g., due… ▽ More

    Submitted 22 May, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: 30 pages, 15 figures

  17. arXiv:2304.05949  [pdf, other

    cond-mat.mes-hall cs.AI cs.ET cs.LG

    CMOS + stochastic nanomagnets: heterogeneous computers for probabilistic inference and learning

    Authors: Nihal Sanjay Singh, Keito Kobayashi, Qixuan Cao, Kemal Selcuk, Tianrui Hu, Shaila Niazi, Navid Anjum Aadit, Shun Kanai, Hideo Ohno, Shunsuke Fukami, Kerem Y. Camsari

    Abstract: Extending Moore's law by augmenting complementary-metal-oxide semiconductor (CMOS) transistors with emerging nanotechnologies (X) has become increasingly important. One important class of problems involve sampling-based Monte Carlo algorithms used in probabilistic machine learning, optimization, and quantum simulation. Here, we combine stochastic magnetic tunnel junction (sMTJ)-based probabilistic… ▽ More

    Submitted 23 February, 2024; v1 submitted 12 April, 2023; originally announced April 2023.

    Journal ref: Nature Communications volume 15, Article number: 2685 (2024)

  18. arXiv:2303.11734  [pdf, other

    cs.LG cs.AI

    Unlocking Layer-wise Relevance Propagation for Autoencoders

    Authors: Kenyu Kobayashi, Renata Khasanova, Arno Schneuwly, Felix Schmidt, Matteo Casserini

    Abstract: Autoencoders are a powerful and versatile tool often used for various problems such as anomaly detection, image processing and machine translation. However, their reconstructions are not always trivial to explain. Therefore, we propose a fast explainability solution by extending the Layer-wise Relevance Propagation method with the help of Deep Taylor Decomposition framework. Furthermore, we introd… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  19. arXiv:2303.03633  [pdf, other

    cs.CV

    Sketch-based Medical Image Retrieval

    Authors: Kazuma Kobayashi, Lin Gu, Ryuichiro Hataya, Takaaki Mizuno, Mototaka Miyake, Hirokazu Watanabe, Masamichi Takahashi, Yasuyuki Takamizawa, Yukihiro Yoshida, Satoshi Nakamura, Nobuji Kouno, Amina Bolatkan, Yusuke Kurose, Tatsuya Harada, Ryuji Hamamoto

    Abstract: The amount of medical images stored in hospitals is increasing faster than ever; however, utilizing the accumulated medical images has been limited. This is because existing content-based medical image retrieval (CBMIR) systems usually require example images to construct query vectors; nevertheless, example images cannot always be prepared. Besides, there can be images with rare characteristics th… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  20. arXiv:2302.09636  [pdf, other

    cs.CV

    Interpretable Medical Image Visual Question Answering via Multi-Modal Relationship Graph Learning

    Authors: Xinyue Hu, Lin Gu, Kazuma Kobayashi, Qiyuan An, Qingyu Chen, Zhiyong Lu, Chang Su, Tatsuya Harada, Yingying Zhu

    Abstract: Medical visual question answering (VQA) aims to answer clinically relevant questions regarding input medical images. This technique has the potential to improve the efficiency of medical professionals while relieving the burden on the public health system, particularly in resource-poor countries. Existing medical VQA methods tend to encode medical images and learn the correspondence between visual… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

  21. arXiv:2301.06701  [pdf, other

    cs.LG stat.AP stat.CO stat.ML

    Improved generalization with deep neural operators for engineering systems: Path towards digital twin

    Authors: Kazuma Kobayashi, James Daniell, Syed Bahauddin Alam

    Abstract: Neural Operator Networks (ONets) represent a novel advancement in machine learning algorithms, offering a robust and generalizable alternative for approximating partial differential equations (PDEs) solutions. Unlike traditional Neural Networks (NN), which directly approximate functions, ONets specialize in approximating mathematical operators, enhancing their efficacy in addressing complex PDEs.… ▽ More

    Submitted 28 April, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

    Journal ref: Engineering Applications of Artificial Intelligence 131 (2024): 107844

  22. arXiv:2301.06676  [pdf, other

    cs.LG stat.AP stat.CO

    Explainable, Interpretable & Trustworthy AI for Intelligent Digital Twin: Case Study on Remaining Useful Life

    Authors: Kazuma Kobayashi, Syed Bahauddin Alam

    Abstract: Artificial intelligence (AI) and Machine learning (ML) are increasingly used in energy and engineering systems, but these models must be fair, unbiased, and explainable. It is critical to have confidence in AI's trustworthiness. ML techniques have been useful in predicting important parameters and in improving model performance. However, for these AI techniques to be useful for making decisions, t… ▽ More

    Submitted 28 April, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

    Journal ref: Engineering Applications of Artificial Intelligence 129 (2024): 107620

  23. arXiv:2211.13157  [pdf, other

    stat.AP cs.LG stat.ML

    Physics-Informed Multi-Stage Deep Learning Framework Development for Digital Twin-Centred State-Based Reactor Power Prediction

    Authors: James Daniell, Kazuma Kobayashi, Susmita Naskar, Dinesh Kumar, Souvik Chakraborty, Ayodeji Alajo, Ethan Taber, Joseph Graham, Syed Alam

    Abstract: Computationally efficient and trustworthy machine learning algorithms are necessary for Digital Twin (DT) framework development. Generally speaking, DT-enabling technologies consist of five major components: (i) Machine learning (ML)-driven prediction algorithm, (ii) Temporal synchronization between physics and digital assets utilizing advanced sensors/instrumentation, (iii) uncertainty propagatio… ▽ More

    Submitted 24 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

  24. arXiv:2210.10314  [pdf, other

    cs.SD eess.AS

    Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion

    Authors: Ding Ma, Lester Phillip Violeta, Kazuhiro Kobayashi, Tomoki Toda

    Abstract: Sequence-to-sequence (seq2seq) voice conversion (VC) models have greater potential in converting electrolaryngeal (EL) speech to normal speech (EL2SP) compared to conventional VC models. However, EL2SP based on seq2seq VC requires a sufficiently large amount of parallel data for the model training and it suffers from significant performance degradation when the amount of training data is insuffici… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to SLT 2022

  25. arXiv:2210.09055  [pdf, other

    cs.CY eess.SY

    Data-driven multi-scale modeling and robust optimization of composite structure with uncertainty quantification

    Authors: Kazuma Kobayashi, Shoaib Usman, Carlos Castano, Dinesh Kumar, Syed Alam

    Abstract: It is important to accurately model materials' properties at lower length scales (micro-level) while translating the effects to the components and/or system level (macro-level) can significantly reduce the amount of experimentation required to develop new technologies. Robustness analysis of fuel and structural performance for harsh environments (such as power uprated reactor systems or aerospace… ▽ More

    Submitted 4 November, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Journal ref: Handbook of Smart Energy Systems, 2022

  26. arXiv:2210.00074  [pdf

    cs.LG stat.AP

    Leveraging Industry 4.0 -- Deep Learning, Surrogate Model and Transfer Learning with Uncertainty Quantification Incorporated into Digital Twin for Nuclear System

    Authors: M. Rahman, Abid Khan, Sayeed Anowar, Md Al-Imran, Richa Verma, Dinesh Kumar, Kazuma Kobayashi, Syed Alam

    Abstract: Industry 4.0 targets the conversion of the traditional industries into intelligent ones through technological revolution. This revolution is only possible through innovation, optimization, interconnection, and rapid decision-making capability. Numerical models are believed to be the key components of Industry 4.0, facilitating quick decision-making through simulations instead of costly experiments… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  27. arXiv:2209.12146  [pdf

    eess.SY cs.LG stat.ML

    Machine Learning and Artificial Intelligence-Driven Multi-Scale Modeling for High Burnup Accident-Tolerant Fuels for Light Water-Based SMR Applications

    Authors: Md. Shamim Hassan, Abid Hossain Khan, Richa Verma, Dinesh Kumar, Kazuma Kobayashi, Shoaib Usman, Syed Alam

    Abstract: The concept of small modular reactor has changed the outlook for tackling future energy crises. This new reactor technology is very promising considering its lower investment requirements, modularity, design simplicity, and enhanced safety features. The application of artificial intelligence-driven multi-scale modeling (neutronics, thermal hydraulics, fuel performance, etc.) incorporating Digital… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Journal ref: Handbook of Smart Energy Systems, 2022

  28. arXiv:2205.11099  [pdf, other

    math.OC cs.LG

    Bézier Flow: a Surface-wise Gradient Descent Method for Multi-objective Optimization

    Authors: Akiyoshi Sannai, Yasunari Hikima, Ken Kobayashi, Akinori Tanaka, Naoki Hamada

    Abstract: In this paper, we propose a strategy to construct a multi-objective optimization algorithm from a single-objective optimization algorithm by using the Bézier simplex model. Also, we extend the stability of optimization algorithms in the sense of Probability Approximately Correct (PAC) learning and define the PAC stability. We prove that it leads to an upper bound on the generalization with high pr… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  29. A Two-phase Framework with a Bézier Simplex-based Interpolation Method for Computationally Expensive Multi-objective Optimization

    Authors: Ryoji Tanabe, Youhei Akimoto, Ken Kobayashi, Hiroshi Umeki, Shinichi Shirakawa, Naoki Hamada

    Abstract: This paper proposes a two-phase framework with a Bézier simplex-based interpolation method (TPB) for computationally expensive multi-objective optimization. The first phase in TPB aims to approximate a few Pareto optimal solutions by optimizing a sequence of single-objective scalar problems. The first phase in TPB can fully exploit a state-of-the-art single-objective derivative-free optimizer. The… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: This is an accepted version of a paper published in the proceedings of GECCO 2022

  30. arXiv:2202.00380  [pdf, ps, other

    quant-ph cond-mat.mtrl-sci cs.LG

    Machine-learning-enhanced quantum sensors for accurate magnetic field imaging

    Authors: Moeta Tsukamoto, Shuji Ito, Kensuke Ogawa, Yuto Ashida, Kento Sasaki, Kensuke Kobayashi

    Abstract: Local detection of magnetic fields is crucial for characterizing nano- and micro-materials and has been implemented using various scanning techniques or even diamond quantum sensors. Diamond nanoparticles (nanodiamonds) offer an attractive opportunity to chieve high spatial resolution because they can easily be close to the target within a few 10 nm simply by attaching them to its surface. A physi… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: 29 pages, 10 figures

  31. arXiv:2112.13208  [pdf, other

    cs.LG

    Neural Network Module Decomposition and Recomposition

    Authors: Hiroaki Kingetsu, Kenichi Kobayashi, Taiji Suzuki

    Abstract: We propose a modularization method that decomposes a deep neural network (DNN) into small modules from a functionality perspective and recomposes them into a new model for some other task. Decomposed modules are expected to have the advantages of interpretability and verifiability due to their small size. In contrast to existing studies based on reusing models that involve retraining, such as a tr… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.

  32. arXiv:2112.12454  [pdf, other

    math.OC cs.DS

    Cardinality-constrained Distributionally Robust Portfolio Optimization

    Authors: Ken Kobayashi, Yuichi Takano, Kazuhide Nakata

    Abstract: This paper studies a distributionally robust portfolio optimization model with a cardinality constraint for limiting the number of invested assets. We formulate this model as a mixed-integer semidefinite optimization (MISDO) problem by means of the moment-based ambiguity set of probability distributions of asset returns. To exactly solve large-scale problems, we propose a specialized cutting-plane… ▽ More

    Submitted 21 December, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

  33. arXiv:2109.14108  [pdf, ps, other

    math.CO cs.DS

    Connected domination in grid graphs

    Authors: Masahisa Goto, Koji M. Kobayashi

    Abstract: Given an undirected simple graph, a subset of the vertices of the graph is a {\em dominating set} if every vertex not in the subset is adjacent to at least one vertex in the subset. A subset of the vertices of the graph is a {\em connected dominating set} if the subset is a dominating set and the subgraph induced by the subset is connected. In this paper, we determine the minimum cardinality of a… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: 44 pages

  34. SpatialViewer: A Remote Work Sharing Tool that Considers Intimacy Among Workers

    Authors: Sicheng Li, Yudai Makioka, Kyousuke Kobayashi, Haoran Xie, Kentaro Takashima

    Abstract: Due to the influence of the new coronavirus disease (COVID-19), teleworking has been expanding rapidly. Although existing interactive remote working systems are convenient, they do not allow users to adjust their spatial distance to team members at will, %"Arbitrarily" is probably not the best word here. It means without apparent reason. A better expression might be "at will." and they ignore the… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: Proceedings of HCII 2021. 12 pages, 6 figures

  35. arXiv:2106.01415  [pdf, other

    cs.SD cs.CL eess.AS

    A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion

    Authors: Wen-Chin Huang, Kazuhiro Kobayashi, Yu-Huai Peng, Ching-Feng Liu, Yu Tsao, Hsin-Min Wang, Tomoki Toda

    Abstract: We propose a new paradigm for maintaining speaker identity in dysarthric voice conversion (DVC). The poor quality of dysarthric speech can be greatly improved by statistical VC, but as the normal speech utterances of a dysarthria patient are nearly impossible to collect, previous work failed to recover the individuality of the patient. In light of this, we suggest a novel, two-stage approach for D… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted to Interspeech 2021. 5 pages, 3 figures, 1 table

  36. arXiv:2104.06793  [pdf, other

    cs.SD cs.CL eess.AS

    Non-autoregressive sequence-to-sequence voice conversion

    Authors: Tomoki Hayashi, Wen-Chin Huang, Kazuhiro Kobayashi, Tomoki Toda

    Abstract: This paper proposes a novel voice conversion (VC) method based on non-autoregressive sequence-to-sequence (NAR-S2S) models. Inspired by the great success of NAR-S2S models such as FastSpeech in text-to-speech (TTS), we extend the FastSpeech2 model for the VC problem. We introduce the convolution-augmented Transformer (Conformer) instead of the Transformer, making it possible to capture both local… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted to ICASSP2021. Demo HP: https://kan-bayashi.github.io/NonARSeq2SeqVC/

  37. arXiv:2104.04679  [pdf, other

    cs.LG stat.ML

    Approximate Bayesian Computation of Bézier Simplices

    Authors: Akinori Tanaka, Akiyoshi Sannai, Ken Kobayashi, Naoki Hamada

    Abstract: Bézier simplex fitting algorithms have been recently proposed to approximate the Pareto set/front of multi-objective continuous optimization problems. These new methods have shown to be successful at approximating various shapes of Pareto sets/fronts when sample points exactly lie on the Pareto set/front. However, if the sample points scatter away from the Pareto set/front, those methods often lik… ▽ More

    Submitted 12 April, 2021; v1 submitted 10 April, 2021; originally announced April 2021.

    Report number: RIKEN-iTHEMS-Report-21

  38. arXiv:2103.12328  [pdf, other

    cs.CV

    Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval

    Authors: Kazuma Kobayashi, Ryuichiro Hataya, Yusuke Kurose, Mototaka Miyake, Masamichi Takahashi, Akiko Nakagawa, Tatsuya Harada, Ryuji Hamamoto

    Abstract: In medical imaging, the characteristics purely derived from a disease should reflect the extent to which abnormal findings deviate from the normal features. Indeed, physicians often need corresponding images without abnormal findings of interest or, conversely, images that contain similar abnormal findings regardless of normal anatomical context. This is called comparative diagnostic reading of me… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  39. arXiv:2103.02858  [pdf, ps, other

    eess.AS cs.SD

    crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder

    Authors: Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda

    Abstract: In this paper, we present an open-source software for develo** a nonparallel voice conversion (VC) system named crank. Although we have released an open-source VC software based on the Gaussian mixture model named sprocket in the last VC Challenge, it is not straightforward to apply any speech corpus because it is necessary to prepare parallel utterances of source and target speakers to model a… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: Accepted to ICASSP 2021

  40. arXiv:2102.08014  [pdf, other

    cs.AI cs.LG stat.ML

    Representing Hierarchical Structure by Using Cone Embedding

    Authors: Daisuke Takehara, Kei Kobayashi

    Abstract: Graph embedding is becoming an important method with applications in various areas, including social networks and knowledge graph completion. In particular, Poincaré embedding has been proposed to capture the hierarchical structure of graphs, and its effectiveness has been reported. However, most of the existing methods have isometric map**s in the embedding space, and the choice of the origin p… ▽ More

    Submitted 10 May, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

  41. Ordered Counterfactual Explanation by Mixed-Integer Linear Optimization

    Authors: Kentaro Kanamori, Takuya Takagi, Ken Kobayashi, Yuichi Ike, Kento Uemura, Hiroki Arimura

    Abstract: Post-hoc explanation methods for machine learning models have been widely used to support decision-making. One of the popular methods is Counterfactual Explanation (CE), also known as Actionable Recourse, which provides a user with a perturbation vector of features that alters the prediction result. Given a perturbation vector, a user can interpret it as an "action" for obtaining one's desired dec… ▽ More

    Submitted 14 March, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: 20 pages, 5 figures, to appear in the 35th AAAI Conference on Artificial Intelligence (AAAI 2021)

  42. arXiv:2011.06224  [pdf, other

    eess.IV cs.CV

    Decomposing Normal and Abnormal Features of Medical Images for Content-based Image Retrieval

    Authors: Kazuma Kobayashi, Ryuichiro Hataya, Yusuke Kurose, Tatsuya Harada, Ryuji Hamamoto

    Abstract: Medical images can be decomposed into normal and abnormal features, which is considered as the compositionality. Based on this idea, we propose an encoder-decoder network to decompose a medical image into two discrete latent codes: a normal anatomy code and an abnormal anatomy code. Using these latent codes, we demonstrate a similarity retrieval by focusing on either normal or abnormal features of… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  43. arXiv:2010.13494  [pdf, other

    cs.LG cs.AI cs.CY

    One-vs.-One Mitigation of Intersectional Bias: A General Method to Extend Fairness-Aware Binary Classification

    Authors: Kenji Kobayashi, Yuri Nakao

    Abstract: With the widespread adoption of machine learning in the real world, the impact of the discriminatory bias has attracted attention. In recent years, various methods to mitigate the bias have been proposed. However, most of them have not considered intersectional bias, which brings unfair situations where people belonging to specific subgroups of a protected group are treated worse when multiple sen… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    ACM Class: I.6.5; I.2.6

  44. arXiv:2010.04446  [pdf, other

    eess.AS cs.CL cs.SD

    The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders

    Authors: Wen-Chin Huang, Patrick Lumban Tobing, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda

    Abstract: In this paper, we present the voice conversion (VC) systems developed at Nagoya University (NU) for the Voice Conversion Challenge 2020 (VCC2020). We aim to determine the effectiveness of two recent significant technologies in VC: sequence-to-sequence (seq2seq) models and autoregressive (AR) neural vocoders. Two respective systems were developed for the two tasks in the challenge: for task 1, we a… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: Accepted to the ISCA Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020

  45. Prediction of hierarchical time series using structured regularization and its application to artificial neural networks

    Authors: Tomokaze Shiratori, Ken Kobayashi, Yuichi Takano

    Abstract: This paper discusses the prediction of hierarchical time series, where each upper-level time series is calculated by summing appropriate lower-level time series. Forecasts for such hierarchical time series should be coherent, meaning that the forecast for an upper-level time series equals the sum of forecasts for corresponding lower-level time series. Previous methods for making coherent forecasts… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

  46. Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network

    Authors: Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda

    Abstract: In this paper, a pitch-adaptive waveform generative model named Quasi-Periodic WaveNet (QPNet) is proposed to improve the limited pitch controllability of vanilla WaveNet (WN) using pitch-dependent dilated convolution neural networks (PDCNNs). Specifically, as a probabilistic autoregressive generation model with stacked dilated convolution layers, WN achieves high-fidelity audio waveform generatio… ▽ More

    Submitted 27 March, 2021; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: 15 pages, 12 figures, 11 tables

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 1134-1148, 2021

  47. arXiv:2005.12573  [pdf, other

    eess.IV cs.CV

    Learning Global and Local Features of Normal Brain Anatomy for Unsupervised Abnormality Detection

    Authors: Kazuma Kobayashi, Ryuichiro Hataya, Yusuke Kurose, Amina Bolatkan, Mototaka Miyake, Hirokazu Watanabe, Masamichi Takahashi, Jun Itami, Tatsuya Harada, Ryuji Hamamoto

    Abstract: In real-world clinical practice, overlooking unanticipated findings can result in serious consequences. However, supervised learning, which is the foundation for the current success of deep learning, only encourages models to identify abnormalities that are defined in datasets in advance. Therefore, abnormality detection must be implemented in medical images that are not limited to a specific dise… ▽ More

    Submitted 8 May, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  48. Non-parallel Voice Conversion System with WaveNet Vocoder and Collapsed Speech Suppression

    Authors: Yi-Chiao Wu, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Hayashi, Tomoki Toda

    Abstract: In this paper, we integrate a simple non-parallel voice conversion (VC) system with a WaveNet (WN) vocoder and a proposed collapsed speech suppression technique. The effectiveness of WN as a vocoder for generating high-fidelity speech waveforms on the basis of acoustic features has been confirmed in recent works. However, when combining the WN vocoder with a VC system, the distorted acoustic featu… ▽ More

    Submitted 6 April, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: 13 pages, 13 figures, 1 table, accepted to publish in IEEE Access

  49. arXiv:2003.00402  [pdf, other

    stat.ML cs.CV cs.LG

    Why is the Mahalanobis Distance Effective for Anomaly Detection?

    Authors: Ryo Kamoi, Kei Kobayashi

    Abstract: The Mahalanobis distance-based confidence score, a recently proposed anomaly detection method for pre-trained neural classifiers, achieves state-of-the-art performance on both out-of-distribution (OoD) and adversarial examples detection. This work analyzes why this method exhibits such strong performance in practical settings while imposing an implausible assumption; namely, that class conditional… ▽ More

    Submitted 30 April, 2020; v1 submitted 29 February, 2020; originally announced March 2020.

  50. arXiv:1911.06515  [pdf, other

    stat.ML cs.LG

    Likelihood Assignment for Out-of-Distribution Inputs in Deep Generative Models is Sensitive to Prior Distribution Choice

    Authors: Ryo Kamoi, Kei Kobayashi

    Abstract: Recent work has shown that deep generative models assign higher likelihood to out-of-distribution inputs than to training data. We show that a factor underlying this phenomenon is a mismatch between the nature of the prior distribution and that of the data distribution, a problem found in widely used deep generative models such as VAEs and Glow. While a typical choice for a prior distribution is a… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.