Skip to main content

Showing 1–41 of 41 results for author: Yamazaki, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18162  [pdf, other

    cs.RO cs.HC

    Multimodal Reaching-Position Prediction for ADL Support Using Neural Networks

    Authors: Yutaka Takase, Kimitoshi Yamazaki

    Abstract: This study aimed to develop daily living support robots for patients with hemiplegia and the elderly. To support the daily living activities using robots in ordinary households without imposing physical and mental burdens on users, the system must detect the actions of the user and move appropriately according to their motions. We propose a reaching-position prediction scheme that targets the mo… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.00307  [pdf, other

    cs.CV

    HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model

    Authors: Khoa Vo, Thinh Phan, Kashu Yamazaki, Minh Tran, Ngan Le

    Abstract: Current video-language models (VLMs) rely extensively on instance-level alignment between video and language modalities, which presents two major limitations: (1) visual reasoning disobeys the natural perception that humans do in first-person perspective, leading to a lack of reasoning interpretation; and (2) learning is limited in capturing inherent fine-grained relationships between two modaliti… ▽ More

    Submitted 6 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: under submission

  3. arXiv:2405.01124  [pdf, other

    stat.ML cs.CV cs.LG eess.IV math.ST

    Investigating Self-Supervised Image Denoising with Denaturation

    Authors: Hiroki Waida, Kimihiro Yamazaki, Atsushi Tokuhisa, Mutsuyo Wada, Yuichiro Wada

    Abstract: Self-supervised learning for image denoising problems in the presence of denaturation for noisy data is a crucial approach in machine learning. However, theoretical understanding of the performance of the approach that uses denatured data is lacking. To provide better understanding of the approach, in this paper, we analyze a self-supervised denoising algorithm that uses denatured data in depth th… ▽ More

    Submitted 8 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2404.12631  [pdf

    cs.NE cs.AI

    Breaching the Bottleneck: Evolutionary Transition from Reward-Driven Learning to Reward-Agnostic Domain-Adapted Learning in Neuromodulated Neural Nets

    Authors: Solvi Arnold, Reiji Suzuki, Takaya Arita, Kimitoshi Yamazaki

    Abstract: Advanced biological intelligence learns efficiently from an information-rich stream of stimulus information, even when feedback on behaviour quality is sparse or absent. Such learning exploits implicit assumptions about task domains. We refer to such learning as Domain-Adapted Learning (DAL). In contrast, AI learning algorithms rely on explicit externally provided measures of behaviour quality to… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures

    ACM Class: I.2.6

  5. arXiv:2402.10429  [pdf, ps, other

    stat.ML cs.LG

    Fixed Confidence Best Arm Identification in the Bayesian Setting

    Authors: Kyoungseok Jang, Junpei Komiyama, Kazutoshi Yamazaki

    Abstract: We consider the fixed-confidence best arm identification (FC-BAI) problem in the Bayesian setting. This problem aims to find the arm of the largest mean with a fixed confidence level when the bandit model has been sampled from the known prior. Most studies on the FC-BAI problem have been conducted in the frequentist setting, where the bandit model is predetermined before the game starts. We show t… ▽ More

    Submitted 22 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  6. arXiv:2310.03923  [pdf, other

    cs.CV cs.RO

    Open-Fusion: Real-time Open-Vocabulary 3D Map** and Queryable Scene Representation

    Authors: Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le

    Abstract: Precise 3D environmental map** is pivotal in robotics. Existing methods often rely on predefined concepts during training or are time-intensive when generating semantic maps. This paper presents Open-Fusion, a groundbreaking approach for real-time open-vocabulary 3D map** and queryable scene representation using RGB-D data. Open-Fusion harnesses the power of a pre-trained vision-language found… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  7. arXiv:2306.06842  [pdf, other

    cs.CV

    AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

    Authors: Kashu Yamazaki, Taisei Hanyu, Minh Tran, Adrian de Luis, Roy McCann, Haitao Liao, Chase Rainwater, Meredith Adkins, Jackson Cothren, Ngan Le

    Abstract: Aerial Image Segmentation is a top-down perspective semantic segmentation and has several challenging characteristics such as strong imbalance in the foreground-background distribution, complex background, intra-class heterogeneity, inter-class homogeneity, and tiny objects. To handle these problems, we inherit the advantages of Transformers and propose AerialFormer, which unifies Transformers at… ▽ More

    Submitted 1 October, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: under review

  8. arXiv:2305.08363  [pdf, other

    cs.IT eess.SP

    User-Centric Clustering Under Fairness Scheduling in Cell-Free Massive MIMO

    Authors: Fabian Göttsch, Noboru Osawa, Yoshiaki Amano, Issei Kanno, Kosuke Yamazaki, Giuseppe Caire

    Abstract: We consider fairness scheduling in a user-centric cell-free massive MIMO network, where $L$ remote radio units, each with $M$ antennas, serve $K_{\rm tot} \approx LM$ user equipments (UEs). Recent results show that the maximum network sum throughput is achieved where $K_{\rm act} \approx \frac{LM}{2}$ UEs are simultaneously active in any given time-frequency slots. However, the number of users… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2211.15294

  9. arXiv:2212.06206  [pdf, other

    cs.CV

    Contextual Explainable Video Representation: Human Perception-based Understanding

    Authors: Khoa Vo, Kashu Yamazaki, Phong X. Nguyen, Phat Nguyen, Khoa Luu, Ngan Le

    Abstract: Video understanding is a growing field and a subject of intense research, which includes many interesting tasks to understanding both spatial and temporal information, e.g., action detection, action recognition, video captioning, video retrieval. One of the most challenging problems in video understanding is dealing with feature extraction, i.e. extract contextual visual representation from given… ▽ More

    Submitted 17 December, 2022; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: Accepted in Asilomar Conference 2022

  10. arXiv:2212.05136  [pdf, other

    cs.CV

    CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection

    Authors: Hyekang Kevin Joo, Khoa Vo, Kashu Yamazaki, Ngan Le

    Abstract: Video anomaly detection (VAD) -- commonly formulated as a multiple-instance learning problem in a weakly-supervised manner due to its labor-intensive nature -- is a challenging problem in video surveillance where the frames of anomaly need to be localized in an untrimmed video. In this paper, we first propose to utilize the ViT-encoded visual features from CLIP, in contrast with the conventional C… ▽ More

    Submitted 3 July, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: Published at the 30th IEEE International Conference on Image Processing (IEEE ICIP 2023)

  11. arXiv:2211.15294  [pdf, other

    cs.IT

    Fairness Scheduling in Dense User-Centric Cell-Free Massive MIMO Networks

    Authors: Fabian Göttsch, Noboru Osawa, Takeo Ohseki, Yoshiaki Amano, Issei Kanno, Kosuke Yamazaki, Giuseppe Caire

    Abstract: We consider a user-centric scalable cell-free massive MIMO network with a total of $LM$ distributed remote radio unit antennas serving $K$ user equipments (UEs). Many works in the current literature assume $LM\gg K$, enabling high UE data rates but also leading to a system not operating at its maximum performance in terms of sum throughput. We provide a new perspective on cell-free massive MIMO ne… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  12. arXiv:2211.15103  [pdf, other

    cs.CV

    VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

    Authors: Kashu Yamazaki, Khoa Vo, Sang Truong, Bhiksha Raj, Ngan Le

    Abstract: Video paragraph captioning aims to generate a multi-sentence description of an untrimmed video with several temporal event locations in coherent storytelling. Following the human perception process, where the scene is effectively understood by decomposing it into visual (e.g. human, animal) and non-visual components (e.g. action, relations) under the mutual influence of vision and language, we fir… ▽ More

    Submitted 15 February, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted to AAAI 2023 Oral

  13. arXiv:2210.06323  [pdf, other

    cs.CV

    AISFormer: Amodal Instance Segmentation with Transformer

    Authors: Minh Tran, Khoa Vo, Kashu Yamazaki, Arthur Fernandes, Michael Kidd, Ngan Le

    Abstract: Amodal Instance Segmentation (AIS) aims to segment the region of both visible and possible occluded parts of an object instance. While Mask R-CNN-based AIS approaches have shown promising results, they are unable to model high-level features coherence due to the limited receptive field. The most recent transformer-based models show impressive performance on vision tasks, even better than Convoluti… ▽ More

    Submitted 17 March, 2024; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to BMVC2022

  14. arXiv:2210.02578  [pdf, other

    cs.CV

    AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

    Authors: Khoa Vo, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, Ngan Le

    Abstract: Temporal action proposal generation (TAPG) is a challenging task, which requires localizing action intervals in an untrimmed video. Intuitively, we as humans, perceive an action through the interactions between actors, relevant objects, and the surrounding environment. Despite the significant progress of TAPG, a vast majority of existing methods ignore the aforementioned principle of the human per… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted for publication in International Journal of Computer Vision

  15. arXiv:2207.11478  [pdf, other

    cs.IT

    Overloaded Pilot Assignment with Pilot Decontamination for Cell-Free Systems

    Authors: Noboru Osawa, Fabian Göttsch, Issei Kanno, Takeo Ohseki, Yoshiaki Amano, Kosuke Yamazaki, Giuseppe Caire

    Abstract: The pilot contamination in cell-free massive multiple-input-multiple-output (CF-mMIMO) must be addressed for accommodating a large number of users. In previous works, we have investigated a decontamination method called subspace projection (SP). The SP separates interference from co-pilot users by using the orthogonality of the principal components of the users' channel subspaces. Non-overloaded p… ▽ More

    Submitted 10 October, 2022; v1 submitted 23 July, 2022; originally announced July 2022.

    Comments: 7 pages, 2 figures, this paper was submitted to IEEE WCNC 2023

  16. arXiv:2206.12972  [pdf, other

    cs.CV

    VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

    Authors: Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le

    Abstract: In this paper, we leverage the human perceiving process, that involves vision and language interaction, to generate a coherent paragraph description of untrimmed videos. We propose vision-language (VL) features consisting of two modalities, i.e., (i) vision modality to capture global visual content of the entire scene and (ii) language modality to extract scene elements description of both human a… ▽ More

    Submitted 6 August, 2022; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: accepted by The 29th IEEE International Conference on Image Processing (IEEE ICIP) 2022

  17. arXiv:2206.10920  [pdf

    cs.RO cs.AI

    Recognising Affordances in Predicted Futures to Plan with Consideration of Non-canonical Affordance Effects

    Authors: Solvi Arnold, Mami Kuroishi, Tadashi Adachi, Kimitoshi Yamazaki

    Abstract: We propose a novel system for action sequence planning based on a combination of affordance recognition and a neural forward model predicting the effects of affordance execution. By performing affordance recognition on predicted futures, we avoid reliance on explicit affordance effect definitions for multi-step planning. Because the system learns affordance effects from experience data, the system… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 8 pages, 8 figures, video: http://youtu.be/4naJ5IghHcg

    ACM Class: I.2.9; I.2.6

  18. arXiv:2206.03801  [pdf, other

    cs.IT eess.SP

    Robust PCA for Subspace Estimation in User-Centric Cell-Free Wireless Networks

    Authors: Fabian Göttsch, Noboru Osawa, Takeo Ohseki, Kosuke Yamazaki, Giuseppe Caire

    Abstract: We consider a scalable user-centric cell-free massive MIMO network with distributed remote radio units (RUs), enabling macrodiversity and joint processing. Due to the limited uplink (UL) pilot dimension, multiuser interference in the UL pilot transmission phase makes channel estimation a non-trivial problem. We make use of two types of UL pilot signals, sounding reference signal (SRS) and demodula… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.00714

  19. arXiv:2206.03800  [pdf, other

    cs.IT eess.SP

    Optimal User Load and Energy Efficiency in User-Centric Cell-Free Wireless Networks

    Authors: Fabian Göttsch, Noboru Osawa, Takeo Ohseki, Kosuke Yamazaki, Giuseppe Caire

    Abstract: Cell-free massive MIMO is a variant of multiuser MIMO and massive MIMO, in which the total number of antennas $LM$ is distributed among the $L$ remote radio units (RUs) in the system, enabling macrodiversity and joint processing. Due to pilot contamination and system scalability, each RU can only serve a limited number of users. Obtaining the optimal number of users simultaneously served on one re… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

  20. arXiv:2203.08951  [pdf, other

    cs.LG cs.CV eess.IV

    Meta-Learning of NAS for Few-shot Learning in Medical Image Applications

    Authors: Viet-Khoa Vo-Ho, Kashu Yamazaki, Hieu Hoang, Minh-Triet Tran, Ngan Le

    Abstract: Deep learning methods have been successful in solving tasks in machine learning and have made breakthroughs in many sectors owing to their ability to automatically extract features from unstructured data. However, their performance relies on manual trial-and-error processes for selecting an appropriate network architecture, hyperparameters for training, and pre-/post-procedures. Even though it has… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: book chapter, in Meta-Learning with Medical Imaging and Health Informatics Applications

  21. ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation

    Authors: Khoa Vo, Kashu Yamazaki, Sang Truong, Minh-Triet Tran, Akihiro Sugimoto, Ngan Le

    Abstract: Temporal action proposal generation (TAPG) aims to estimate temporal intervals of actions in untrimmed videos, which is a challenging yet plays an important role in many tasks of video analysis and understanding. Despite the great achievement in TAPG, most existing works ignore the human perception of interaction between agents and the surrounding environment by applying a deep learning model as a… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted in the journal of IEEE Access Vol. 9

  22. arXiv:2203.00714  [pdf, other

    cs.IT eess.SP

    Subspace-Based Pilot Decontamination in User-Centric Scalable Cell-Free Wireless Networks

    Authors: Fabian Göttsch, Noboru Osawa, Takeo Ohseki, Kosuke Yamazaki, Giuseppe Caire

    Abstract: We consider a cell-free wireless system operated in Time Division Duplex (TDD) mode with user-centric clusters of remote radio units (RUs). Since the uplink pilot dimensions per channel coherence slot is limited, co-pilot users might incur mutual pilot contamination. In the current literature, it is assumed that the long-term statistical knowledge of all user channels is available. This enables Mi… ▽ More

    Submitted 17 November, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

  23. arXiv:2201.04922  [pdf, other

    cs.IT eess.SP

    Uplink-Downlink Duality and Precoding Strategies with Partial CSI in Cell-Free Wireless Networks

    Authors: Fabian Göttsch, Noboru Osawa, Takeo Ohseki, Kosuke Yamazaki, Giuseppe Caire

    Abstract: We consider a scalable user-centric wireless network with dynamic cluster formation as defined by Björnsson and Sanguinetti. After having shown the importance of dominant channel subspace information for uplink (UL) pilot decontamination and having examined different UL combining schemes in our previous work, here we investigate precoding strategies for the downlink (DL). Distributed scalable DL p… ▽ More

    Submitted 17 January, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:2108.04579

  24. Energy Efficiency of Uplink Cell-Free Massive MIMO With Transmit Power Control in Measured Propagation Channel

    Authors: Thomas Choi, Masaaki Ito, Issei Kanno, Jorge Gomez-Ponce, Colton Bullard, Takeo Ohseki, Kosuke Yamazaki, Andreas F. Molisch

    Abstract: Cell-free massive MIMO (CF-mMIMO) provides wireless connectivity for a large number of user equipments (UEs) using access points (APs) distributed across a wide area with high spectral efficiency (SE). The energy efficiency (EE) of the uplink is determined by (i) the transmit power control (TPC) algorithms, (ii) the numbers, configurations, and locations of the APs and the UEs, and (iii) the propa… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 12 pages, 12 figures, IEEE Open Journal of Circuits and Systems. arXiv admin note: text overlap with arXiv:2108.02130

    Journal ref: 2021 IEEE Open Journal of Circuits and Systems

  25. arXiv:2110.11474  [pdf, other

    cs.CV

    AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

    Authors: Khoa Vo, Hyekang Joo, Kashu Yamazaki, Sang Truong, Kris Kitani, Minh-Triet Tran, Ngan Le

    Abstract: Humans typically perceive the establishment of an action in a video through the interaction between an actor and the surrounding environment. An action only starts when the main actor in the video begins to interact with the environment, while it ends when the main actor stops the interaction. Despite the great progress in temporal action proposal generation, most existing works ignore the aforeme… ▽ More

    Submitted 24 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: Accepted in BMVC 2021 (Oral Session)

  26. arXiv:2108.11510  [pdf, other

    cs.CV cs.AI

    Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey

    Authors: Ngan Le, Vidhiwar Singh Rathour, Kashu Yamazaki, Khoa Luu, Marios Savvides

    Abstract: Deep reinforcement learning augments the reinforcement learning framework and utilizes the powerful representation of deep neural networks. Recent works have demonstrated the remarkable successes of deep reinforcement learning in various domains including finance, medicine, healthcare, video games, robotics, and computer vision. In this work, we provide a detailed review of recent and state-of-the… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

  27. arXiv:2108.07936  [pdf

    eess.IV cs.CV

    Calibration Method of the Monocular Omnidirectional Stereo Camera

    Authors: Ryota Kawamata, Keiichi Betsui, Kazuyoshi Yamazaki, Rei Sakakibara, Takeshi Shimano

    Abstract: Compact and low-cost devices are needed for autonomous driving to image and measure distances to objects 360-degree around. We have been develo** an omnidirectional stereo camera exploiting two hyperbolic mirrors and a single set of a lens and sensor, which makes this camera compact and cost efficient. We establish a new calibration method for this camera considering higher-order radial distorti… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: 8 pages, 8 figures, 2 tables, accepted for publication in International Journal of Automotive Engineering

  28. arXiv:2108.04579  [pdf, other

    cs.IT eess.SP

    The Impact of Subspace-Based Pilot Decontamination in User-Centric Scalable Cell-Free Wireless Networks

    Authors: Fabian Göttsch, Noboru Osawa, Takeo Ohseki, Kosuke Yamazaki, Giuseppe Caire

    Abstract: We consider a scalable user-centric wireless network with dynamic cluster formation as defined by Björnsson and Sanguinetti. Several options for scalable uplink (UL) processing are examined including: i) cluster size and SNR threshold criterion for cluster formation; ii) UL pilot dimension; iii) local detection and global (per cluster) combining. We use a simple model for the channel vector spatia… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

  29. arXiv:2107.08323  [pdf, other

    cs.CV

    Agent-Environment Network for Temporal Action Proposal Generation

    Authors: Viet-Khoa Vo-Ho, Ngan Le, Kashu Yamazaki, Akihiro Sugimoto, Minh-Triet Tran

    Abstract: Temporal action proposal generation is an essential and challenging task that aims at localizing temporal intervals containing human actions in untrimmed videos. Most of existing approaches are unable to follow the human cognitive process of understanding the video context due to lack of attention mechanism to express the concept of an action or an agent who performs the action or the interaction… ▽ More

    Submitted 16 March, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: Accepted in ICASSP 2021

  30. arXiv:2103.09042  [pdf, ps, other

    eess.IV cs.CV

    Invertible Residual Network with Regularization for Effective Medical Image Segmentation

    Authors: Kashu Yamazaki, Vidhiwar Singh Rathour, T. Hoang Ngan Le

    Abstract: Deep Convolutional Neural Networks (CNNs) i.e. Residual Networks (ResNets) have been used successfully for many computer vision tasks, but are difficult to scale to 3D volumetric medical data. Memory is increasingly often the bottleneck when training 3D Convolutional Neural Networks (CNNs). Recently, invertible neural networks have been applied to significantly reduce activation memory footprint w… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

  31. arXiv:2103.08137  [pdf

    cs.RO cs.AI

    Cloth Manipulation Planning on Basis of Mesh Representations with Incomplete Domain Knowledge and Voxel-to-Mesh Estimation

    Authors: Solvi Arnold, Daisuke Tanaka, Kimitoshi Yamazaki

    Abstract: We consider the problem of open-goal planning for robotic cloth manipulation. Core of our system is a neural network trained as a forward model of cloth behaviour under manipulation, with planning performed through backpropagation. We introduce a neural network-based routine for estimating mesh representations from voxel input, and perform planning in mesh format internally. We address the problem… ▽ More

    Submitted 12 November, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: 27 pages, 13 figures

  32. arXiv:2012.02463  [pdf, other

    eess.IV cs.CV

    Offset Curves Loss for Imbalanced Problem in Medical Segmentation

    Authors: Ngan Le, Trung Le, Kashu Yamazaki, Toan Duc Bui, Khoa Luu, Marios Savides

    Abstract: Medical image segmentation has played an important role in medical analysis and widely developed for many clinical applications. Deep learning-based approaches have achieved high performance in semantic segmentation but they are limited to pixel-wise setting and imbalanced classes data problem. In this paper, we tackle those limitations by develo** a new deep learning-based model which takes int… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: ICPR 2020

  33. arXiv:2012.02073  [pdf, other

    cs.CV

    A Multi-task Contextual Atrous Residual Network for Brain Tumor Detection & Segmentation

    Authors: Ngan Le, Kashu Yamazaki, Dat Truong, Kha Gia Quach, Marios Savvides

    Abstract: In recent years, deep neural networks have achieved state-of-the-art performance in a variety of recognition and segmentation tasks in medical imaging including brain tumor segmentation. We investigate that segmenting a brain tumor is facing to the imbalanced data problem where the number of pixels belonging to the background class (non tumor pixel) is much larger than the number of pixels belongi… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted in ICPR 2020

  34. arXiv:2010.15396  [pdf, ps, other

    cs.IT eess.SP

    Channel Estimation and Equalization for CP-OFDM-based OTFS in Fractional Doppler Channels

    Authors: Noriyuki Hashimoto, Noboru Osawa, Kosuke Yamazaki, Shinsuke Ibi

    Abstract: Orthogonal time frequency and space (OTFS) modulation is a promising technology that satisfies high Doppler requirements for future mobile systems. OTFS modulation encodes information symbols and pilot symbols into the two-dimensional (2D) delay-Doppler (DD) domain. The received symbols suffer from inter-Doppler interference (IDI) in the fading channels with fractional Doppler shifts that are samp… ▽ More

    Submitted 21 January, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

  35. arXiv:1906.09391  [pdf, other

    stat.ML cs.LG

    Model Bridging: Connection between Simulation Model and Neural Network

    Authors: Keiichi Kisamori, Keisuke Yamazaki, Yuto Komori, Hiroshi Tokieda

    Abstract: The interpretability of machine learning, particularly for deep neural networks, is crucial for decision making in real-world applications. One approach is replacing the un-interpretable machine learning model with a surrogate model, which has a simple structure for interpretation. Another approach is understanding the target system by using a simulation modeled by human knowledge with interpretab… ▽ More

    Submitted 21 July, 2020; v1 submitted 22 June, 2019; originally announced June 2019.

  36. arXiv:1809.08159  [pdf, other

    stat.ML cs.LG

    Simulator Calibration under Covariate Shift with Kernels

    Authors: Keiichi Kisamori, Motonobu Kanagawa, Keisuke Yamazaki

    Abstract: We propose a novel calibration method for computer simulators, dealing with the problem of covariate shift. Covariate shift is the situation where input distributions for training and test are different, and ubiquitous in applications of simulations. Our approach is based on Bayesian inference with kernel mean embedding of distributions, and on the use of an importance-weighted reproducing kernel… ▽ More

    Submitted 18 March, 2020; v1 submitted 21 September, 2018; originally announced September 2018.

  37. arXiv:1408.5661  [pdf, ps, other

    stat.ML cs.LG

    Asymptotic Accuracy of Bayesian Estimation for a Single Latent Variable

    Authors: Keisuke Yamazaki

    Abstract: In data science and machine learning, hierarchical parametric models, such as mixture models, are often used. They contain two kinds of variables: observable variables, which represent the parts of the data that can be directly measured, and latent variables, which represent the underlying processes that generate the data. Although there has been an increase in research on the estimation accuracy… ▽ More

    Submitted 17 April, 2015; v1 submitted 25 August, 2014; originally announced August 2014.

    Comments: 28 pages, 3 figures

  38. arXiv:1212.2511  [pdf

    cs.LG stat.ML

    Stochastic complexity of Bayesian networks

    Authors: Keisuke Yamazaki, Sumio Watanbe

    Abstract: Bayesian networks are now being used in enormous fields, for example, diagnosis of a system, data mining, clustering and so on. In spite of their wide range of applications, the statistical properties have not yet been clarified, because the models are nonidentifiable and non-regular. In a Bayesian network, the set of its parameter for a smaller model is an analytic set with singularities in the… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-592-599

  39. arXiv:1211.2293  [pdf

    cs.DC cs.CE cs.PF

    Performance Evaluation of Treecode Algorithm for N-Body Simulation Using GridRPC System

    Authors: Truong Vinh Truong Duy, Katsuhiro Yamazaki, Shigeru Oyanagi

    Abstract: This paper is aimed at improving the performance of the treecode algorithm for N-Body simulation by employing the NetSolve GridRPC programming model to exploit the use of multiple clusters. N-Body is a classical problem, and appears in many areas of science and engineering, including astrophysics, molecular dynamics, and graphics. In the simulation of N-Body, the specific routine for calculating t… ▽ More

    Submitted 10 November, 2012; originally announced November 2012.

    Comments: 4 pages, 9 figures

  40. arXiv:1211.2292  [pdf

    cs.DC cs.CE cs.PF

    Hybrid MPI-OpenMP Paradigm on SMP Clusters: MPEG-2 Encoder and N-Body Simulation

    Authors: Truong Vinh Truong Duy, Katsuhiro Yamazaki, Kosai Ikegami, Shigeru Oyanagi

    Abstract: Clusters of SMP nodes provide support for a wide diversity of parallel programming paradigms. Combining both shared memory and message passing parallelizations within the same application, the hybrid MPI-OpenMP paradigm is an emerging trend for parallel programming to fully exploit distributed shared-memory architecture. In this paper, we improve the performance of MPEG-2 encoder and n-body simula… ▽ More

    Submitted 10 November, 2012; originally announced November 2012.

    Comments: 8 pages, 9 figures, 6 tables

  41. arXiv:1204.2069  [pdf, ps, other

    stat.ML cs.LG

    Asymptotic Accuracy of Distribution-Based Estimation for Latent Variables

    Authors: Keisuke Yamazaki

    Abstract: Hierarchical statistical models are widely employed in information science and data engineering. The models consist of two types of variables: observable variables that represent the given data and latent variables for the unobservable labels. An asymptotic analysis of the models plays an important role in evaluating the learning process; the result of the analysis is applied not only to theoretic… ▽ More

    Submitted 19 February, 2014; v1 submitted 10 April, 2012; originally announced April 2012.

    Comments: 25pages, 2 figures