Skip to main content

Showing 1–23 of 23 results for author: Fujita, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01291  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Lightweight Zero-shot Text-to-Speech with Mixture of Adapters

    Authors: Kenichi Fujita, Takanori Ashihara, Marc Delcroix, Yusuke Ijima

    Abstract: The advancements in zero-shot text-to-speech (TTS) methods, based on large-scale models, have demonstrated high fidelity in reproducing speaker characteristics. However, these models are too large for practical daily use. We propose a lightweight zero-shot TTS method using a mixture of adapters (MoA). Our proposed method incorporates MoA modules into the decoder and the variance adapter of a non-a… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 5 pages,3 figures, Accepted to INTERSPEECH 2024

  2. Development of an Estimation Method for the Seismic Motion Reproducibility of a Three-dimensional Ground Structure Model by combining Surface-observed Seismic Motion and Three-dimensional Seismic Motion Analysis

    Authors: Tsuyoshi Ichimura, Kohei Fujita, Ryota Kusakabe, Hiroyuki Fujiwara, Muneo Hori, Maddegedara Lalith

    Abstract: The ground structure can substantially influence seismic ground motion underscoring the need to develop a ground structure model with sufficient reliability in terms of ground motion estimation for earthquake damage mitigation. While many methods for generating ground structure models have been proposed and used in practice, there remains room for enhancing their reliability. In this study, amid m… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 16 pages, 10 figures, accepted for IHPCES/ICCS 2024 (14th International Workshop on Advances in High-Performance Computational Earth Sciences: NumericalMethods, Frameworks & Applications / 24th International Conference on Computational Science)

    Journal ref: ICCS 2024. ICCS 2024. Lecture Notes in Computer Science, vol 14834. Springer, Cham

  3. Low-ordered Orthogonal Voxel Finite Element with INT8 Tensor Cores for GPU-based Explicit Elastic Wave Propagation Analysis

    Authors: Tsuyoshi Ichimura, Kohei Fujita, Muneo Hori, Maddegedara Lalith

    Abstract: Faster explicit elastic wavefield simulations are required for large and complex three-dimensional media using a structured finite element method. Such wavefield simulations are suitable for GPUs, which have exhibited improved computational performance in recent years, and the use of GPUs is expected to speed up such simulations. However, available computational performance on GPUs is typically no… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 16 pages, 6 figures, accepted for IHPCES/ICCS 2024 (14th International Workshop on Advances in High-Performance Computational Earth Sciences: Numerical Methods, Frameworks & Applications / 24th International Conference on Computational Science)

    Journal ref: ICCS 2024. ICCS 2024. Lecture Notes in Computer Science, vol 14834. Springer, Cham

  4. arXiv:2402.07085  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis

    Authors: Kenichi Fujita, Atsushi Ando, Yusuke Ijima

    Abstract: This paper proposes a speech rhythm-based method for speaker embeddings to model phoneme duration using a few utterances by the target speaker. Speech rhythm is one of the essential factors among speaker characteristics, along with acoustic features such as F0, for reproducing individual utterances in speech synthesis. A novel feature of the proposed method is the rhythm-based embeddings extracted… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: 11 pages,9 figures, Accepted to IEICE TRANSACTIONS on Information and Systems

    Journal ref: IEICE TRANSACTIONS on Information and Systems 107.1 (2024): 93-104

  5. arXiv:2401.05111  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters

    Authors: Kenichi Fujita, Hiroshi Sato, Takanori Ashihara, Hiroki Kanagawa, Marc Delcroix, Takafumi Moriya, Yusuke Ijima

    Abstract: The zero-shot text-to-speech (TTS) method, based on speaker embeddings extracted from reference speech using self-supervised learning (SSL) speech representations, can reproduce speaker characteristics very accurately. However, this approach suffers from degradation in speech synthesis quality when the reference speech contains noise. In this paper, we propose a noise-robust zero-shot TTS method.… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 5 pages,3 figures, Accepted to IEEE ICASSP 2024

  6. Characteristics of networks generated by kernel growing neural gas

    Authors: Kazuhisa Fujita

    Abstract: This research aims to develop kernel GNG, a kernelized version of the growing neural gas (GNG) algorithm, and to investigate the features of the networks generated by the kernel GNG. The GNG is an unsupervised artificial neural network that can transform a dataset into an undirected graph, thereby extracting the features of the dataset as a graph. The GNG is widely used in vector quantization, clu… ▽ More

    Submitted 25 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Journal ref: International Journal of Artificial Intelligence & Applications, 14(5), 25-39 (2023)

  7. An efficient and straightforward online quantization method for a data stream through remove-birth updating

    Authors: Kazuhisa Fujita

    Abstract: The growth of network-connected devices has led to an exponential increase in data generation, creating significant challenges for efficient data analysis. This data is generated continuously, creating a dynamic flow known as a data stream. The characteristics of a data stream may change dynamically, and this change is known as concept drift. Consequently, a method for handling data streams must e… ▽ More

    Submitted 25 December, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  8. Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model

    Authors: Kenichi Fujita, Takanori Ashihara, Hiroki Kanagawa, Takafumi Moriya, Yusuke Ijima

    Abstract: This paper proposes a zero-shot text-to-speech (TTS) conditioned by a self-supervised speech-representation model acquired through self-supervised learning (SSL). Conventional methods with embedding vectors from x-vector or global style tokens still have a gap in reproducing the speaker characteristics of unseen speakers. A novel point of the proposed method is the direct use of the SSL model to o… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: 5 pages,3 figures, Accepted to IEEE ICASSP 2023 workshop Self-supervision in Audio, Speech and Beyond

    Journal ref: 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), 2023, pp. 1-5,

  9. arXiv:2212.13157  [pdf, other

    cs.LG

    Gaussian Process Classification Bandits

    Authors: Tatsuya Hayashi, Naoki Ito, Koji Tabata, Atsuyoshi Nakamura, Katsumasa Fujita, Yoshinori Harada, Tamiki Komatsuzaki

    Abstract: Classification bandits are multi-armed bandit problems whose task is to classify a given set of arms into either positive or negative class depending on whether the rate of the arms with the expected reward of at least h is not less than w for given thresholds h and w. We study a special classification bandit problem in which arms correspond to points x in d-dimensional real space with expected re… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

  10. arXiv:2112.11323  [pdf

    physics.acc-ph cs.LG math.NA

    Physics-informed neural network method for modelling beam-wall interactions

    Authors: Kazuhiro Fujita

    Abstract: A mesh-free approach for modelling beam-wall interactions in particle accelerators is proposed. The key idea of our method is to use a deep neural network as a surrogate for the solution to a set of partial differential equations involving the particle beam, and the surface impedance concept. The proposed approach is applied to the coupling impedance of an accelerator vacuum chamber with thin cond… ▽ More

    Submitted 4 January, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: 3 pages, 3 figures, submitted for IET possible publications

    MSC Class: 35J05 ACM Class: J.2

    Journal ref: Electronics Letters, 2022

  11. arXiv:2111.06266  [pdf, ps, other

    cs.LG cs.AI

    AlphaDDA: Strategies for Adjusting the Playing Strength of a Fully Trained AlphaZero System to a Suitable Human Training Partner

    Authors: Kazuhisa Fujita

    Abstract: Artificial intelligence (AI) has achieved superhuman performance in board games such as Go, chess, and Othello (Reversi). In other words, the AI system surpasses the level of a strong human expert player in such games. In this context, it is difficult for a human player to enjoy playing the games with the AI. To keep human players entertained and immersed in a game, the AI is required to dynamical… ▽ More

    Submitted 20 September, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: 24 pages, 10 figures

    MSC Class: 68T05 ACM Class: I.2.m

  12. Estimation of the number of clusters on d-dimensional sphere

    Authors: Kazuhisa Fujita

    Abstract: Spherical data is distributed on the sphere. The data appears in various fields such as meteorology, biology, and natural language processing. However, a method for analysis of spherical data does not develop enough yet. One of the important issues is an estimation of the number of clusters in spherical data. To address the issue, I propose a new method called the Spherical X-means (SX-means) that… ▽ More

    Submitted 13 May, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

    Journal ref: Artificial Intelligence Research, 10, 57-63 (2021)

  13. arXiv:2009.07101  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Approximate spectral clustering using both reference vectors and topology of the network generated by growing neural gas

    Authors: Kazuhisa Fujita

    Abstract: Spectral clustering (SC) is one of the most popular clustering methods and often outperforms traditional clustering methods. SC uses the eigenvectors of a Laplacian matrix calculated from a similarity matrix of a dataset. SC has serious drawbacks: the significant increases in the time complexity derived from the computation of eigenvectors and the memory space complexity to store the similarity ma… ▽ More

    Submitted 12 August, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

    Journal ref: PeerJ Computer Science, 7, e679 (2021)

  14. arXiv:2005.08870  [pdf, other

    cs.CE

    Topology design of two-fluid heat exchange

    Authors: Hiroki Kobayashi, Kentaro Yaji, Shintaro Yamasaki, Kikuo Fujita

    Abstract: Heat exchangers are devices that typically transfer heat between two fluids. The performance of a heat exchanger such as heat transfer rate and pressure loss strongly depends on the flow regime in the heat transfer system. In this paper, we present a density-based topology optimization method for a two-fluid heat exchange system, which achieves a maximum heat transfer rate under fixed pressure los… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: 19 pages, 17 figures

  15. A Feedback Shift Correction in Predicting Conversion Rates under Delayed Feedback

    Authors: Shota Yasui, Gota Morishita, Komei Fujita, Masashi Shibata

    Abstract: In display advertising, predicting the conversion rate, that is, the probability that a user takes a predefined action on an advertiser's website, such as purchasing goods is fundamental in estimating the value of displaying the advertisement. However, there is a relatively long time delay between a click and its resultant conversion. Because of the delayed feedback, some positive instances at the… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Comments: The Web Conference 2020 (WWW '20)

  16. Deep learning generates custom-made logistic regression models for explaining how breast cancer subtypes are classified

    Authors: Takuma Shibahara, Chisa Wada, Yasuho Yamashita, Kazuhiro Fujita, Masamichi Sato, Junichi Kuwata, Atsushi Okamoto, Yoshimasa Ono

    Abstract: Differentiating the intrinsic subtypes of breast cancer is crucial for deciding the best treatment strategy. Deep learning can predict the subtypes from genetic information more accurately than conventional statistical methods, but to date, deep learning has not been directly utilized to examine which genes are associated with which subtypes. To clarify the mechanisms embedded in the intrinsic sub… ▽ More

    Submitted 18 July, 2022; v1 submitted 20 January, 2020; originally announced January 2020.

    Comments: 25 pages, 5 figures

  17. arXiv:1908.08936  [pdf, other

    cs.CY cs.LG stat.ML

    Fatigue-Aware Ad Creative Selection

    Authors: Daisuke Moriwaki, Komei Fujita, Shota Yasui, Takahiro Hoshino

    Abstract: In online display advertising, selecting the most effective ad creative (ad image) for each impression is a crucial task for DSPs (Demand-Side Platforms) to fulfill their goals (click-through rate, number of conversions, revenue, and brand improvement). As widely recognized in the marketing literature, the effect of ad creative changes with the number of repetitive ad exposures. In this study, we… ▽ More

    Submitted 14 January, 2020; v1 submitted 20 August, 2019; originally announced August 2019.

    Comments: The previous version was uploaded under the title of "A Contextual Bandit for Ad Creative Selection under Ad Fatigue"

  18. arXiv:1902.10895  [pdf

    cs.CV

    What you get is not always what you see: pitfalls in solar array assessment using overhead imagery

    Authors: Wei Hu, Kyle Bradbury, Jordan M. Malof, Boning Li, Bohao Huang, Artem Streltsov, K. Sydny Fujita, Ben Hoen

    Abstract: Effective integration planning for small, distributed solar photovoltaic (PV) arrays into electric power grids requires access to high quality data: the location and power capacity of individual solar PV arrays. Unfortunately, national databases of small-scale solar PV do not exist; those that do are limited in their spatial resolution, typically aggregated up to state or national levels. While se… ▽ More

    Submitted 25 July, 2022; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: 25 pages

  19. arXiv:1809.00563  [pdf, ps, other

    cs.CL

    End-to-End Argument Mining for Discussion Threads Based on Parallel Constrained Pointer Architecture

    Authors: Gaku Morio, Katsuhide Fujita

    Abstract: Argument Mining (AM) is a relatively recent discipline, which concentrates on extracting claims or premises from discourses, and inferring their structures. However, many existing works do not consider micro-level AM studies on discussion threads sufficiently. In this paper, we tackle AM for discussion threads. Our main contributions are follows: (1) A novel combination scheme focusing on micro-le… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: accepted at the 5th Workshop on Argument Mining at EMNLP 2018

  20. arXiv:1808.00810  [pdf, other

    physics.soc-ph cs.SI

    Identifying exogenous and endogenous activity in social media

    Authors: Kazuki Fujita, Alexey Medvedev, Shinsuke Koyama, Renaud Lambiotte, Shigeru Shinomoto

    Abstract: The occurrence of new events in a system is typically driven by external causes and by previous events taking place inside the system. This is a general statement, applying to a range of situations including, more recently, to the activity of users in Online social networks (OSNs). Here we develop a method for extracting from a series of posting times the relative contributions of exogenous, e.g.… ▽ More

    Submitted 2 August, 2018; originally announced August 2018.

    Comments: 5 figures

    Journal ref: Phys. Rev. E 98, 052304 (2018)

  21. arXiv:1710.08679  [pdf, ps, other

    cs.DC cs.MS

    Implicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation using OpenACC

    Authors: Takuma Yamaguchi, Kohei Fujita, Tsuyoshi Ichimura, Muneo Hori, Maddegedara Lalith, Kengo Nakajima

    Abstract: In this paper, we develop a low-order three-dimensional finite-element solver for fast multiple-case crust deformation analysis on GPU-based systems. Based on a high-performance solver designed for massively parallel CPU based systems, we modify the algorithm to reduce random data access, and then insert OpenACC directives. The developed solver on ten Reedbush-H nodes (20 P100 GPUs) attained speed… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: 18 pages, 10 figures, accepted for WACCPD2017

  22. On Upper Bounds on the Church-Rosser Theorem

    Authors: Ken-etsu Fujita

    Abstract: The Church-Rosser theorem in the type-free lambda-calculus is well investigated both for beta-equality and beta-reduction. We provide a new proof of the theorem for beta-equality with no use of parallel reductions, but simply with Takahashi's translation (Gross-Knuth strategy). Based on this, upper bounds for reduction sequences on the theorem are obtained as the fourth level of the Grzegorc… ▽ More

    Submitted 3 January, 2017; originally announced January 2017.

    Comments: In Proceedings WPTE 2016, arXiv:1701.00233

    Journal ref: EPTCS 235, 2017, pp. 16-31

  23. arXiv:1506.05068  [pdf, ps, other

    cs.CV

    Extract an essential skeleton of a character as a graph from a character image

    Authors: Kazuhisa Fujita

    Abstract: This paper aims to make a graph representing an essential skeleton of a character from an image that includes a machine printed or a handwritten character using growing neural gas (GNG) method and relative network graph (RNG) algorithm. The visual system in our brain can recognize printed characters and handwritten characters easily, robustly, and precisely. How does our brain robustly recognize c… ▽ More

    Submitted 31 January, 2022; v1 submitted 13 June, 2015; originally announced June 2015.

    Journal ref: International Journal of Computer Science Issues 10, 5, 35-39, 2013