Skip to main content

Showing 1–21 of 21 results for author: Washio, T

.
  1. arXiv:2109.14198  [pdf, other

    cs.LG

    Breaking the curse of dimensionality with Isolation Kernel

    Authors: Kai Ming Ting, Takashi Washio, Ye Zhu, Yang Xu

    Abstract: The curse of dimensionality has been studied in different aspects. However, breaking the curse has been elusive. We show for the first time that it is possible to break the curse using the recently introduced Isolation Kernel. We show that only Isolation Kernel performs consistently well in indexed search, spectral & density peaks clustering, SVM classification and t-SNE visualization in both low… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  2. arXiv:2009.12196  [pdf, other

    cs.LG stat.ML

    Isolation Distributional Kernel: A New Tool for Point & Group Anomaly Detection

    Authors: Kai Ming Ting, Bi-Cun Xu, Takashi Washio, Zhi-Hua Zhou

    Abstract: We introduce Isolation Distributional Kernel as a new way to measure the similarity between two distributions. Existing approaches based on kernel mean embedding, which convert a point kernel to a distributional kernel, have two key issues: the point kernel employed has a feature map with intractable dimensionality; and it is {\em data independent}. This paper shows that Isolation Distributional K… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 14 pages

  3. arXiv:1907.01104  [pdf, other

    cs.LG stat.ML

    Isolation Kernel: The X Factor in Efficient and Effective Large Scale Online Kernel Learning

    Authors: Kai Ming Ting, Jonathan R. Wells, Takashi Washio

    Abstract: Large scale online kernel learning aims to build an efficient and scalable kernel-based predictive model incrementally from a sequence of potentially infinite data points. A current key approach focuses on ways to produce an approximate finite-dimensional feature map, assuming that the kernel used has a feature map with intractable dimensionality---an assumption traditionally held in kernel-based… ▽ More

    Submitted 24 September, 2019; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: Textural updates. Restructured section 8.4 including additional experimental results

  4. arXiv:1902.03402  [pdf, ps, other

    cs.IR

    A new simple and effective measure for bag-of-word inter-document similarity measurement

    Authors: Sunil Aryal, Kai Ming Ting, Takashi Washio, Gholamreza Haffari

    Abstract: To measure the similarity of two documents in the bag-of-words (BoW) vector representation, different term weighting schemes are used to improve the performance of cosine similarity---the most widely used inter-document similarity measure in text mining. In this paper, we identify the shortcomings of the underlying assumptions of term weighting in the inter-document similarity measurement task; an… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  5. Free-hand gas identification based on transfer function ratios without gas flow control

    Authors: Gaku Imamura, Kota Shiba, Genki Yoshikawa, Takashi Washio

    Abstract: Gas identification is one of the most important functions of gas sensor systems. To identify gas species from sensing signals, however, gas input patterns (e.g. the gas flow sequence) must be controlled or monitored precisely with additional instruments such as pumps or mass flow controllers; otherwise, effective signal features for analysis are difficult to be extracted. Toward a compact and easy… ▽ More

    Submitted 12 December, 2018; originally announced December 2018.

    Comments: 19 pages, 8 figures, 3 tables

  6. arXiv:1812.03395  [pdf, other

    cs.LG stat.ML

    Learning Graph Representation via Formal Concept Analysis

    Authors: Yuka Yoneda, Mahito Sugiyama, Takashi Washio

    Abstract: We present a novel method that can learn a graph representation from multivariate data. In our representation, each node represents a cluster of data points and each edge represents the subset-superset relationship between clusters, which can be mutually overlapped. The key to our method is to use formal concept analysis (FCA), which can extract hierarchical relationships between clusters based on… ▽ More

    Submitted 8 December, 2018; originally announced December 2018.

    Comments: 5 pages, 2 figures, Relational Representation Learning Workshop (NeurIPS 2018)

  7. Analysis of cause-effect inference by comparing regression errors

    Authors: Patrick Blöbaum, Dominik Janzing, Takashi Washio, Shohei Shimizu, Bernhard Schölkopf

    Abstract: We address the problem of inferring the causal direction between two variables by comparing the least-squares errors of the predictions in both possible directions. Under the assumption of an independence between the function relating cause and effect, the conditional noise distribution, and the distribution of the cause, we show that the errors are smaller in causal direction if both variables ar… ▽ More

    Submitted 24 January, 2019; v1 submitted 19 February, 2018; originally announced February 2018.

    Comments: This is an extended version of the AISTATS 2018 paper

    Journal ref: PeerJ, 2019

  8. arXiv:1610.03263  [pdf, other

    cs.AI cs.LG stat.ML

    Error Asymmetry in Causal and Anticausal Regression

    Authors: Patrick Blöbaum, Takashi Washio, Shohei Shimizu

    Abstract: It is generally difficult to make any statements about the expected prediction error in an univariate setting without further knowledge about how the data were generated. Recent work showed that knowledge about the real underlying causal structure of a data generation process has implications for various machine learning settings. Assuming an additive noise and an independence between data generat… ▽ More

    Submitted 17 April, 2017; v1 submitted 11 October, 2016; originally announced October 2016.

    Journal ref: Behaviormetrika, 2017, 10.1007/s41237-017-0022-z

  9. arXiv:1408.0337  [pdf, ps, other

    stat.ML

    A Bayesian estimation approach to analyze non-Gaussian data-generating processes with latent classes

    Authors: Naoki Tanaka, Shohei Shimizu, Takashi Washio

    Abstract: A large amount of observational data has been accumulated in various fields in recent times, and there is a growing need to estimate the generating processes of these data. A linear non-Gaussian acyclic model (LiNGAM) based on the non-Gaussianity of external influences has been proposed to estimate the data-generating processes of variables. However, the results of the estimation can be biased if… ▽ More

    Submitted 2 August, 2014; originally announced August 2014.

    Comments: 10 pages, 1 figures

  10. arXiv:1401.5636  [pdf, ps, other

    stat.ML cs.LG

    Causal Discovery in a Binary Exclusive-or Skew Acyclic Model: BExSAM

    Authors: Takanori Inazumi, Takashi Washio, Shohei Shimizu, Joe Suzuki, Akihiro Yamamoto, Yoshinobu Kawahara

    Abstract: Discovering causal relations among observed variables in a given data set is a major objective in studies of statistics and artificial intelligence. Recently, some techniques to discover a unique causal model have been explored based on non-Gaussianity of the observed data distribution. However, most of these are limited to continuous data. In this paper, we present a novel causal model for binary… ▽ More

    Submitted 22 January, 2014; originally announced January 2014.

    Comments: 10 pages. A longer version of our UAI2011 paper (Inazumi et al., 2011). arXiv admin note: text overlap with arXiv:1202.3736

  11. arXiv:1401.5625  [pdf, ps, other

    stat.ML

    Identifiability of an Integer Modular Acyclic Additive Noise Model and its Causal Structure Discovery

    Authors: Joe Suzuki, Takanori Inazumi, Takashi Washio, Shohei Shimizu

    Abstract: The notion of causality is used in many situations dealing with uncertainty. We consider the problem whether causality can be identified given data set generated by discrete random variables rather than continuous ones. In particular, for non-binary data, thus far it was only known that causality can be identified except rare cases. In this paper, we present necessary and sufficient condition for… ▽ More

    Submitted 22 January, 2014; originally announced January 2014.

    Comments: 30 pages, 4 figures

  12. arXiv:1401.4785  [pdf, ps, other

    quant-ph stat.AP stat.ML

    Anomaly detection in reconstructed quantum states using a machine-learning technique

    Authors: Satoshi Hara, Takafumi Ono, Ryo Okamoto, Takashi Washio, Shigeki Takeuchi

    Abstract: The accurate detection of small deviations in given density matrices is important for quantum information processing. Here we propose a new method based on the concept of data mining. We demonstrate that the proposed method can more accurately detect small erroneous deviations in reconstructed density matrices, which contain intrinsic fluctuations due to the limited number of samples, than a naive… ▽ More

    Submitted 19 January, 2014; originally announced January 2014.

    Comments: Accepted for Physical Review A

    MSC Class: 81V80

  13. arXiv:1303.7410  [pdf, ps, other

    stat.ML

    ParceLiNGAM: A causal ordering method robust against latent confounders

    Authors: Tatsuya Tashiro, Shohei Shimizu, Aapo Hyvarinen, Takashi Washio

    Abstract: We consider learning a causal ordering of variables in a linear non-Gaussian acyclic model called LiNGAM. Several existing methods have been shown to consistently estimate a causal ordering assuming that all the model assumptions are correct. But, the estimation results could be distorted if some assumptions actually are violated. In this paper, we propose a new algorithm for learning causal order… ▽ More

    Submitted 28 July, 2013; v1 submitted 29 March, 2013; originally announced March 2013.

    Comments: A revised version of this was accepted in Neural Computation. 18 pages and 5 figures. arXiv admin note: substantial text overlap with arXiv:1204.1795

  14. arXiv:1204.1795  [pdf, ps, other

    stat.ML

    Estimation of causal orders in a linear non-Gaussian acyclic model: a method robust against latent confounders

    Authors: Tatsuya Tashiro, Shohei Shimizu, Aapo Hyvarinen, Takashi Washio

    Abstract: We consider to learn a causal ordering of variables in a linear non-Gaussian acyclic model called LiNGAM. Several existing methods have been shown to consistently estimate a causal ordering assuming that all the model assumptions are correct. But, the estimation results could be distorted if some assumptions actually are violated. In this paper, we propose a new algorithm for learning causal order… ▽ More

    Submitted 9 April, 2012; originally announced April 2012.

    Comments: 8 pages, 2 figures

  15. arXiv:1203.0117  [pdf, ps, other

    stat.ML

    Learning a Common Substructure of Multiple Graphical Gaussian Models

    Authors: Satoshi Hara, Takashi Washio

    Abstract: Properties of data are frequently seen to vary depending on the sampled situations, which usually changes along a time evolution or owing to environmental effects. One way to analyze such data is to find invariances, or representative features kept constant over changes. The aim of this paper is to identify one such feature, namely interactions or dependencies among variables that are common acros… ▽ More

    Submitted 24 September, 2012; v1 submitted 1 March, 2012; originally announced March 2012.

    Comments: 47 pages, 6 figures, elsarticle.cls

  16. arXiv:1202.3736  [pdf

    cs.LG stat.ML

    Discovering causal structures in binary exclusive-or skew acyclic models

    Authors: Takanori Inazumi, Takashi Washio, Shohei Shimizu, Joe Suzuki, Akihiro Yamamoto, Yoshinobu Kawahara

    Abstract: Discovering causal relations among observed variables in a given data set is a main topic in studies of statistics and artificial intelligence. Recently, some techniques to discover an identifiable causal structure have been explored based on non-Gaussianity of the observed data distribution. However, most of these are limited to continuous data. In this paper, we present a novel causal model for… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-373-382

  17. GTRACE-RS: Efficient Graph Sequence Mining using Reverse Search

    Authors: Akihiro Inokuchi, Hiroaki Ikuta, Takashi Washio

    Abstract: The mining of frequent subgraphs from labeled graph data has been studied extensively. Furthermore, much attention has recently been paid to frequent pattern mining from graph sequences. A method, called GTRACE, has been proposed to mine frequent patterns from graph sequences under the assumption that changes in graphs are gradual. Although GTRACE mines the frequent patterns efficiently, it still… ▽ More

    Submitted 18 October, 2011; originally announced October 2011.

  18. arXiv:1108.4217  [pdf, ps, other

    cs.DS

    Prismatic Algorithm for Discrete D.C. Programming Problems

    Authors: Yoshinobu Kawahara, Takashi Washio

    Abstract: In this paper, we propose the first exact algorithm for minimizing the difference of two submodular functions (D.S.), i.e., the discrete version of the D.C. programming problem. The developed algorithm is a branch-and-bound-based algorithm which responds to the structure of this problem through the relationship between submodularity and convexity. The D.S. programming problem covers a broad range… ▽ More

    Submitted 21 August, 2011; originally announced August 2011.

  19. arXiv:1101.2489  [pdf, ps, other

    stat.ML

    DirectLiNGAM: A direct method for learning a linear non-Gaussian structural equation model

    Authors: Shohei Shimizu, Takanori Inazumi, Yasuhiro Sogawa, Aapo Hyvarinen, Yoshinobu Kawahara, Takashi Washio, Patrik O. Hoyer, Kenneth Bollen

    Abstract: Structural equation models and Bayesian networks have been widely used to analyze causal relations between continuous variables. In such frameworks, linear acyclic models are typically used to model the data-generating process of variables. Recently, it was shown that use of non-Gaussianity identifies the full structure of a linear acyclic model, i.e., a causal ordering of variables and their conn… ▽ More

    Submitted 7 April, 2011; v1 submitted 12 January, 2011; originally announced January 2011.

    Comments: A revised version of this was accepted in Journal of Machine Learning Research

  20. arXiv:1006.5041  [pdf, ps, other

    cs.AI

    GroupLiNGAM: Linear non-Gaussian acyclic models for sets of variables

    Authors: Yoshinobu Kawahara, Kenneth Bollen, Shohei Shimizu, Takashi Washio

    Abstract: Finding the structure of a graphical model has been received much attention in many fields. Recently, it is reported that the non-Gaussianity of data enables us to identify the structure of a directed acyclic graph without any prior knowledge on the structure. In this paper, we propose a novel non-Gaussianity based algorithm for more general type of models; chain graphs. The algorithm finds an ord… ▽ More

    Submitted 24 June, 2010; originally announced June 2010.

  21. Finding Exogenous Variables in Data with Many More Variables than Observations

    Authors: Shohei Shimizu, Takashi Washio, Aapo Hyvarinen, Seiya Imoto

    Abstract: Many statistical methods have been proposed to estimate causal models in classical situations with fewer variables than observations (p<n, p: the number of variables and n: the number of observations). However, modern datasets including gene expression data need high-dimensional causal modeling in challenging situations with orders of magnitude more variables than observations (p>>n). In this pape… ▽ More

    Submitted 7 April, 2011; v1 submitted 5 April, 2009; originally announced April 2009.

    Comments: A revised version of this was published in Proc. ICANN2010

    Journal ref: ARTIFICIAL NEURAL NETWORKS - ICANN 2010. Lecture Notes in Computer Science, 2010, Volume 6352/2010, 67-76