Skip to main content

Showing 1–22 of 22 results for author: Nakajima, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.10935  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph stat.ML

    Molecular relaxation by reverse diffusion with time step prediction

    Authors: Khaled Kahouli, Stefaan Simon Pierre Hessmann, Klaus-Robert Müller, Shinichi Nakajima, Stefan Gugler, Niklas Wolf Andreas Gebauer

    Abstract: Molecular relaxation, finding the equilibrium state of a non-equilibrium structure, is an essential component of computational chemistry to understand reactivity. Classical force field methods often rely on insufficient local energy minimization, while neural network force field models require large labeled datasets encompassing both equilibrium and non-equilibrium structures. As a remedy, we prop… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  2. arXiv:2311.13594  [pdf, other

    cs.LG cs.AI stat.ML

    Labeling Neural Representations with Inverse Recognition

    Authors: Kirill Bykov, Laura Kopf, Shinichi Nakajima, Marius Kloft, Marina M. -C. Höhne

    Abstract: Deep Neural Networks (DNNs) demonstrate remarkable capabilities in learning complex hierarchical data representations, but the nature of these representations remains largely unknown. Existing global explainability methods, such as Network Dissection, face limitations such as reliance on segmentation masks, lack of statistical significance testing, and high computational demands. We propose Invers… ▽ More

    Submitted 18 January, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: 25 pages, 16 figures

    Journal ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  3. arXiv:2310.17638  [pdf, other

    cs.LG stat.ML

    Generative Fractional Diffusion Models

    Authors: Gabriel Nobis, Maximilian Springenberg, Marco Aversa, Michael Detzel, Rembert Daems, Roderick Murray-Smith, Shinichi Nakajima, Sebastian Lapuschkin, Stefano Ermon, Tolga Birdal, Manfred Opper, Christoph Knochenhauer, Luis Oala, Wojciech Samek

    Abstract: We introduce the first continuous-time score-based generative model that leverages fractional diffusion processes for its underlying dynamics. Although diffusion models have excelled at capturing data distributions, they still suffer from various limitations such as slow convergence, mode-collapse on imbalanced data, and lack of diversity. These issues are partially linked to the use of light-tail… ▽ More

    Submitted 24 June, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    ACM Class: I.2.4; F.4.1; G.3

  4. arXiv:2302.14112  [pdf, other

    cond-mat.dis-nn cs.LG math.PR stat.ML

    Injectivity of ReLU networks: perspectives from statistical physics

    Authors: Antoine Maillard, Afonso S. Bandeira, David Belius, Ivan Dokmanić, Shuta Nakajima

    Abstract: When can the input of a ReLU neural network be inferred from its output? In other words, when is the network injective? We consider a single layer, $x \mapsto \mathrm{ReLU}(Wx)$, with a random Gaussian $m \times n$ matrix $W$, in a high-dimensional setting where $n, m \to \infty$. Recent work connects this problem to spherical integral geometry giving rise to a conjectured sharp injectivity thresh… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 60 pages

  5. arXiv:2207.08219  [pdf, other

    cs.LG stat.ML

    Gradients should stay on Path: Better Estimators of the Reverse- and Forward KL Divergence for Normalizing Flows

    Authors: Lorenz Vaitl, Kim A. Nicoli, Shinichi Nakajima, Pan Kessel

    Abstract: We propose an algorithm to estimate the path-gradient of both the reverse and forward Kullback-Leibler divergence for an arbitrary manifestly invertible normalizing flow. The resulting path-gradient estimators are straightforward to implement, have lower variance, and lead not only to faster convergence of training but also to better overall approximation results compared to standard total gradien… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: 29 pages, 8 figures

  6. arXiv:2206.09016  [pdf, other

    cs.LG stat.ML

    Path-Gradient Estimators for Continuous Normalizing Flows

    Authors: Lorenz Vaitl, Kim A. Nicoli, Shinichi Nakajima, Pan Kessel

    Abstract: Recent work has established a path-gradient estimator for simple variational Gaussian distributions and has argued that the path-gradient is particularly beneficial in the regime in which the variational distribution approaches the exact target distribution. In many applications, this regime can however not be reached by a simple Gaussian variational distribution. In this work, we overcome this cr… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 8 pages, 5 figures, 39th International Conference on Machine Learning

  7. arXiv:2204.05229  [pdf, other

    cs.LG stat.ML

    Mixture-of-experts VAEs can disregard variation in surjective multimodal data

    Authors: Jannik Wolff, Tassilo Klein, Moin Nabi, Rahul G. Krishnan, Shinichi Nakajima

    Abstract: Machine learning systems are often deployed in domains that entail data from multiple modalities, for example, phenotypic and genotypic characteristics describe patients in healthcare. Previous works have developed multimodal variational autoencoders (VAEs) that generate several modalities. We consider subjective data, where single datapoints from one modality (such as class labels) describe multi… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted at the NeurIPS 2021 workshop on Bayesian Deep Learning

  8. arXiv:2201.10859  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Visualizing the Diversity of Representations Learned by Bayesian Neural Networks

    Authors: Dennis Grinwald, Kirill Bykov, Shinichi Nakajima, Marina M. -C. Höhne

    Abstract: Explainable Artificial Intelligence (XAI) aims to make learning machines less opaque, and offers researchers and practitioners various tools to reveal the decision-making strategies of neural networks. In this work, we investigate how XAI methods can be used for exploring and visualizing the diversity of feature representations learned by Bayesian Neural Networks (BNNs). Our goal is to provide a g… ▽ More

    Submitted 14 November, 2023; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: 16 pages, 18 figures

    Journal ref: Published in Transactions on Machine Learning Research (11/2023)

  9. arXiv:2108.10346  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Explaining Bayesian Neural Networks

    Authors: Kirill Bykov, Marina M. -C. Höhne, Adelaida Creosteanu, Klaus-Robert Müller, Frederick Klauschen, Shinichi Nakajima, Marius Kloft

    Abstract: To make advanced learning machines such as Deep Neural Networks (DNNs) more transparent in decision making, explainable AI (XAI) aims to provide interpretations of DNNs' predictions. These interpretations are usually given in the form of heatmaps, each one illustrating relevant patterns regarding the prediction for a given instance. Bayesian approaches such as Bayesian Neural Networks (BNNs) so fa… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 16 pages, 8 figures

  10. arXiv:2008.13723  [pdf, other

    cs.LG stat.ML

    Langevin Cooling for Domain Translation

    Authors: Vignesh Srinivasan, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima

    Abstract: Domain translation is the task of finding correspondence between two domains. Several Deep Neural Network (DNN) models, e.g., CycleGAN and cross-lingual language models, have shown remarkable successes on this task under the unsupervised setting---the map**s between the domains are learned from two independent sets of training data in both domains (without paired samples). However, those methods… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

  11. arXiv:2006.09000  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    How Much Can I Trust You? -- Quantifying Uncertainties in Explaining Neural Networks

    Authors: Kirill Bykov, Marina M. -C. Höhne, Klaus-Robert Müller, Shinichi Nakajima, Marius Kloft

    Abstract: Explainable AI (XAI) aims to provide interpretations for predictions made by learning machines, such as deep neural networks, in order to make the machines more transparent for the user and furthermore trustworthy also for applications in e.g. safety-critical areas. So far, however, no methods for quantifying uncertainties of explanations have been conceived, which is problematic in domains where… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 12 pages, 10 figures

  12. arXiv:2006.03589  [pdf, other

    cs.LG cs.AI stat.ML

    Higher-Order Explanations of Graph Neural Networks via Relevant Walks

    Authors: Thomas Schnake, Oliver Eberle, Jonas Lederer, Shinichi Nakajima, Kristof T. Schütt, Klaus-Robert Müller, Grégoire Montavon

    Abstract: Graph Neural Networks (GNNs) are a popular approach for predicting graph structured data. As GNNs tightly entangle the input graph into the neural network structure, common explainable AI approaches are not applicable. To a large extent, GNNs have remained black-boxes for the user so far. In this paper, we show that GNNs can in fact be naturally explained using higher-order expansions, i.e. by ide… ▽ More

    Submitted 26 November, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 14 pages + 6 pages supplement

  13. arXiv:2003.09136  [pdf

    cs.LG cs.CL stat.ML

    Automatic Identification of Types of Alterations in Historical Manuscripts

    Authors: David Lassner, Anne Baillot, Sergej Dogadov, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: Alterations in historical manuscripts such as letters represent a promising field of research. On the one hand, they help understand the construction of text. On the other hand, topics that are being considered sensitive at the time of the manuscript gain coherence and contextuality when taking alterations into account, especially in the case of deletions. The analysis of alterations in manuscript… ▽ More

    Submitted 4 November, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

    Comments: Accepted for publication in Digital Humanities Quarterly

  14. arXiv:1910.13496  [pdf, other

    cond-mat.stat-mech cs.LG stat.ML

    Asymptotically unbiased estimation of physical observables with neural samplers

    Authors: Kim A. Nicoli, Shinichi Nakajima, Nils Strodthoff, Wojciech Samek, Klaus-Robert Müller, Pan Kessel

    Abstract: We propose a general framework for the estimation of observables with generative neural samplers focusing on modern deep generative neural networks that provide an exact sampling probability. In this framework, we present asymptotically unbiased estimators for generic observables, including those that explicitly depend on the partition function such as free energy or entropy, and derive correspond… ▽ More

    Submitted 13 February, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: 5 figures

    Journal ref: Phys. Rev. E 101, 023304 (2020)

  15. arXiv:1910.09840  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Towards Best Practice in Explaining Neural Network Decisions with LRP

    Authors: Maximilian Kohlbrenner, Alexander Bauer, Shinichi Nakajima, Alexander Binder, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Within the last decade, neural network based predictors have demonstrated impressive - and at times super-human - capabilities. This performance is often paid for with an intransparent prediction process and thus has sparked numerous contributions in the novel field of explainable artificial intelligence (XAI). In this paper, we focus on a popular and widely used method of XAI, the Layer-wise Rele… ▽ More

    Submitted 13 July, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 7 pages, 4 figures, 1 table. fixed table row compared to v2. Presented virtually at IJCNN 2020

  16. arXiv:1903.11048  [pdf, other

    cond-mat.stat-mech cs.LG stat.ML

    Comment on "Solving Statistical Mechanics Using VANs": Introducing saVANt - VANs Enhanced by Importance and MCMC Sampling

    Authors: Kim Nicoli, Pan Kessel, Nils Strodthoff, Wojciech Samek, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: In this comment on "Solving Statistical Mechanics Using Variational Autoregressive Networks" by Wu et al., we propose a subtle yet powerful modification of their approach. We show that the inherent sampling error of their method can be corrected by using neural network-based MCMC or importance sampling which leads to asymptotically unbiased estimators for physical quantities. This modification is… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

    Comments: 6 pages, 4 figures

  17. arXiv:1902.10664  [pdf, other

    cs.LG math.NA stat.ML

    Local Function Complexity for Active Learning via Mixture of Gaussian Processes

    Authors: Danny Panknin, Stefan Chmiela, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: Inhomogeneities in real-world data, e.g., due to changes in the observation noise level or variations in the structural complexity of the source function, pose a unique set of challenges for statistical inference. Accounting for them can greatly improve predictive power when physical resources or computation time is limited. In this paper, we draw on recent theoretical results on the estimation of… ▽ More

    Submitted 12 December, 2023; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: 30 pages (+18 pages of references and appendices), 20 figures

    Journal ref: Transactions on Machine Learning Research, December 2023

  18. arXiv:1806.11326  [pdf, other

    stat.ML cs.LG

    Unsupervised Detection and Explanation of Latent-class Contextual Anomalies

    Authors: Jacob Kauffmann, Grégoire Montavon, Luiz Alberto Lima, Shinichi Nakajima, Klaus-Robert Müller, Nico Görnitz

    Abstract: Detecting and explaining anomalies is a challenging effort. This holds especially true when data exhibits strong dependencies and single measurements need to be assessed and analyzed in their respective context. In this work, we consider scenarios where measurements are non-i.i.d, i.e. where samples are dependent on corresponding discrete latent variables which are connected through some given dep… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

  19. arXiv:1805.12017  [pdf, other

    cs.LG stat.ML

    Robustifying Models Against Adversarial Attacks by Langevin Dynamics

    Authors: Vignesh Srinivasan, Arturo Marban, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima

    Abstract: Adversarial attacks on deep learning models have compromised their performance considerably. As remedies, a lot of defense methods were proposed, which however, have been circumvented by newer attacking strategies. In the midst of this ensuing arms race, the problem of robustness against adversarial attacks still remains unsolved. This paper proposes a novel, simple yet effective defense strategy… ▽ More

    Submitted 6 June, 2019; v1 submitted 30 May, 2018; originally announced May 2018.

  20. arXiv:1609.03219  [pdf, other

    stat.ML cs.LG

    Sharing Hash Codes for Multiple Purposes

    Authors: Wikor Pronobis, Danny Panknin, Johannes Kirschnick, Vignesh Srinivasan, Wojciech Samek, Volker Markl, Manohar Kaul, Klaus-Robert Mueller, Shinichi Nakajima

    Abstract: Locality sensitive hashing (LSH) is a powerful tool for sublinear-time approximate nearest neighbor search, and a variety of hashing schemes have been proposed for different dissimilarity measures. However, hash codes significantly depend on the dissimilarity, which prohibits users from adjusting the dissimilarity at query time. In this paper, we propose {multiple purpose LSH (mp-LSH) which shares… ▽ More

    Submitted 1 June, 2017; v1 submitted 11 September, 2016; originally announced September 2016.

  21. arXiv:1609.00626  [pdf, other

    cs.CL stat.AP

    SynsetRank: Degree-adjusted Random Walk for Relation Identification

    Authors: Shinichi Nakajima, Sebastian Krause, Dirk Weissenborn, Sven Schmeier, Nico Goernitz, Feiyu Xu

    Abstract: In relation extraction, a key process is to obtain good detectors that find relevant sentences describing the target relation. To minimize the necessity of labeled data for refining detectors, previous work successfully made use of BabelNet, a semantic graph structure expressing relationships between synsets, as side information or prior knowledge. The goal of this paper is to enhance the use of g… ▽ More

    Submitted 15 September, 2016; v1 submitted 2 September, 2016; originally announced September 2016.

  22. Sparse Probit Linear Mixed Model

    Authors: Stephan Mandt, Florian Wenzel, Shinichi Nakajima, John P. Cunningham, Christoph Lippert, Marius Kloft

    Abstract: Linear Mixed Models (LMMs) are important tools in statistical genetics. When used for feature selection, they allow to find a sparse set of genetic traits that best predict a continuous phenotype of interest, while simultaneously correcting for various confounding factors such as age, ethnicity and population structure. Formulated as models for linear regression, LMMs have been restricted to conti… ▽ More

    Submitted 17 July, 2017; v1 submitted 16 July, 2015; originally announced July 2015.

    Comments: Published version, 21 pages, 6 figures

    Journal ref: Machine Learning, 106(9), 1621-1642 (2017)