Skip to main content

Showing 1–42 of 42 results for author: Nakajima, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06150  [pdf, other

    cs.LG quant-ph

    Physics-Informed Bayesian Optimization of Variational Quantum Circuits

    Authors: Kim A. Nicoli, Christopher J. Anders, Lena Funcke, Tobias Hartung, Karl Jansen, Stefan Kühn, Klaus-Robert Müller, Paolo Stornati, Pan Kessel, Shinichi Nakajima

    Abstract: In this paper, we propose a novel and powerful method to harness Bayesian optimization for Variational Quantum Eigensolvers (VQEs) -- a hybrid quantum-classical protocol used to approximate the ground state of a quantum Hamiltonian. Specifically, we derive a VQE-kernel which incorporates important prior information about quantum circuits: the kernel feature map of the VQE-kernel exactly matches th… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 36 pages, 17 figures, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  2. arXiv:2404.10935  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph stat.ML

    Molecular relaxation by reverse diffusion with time step prediction

    Authors: Khaled Kahouli, Stefaan Simon Pierre Hessmann, Klaus-Robert Müller, Shinichi Nakajima, Stefan Gugler, Niklas Wolf Andreas Gebauer

    Abstract: Molecular relaxation, finding the equilibrium state of a non-equilibrium structure, is an essential component of computational chemistry to understand reactivity. Classical force field methods often rely on insufficient local energy minimization, while neural network force field models require large labeled datasets encompassing both equilibrium and non-equilibrium structures. As a remedy, we prop… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  3. arXiv:2403.03333  [pdf, other

    cs.LG cs.DC

    Federated Learning over Connected Modes

    Authors: Dennis Grinwald, Philipp Wiesner, Shinichi Nakajima

    Abstract: Statistical heterogeneity in federated learning poses two major challenges: slow global training due to conflicting gradient signals, and the need of personalization for local distributions. In this work, we tackle both challenges by leveraging recent advances in \emph{linear mode connectivity} -- identifying a linearly connected low-loss region in the weight space of neural networks, which we cal… ▽ More

    Submitted 21 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  4. arXiv:2311.13594  [pdf, other

    cs.LG cs.AI stat.ML

    Labeling Neural Representations with Inverse Recognition

    Authors: Kirill Bykov, Laura Kopf, Shinichi Nakajima, Marius Kloft, Marina M. -C. Höhne

    Abstract: Deep Neural Networks (DNNs) demonstrate remarkable capabilities in learning complex hierarchical data representations, but the nature of these representations remains largely unknown. Existing global explainability methods, such as Network Dissection, face limitations such as reliance on segmentation masks, lack of statistical significance testing, and high computational demands. We propose Invers… ▽ More

    Submitted 18 January, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: 25 pages, 16 figures

    Journal ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  5. arXiv:2310.17638  [pdf, other

    cs.LG stat.ML

    Generative Fractional Diffusion Models

    Authors: Gabriel Nobis, Maximilian Springenberg, Marco Aversa, Michael Detzel, Rembert Daems, Roderick Murray-Smith, Shinichi Nakajima, Sebastian Lapuschkin, Stefano Ermon, Tolga Birdal, Manfred Opper, Christoph Knochenhauer, Luis Oala, Wojciech Samek

    Abstract: We introduce the first continuous-time score-based generative model that leverages fractional diffusion processes for its underlying dynamics. Although diffusion models have excelled at capturing data distributions, they still suffer from various limitations such as slow convergence, mode-collapse on imbalanced data, and lack of diversity. These issues are partially linked to the use of light-tail… ▽ More

    Submitted 24 June, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    ACM Class: I.2.4; F.4.1; G.3

  6. arXiv:2302.14112  [pdf, other

    cond-mat.dis-nn cs.LG math.PR stat.ML

    Injectivity of ReLU networks: perspectives from statistical physics

    Authors: Antoine Maillard, Afonso S. Bandeira, David Belius, Ivan Dokmanić, Shuta Nakajima

    Abstract: When can the input of a ReLU neural network be inferred from its output? In other words, when is the network injective? We consider a single layer, $x \mapsto \mathrm{ReLU}(Wx)$, with a random Gaussian $m \times n$ matrix $W$, in a high-dimensional setting where $n, m \to \infty$. Recent work connects this problem to spherical integral geometry giving rise to a conjectured sharp injectivity thresh… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 60 pages

  7. arXiv:2302.14082  [pdf, other

    hep-lat cs.LG physics.comp-ph

    Detecting and Mitigating Mode-Collapse for Flow-based Sampling of Lattice Field Theories

    Authors: Kim A. Nicoli, Christopher J. Anders, Tobias Hartung, Karl Jansen, Pan Kessel, Shinichi Nakajima

    Abstract: We study the consequences of mode-collapse of normalizing flows in the context of lattice field theory. Normalizing flows allow for independent sampling. For this reason, it is hoped that they can avoid the tunneling problem of local-update MCMC algorithms for multi-modal distributions. In this work, we first point out that the tunneling problem is also present for normalizing flows but is shifted… ▽ More

    Submitted 3 November, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 10 pages, 7 figures, 6 pages of supplement material

  8. arXiv:2210.04962  [pdf, other

    cs.CL

    Domain-Specific Word Embeddings with Structure Prediction

    Authors: Stephanie Brandl, David Lassner, Anne Baillot, Shinichi Nakajima

    Abstract: Complementary to finding good general word embeddings, an important question for representation learning is to find dynamic word embeddings, e.g., across time or domain. Current methods do not offer a way to use or predict information on structure between sub-corpora, time or domain and dynamic embeddings can only be compared after post-alignment. We propose novel word embedding methods that provi… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: accepted at TACL 13 pages, 4 figures

  9. arXiv:2207.08219  [pdf, other

    cs.LG stat.ML

    Gradients should stay on Path: Better Estimators of the Reverse- and Forward KL Divergence for Normalizing Flows

    Authors: Lorenz Vaitl, Kim A. Nicoli, Shinichi Nakajima, Pan Kessel

    Abstract: We propose an algorithm to estimate the path-gradient of both the reverse and forward Kullback-Leibler divergence for an arbitrary manifestly invertible normalizing flow. The resulting path-gradient estimators are straightforward to implement, have lower variance, and lead not only to faster convergence of training but also to better overall approximation results compared to standard total gradien… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: 29 pages, 8 figures

  10. arXiv:2206.11723  [pdf, other

    cs.CV cs.LG

    Self-Supervised Training with Autoencoders for Visual Anomaly Detection

    Authors: Alexander Bauer, Shinichi Nakajima, Klaus-Robert Müller

    Abstract: We focus on a specific use case in anomaly detection where the distribution of normal samples is supported by a lower-dimensional manifold. Here, regularized autoencoders provide a popular approach by learning the identity map** on the set of normal examples, while trying to prevent good reconstruction on points outside of the manifold. Typically, this goal is implemented by controlling the capa… ▽ More

    Submitted 13 May, 2024; v1 submitted 23 June, 2022; originally announced June 2022.

  11. arXiv:2206.09016  [pdf, other

    cs.LG stat.ML

    Path-Gradient Estimators for Continuous Normalizing Flows

    Authors: Lorenz Vaitl, Kim A. Nicoli, Shinichi Nakajima, Pan Kessel

    Abstract: Recent work has established a path-gradient estimator for simple variational Gaussian distributions and has argued that the path-gradient is particularly beneficial in the regime in which the variational distribution approaches the exact target distribution. In many applications, this regime can however not be reached by a simple Gaussian variational distribution. In this work, we overcome this cr… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 8 pages, 5 figures, 39th International Conference on Machine Learning

  12. arXiv:2204.05229  [pdf, other

    cs.LG stat.ML

    Mixture-of-experts VAEs can disregard variation in surjective multimodal data

    Authors: Jannik Wolff, Tassilo Klein, Moin Nabi, Rahul G. Krishnan, Shinichi Nakajima

    Abstract: Machine learning systems are often deployed in domains that entail data from multiple modalities, for example, phenotypic and genotypic characteristics describe patients in healthcare. Previous works have developed multimodal variational autoencoders (VAEs) that generate several modalities. We consider subjective data, where single datapoints from one modality (such as class labels) describe multi… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted at the NeurIPS 2021 workshop on Bayesian Deep Learning

  13. arXiv:2201.10859  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Visualizing the Diversity of Representations Learned by Bayesian Neural Networks

    Authors: Dennis Grinwald, Kirill Bykov, Shinichi Nakajima, Marina M. -C. Höhne

    Abstract: Explainable Artificial Intelligence (XAI) aims to make learning machines less opaque, and offers researchers and practitioners various tools to reveal the decision-making strategies of neural networks. In this work, we investigate how XAI methods can be used for exploring and visualizing the diversity of feature representations learned by Bayesian Neural Networks (BNNs). Our goal is to provide a g… ▽ More

    Submitted 14 November, 2023; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: 16 pages, 18 figures

    Journal ref: Published in Transactions on Machine Learning Research (11/2023)

  14. arXiv:2111.11303  [pdf, ps, other

    hep-lat cs.LG

    Machine Learning of Thermodynamic Observables in the Presence of Mode Collapse

    Authors: Kim A. Nicoli, Christopher Anders, Lena Funcke, Tobias Hartung, Karl Jansen, Pan Kessel, Shinichi Nakajima, Paolo Stornati

    Abstract: Estimating the free energy, as well as other thermodynamic observables, is a key task in lattice field theories. Recently, it has been pointed out that deep generative models can be used in this context [1]. Crucially, these models allow for the direct estimation of the free energy at a given point in parameter space. This is in contrast to existing methods based on Markov chains which generically… ▽ More

    Submitted 30 November, 2021; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: 10 pages, 2 figures, Proceedings of the 38th International Symposium on Lattice Field Theory, 26th-30th July 2021, Zoom/Gather@Massachusetts Institute of Technology

    Report number: MIT-CTP/5353

  15. arXiv:2111.09462  [pdf

    physics.optics cs.ET quant-ph

    Order recognition by Schubert polynomials generated by optical near-field statistics via nanometre-scale photochromism

    Authors: Kazuharu Uchiyama, Sota Nakajima, Hirotsugu Suzui, Nicolas Chauvet, Hayato Saigo, Ryoichi Horisaki, Kingo Uchida, Makoto Naruse, Hirokazu Hori

    Abstract: We have previously observed an irregular spatial distribution of photon transmission through a photochromic crystal photoisomerized by a local optical near-field excitation, manifesting complex branching processes via the interplay of deformation of the material and near-field photon transfer therein. Furthermore, by combining such naturally constructed complex photon transmission with a simple ph… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

  16. arXiv:2108.10346  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Explaining Bayesian Neural Networks

    Authors: Kirill Bykov, Marina M. -C. Höhne, Adelaida Creosteanu, Klaus-Robert Müller, Frederick Klauschen, Shinichi Nakajima, Marius Kloft

    Abstract: To make advanced learning machines such as Deep Neural Networks (DNNs) more transparent in decision making, explainable AI (XAI) aims to provide interpretations of DNNs' predictions. These interpretations are usually given in the form of heatmaps, each one illustrating relevant patterns regarding the prediction for a given instance. Bayesian approaches such as Bayesian Neural Networks (BNNs) so fa… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 16 pages, 8 figures

  17. arXiv:2107.05045  [pdf, other

    cs.LG

    Positive-Unlabeled Classification under Class-Prior Shift: A Prior-invariant Approach Based on Density Ratio Estimation

    Authors: Shota Nakajima, Masashi Sugiyama

    Abstract: Learning from positive and unlabeled (PU) data is an important problem in various applications. Most of the recent approaches for PU classification assume that the class-prior (the ratio of positive samples) in the training unlabeled dataset is identical to that of the test data, which does not hold in many practical cases. In addition, we usually do not know the class-priors of the training and t… ▽ More

    Submitted 15 December, 2021; v1 submitted 11 July, 2021; originally announced July 2021.

    Comments: 36 pages, 4 figures

  18. arXiv:2106.10185  [pdf, other

    cs.LG cs.AI

    NoiseGrad: Enhancing Explanations by Introducing Stochasticity to Model Weights

    Authors: Kirill Bykov, Anna Hedström, Shinichi Nakajima, Marina M. -C. Höhne

    Abstract: Many efforts have been made for revealing the decision-making process of black-box learning machines such as deep neural networks, resulting in useful local and global explanation methods. For local explanation, stochasticity is known to help: a simple method, called SmoothGrad, has improved the visual quality of gradient-based attribution by adding noise to the input space and averaging the expla… ▽ More

    Submitted 30 May, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 21 pages, 18 figures

  19. arXiv:2105.11990  [pdf, other

    cs.LG

    Optimal Sampling Density for Nonparametric Regression

    Authors: Danny Panknin, Klaus Robert Müller, Shinichi Nakajima

    Abstract: We propose a novel active learning strategy for regression, which is model-agnostic, robust against model mismatch, and interpretable. Assuming that a small number of initial samples are available, we derive the optimal training density that minimizes the generalization error of local polynomial smoothing (LPS) with its kernel bandwidth tuned locally: We adopt the mean integrated squared error (MI… ▽ More

    Submitted 24 July, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: 50 pages, plus 33 pages appendix

  20. arXiv:2008.13723  [pdf, other

    cs.LG stat.ML

    Langevin Cooling for Domain Translation

    Authors: Vignesh Srinivasan, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima

    Abstract: Domain translation is the task of finding correspondence between two domains. Several Deep Neural Network (DNN) models, e.g., CycleGAN and cross-lingual language models, have shown remarkable successes on this task under the unsupervised setting---the map**s between the domains are learned from two independent sets of training data in both domains (without paired samples). However, those methods… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

  21. arXiv:2007.07115  [pdf, other

    hep-lat cs.LG physics.comp-ph

    Estimation of Thermodynamic Observables in Lattice Field Theories with Deep Generative Models

    Authors: Kim A. Nicoli, Christopher J. Anders, Lena Funcke, Tobias Hartung, Karl Jansen, Pan Kessel, Shinichi Nakajima, Paolo Stornati

    Abstract: In this work, we demonstrate that applying deep generative machine learning models for lattice field theory is a promising route for solving problems where Markov Chain Monte Carlo (MCMC) methods are problematic. More specifically, we show that generative models can be used to estimate the absolute value of the free energy, which is in contrast to existing MCMC-based methods which are limited to o… ▽ More

    Submitted 5 January, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: 8 figures

    Journal ref: Phys. Rev. Lett. 126, 032001 (2021)

  22. arXiv:2006.09000  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    How Much Can I Trust You? -- Quantifying Uncertainties in Explaining Neural Networks

    Authors: Kirill Bykov, Marina M. -C. Höhne, Klaus-Robert Müller, Shinichi Nakajima, Marius Kloft

    Abstract: Explainable AI (XAI) aims to provide interpretations for predictions made by learning machines, such as deep neural networks, in order to make the machines more transparent for the user and furthermore trustworthy also for applications in e.g. safety-critical areas. So far, however, no methods for quantifying uncertainties of explanations have been conceived, which is problematic in domains where… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 12 pages, 10 figures

  23. arXiv:2006.03589  [pdf, other

    cs.LG cs.AI stat.ML

    Higher-Order Explanations of Graph Neural Networks via Relevant Walks

    Authors: Thomas Schnake, Oliver Eberle, Jonas Lederer, Shinichi Nakajima, Kristof T. Schütt, Klaus-Robert Müller, Grégoire Montavon

    Abstract: Graph Neural Networks (GNNs) are a popular approach for predicting graph structured data. As GNNs tightly entangle the input graph into the neural network structure, common explainable AI approaches are not applicable. To a large extent, GNNs have remained black-boxes for the user so far. In this paper, we show that GNNs can in fact be naturally explained using higher-order expansions, i.e. by ide… ▽ More

    Submitted 26 November, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 14 pages + 6 pages supplement

  24. arXiv:2003.09136  [pdf

    cs.LG cs.CL stat.ML

    Automatic Identification of Types of Alterations in Historical Manuscripts

    Authors: David Lassner, Anne Baillot, Sergej Dogadov, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: Alterations in historical manuscripts such as letters represent a promising field of research. On the one hand, they help understand the construction of text. On the other hand, topics that are being considered sensitive at the time of the manuscript gain coherence and contextuality when taking alterations into account, especially in the case of deletions. The analysis of alterations in manuscript… ▽ More

    Submitted 4 November, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

    Comments: Accepted for publication in Digital Humanities Quarterly

  25. arXiv:1912.12090  [pdf, other

    cs.DM cs.LG

    Polynomial-Time Exact MAP Inference on Discrete Models with Global Dependencies

    Authors: Alexander Bauer, Shinichi Nakajima

    Abstract: Considering the worst-case scenario, junction tree algorithm remains the most general solution for exact MAP inference with polynomial run-time guarantees. Unfortunately, its main tractability assumption requires the treewidth of a corresponding MRF to be bounded strongly limiting the range of admissible applications. In fact, many practical problems in the area of structured prediction require mo… ▽ More

    Submitted 9 February, 2022; v1 submitted 27 December, 2019; originally announced December 2019.

  26. arXiv:1911.11596  [pdf, ps, other

    cs.LG

    Distortion and Faults in Machine Learning Software

    Authors: Shin Nakajima

    Abstract: Machine learning software, deep neural networks (DNN) software in particular, discerns valuable information from a large dataset, a set of data. Outcomes of such DNN programs are dependent on the quality of both learning programs and datasets. Unfortunately, the quality of datasets is difficult to be defined, because they are just samples. The quality assurance of DNN software is difficult, becaus… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: Presented at the 9th SOFL+MSVL Workshop in Shenzhen, November 2019

  27. arXiv:1910.13496  [pdf, other

    cond-mat.stat-mech cs.LG stat.ML

    Asymptotically unbiased estimation of physical observables with neural samplers

    Authors: Kim A. Nicoli, Shinichi Nakajima, Nils Strodthoff, Wojciech Samek, Klaus-Robert Müller, Pan Kessel

    Abstract: We propose a general framework for the estimation of observables with generative neural samplers focusing on modern deep generative neural networks that provide an exact sampling probability. In this framework, we present asymptotically unbiased estimators for generic observables, including those that explicitly depend on the partition function such as free energy or entropy, and derive correspond… ▽ More

    Submitted 13 February, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: 5 figures

    Journal ref: Phys. Rev. E 101, 023304 (2020)

  28. arXiv:1910.09840  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Towards Best Practice in Explaining Neural Network Decisions with LRP

    Authors: Maximilian Kohlbrenner, Alexander Bauer, Shinichi Nakajima, Alexander Binder, Wojciech Samek, Sebastian Lapuschkin

    Abstract: Within the last decade, neural network based predictors have demonstrated impressive - and at times super-human - capabilities. This performance is often paid for with an intransparent prediction process and thus has sparked numerous contributions in the novel field of explainable artificial intelligence (XAI). In this paper, we focus on a popular and widely used method of XAI, the Layer-wise Rele… ▽ More

    Submitted 13 July, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 7 pages, 4 figures, 1 table. fixed table row compared to v2. Presented virtually at IJCNN 2020

  29. arXiv:1904.05586  [pdf, other

    cs.CV

    Black-Box Decision based Adversarial Attack with Symmetric $α$-stable Distribution

    Authors: Vignesh Srinivasan, Ercan E. Kuruoglu, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima

    Abstract: Develo** techniques for adversarial attack and defense is an important research field for establishing reliable machine learning and its applications. Many existing methods employ Gaussian random variables for exploring the data space to find the most adversarial (for attacking) or least adversarial (for defense) point. However, the Gaussian distribution is not necessarily the optimal choice whe… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

  30. arXiv:1903.11048  [pdf, other

    cond-mat.stat-mech cs.LG stat.ML

    Comment on "Solving Statistical Mechanics Using VANs": Introducing saVANt - VANs Enhanced by Importance and MCMC Sampling

    Authors: Kim Nicoli, Pan Kessel, Nils Strodthoff, Wojciech Samek, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: In this comment on "Solving Statistical Mechanics Using Variational Autoregressive Networks" by Wu et al., we propose a subtle yet powerful modification of their approach. We show that the inherent sampling error of their method can be corrected by using neural network-based MCMC or importance sampling which leads to asymptotically unbiased estimators for physical quantities. This modification is… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

    Comments: 6 pages, 4 figures

  31. arXiv:1902.10664  [pdf, other

    cs.LG math.NA stat.ML

    Local Function Complexity for Active Learning via Mixture of Gaussian Processes

    Authors: Danny Panknin, Stefan Chmiela, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: Inhomogeneities in real-world data, e.g., due to changes in the observation noise level or variations in the structural complexity of the source function, pose a unique set of challenges for statistical inference. Accounting for them can greatly improve predictive power when physical resources or computation time is limited. In this paper, we draw on recent theoretical results on the estimation of… ▽ More

    Submitted 12 December, 2023; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: 30 pages (+18 pages of references and appendices), 20 figures

    Journal ref: Transactions on Machine Learning Research, December 2023

  32. arXiv:1806.11326  [pdf, other

    stat.ML cs.LG

    Unsupervised Detection and Explanation of Latent-class Contextual Anomalies

    Authors: Jacob Kauffmann, Grégoire Montavon, Luiz Alberto Lima, Shinichi Nakajima, Klaus-Robert Müller, Nico Görnitz

    Abstract: Detecting and explaining anomalies is a challenging effort. This holds especially true when data exhibits strong dependencies and single measurements need to be assessed and analyzed in their respective context. In this work, we consider scenarios where measurements are non-i.i.d, i.e. where samples are dependent on corresponding discrete latent variables which are connected through some given dep… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

  33. arXiv:1806.06126  [pdf, other

    cs.DS

    Tight Bound of Incremental Cover Trees for Dynamic Diversification

    Authors: Hannah Marienwald, Wikor Pronobis, Klaus-Robert Müller, Shinichi Nakajima

    Abstract: Dynamic diversification---finding a set of data points with maximum diversity from a time-dependent sample pool---is an important task in recommender systems, web search, database search, and notification services, to avoid showing users duplicate or very similar items. The incremental cover tree (ICT) with high computational efficiency and flexibility has been applied to this task, and shown good… ▽ More

    Submitted 15 June, 2018; originally announced June 2018.

  34. arXiv:1805.12017  [pdf, other

    cs.LG stat.ML

    Robustifying Models Against Adversarial Attacks by Langevin Dynamics

    Authors: Vignesh Srinivasan, Arturo Marban, Klaus-Robert Müller, Wojciech Samek, Shinichi Nakajima

    Abstract: Adversarial attacks on deep learning models have compromised their performance considerably. As remedies, a lot of defense methods were proposed, which however, have been circumvented by newer attacking strategies. In the midst of this ensuing arms race, the problem of robustness against adversarial attacks still remains unsolved. This paper proposes a novel, simple yet effective defense strategy… ▽ More

    Submitted 6 June, 2019; v1 submitted 30 May, 2018; originally announced May 2018.

  35. arXiv:1805.04636   

    cs.LO cs.SE

    Proceedings Joint Workshop on Handling IMPlicit and EXplicit knowledge in formal system development (IMPEX) and Formal and Model-Driven Techniques for Develo** Trustworthy Systems (FM&MDD)

    Authors: Régine Laleau, Dominique Méry, Shin Nakajima, Elena Troubitsyna

    Abstract: This volume contains the joint proceedings of IMPEX 2017, the first workshop on Handling IMPlicit and EXplicit knowledge in formal system development and FM&MDD, the second workshop on Formal and Model-Driven Techniques for Develo** Trustworthy Systems (FM&MDD) held together on November 16, 2017 in Xi'an, China, as part of ICFEM 2017, 19th International Conference on Formal Engineering Method… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

    Journal ref: EPTCS 271, 2018

  36. arXiv:1709.01562  [pdf, other

    cs.CL

    Optimizing for Measure of Performance in Max-Margin Parsing

    Authors: Alexander Bauer, Shinichi Nakajima, Nico Görnitz, Klaus-Robert Müller

    Abstract: Many statistical learning problems in the area of natural language processing including sequence tagging, sequence segmentation and syntactic parsing has been successfully approached by means of structured prediction methods. An appealing property of the corresponding discriminative learning algorithms is their ability to integrate the loss function of interest directly into the optimization proce… ▽ More

    Submitted 8 September, 2017; v1 submitted 5 September, 2017; originally announced September 2017.

  37. arXiv:1708.03314  [pdf, other

    cs.DS

    Partial Optimality of Dual Decomposition for MAP Inference in Pairwise MRFs

    Authors: Alexander Bauer, Shinichi Nakajima, Nico Görnitz, Klaus-Robert Müller

    Abstract: Markov random fields (MRFs) are a powerful tool for modelling statistical dependencies for a set of random variables using a graphical representation. An important computational problem related to MRFs, called maximum a posteriori (MAP) inference, is finding a joint variable assignment with the maximal probability. It is well known that the two popular optimisation techniques for this task, linear… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

  38. arXiv:1609.03219  [pdf, other

    stat.ML cs.LG

    Sharing Hash Codes for Multiple Purposes

    Authors: Wikor Pronobis, Danny Panknin, Johannes Kirschnick, Vignesh Srinivasan, Wojciech Samek, Volker Markl, Manohar Kaul, Klaus-Robert Mueller, Shinichi Nakajima

    Abstract: Locality sensitive hashing (LSH) is a powerful tool for sublinear-time approximate nearest neighbor search, and a variety of hashing schemes have been proposed for different dissimilarity measures. However, hash codes significantly depend on the dissimilarity, which prohibits users from adjusting the dissimilarity at query time. In this paper, we propose {multiple purpose LSH (mp-LSH) which shares… ▽ More

    Submitted 1 June, 2017; v1 submitted 11 September, 2016; originally announced September 2016.

  39. arXiv:1609.00626  [pdf, other

    cs.CL stat.AP

    SynsetRank: Degree-adjusted Random Walk for Relation Identification

    Authors: Shinichi Nakajima, Sebastian Krause, Dirk Weissenborn, Sven Schmeier, Nico Goernitz, Feiyu Xu

    Abstract: In relation extraction, a key process is to obtain good detectors that find relevant sentences describing the target relation. To minimize the necessity of labeled data for refining detectors, previous work successfully made use of BabelNet, a semantic graph structure expressing relationships between synsets, as side information or prior knowledge. The goal of this paper is to enhance the use of g… ▽ More

    Submitted 15 September, 2016; v1 submitted 2 September, 2016; originally announced September 2016.

  40. arXiv:1507.06878  [pdf, other

    quant-ph cs.CC cs.DS

    Quantum Algorithm for Triangle Finding in Sparse Graphs

    Authors: François Le Gall, Shogo Nakajima

    Abstract: This paper presents a quantum algorithm for triangle finding over sparse graphs that improves over the previous best quantum algorithm for this task by Buhrman et al. [SIAM Journal on Computing, 2005]. Our algorithm is based on the recent $\tilde O(n^{5/4})$-query algorithm given by Le Gall [FOCS 2014] for triangle finding over dense graphs (here $n$ denotes the number of vertices in the graph). W… ▽ More

    Submitted 24 July, 2015; originally announced July 2015.

    Comments: 13 pages

    Journal ref: Algorithmica, Vol. 79 No. 3, pp. 941-959, 2017

  41. Sparse Probit Linear Mixed Model

    Authors: Stephan Mandt, Florian Wenzel, Shinichi Nakajima, John P. Cunningham, Christoph Lippert, Marius Kloft

    Abstract: Linear Mixed Models (LMMs) are important tools in statistical genetics. When used for feature selection, they allow to find a sparse set of genetic traits that best predict a continuous phenotype of interest, while simultaneously correcting for various confounding factors such as age, ethnicity and population structure. Formulated as models for linear regression, LMMs have been restricted to conti… ▽ More

    Submitted 17 July, 2017; v1 submitted 16 July, 2015; originally announced July 2015.

    Comments: Published version, 21 pages, 6 figures

    Journal ref: Machine Learning, 106(9), 1621-1642 (2017)

  42. Insights from Classifying Visual Concepts with Multiple Kernel Learning

    Authors: Alexander Binder, Shinichi Nakajima, Marius Kloft, Christina Müller, Wojciech Samek, Ulf Brefeld, Klaus-Robert Müller, Motoaki Kawanabe

    Abstract: Combining information from various image features has become a standard technique in concept recognition tasks. However, the optimal way of fusing the resulting kernel functions is usually unknown in practical applications. Multiple kernel learning (MKL) techniques allow to determine an optimal linear combination of such similarity matrices. Classical approaches to MKL promote sparse mixtures. Unf… ▽ More

    Submitted 15 December, 2011; originally announced December 2011.

    Comments: 18 pages, 8 tables, 4 figures, format deviating from plos one submission format requirements for aesthetic reasons

    Journal ref: PLoS ONE 7(8): e38897, 2012