Skip to main content

Showing 1–23 of 23 results for author: Schmidt, M N

.
  1. arXiv:2406.13584  [pdf, other

    cs.LG

    Explaining time series models using frequency masking

    Authors: Thea Brüsch, Kristoffer K. Wickstrøm, Mikkel N. Schmidt, Tommy S. Alstrøm, Robert Jenssen

    Abstract: Time series data is fundamentally important for describing many critical domains such as healthcare, finance, and climate, where explainable models are necessary for safe automated decision-making. To develop eXplainable AI (XAI) in these domains therefore implies explaining salient information in the time series. Current methods for obtaining saliency maps assumes localized information in the raw… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Submitted to the Next Generation of AI Safety workshop at ICML 2024

  2. arXiv:2405.13367  [pdf, other

    eess.SP

    End-to-End Learning of Pulse-Shaper and Receiver Filter in the Presence of Strong Intersymbol Interference

    Authors: Søren Føns Nielsen, Francesco Da Ros, Mikkel N. Schmidt, Darko Zibar

    Abstract: We numerically demonstrate that joint optimization of FIR based pulse-shaper and receiver filter results in an improved system performance, and shorter filter lengths (lower complexity), for 4-PAM 100 GBd IM/DD systems.

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 4 pages (3 article pages + 1 page for references) and 5 figures. Submitted to European Conference on Optical Communications (ECOC) 2024

  3. arXiv:2312.04174  [pdf, other

    stat.ML cs.LG physics.comp-ph

    Coherent energy and force uncertainty in deep learning force fields

    Authors: Peter Bjørn Jørgensen, Jonas Busk, Ole Winther, Mikkel N. Schmidt

    Abstract: In machine learning energy potentials for atomic systems, forces are commonly obtained as the negative derivative of the energy function with respect to atomic positions. To quantify aleatoric uncertainty in the predicted energies, a widely used modeling approach involves predicting both a mean and variance for each energy value. However, this model is not differentiable under the usual white nois… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Presented at Advancing Molecular Machine Learning - Overcoming Limitations [ML4Molecules], ELLIS workshop, VIRTUAL, December 8, 2023, unofficial NeurIPS 2023 side-event

  4. arXiv:2307.09614  [pdf, other

    stat.ML cs.LG eess.SP

    Multi-view self-supervised learning for multivariate variable-channel time series

    Authors: Thea Brüsch, Mikkel N. Schmidt, Tommy S. Alstrøm

    Abstract: Labeling of multivariate biomedical time series data is a laborious and expensive process. Self-supervised contrastive learning alleviates the need for large, labeled datasets through pretraining on unlabeled data. However, for multivariate time series data, the set of input channels often varies between applications, and most existing work does not allow for transfer between datasets with differe… ▽ More

    Submitted 20 July, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: To appear in proceedings of 2023 IEEE International workshop on Machine Learning for Signal Processing

  5. arXiv:2306.13263  [pdf, other

    cs.LG cs.CV cs.DC

    Synthetic data shuffling accelerates the convergence of federated learning under data heterogeneity

    Authors: Bo Li, Yasin Esfandiari, Mikkel N. Schmidt, Tommy S. Alstrøm, Sebastian U. Stich

    Abstract: In federated learning, data heterogeneity is a critical challenge. A straightforward solution is to shuffle the clients' data to homogenize the distribution. However, this may violate data access rights, and how and when shuffling can accelerate the convergence of a federated optimization algorithm is not theoretically well understood. In this paper, we establish a precise and quantifiable corresp… ▽ More

    Submitted 8 April, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: Accepted at TMLR

  6. arXiv:2305.16325  [pdf, other

    physics.chem-ph cs.LG

    Graph Neural Network Interatomic Potential Ensembles with Calibrated Aleatoric and Epistemic Uncertainty on Energy and Forces

    Authors: Jonas Busk, Mikkel N. Schmidt, Ole Winther, Tejs Vegge, Peter Bjørn Jørgensen

    Abstract: Inexpensive machine learning potentials are increasingly being used to speed up structural optimization and molecular dynamics simulations of materials by iteratively predicting and applying interatomic forces. In these settings, it is crucial to detect when predictions are unreliable to avoid wrong or misleading results. Here, we present a complete framework for training and recalibrating graph n… ▽ More

    Submitted 11 September, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

  7. arXiv:2212.02191  [pdf, other

    cs.LG cs.DC

    On the effectiveness of partial variance reduction in federated learning with heterogeneous data

    Authors: Bo Li, Mikkel N. Schmidt, Tommy S. Alstrøm, Sebastian U. Stich

    Abstract: Data heterogeneity across clients is a key challenge in federated learning. Prior works address this by either aligning client and server models or using control variates to correct client model drift. Although these methods achieve fast convergence in convex or simple non-convex problems, the performance in over-parameterized models such as deep neural networks is lacking. In this paper, we first… ▽ More

    Submitted 9 June, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted to CVPR 2023

  8. End-to-End Learning for VCSEL-based Optical Interconnects: State-of-the-Art, Challenges, and Opportunities

    Authors: Muralikrishnan Srinivasan, **xiang Song, Alexander Grabowski, Krzysztof Szczerba, Holger K. Iversen, Mikkel N. Schmidt, Darko Zibar, Jochen Schröder, Anders Larsson, Christian Häger, Henk Wymeersch

    Abstract: Optical interconnects (OIs) based on vertical-cavity surface-emitting lasers (VCSELs) are the main workhorse within data centers, supercomputers, and even vehicles, providing low-cost, high-rate connectivity. VCSELs must operate under extremely harsh and time-varying conditions, thus requiring adaptive and flexible designs of the communication chain. Such designs can be built based on mathematical… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

  9. Raman Spectrum Matching with Contrastive Representation Learning

    Authors: Bo Li, Mikkel N. Schmidt, Tommy S. Alstrøm

    Abstract: Raman spectroscopy is an effective, low-cost, non-intrusive technique often used for chemical identification. Typical approaches are based on matching observations to a reference database, which requires careful preprocessing, or supervised machine learning, which requires a fairly large number of training observations from each class. We propose a new machine learning technique for Raman spectrum… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: Under review at Analytical Chemistry

  10. arXiv:2201.06863  [pdf, other

    cs.AI

    Programmatic Policy Extraction by Iterative Local Search

    Authors: Rasmus Larsen, Mikkel Nørgaard Schmidt

    Abstract: Reinforcement learning policies are often represented by neural networks, but programmatic policies are preferred in some cases because they are more interpretable, amenable to formal verification, or generalize better. While efficient algorithms for learning neural policies exist, learning programmatic policies is challenging. Combining imitation-projection and dataset aggregation with a local se… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

  11. arXiv:2107.06068  [pdf, ps, other

    cs.LG stat.ML

    Calibrated Uncertainty for Molecular Property Prediction using Ensembles of Message Passing Neural Networks

    Authors: Jonas Busk, Peter Bjørn Jørgensen, Arghya Bhowmik, Mikkel N. Schmidt, Ole Winther, Tejs Vegge

    Abstract: Data-driven methods based on machine learning have the potential to accelerate computational analysis of atomic structures. In this context, reliable uncertainty estimates are important for assessing confidence in predictions and enabling decision making. However, machine learning models can produce badly calibrated uncertainty estimates and it is therefore crucial to detect and handle uncertainty… ▽ More

    Submitted 3 November, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  12. arXiv:1905.06048  [pdf, other

    cond-mat.mtrl-sci stat.ML

    Materials property prediction using symmetry-labeled graphs as atomic-position independent descriptors

    Authors: Peter Bjørn Jørgensen, Estefanía Garijo del Río, Mikkel N. Schmidt, Karsten Wedel Jacobsen

    Abstract: Computational materials screening studies require fast calculation of the properties of thousands of materials. The calculations are often performed with Density Functional Theory (DFT), but the necessary computer time sets limitations for the investigated material space. Therefore, the development of machine learning models for prediction of DFT calculated properties are currently of interest. A… ▽ More

    Submitted 15 October, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: 14 pages including references and 13 figures

    Journal ref: Phys. Rev. B 100, 104114 (2019)

  13. arXiv:1806.08195  [pdf, other

    stat.ML cs.LG

    Probabilistic PARAFAC2

    Authors: Philip J. H. Jørgensen, Søren F. V. Nielsen, Jesper L. Hinrich, Mikkel N. Schmidt, Kristoffer H. Madsen, Morten Mørup

    Abstract: The PARAFAC2 is a multimodal factor analysis model suitable for analyzing multi-way data when one of the modes has incomparable observation units, for example because of differences in signal sampling or batch sizes. A fully probabilistic treatment of the PARAFAC2 is desirable in order to improve robustness to noise and provide a well founded principle for determining the number of factors, but ch… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: 16 pages (incl. 4 pages of supplemental material), 5 figures

  14. arXiv:1806.03146  [pdf, other

    stat.ML cs.LG

    Neural Message Passing with Edge Updates for Predicting Properties of Molecules and Materials

    Authors: Peter Bjørn Jørgensen, Karsten Wedel Jacobsen, Mikkel N. Schmidt

    Abstract: Neural message passing on molecular graphs is one of the most promising methods for predicting formation energy and other properties of molecules and materials. In this work we extend the neural message passing model with an edge update network which allows the information exchanged between atoms to depend on the hidden state of the receiving atom. We benchmark the proposed model on three publicly… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

  15. arXiv:1612.04555  [pdf, ps, other

    stat.AP stat.ML

    Scalable Group Level Probabilistic Sparse Factor Analysis

    Authors: Jesper L. Hinrich, Søren F. V. Nielsen, Nicolai A. B. Riis, Casper T. Eriksen, Jacob Frøsig, Marco D. F. Kristensen, Mikkel N. Schmidt, Kristoffer H. Madsen, Morten Mørup

    Abstract: Many data-driven approaches exist to extract neural representations of functional magnetic resonance imaging (fMRI) data, but most of them lack a proper probabilistic formulation. We propose a group level scalable probabilistic sparse factor analysis (psFA) allowing spatially sparse maps, component pruning using automatic relevance determination (ARD) and subject specific heteroscedastic spatial n… ▽ More

    Submitted 14 December, 2016; originally announced December 2016.

    Comments: 10 pages plus 5 pages appendix, Submitted to ICASSP 17

  16. arXiv:1601.00496  [pdf, other

    stat.AP q-bio.NC stat.ML

    Nonparametric Modeling of Dynamic Functional Connectivity in fMRI Data

    Authors: Søren F. V. Nielsen, Kristoffer H. Madsen, Rasmus Røge, Mikkel N. Schmidt, Morten Mørup

    Abstract: Dynamic functional connectivity (FC) has in recent years become a topic of interest in the neuroimaging community. Several models and methods exist for both functional magnetic resonance imaging (fMRI) and electroencephalography (EEG), and the results point towards the conclusion that FC exhibits dynamic changes. The existing approaches modeling dynamic connectivity have primarily been based on ti… ▽ More

    Submitted 8 June, 2016; v1 submitted 4 January, 2016; originally announced January 2016.

    Comments: 8 pages, 1 figure. Presented at the Machine Learning and Interpretation in Neuroimaging Workshop (MLINI-2015), 2015 (arXiv:1605.04435)

    Report number: MLINI/2015/08

  17. Bayesian Dropout

    Authors: Tue Herlau, Morten Mørup, Mikkel N. Schmidt

    Abstract: Dropout has recently emerged as a powerful and simple method for training neural networks preventing co-adaptation by stochastically omitting neurons. Dropout is currently not grounded in explicit modelling assumptions which so far has precluded its adoption in Bayesian modelling. Using Bayesian entropic reasoning we show that dropout can be interpreted as optimal inference under constraints. We d… ▽ More

    Submitted 17 May, 2022; v1 submitted 12 August, 2015; originally announced August 2015.

    Comments: 21 pages, 3 figures. Manuscript prepared 2014 and awaiting submission

    MSC Class: 68T37 ACM Class: I.5.1

    Journal ref: Procedia Computer Science 201 (2022) 771-776

  18. arXiv:1507.02925  [pdf, other

    stat.ML

    Completely random measures for modelling block-structured networks

    Authors: Tue Herlau, Mikkel N. Schmidt, Morten Mørup

    Abstract: Many statistical methods for network data parameterize the edge-probability by attributing latent traits to the vertices such as block structure and assume exchangeability in the sense of the Aldous-Hoover representation theorem. Empirical studies of networks indicate that many real-world networks have a power-law distribution of the vertices which in turn implies the number of edges scale slower… ▽ More

    Submitted 4 December, 2015; v1 submitted 10 July, 2015; originally announced July 2015.

  19. arXiv:1406.0071  [pdf, other

    stat.ML

    Adaptive Reconfiguration Moves for Dirichlet Mixtures

    Authors: Tue Herlau, Morten Mørup, Yee Whye Teh, Mikkel N. Schmidt

    Abstract: Bayesian mixture models are widely applied for unsupervised learning and exploratory data analysis. Markov chain Monte Carlo based on Gibbs sampling and split-merge moves are widely used for inference in these models. However, both methods are restricted to limited types of transitions and suffer from torpid mixing and low accept rates even for problems of modest size. We propose a method that con… ▽ More

    Submitted 31 May, 2014; originally announced June 2014.

    Comments: Manuscript is in preparation, 26 pages including figures

  20. Non-parametric Bayesian modeling of complex networks

    Authors: Mikkel N. Schmidt, Morten Mørup

    Abstract: Modeling structure in complex networks using Bayesian non-parametrics makes it possible to specify flexible model structures and infer the adequate model complexity from the observed data. This paper provides a gentle introduction to non-parametric Bayesian modeling of complex networks: Using an infinite mixture model as running example we go through the steps of deriving the model as an infinite… ▽ More

    Submitted 20 December, 2013; originally announced December 2013.

    Journal ref: Signal Processing Magazine, IEEE (Volume:30, Issue:3, Year:2013)

  21. The Infinite Degree Corrected Stochastic Block Model

    Authors: Tue Herlau, Mikkel N. Schmidt, Morten Mørup

    Abstract: In Stochastic blockmodels, which are among the most prominent statistical models for cluster analysis of complex networks, clusters are defined as groups of nodes with statistically similar link probabilities within and between groups. A recent extension by Karrer and Newman incorporates a node degree correction to model degree heterogeneity within each group. Although this demonstrably leads to b… ▽ More

    Submitted 30 May, 2014; v1 submitted 11 November, 2013; originally announced November 2013.

    Comments: Originally presented at the Complex Networks workshop NIPS 2013

    Journal ref: Phys. Rev. E 90, 032819 (2014)

  22. arXiv:1311.1033  [pdf, other

    stat.ML

    Nonparametric Bayesian models of hierarchical structure in complex networks

    Authors: Mikkel N. Schmidt, Tue Herlau, Morten Mørup

    Abstract: Analyzing and understanding the structure of complex relational data is important in many applications including analysis of the connectivity in the human brain. Such networks can have prominent patterns on different scales, calling for a hierarchically structured model. We propose two non-parametric Bayesian hierarchical network models based on Gibbs fragmentation tree priors, and demonstrate the… ▽ More

    Submitted 21 November, 2013; v1 submitted 5 November, 2013; originally announced November 2013.

  23. arXiv:1101.5097  [pdf, ps, other

    cs.SI cs.LG physics.soc-ph

    Infinite Multiple Membership Relational Modeling for Complex Networks

    Authors: Morten Mørup, Mikkel N. Schmidt, Lars Kai Hansen

    Abstract: Learning latent structure in complex networks has become an important problem fueled by many types of networked data originating from practically all fields of science. In this paper, we propose a new non-parametric Bayesian multiple-membership latent feature model for networks. Contrary to existing multiple-membership models that scale quadratically in the number of vertices the proposed model sc… ▽ More

    Submitted 26 January, 2011; originally announced January 2011.

    Comments: 8 pages, 4 figures