Skip to main content

Showing 1–42 of 42 results for author: Nielsen, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13469  [pdf, other

    cs.CL cs.AI cs.LG

    Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks

    Authors: Dan Saattrup Nielsen, Kenneth Enevoldsen, Peter Schneider-Kamp

    Abstract: This paper explores the performance of encoder and decoder language models on multilingual Natural Language Understanding (NLU) tasks, with a broad focus on Germanic languages. Building upon the ScandEval benchmark, which initially was restricted to evaluating encoder models, we extend the evaluation framework to include decoder models. We introduce a method for evaluating decoder models on NLU ta… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 14 pages, 2 figures

    ACM Class: I.2.7

  2. arXiv:2403.15409  [pdf, other

    eess.SP cs.LG q-bio.NC

    Coupled generator decomposition for fusion of electro- and magnetoencephalography data

    Authors: Anders Stevnhoved Olsen, Jesper Duemose Nielsen, Morten Mørup

    Abstract: Data fusion modeling can identify common features across diverse data sources while accounting for source-specific variability. Here we introduce the concept of a \textit{coupled generator decomposition} and demonstrate how it generalizes sparse principal component analysis (SPCA) for data fusion. Leveraging data from a multisubject, multimodal (electro- and magnetoencephalography (EEG and MEG)) n… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  3. arXiv:2311.09145  [pdf, other

    cs.LG stat.ML

    Model Agnostic Explainable Selective Regression via Uncertainty Estimation

    Authors: Andrea Pugnana, Carlos Mougan, Dan Saattrup Nielsen

    Abstract: With the wide adoption of machine learning techniques, requirements have evolved beyond sheer high performance, often requiring models to be trustworthy. A common approach to increase the trustworthiness of such systems is to allow them to refrain from predicting. Such a framework is known as selective prediction. While selective prediction for classification tasks has been widely analyzed, the pr… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  4. arXiv:2311.07264  [pdf, other

    cs.CL

    Danish Foundation Models

    Authors: Kenneth Enevoldsen, Lasse Hansen, Dan S. Nielsen, Rasmus A. F. Egebæk, Søren V. Holm, Martin C. Nielsen, Martin Bernstorff, Rasmus Larsen, Peter B. Jørgensen, Malte Højmark-Bertelsen, Peter B. Vahlstrup, Per Møldrup-Dalum, Kristoffer Nielbo

    Abstract: Large language models, sometimes referred to as foundation models, have transformed multiple fields of research. However, smaller languages risk falling behind due to high training costs and small incentives for large companies to train these models. To combat this, the Danish Foundation Models project seeks to provide and maintain open, well-documented, and high-quality foundation models for the… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 4 pages, 2 tables

  5. arXiv:2306.05370  [pdf, other

    cs.CY cs.CL

    Detecting Human Rights Violations on Social Media during Russia-Ukraine War

    Authors: Poli Nemkova, Solomon Ubani, Suleyman Olcay Polat, Nayeon Kim, Rodney D. Nielsen

    Abstract: The present-day Russia-Ukraine military conflict has exposed the pivotal role of social media in enabling the transparent and unbridled sharing of information directly from the frontlines. In conflict zones where freedom of expression is constrained and information warfare is pervasive, social media has emerged as an indispensable lifeline. Anonymous social media platforms, as publicly available s… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 9 pages

  6. arXiv:2304.00906  [pdf, other

    cs.CL cs.LG

    ScandEval: A Benchmark for Scandinavian Natural Language Processing

    Authors: Dan Saattrup Nielsen

    Abstract: This paper introduces a Scandinavian benchmarking platform, ScandEval, which can benchmark any pretrained model on four different tasks in the Scandinavian languages. The datasets used in two of the tasks, linguistic acceptability and question answering, are new. We develop and release a Python package and command-line interface, scandeval, which can benchmark any model that has been uploaded to t… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 17 pages, 11 figures, camera-ready NoDaLiDa 2023 submission

  7. arXiv:2304.00277  [pdf, other

    cs.NI

    Energy Consumption Optimization in Radio Access Networks (ECO-RAN)

    Authors: Anders Mariegaard, Kim G. Larsen, Marco Muniz, Thomas Dyhre Nielsen

    Abstract: In recent years, mobile network operators are showing interest in reducing energy consumption. Toward this goal, in cooperation with the Danish company 2Operate we have developed a stochastic simulation environment for mobile networks. Our simulator interacts with historical data from 2Operate and allow us to turn on and off network cells, replay traffic loads, etc. We have developed an optimizati… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: Report for Energy Cluster Denmark project of the year. https://www.energycluster.dk/en/eco-ran-wins-innovation-project-of-the-year/

  8. arXiv:2303.11042  [pdf, other

    cs.LG cs.AI

    Hospitalization Length of Stay Prediction using Patient Event Sequences

    Authors: Emil Riis Hansen, Thomas Dyhre Nielsen, Thomas Mulvad, Mads Nibe Strausholm, Tomer Sagi, Katja Hose

    Abstract: Predicting patients hospital length of stay (LOS) is essential for improving resource allocation and supporting decision-making in healthcare organizations. This paper proposes a novel approach for predicting LOS by modeling patient information as sequences of events. Specifically, we present a transformer-based model, termed Medic-BERT (M-BERT), for LOS prediction using the unique features descri… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 11 pages, 5 figures

    MSC Class: 68T07 ACM Class: I.2.7; J.3

  9. arXiv:2210.09014  [pdf

    cs.CY cs.AI cs.LG cs.SI

    Addressing contingency in algorithmic (mis)information classification: Toward a responsible machine learning agenda

    Authors: Andrés Domínguez Hernández, Richard Owen, Dan Saattrup Nielsen, Ryan McConville

    Abstract: Machine learning (ML) enabled classification models are becoming increasingly popular for tackling the sheer volume and speed of online misinformation and other content that could be identified as harmful. In building these models, data scientists need to take a stance on the legitimacy, authoritativeness and objectivity of the sources of ``truth" used for model training and testing. This has poli… ▽ More

    Submitted 13 April, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Andrés Domínguez Hernández, Richard Owen, Dan Saattrup Nielsen and Ryan McConville. 2023. Addressing contingency in algorithmic (mis)information classification: Toward a responsible machine learning agenda. Accepted in 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT '23), June 12-15, 2023, Chicago, United States of America. ACM, New York, NY, USA, 16 pages

  10. arXiv:2208.14645  [pdf, other

    cs.AR cs.DC eess.SY

    PaRTAA: A Real-time Multiprocessor for Mixed-Criticality Airborne Systems

    Authors: Shibarchi Majumder, Jens F D Nielsen, Thomas Bak

    Abstract: Mixed-criticality systems, where multiple systems with varying criticality-levels share a single hardware platform, require isolation between tasks with different criticality-levels. Isolation can be achieved with software-based solutions or can be enforced by a hardware level partitioning. An asymmetric multiprocessor architecture offers hardware-based isolation at the cost of underutilized hardw… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Journal ref: in IEEE Transactions on Computers, vol. 69, no. 8, pp. 1221-1232, 1 Aug. 2020

  11. Ærø: A Platform Architecture for Mixed-Criticality Airborne Systems

    Authors: Shibarchi Majumder, Jens Frederik Dalsgaard Nielsen, Thomas Bak

    Abstract: Real-time embedded platforms with resource constraints can take the benefits of mixed-criticality system where applications with different criticality-level share computational resources, with isolation in the temporal and spatial domain. A conventional software-based isolation mechanism adds additional overhead and requires certification with the highest level of criticality present in the system… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Journal ref: in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 39, no. 10, pp. 2307-2318, Oct. 2020

  12. ICLR 2022 Challenge for Computational Geometry and Topology: Design and Results

    Authors: Adele Myers, Saiteja Utpala, Shubham Talbar, Sophia Sanborn, Christian Shewmake, Claire Donnat, Johan Mathe, Umberto Lupo, Rishi Sonthalia, Xinyue Cui, Tom Szwagier, Arthur Pignet, Andri Bergsson, Soren Hauberg, Dmitriy Nielsen, Stefan Sommer, David Klindt, Erik Hermansen, Melvin Vaupel, Benjamin Dunn, Jeffrey Xiong, Noga Aharony, Itsik Pe'er, Felix Ambellan, Martin Hanik , et al. (3 additional authors not shown)

    Abstract: This paper presents the computational challenge on differential geometry and topology that was hosted within the ICLR 2022 workshop ``Geometric and Topological Representation Learning". The competition asked participants to provide implementations of machine learning algorithms on manifolds that would respect the API of the open-source software Geomstats (manifold part) and Scikit-Learn (machine l… ▽ More

    Submitted 26 June, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  13. arXiv:2206.07696  [pdf, other

    cs.CV cs.LG stat.ML

    Diffusion Models for Video Prediction and Infilling

    Authors: Tobias Höppe, Arash Mehrjou, Stefan Bauer, Didrik Nielsen, Andrea Dittadi

    Abstract: Predicting and anticipating future outcomes or reasoning about missing information in a sequence are critical skills for agents to be able to make intelligent decisions. This requires strong, temporally coherent generative capabilities. Diffusion models have shown remarkable success in several generative tasks, but have not been extensively explored in the video domain. We present Random-Mask Vide… ▽ More

    Submitted 14 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Published in TMLR (11/2022)

  14. arXiv:2205.15463  [pdf, other

    cs.CV cs.LG stat.ML

    Few-Shot Diffusion Models

    Authors: Giorgio Giannone, Didrik Nielsen, Ole Winther

    Abstract: Denoising diffusion probabilistic models (DDPM) are powerful hierarchical latent variable models with remarkable sample generation quality and training stability. These properties can be attributed to parameter sharing in the generative hierarchy, as well as a parameter-free diffusion-based inference procedure. In this paper, we present Few-Shot Diffusion Models (FSDM), a framework for few-shot ge… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  15. arXiv:2204.12270  [pdf, other

    q-bio.GN cs.LG q-bio.QM

    Graph Neural Networks for Microbial Genome Recovery

    Authors: Andre Lamurias, Alessandro Tibo, Katja Hose, Mads Albertsen, Thomas Dyhre Nielsen

    Abstract: Microbes have a profound impact on our health and environment, but our understanding of the diversity and function of microbial communities is severely limited. Through DNA sequencing of microbial communities (metagenomics), DNA fragments (reads) of the individual microbes can be obtained, which through assembly graphs can be combined into long contiguous DNA sequences (contigs). Given the complex… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  16. arXiv:2204.09889  [pdf, other

    cs.LG stat.ML

    Inducing Gaussian Process Networks

    Authors: Alessandro Tibo, Thomas Dyhre Nielsen

    Abstract: Gaussian processes (GPs) are powerful but computationally expensive machine learning models, requiring an estimate of the kernel covariance matrix for every prediction. In large and complex domains, such as graphs, sets, or images, the choice of suitable kernel can also be non-trivial to determine, providing an additional obstacle to the learning task. Over the last decade, these challenges have r… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  17. arXiv:2202.11684  [pdf, other

    cs.LG cs.CL cs.CY cs.IR cs.SI

    MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset

    Authors: Dan Saattrup Nielsen, Ryan McConville

    Abstract: Misinformation is becoming increasingly prevalent on social media and in news articles. It has become so widespread that we require algorithmic assistance utilising machine learning to detect such content. Training these machine learning models require datasets of sufficient scale, diversity and quality. However, datasets in the field of automatic misinformation detection are predominantly monolin… ▽ More

    Submitted 8 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: 9+3 pages

  18. arXiv:2201.11676  [pdf, other

    cs.LG stat.ML

    Monitoring Model Deterioration with Explainable Uncertainty Estimation via Non-parametric Bootstrap

    Authors: Carlos Mougan, Dan Saattrup Nielsen

    Abstract: Monitoring machine learning models once they are deployed is challenging. It is even more challenging to decide when to retrain models in real-case scenarios when labeled data is beyond reach, and monitoring performance metrics becomes unfeasible. In this work, we use non-parametric bootstrapped uncertainty estimates and SHAP values to provide explainable uncertainty estimation as a technique that… ▽ More

    Submitted 22 November, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 7+6 pages. Accepted at AAAI'23 Safe and Robust AI track

  19. arXiv:2104.13321  [pdf, other

    cs.LG cs.DB

    UniTE -- The Best of Both Worlds: Unifying Function-Fitting and Aggregation-Based Approaches to Travel Time and Travel Speed Estimation

    Authors: Tobias Skovgaard Jepsen, Christian S. Jensen, Thomas Dyhre Nielsen

    Abstract: Travel time or speed estimation are part of many intelligent transportation applications. Existing estimation approaches rely on either function fitting or aggregation and represent different trade-offs between generalizability and accuracy. Function-fitting approaches learn functions that map feature vectors of, e.g., routes, to travel time or speed estimates, which enables generalization to unse… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  20. arXiv:2102.05379  [pdf, other

    stat.ML cs.CL cs.LG

    Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions

    Authors: Emiel Hoogeboom, Didrik Nielsen, Priyank Jaini, Patrick Forré, Max Welling

    Abstract: Generative flows and diffusion models have been predominantly trained on ordinal data, for example natural images. This paper introduces two extensions of flows and diffusion for categorical data such as language or image segmentation: Argmax Flows and Multinomial Diffusion. Argmax Flows are defined by a composition of a continuous distribution (such as a normalizing flow), and an argmax function.… ▽ More

    Submitted 22 October, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2021)

  21. arXiv:2102.02374  [pdf, other

    cs.LG

    Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC

    Authors: Priyank Jaini, Didrik Nielsen, Max Welling

    Abstract: Hybrid Monte Carlo is a powerful Markov Chain Monte Carlo method for sampling from complex continuous distributions. However, a major limitation of HMC is its inability to be applied to discrete domains due to the lack of gradient signal. In this work, we introduce a new approach based on augmenting Monte Carlo methods with SurVAE Flows to sample from discrete distributions using a combination of… ▽ More

    Submitted 1 March, 2021; v1 submitted 3 February, 2021; originally announced February 2021.

    Comments: Accepted at AISTATS 2021; added experiments with longer MCMC chains

  22. arXiv:2007.02731  [pdf, other

    cs.LG stat.ML

    SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows

    Authors: Didrik Nielsen, Priyank Jaini, Emiel Hoogeboom, Ole Winther, Max Welling

    Abstract: Normalizing flows and variational autoencoders are powerful generative models that can represent complicated density functions. However, they both impose constraints on the models: Normalizing flows use bijective transformations to model densities whereas VAEs learn stochastic transformations that are non-invertible and thus typically do not provide tractable estimates of the marginal likelihood.… ▽ More

    Submitted 30 October, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

  23. Relational Fusion Networks: Graph Convolutional Networks for Road Networks

    Authors: Tobias Skovgaard Jepsen, Christian S. Jensen, Thomas Dyhre Nielsen

    Abstract: The application of machine learning techniques in the setting of road networks holds the potential to facilitate many important intelligent transportation applications. Graph Convolutional Networks (GCNs) are neural networks that are capable of leveraging the structure of a network. However, many implicit assumptions of GCNs do not apply to road networks. We introduce the Relational Fusion Network… ▽ More

    Submitted 14 September, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: IEEE Transactions on Intelligent Transportation Systems (2020). arXiv admin note: substantial text overlap with arXiv:1908.11567

  24. arXiv:2002.02547  [pdf, other

    cs.LG cs.CV stat.ML

    Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow

    Authors: Didrik Nielsen, Ole Winther

    Abstract: Flow models have recently made great progress at modeling ordinal discrete data such as images and audio. Due to the continuous nature of flow models, dequantization is typically applied when using them for such discrete data, resulting in lower bound estimates of the likelihood. In this paper, we introduce subset flows, a class of flows that can tractably transform finite volumes and thus allow e… ▽ More

    Submitted 30 October, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

  25. On Network Embedding for Machine Learning on Road Networks: A Case Study on the Danish Road Network

    Authors: Tobias Skovgaard Jepsen, Christian S. Jensen, Thomas Dyhre Nielsen

    Abstract: Road networks are a type of spatial network, where edges may be associated with qualitative information such as road type and speed limit. Unfortunately, such information is often incomplete; for instance, OpenStreetMap only has speed limits for 13% of all Danish road segments. This is problematic for analysis tasks that rely on such information for machine learning. To enable machine learning in… ▽ More

    Submitted 15 November, 2019; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: Best Paper at the 3rd IEEE International Workshop on Big Spatial Data (BSD 2018)

    Journal ref: 2018 IEEE International Conference on Big Data (Big Data), 2018, pp. 3422-3431

  26. arXiv:1908.11567  [pdf, other

    cs.LG cs.DB stat.ML

    Graph Convolutional Networks for Road Networks

    Authors: Tobias Skovgaard Jepsen, Christian S. Jensen, Thomas Dyhre Nielsen

    Abstract: Machine learning techniques for road networks hold the potential to facilitate many important transportation applications. Graph Convolutional Networks (GCNs) are neural networks that are capable of leveraging the structure of a road network by utilizing information of, e.g., adjacent road segments. While state-of-the-art GCNs target node classification tasks in social, citation, and biological ne… ▽ More

    Submitted 22 July, 2020; v1 submitted 30 August, 2019; originally announced August 2019.

    Comments: Ten-page pre-print version of a four-page ACM SIGSPATIAL 2019 poster paper

  27. arXiv:1908.03442  [pdf, other

    cs.LG math.ST stat.ML

    Probabilistic Models with Deep Neural Networks

    Authors: Andrés R. Masegosa, Rafael Cabañas, Helge Langseth, Thomas D. Nielsen, Antonio Salmerón

    Abstract: Recent advances in statistical inference have significantly expanded the toolbox of probabilistic modeling. Historically, probabilistic modeling has been constrained to (i) very restricted model classes where exact or approximate probabilistic inference were feasible, and (ii) small or medium-sized data sets which fit within the main memory of the computer. However, developments in variational inf… ▽ More

    Submitted 2 October, 2019; v1 submitted 9 August, 2019; originally announced August 2019.

  28. arXiv:1904.03713  [pdf, other

    cs.HC cs.CL

    AI Meets Austen: Towards Human-Robot Discussions of Literary Metaphor

    Authors: Natalie Parde, Rodney D. Nielsen

    Abstract: Artificial intelligence is revolutionizing formal education, fueled by innovations in learning assessment, content generation, and instructional delivery. Informal, lifelong learning settings have been the subject of less attention. We provide a proof-of-concept for an embodied book discussion companion, designed to stimulate conversations with readers about particularly creative metaphors in fict… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

    Comments: Accepted to the 20th International Conference on Artificial Intelligence in Education (AIED 2019)

  29. arXiv:1811.04504  [pdf, other

    cs.LG cs.AI stat.ML

    SLANG: Fast Structured Covariance Approximations for Bayesian Deep Learning with Natural Gradient

    Authors: Aaron Mishkin, Frederik Kunstner, Didrik Nielsen, Mark Schmidt, Mohammad Emtiyaz Khan

    Abstract: Uncertainty estimation in large deep-learning models is a computationally challenging task, where it is difficult to form even a Gaussian approximation to the posterior distribution. In such situations, existing methods usually resort to a diagonal approximation of the covariance matrix despite, the fact that these matrices are known to result in poor uncertainty estimates. To address this issue,… ▽ More

    Submitted 11 January, 2019; v1 submitted 11 November, 2018; originally announced November 2018.

    Comments: NeurIPS 2018 final version

  30. arXiv:1807.04489  [pdf, other

    stat.ML cs.IT cs.LG stat.CO

    Fast yet Simple Natural-Gradient Descent for Variational Inference in Complex Models

    Authors: Mohammad Emtiyaz Khan, Didrik Nielsen

    Abstract: Bayesian inference plays an important role in advancing machine learning, but faces computational challenges when applied to complex models such as deep neural networks. Variational inference circumvents these challenges by formulating Bayesian inference as an optimization problem and solving it using gradient-based optimization. In this paper, we argue in favor of natural-gradient approaches whic… ▽ More

    Submitted 2 August, 2018; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: Camera-ready version

    Journal ref: International Symposium on Information Theory and Its Applications (ISITA), 2018

  31. arXiv:1806.04854  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam

    Authors: Mohammad Emtiyaz Khan, Didrik Nielsen, Voot Tangkaratt, Wu Lin, Yarin Gal, Akash Srivastava

    Abstract: Uncertainty computation in deep learning is essential to design robust and reliable systems. Variational inference (VI) is a promising approach for such computation, but requires more effort to implement and execute compared to maximum-likelihood methods. In this paper, we propose new natural-gradient algorithms to reduce such efforts for Gaussian mean-field VI. Our algorithms can be implemented w… ▽ More

    Submitted 2 August, 2018; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: Camera ready version

    Journal ref: Thirty-fifth International Conference on Machine Learning, 2018

  32. arXiv:1806.03369  [pdf, other

    cs.CL

    #SarcasmDetection is soooo general! Towards a Domain-Independent Approach for Detecting Sarcasm

    Authors: Natalie Parde, Rodney D. Nielsen

    Abstract: Automatic sarcasm detection methods have traditionally been designed for maximum performance on a specific domain. This poses challenges for those wishing to transfer those approaches to other existing or novel domains, which may be typified by very different language characteristics. We develop a general set of features and evaluate it under different training scenarios utilizing in-domain and/or… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: Proceedings of the 30th International Florida Artificial Intelligence Research Society Conference

  33. Adaptive User-Oriented Direct Load-Control of Residential Flexible Devices

    Authors: Davide Frazzetto, Bijay Neupane, Torben Bach Pedersen, Thomas Dyhre Nielsen

    Abstract: Demand Response (DR) schemes are effective tools to maintain a dynamic balance in energy markets with higher integration of fluctuating renewable energy sources. DR schemes can be used to harness residential devices' flexibility and to utilize it to achieve social and financial objectives. However, existing DR schemes suffer from low user participation as they fail at taking into account the users… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

    Comments: 10 pages plus 1 page references, 11 figures, conference: ACM e-Energy 2018

  34. arXiv:1711.05560  [pdf, other

    stat.ML cs.LG

    Variational Adaptive-Newton Method for Explorative Learning

    Authors: Mohammad Emtiyaz Khan, Wu Lin, Voot Tangkaratt, Zuozhu Liu, Didrik Nielsen

    Abstract: We present the Variational Adaptive Newton (VAN) method which is a black-box optimization method especially suitable for explorative-learning tasks such as active learning and reinforcement learning. Similar to Bayesian methods, VAN estimates a distribution that can be used for exploration, but requires computations that are similar to continuous optimization methods. Our theoretical contribution… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

  35. arXiv:1707.02293  [pdf, other

    cs.LG stat.ML

    Bayesian Models of Data Streams with Hierarchical Power Priors

    Authors: Andres Masegosa, Thomas D. Nielsen, Helge Langseth, Dario Ramos-Lopez, Antonio Salmeron, Anders L. Madsen

    Abstract: Making inferences from data streams is a pervasive problem in many modern data analysis applications. But it requires to address the problem of continuous model updating and adapt to changes or drifts in the underlying data generating distribution. In this paper, we approach these problems from a Bayesian perspective covering general conjugate exponential models. Our proposal makes use of non-conj… ▽ More

    Submitted 7 July, 2017; originally announced July 2017.

    Comments: ICML 2017

  36. AMIDST: a Java Toolbox for Scalable Probabilistic Machine Learning

    Authors: Andrés R. Masegosa, Ana M. Martínez, Darío Ramos-López, Rafael Cabañas, Antonio Salmerón, Thomas D. Nielsen, Helge Langseth, Anders L. Madsen

    Abstract: The AMIDST Toolbox is a software for scalable probabilistic machine learning with a spe- cial focus on (massive) streaming data. The toolbox supports a flexible modeling language based on probabilistic graphical models with latent variables and temporal dependencies. The specified models can be learnt from large data sets using parallel or distributed implementa- tions of Bayesian learning algorit… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

    ACM Class: I.2.6

  37. Automatic Measurement of Physical Mobility in Get-Up-and-Go Test Using Kinect Sensor

    Authors: Amir H. Kargar B., Ali Mollahosseini, Taylor Struemph, Wilson Pace, Rodney D. Nielsen, Mohammad H. Mahoor

    Abstract: Get-Up-and-Go Test is commonly used for assessing the physical mobility of the elderly by physicians. This paper presents a method for automatic analysis and classification of human gait in the Get-Up-and-Go Test using a Microsoft Kinect sensor. Two types of features are automatically extracted from the human skeleton data provided by the Kinect sensor. The first type of feature is related to the… ▽ More

    Submitted 11 November, 2015; originally announced November 2015.

    Comments: Published in: Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE

  38. arXiv:1301.6729  [pdf

    cs.AI

    Welldefined Decision Scenarios

    Authors: Thomas D. Nielsen, Finn Verner Jensen

    Abstract: Influence diagrams serve as a powerful tool for modelling symmetric decision problems. When solving an influence diagram we determine a set of strategies for the decisions involved. A strategy for a decision variable is in principle a function over its past. However, some of the past may be irrelevant for the decision, and for computational reasons it is important not to deal with redundant variab… ▽ More

    Submitted 23 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI1999)

    Report number: UAI-P-1999-PG-502-511

  39. arXiv:1301.3880  [pdf

    cs.AI

    Using ROBDDs for Inference in Bayesian Networks with Troubleshooting as an Example

    Authors: Thomas D. Nielsen, Pierre-Henri Wuillemin, Finn Verner Jensen, Uffe Kjærulff

    Abstract: When using Bayesian networks for modelling the behavior of man-made machinery, it usually happens that a large part of the model is deterministic. For such Bayesian networks deterministic part of the model can be represented as a Boolean function, and a central part of belief updating reduces to the task of calculating the number of satisfying configurations in a Boolean function. In this paper we… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-426-435

  40. arXiv:1301.3879  [pdf

    cs.AI

    Representing and Solving Asymmetric Bayesian Decision Problems

    Authors: Thomas D. Nielsen, Finn Verner Jensen

    Abstract: This paper deals with the representation and solution of asymmetric Bayesian decision problems. We present a formal framework, termed asymmetric influence diagrams, that is based on the influence diagram and allows an efficient representation of asymmetric decision problems. As opposed to existing frameworks, the asymmetric influece diagram primarily encodes asymmetry at the qualitative level and… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-416-425

  41. arXiv:1212.3873  [pdf, ps, other

    cs.LG cs.LO cs.SE

    Learning Markov Decision Processes for Model Checking

    Authors: Hua Mao, Yingke Chen, Manfred Jaeger, Thomas D. Nielsen, Kim G. Larsen, Brian Nielsen

    Abstract: Constructing an accurate system model for formal model verification can be both resource demanding and time-consuming. To alleviate this shortcoming, algorithms have been proposed for automatically learning system models based on observed system behaviors. In this paper we extend the algorithm on learning probabilistic automata to reactive systems, where the observed system behavior is in the form… ▽ More

    Submitted 16 December, 2012; originally announced December 2012.

    Comments: In Proceedings QFM 2012, arXiv:1212.3454

    Journal ref: EPTCS 103, 2012, pp. 49-63

  42. arXiv:1212.2500  [pdf

    cs.LG cs.AI stat.ML

    On Local Optima in Learning Bayesian Networks

    Authors: Jens D. Nielsen, Tomas Kocka, Jose M. Pena

    Abstract: This paper proposes and evaluates the k-greedy equivalence search algorithm (KES) for learning Bayesian networks (BNs) from complete data. The main characteristic of KES is that it allows a trade-off between greediness and randomness, thus exploring different good local optima. When greediness is set at maximum, KES corresponds to the greedy equivalence search algorithm (GES). When greediness is k… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-435-442