Skip to main content

Showing 1–32 of 32 results for author: Andersen, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16658  [pdf, other

    eess.IV cs.CV math.ST

    Sampling Strategies in Bayesian Inversion: A Study of RTO and Langevin Methods

    Authors: Remi Laumont, Yiqiu Dong, Martin Skovgaard Andersen

    Abstract: This paper studies two classes of sampling methods for the solution of inverse problems, namely Randomize-Then-Optimize (RTO), which is rooted in sensitivity analysis, and Langevin methods, which are rooted in the Bayesian framework. The two classes of methods correspond to different assumptions and yield samples from different target distributions. We highlight the main conceptual and theoretical… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    MSC Class: 65K10; 65K05; 65D18; 62F15; 62C10; 68Q25; 68U10; 90C25; 65C05

  2. arXiv:2406.10133  [pdf, other

    cs.CL cs.AI

    Evaluation of Large Language Models: STEM education and Gender Stereotypes

    Authors: Smilla Due, Sneha Das, Marianne Andersen, Berta Plandolit López, Sniff Andersen Nexø, Line Clemmensen

    Abstract: Large Language Models (LLMs) have an increasing impact on our lives with use cases such as chatbots, study support, coding support, ideation, writing assistance, and more. Previous studies have revealed linguistic biases in pronouns used to describe professions or adjectives used to describe men vs women. These issues have to some degree been addressed in updated LLM versions, at least to pass exi… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2404.14906  [pdf, other

    cs.CV cs.AI cs.LG

    Driver Activity Classification Using Generalizable Representations from Vision-Language Models

    Authors: Ross Greer, Mathias Viborg Andersen, Andreas Møgelmose, Mohan Trivedi

    Abstract: Driver activity classification is crucial for ensuring road safety, with applications ranging from driver assistance systems to autonomous vehicle control transitions. In this paper, we present a novel approach leveraging generalizable representations from vision-language models for driver activity classification. Our method employs a Semantic Representation Late Fusion Neural Network (SRLF-Net) t… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. Asset management, condition monitoring and Digital Twins: damage detection and virtual inspection on a reinforced concrete bridge

    Authors: Arnulf Hagen, Trond Michael Andersen

    Abstract: In April 2021 Stava bridge, a main bridge on E6 in Norway, was abruptly closed for traffic. A structural defect had seriously compromised the bridge structural integrity. The Norwegian Public Roads Administration (NPRA) closed it, made a temporary solution and reopened with severe traffic restrictions. The incident was alerted through what constitutes the bridge Digital Twin processing data from I… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Structure and Infrastructure Engineering (2024)

  5. Raw Instinct: Trust Your Classifiers and Skip the Conversion

    Authors: Christos Kantas, Bjørk Antoniussen, Mathias V. Andersen, Rasmus Munksø, Shobhit Kotnala, Simon B. Jensen, Andreas Møgelmose, Lau Nørgaard, Thomas B. Moeslund

    Abstract: Using RAW-images in computer vision problems is surprisingly underexplored considering that converting from RAW to RGB does not introduce any new capture information. In this paper, we show that a sufficiently advanced classifier can yield equivalent results on RAW input compared to RGB and present a new public dataset consisting of RAW images and the corresponding converted RGB images. Classifyin… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: https://www.kaggle.com/datasets/mathiasviborg/raw-instinct

    Journal ref: 2023 IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence (PRAI)

  6. arXiv:2403.00196  [pdf, other

    cs.CV cs.AI cs.LG

    Learning to Find Missing Video Frames with Synthetic Data Augmentation: A General Framework and Application in Generating Thermal Images Using RGB Cameras

    Authors: Mathias Viborg Andersen, Ross Greer, Andreas Møgelmose, Mohan Trivedi

    Abstract: Advanced Driver Assistance Systems (ADAS) in intelligent vehicles rely on accurate driver perception within the vehicle cabin, often leveraging a combination of sensing modalities. However, these modalities operate at varying rates, posing challenges for real-time, comprehensive driver state monitoring. This paper addresses the issue of missing data due to sensor frame rate mismatches, introducing… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  7. arXiv:2401.16634  [pdf, other

    cs.CV cs.LG

    The Why, When, and How to Use Active Learning in Large-Data-Driven 3D Object Detection for Safe Autonomous Driving: An Empirical Exploration

    Authors: Ross Greer, Bjørk Antoniussen, Mathias V. Andersen, Andreas Møgelmose, Mohan M. Trivedi

    Abstract: Active learning strategies for 3D object detection in autonomous driving datasets may help to address challenges of data imbalance, redundancy, and high-dimensional data. We demonstrate the effectiveness of entropy querying to select informative samples, aiming to reduce annotation costs and improve model performance. We experiment using the BEVFusion model for 3D object detection on the nuScenes… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  8. arXiv:2311.09389  [pdf, other

    cs.CL cs.LG

    Neural machine translation for automated feedback on children's early-stage writing

    Authors: Jonas Vestergaard Jensen, Mikkel Jordahn, Michael Riis Andersen

    Abstract: In this work, we address the problem of assessing and constructing feedback for early-stage writing automatically using machine learning. Early-stage writing is typically vastly different from conventional writing due to phonetic spelling and lack of proper grammar, punctuation, spacing etc. Consequently, early-stage writing is highly non-trivial to analyze using common linguistic metrics. We prop… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 9 pages, 1 figure, 1 table, to be published in the proceedings of the Northern Lights Deep Learning Conference 2024

    ACM Class: I.2.7

  9. AdaSub: Stochastic Optimization Using Second-Order Information in Low-Dimensional Subspaces

    Authors: João Victor Galvão da Mata, Martin S. Andersen

    Abstract: We introduce AdaSub, a stochastic optimization algorithm that computes a search direction based on second-order information in a low-dimensional subspace that is defined adaptively based on available current and past information. Compared to first-order methods, second-order methods exhibit better convergence characteristics, but the need to compute the Hessian matrix at each iteration results in… ▽ More

    Submitted 6 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Published in: 2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA)

  10. arXiv:2308.15191  [pdf, ps, other

    cs.CR

    State of the Art Report: Verified Computation

    Authors: Jim Woodcock, Mikkel Schmidt Andersen, Diego F. Aranha, Stefan Hallerstede, Simon Thrane Hansen, Nikolaj Kuhne Jakobsen, Tomas Kulik, Peter Gorm Larsen, Hugo Daniel Macedo, Carlos Ignacio Isasa Martin, Victor Alexander Mtsimbe Norrild

    Abstract: This report describes the state of the art in verifiable computation. The problem being solved is the following: The Verifiable Computation Problem (Verifiable Computing Problem) Suppose we have two computing agents. The first agent is the verifier, and the second agent is the prover. The verifier wants the prover to perform a computation. The verifier sends a description of the computation to t… ▽ More

    Submitted 16 February, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: 54 pages

  11. arXiv:2304.04048  [pdf, other

    cs.CV cs.LG

    Polygonizer: An auto-regressive building delineator

    Authors: Maxim Khomiakov, Michael Riis Andersen, Jes Frellsen

    Abstract: In geospatial planning, it is often essential to represent objects in a vectorized format, as this format easily translates to downstream tasks such as web development, graphics, or design. While these problems are frequently addressed using semantic segmentation, which requires additional post-processing to vectorize objects in a non-trivial way, we present an Image-to-Sequence model that allows… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: ICLR 2023 Workshop on Machine Learning in Remote Sensing

  12. arXiv:2303.11215  [pdf, other

    cs.CV cs.LG

    Learning to Generate 3D Representations of Building Roofs Using Single-View Aerial Imagery

    Authors: Maxim Khomiakov, Alejandro Valverde Mahou, Alba Reinders Sánchez, Jes Frellsen, Michael Riis Andersen

    Abstract: We present a novel pipeline for learning the conditional distribution of a building roof mesh given pixels from an aerial image, under the assumption that roof geometry follows a set of regular patterns. Unlike alternative methods that require multiple images of the same object, our approach enables estimating 3D roof meshes using only a single image for predictions. The approach employs the PolyG… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Copyright 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  13. arXiv:2301.05983  [pdf, other

    stat.ML cs.LG

    On the role of Model Uncertainties in Bayesian Optimization

    Authors: Jonathan Foldager, Mikkel Jordahn, Lars Kai Hansen, Michael Riis Andersen

    Abstract: Bayesian optimization (BO) is a popular method for black-box optimization, which relies on uncertainty as part of its decision-making process when deciding which experiment to perform next. However, not much work has addressed the effect of uncertainty on the performance of the BO algorithm and to what extent calibrated uncertainties improve the ability to find the global optimum. In this work, we… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 14 pages, 4 figures, 2 tables

  14. VISEM-Tracking, a human spermatozoa tracking dataset

    Authors: Vajira Thambawita, Steven A. Hicks, Andrea M. Storås, Thu Nguyen, Jorunn M. Andersen, Oliwia Witczak, Trine B. Haugen, Hugo L. Hammer, Pål Halvorsen, Michael A. Riegler

    Abstract: A manual assessment of sperm motility requires microscopy observation, which is challenging due to the fast-moving spermatozoa in the field of view. To obtain correct results, manual evaluation requires extensive training. Therefore, computer-assisted sperm analysis (CASA) has become increasingly used in clinics. Despite this, more data is needed to train supervised machine learning approaches in… ▽ More

    Submitted 10 May, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Report number: Scientific Data volume 10

    Journal ref: Sci Data 10, 260 (2023)

  15. arXiv:2212.01260  [pdf, other

    cs.CV cs.LG

    SolarDK: A high-resolution urban solar panel image classification and localization dataset

    Authors: Maxim Khomiakov, Julius Holbech Radzikowski, Carl Anton Schmidt, Mathias Bonde Sørensen, Mads Andersen, Michael Riis Andersen, Jes Frellsen

    Abstract: The body of research on classification of solar panel arrays from aerial imagery is increasing, yet there are still not many public benchmark datasets. This paper introduces two novel benchmark datasets for classifying and localizing solar panel arrays in Denmark: A human annotated dataset for classification and segmentation, as well as a classification dataset acquired using self-reported data fr… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 7 pages, 2 figures, to access the dataset, see https://osf.io/aj539/

  16. arXiv:2203.15945  [pdf, other

    stat.ML cs.LG stat.ME

    A Framework for Improving the Reliability of Black-box Variational Inference

    Authors: Manushi Welandawe, Michael Riis Andersen, Aki Vehtari, Jonathan H. Huggins

    Abstract: Black-box variational inference (BBVI) now sees widespread use in machine learning and statistics as a fast yet flexible alternative to Markov chain Monte Carlo methods for approximate Bayesian inference. However, stochastic optimization methods for BBVI remain unreliable and require substantial expertise and hand-tuning to apply effectively. In this paper, we propose Robust and Automated Black-bo… ▽ More

    Submitted 16 May, 2024; v1 submitted 29 March, 2022; originally announced March 2022.

  17. arXiv:2201.00270  [pdf, other

    cs.NI cs.SE

    Towards a secure API client generator for IoT devices

    Authors: Anders Aaen Springborg, Martin Kaldahl Andersen, Kaare Holland Hattel, Michele Albano

    Abstract: Given the success of IoT platforms, more developers and companies want to include the technology in their portfolio. However, in the case of single board microcontrollers, the support for networking operations is not ideal, and different IoT platforms allow access to the networking submodule via different libraries and system calls, leading to a steeper learning curve. Code generators for API clie… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: The work was accepted as 4-pages paper to ACM SAC 2022. Together with the TPC, I am providing a technical report here with the information I have to cut from the 4-pages version

    ACM Class: C.2; D.2

  18. arXiv:2109.08495  [pdf, other

    cs.DS cs.DB cs.LG

    Micro-architectural Analysis of a Learned Index

    Authors: Mikkel Møller Andersen, Pınar Tözün

    Abstract: Since the publication of The Case for Learned Index Structures in 2018, there has been a rise in research that focuses on learned indexes for different domains and with different functionalities. While the effectiveness of learned indexes as an alternative to traditional index structures such as B+Trees have already been demonstrated by several studies, previous work tend to focus on higher-level… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Under submission

  19. arXiv:2106.12505  [pdf, other

    cs.DB cs.HC

    Mr. Plotter: Unifying Data Reduction Techniques in Storage and Visualization Systems

    Authors: Sam Kumar, Michael P Andersen, David E. Culler

    Abstract: As the rate of data collection continues to grow rapidly, develo** visualization tools that scale to immense data sets is a serious and ever-increasing challenge. Existing approaches generally seek to decouple storage and visualization systems, performing just-in-time data reduction to transparently avoid overloading the visualizer. We present a new architecture in which the visualizer and data… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: 14 pages; Originally published in May 2018 as a technical report in the UC Berkeley EECS Technical Report Series (see https://www2.eecs.berkeley.edu/Pubs/TechRpts/2018/EECS-2018-85.html)

    Report number: Technical Report No. UCB/EECS-2018-85

  20. arXiv:2103.13909  [pdf, ps, other

    math.OC cs.LG math.NA

    Regularization by Denoising Sub-sampled Newton Method for Spectral CT Multi-Material Decomposition

    Authors: Alessandro Perelli, Martin S. Andersen

    Abstract: Spectral Computed Tomography (CT) is an emerging technology that enables to estimate the concentration of basis materials within a scanned object by exploiting different photon energy spectra. In this work, we aim at efficiently solving a model-based maximum-a-posterior problem to reconstruct multi-materials images with application to spectral CT. In particular, we propose to solve a regularized o… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: Accepted in Philosophical Transactions A, issue "Synergistic tomographic image reconstruction (Part 1)"

  21. arXiv:2103.01085  [pdf, other

    cs.LG stat.ME stat.ML

    Challenges and Opportunities in High-dimensional Variational Inference

    Authors: Akash Kumar Dhaka, Alejandro Catalina, Manushi Welandawe, Michael Riis Andersen, Jonathan Huggins, Aki Vehtari

    Abstract: Current black-box variational inference (BBVI) methods require the user to make numerous design choices -- such as the selection of variational objective and approximating family -- yet there is little principled guidance on how to do so. We develop a conceptual framework and set of experimental tools to understand the effects of these choices, which we leverage to propose best practices for maxim… ▽ More

    Submitted 30 June, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

  22. arXiv:2009.00666  [pdf, other

    cs.LG stat.ME stat.ML

    Robust, Accurate Stochastic Optimization for Variational Inference

    Authors: Akash Kumar Dhaka, Alejandro Catalina, Michael Riis Andersen, Måns Magnusson, Jonathan H. Huggins, Aki Vehtari

    Abstract: We consider the problem of fitting variational posterior approximations using stochastic optimization methods. The performance of these approximations depends on (1) how well the variational family matches the true posterior distribution,(2) the choice of divergence, and (3) the optimization of the variational objective. We show that even in the best-case scenario when the exact posterior belongs… ▽ More

    Submitted 3 September, 2020; v1 submitted 1 September, 2020; originally announced September 2020.

    Journal ref: NeurIPS 2020

  23. arXiv:2007.05994  [pdf, other

    stat.ML cs.LG

    State Space Expectation Propagation: Efficient Inference Schemes for Temporal Gaussian Processes

    Authors: William J. Wilkinson, Paul E. Chang, Michael Riis Andersen, Arno Solin

    Abstract: We formulate approximate Bayesian inference in non-conjugate temporal and spatio-temporal Gaussian process models as a simple parameter update rule applied during Kalman smoothing. This viewpoint encompasses most inference schemes, including expectation propagation (EP), the classical (Extended, Unscented, etc.) Kalman smoothers, and variational inference. We provide a unifying perspective on thes… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted to International Conference on Machine Learning (ICML) 2020

  24. arXiv:2003.11435  [pdf, other

    cs.LG stat.ML

    Preferential Batch Bayesian Optimization

    Authors: Eero Siivola, Akash Kumar Dhaka, Michael Riis Andersen, Javier Gonzalez, Pablo Garcia Moreno, Aki Vehtari

    Abstract: Most research in Bayesian optimization (BO) has focused on \emph{direct feedback} scenarios, where one has access to exact values of some expensive-to-evaluate objective. This direction has been mainly driven by the use of BO in machine learning hyper-parameter configuration problems. However, in domains such as modelling human preferences, A/B tests, or recommender systems, there is a need for me… ▽ More

    Submitted 31 August, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: 6 pages + 7 pages in supplementary material

  25. arXiv:1910.13327  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Machine Learning-Based Analysis of Sperm Videos and Participant Data for Male Fertility Prediction

    Authors: Steven A. Hicks, Jorunn M. Andersen, Oliwia Witczak, Vajira Thambawita, Påll Halvorsen, Hugo L. Hammer, Trine B. Haugen, Michael A. Riegler

    Abstract: Methods for automatic analysis of clinical data are usually targeted towards a specific modality and do not make use of all relevant data available. In the field of male human reproduction, clinical and biological data are not used to its fullest potential. Manual evaluation of a semen sample using a microscope is time-consuming and requires extensive training. Furthermore, the validity of manual… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Preprint, accepted by Nature Scientific Reports for publication 24.10.2019

  26. arXiv:1905.13369  [pdf, other

    cs.CR

    JEDI: Many-to-Many End-to-End Encryption and Key Delegation for IoT

    Authors: Sam Kumar, Yuncong Hu, Michael P Andersen, Raluca Ada Popa, David E. Culler

    Abstract: As the Internet of Things (IoT) emerges over the next decade, develo** secure communication for IoT devices is of paramount importance. Achieving end-to-end encryption for large-scale IoT systems, like smart buildings or smart cities, is challenging because multiple principals typically interact indirectly via intermediaries, meaning that the recipient of a message is not known in advance. This… ▽ More

    Submitted 3 March, 2020; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: Extended version of a paper accepted at USENIX Security 2019

  27. arXiv:1904.10679  [pdf, other

    stat.ML cs.LG

    Bayesian leave-one-out cross-validation for large data

    Authors: Måns Magnusson, Michael Riis Andersen, Johan Jonasson, Aki Vehtari

    Abstract: Model inference, such as model comparison, model checking, and model selection, is an important part of model development. Leave-one-out cross-validation (LOO) is a general approach for assessing the generalizability of a model, but unfortunately, LOO does not scale well to large datasets. We propose a combination of using approximate inference techniques and probability-proportional-to-size-sampl… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

    Comments: Accepted to ICML 2019. This version is the submitted paper

    Journal ref: Thirty-sixth International Conference on Machine Learning, PMLR 97:4244-4253, 2019

  28. arXiv:1901.11436  [pdf, other

    stat.ML cs.LG cs.SD eess.AS eess.SP

    End-to-End Probabilistic Inference for Nonstationary Audio Analysis

    Authors: William J. Wilkinson, Michael Riis Andersen, Joshua D. Reiss, Dan Stowell, Arno Solin

    Abstract: A typical audio signal processing pipeline includes multiple disjoint analysis stages, including calculation of a time-frequency representation followed by spectrogram-based feature analysis. We show how time-frequency analysis and nonnegative matrix factorisation can be jointly formulated as a spectral mixture Gaussian process model with nonstationary priors over the amplitude variance parameters… ▽ More

    Submitted 27 April, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: Accepted to the Thirty-sixth International Conference on Machine Learning (ICML) 2019

  29. arXiv:1811.02721  [pdf, other

    cs.NI

    Performant TCP for Low-Power Wireless Networks

    Authors: Sam Kumar, Michael P Andersen, Hyung-Sin Kim, David E. Culler

    Abstract: Low-power and lossy networks (LLNs) enable diverse applications integrating many resource-constrained embedded devices, often requiring interconnectivity with existing TCP/IP networks as part of the Internet of Things. But TCP has received little attention in LLNs due to concerns about its overhead and performance, leading to LLN-specific protocols that require specialized gateways for interoperab… ▽ More

    Submitted 28 February, 2020; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: 22 pages; Accepted at NSDI 2020; Updated Table 6

  30. arXiv:1811.02489  [pdf, other

    eess.SP cs.LG cs.SD eess.AS stat.ML

    Unifying Probabilistic Models for Time-Frequency Analysis

    Authors: William J. Wilkinson, Michael Riis Andersen, Joshua D. Reiss, Dan Stowell, Arno Solin

    Abstract: In audio signal processing, probabilistic time-frequency models have many benefits over their non-probabilistic counterparts. They adapt to the incoming signal, quantify uncertainty, and measure correlation between the signal's amplitude and phase information, making time domain resynthesis straightforward. However, these models are still not widely used since they come at a high computational cos… ▽ More

    Submitted 12 February, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: Accepted to International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019

  31. Bisimulation and expressivity for conditional belief, degrees of belief, and safe belief

    Authors: Mikkel Birkegaard Andersen, Thomas Bolander, Hans van Ditmarsch, Martin Holm Jensen

    Abstract: Plausibility models are Kripke models that agents use to reason about knowledge and belief, both of themselves and of each other. Such models are used to interpret the notions of conditional belief, degrees of belief, and safe belief. The logic of conditional belief contains that modality and also the knowledge modality, and similarly for the logic of degrees of belief and the logic of safe belief… ▽ More

    Submitted 25 February, 2016; v1 submitted 26 June, 2015; originally announced June 2015.

    Journal ref: Synthese 194(7): 2447-2487 (2017)

  32. arXiv:1503.01993  [pdf, other

    cs.CV math.NA

    Tomographic Image Reconstruction using Training images

    Authors: Sara Soltani, Martin S. Andersen, Per Christian Hansen

    Abstract: We describe and examine an algorithm for tomographic image reconstruction where prior knowledge about the solution is available in the form of training images. We first construct a nonnegative dictionary based on prototype elements from the training images; this problem is formulated as a regularized non-negative matrix factorization. Incorporating the dictionary as a prior in a convex reconstruct… ▽ More

    Submitted 17 August, 2015; v1 submitted 6 March, 2015; originally announced March 2015.

    Comments: 25 pages, 12 figures

    MSC Class: 65F22; 65K10