Skip to main content

Showing 1–47 of 47 results for author: Hansen, L K

.
  1. arXiv:2406.09981  [pdf, other

    cs.LG cs.AI cs.CV

    Challenges in explaining deep learning models for data with biological variation

    Authors: Lenka Tětková, Erik Schou Dreier, Robin Malm, Lars Kai Hansen

    Abstract: Much machine learning research progress is based on develo** models and evaluating them on a benchmark dataset (e.g., ImageNet for images). However, applying such benchmark-successful methods to real-world data often does not work as expected. This is particularly the case for biological data where we expect variability at multiple time and spatial scales. In this work, we are using grain data a… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2404.07008  [pdf, other

    cs.LG cs.AI

    Knowledge graphs for empirical concept retrieval

    Authors: Lenka Tětková, Teresa Karen Scheidt, Maria Mandrup Fogh, Ellen Marie Gaunby Jørgensen, Finn Årup Nielsen, Lars Kai Hansen

    Abstract: Concept-based explainable AI is promising as a tool to improve the understanding of complex models at the premises of a given user, viz.\ as a tool for personalized explainability. An important class of concept-based explainability methods is constructed with empirically defined concepts, indirectly defined through a set of positive and negative examples, as in the TCAV approach (Kim et al., 2018)… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Preprint. Accepted to The 2nd World Conference on eXplainable Artificial Intelligence

  3. arXiv:2311.18364  [pdf, other

    cs.CL cs.LG cs.SI

    Hubness Reduction Improves Sentence-BERT Semantic Spaces

    Authors: Beatrix M. G. Nielsen, Lars Kai Hansen

    Abstract: Semantic representations of text, i.e. representations of natural language which capture meaning by geometry, are essential for areas such as information retrieval and document grou**. High-dimensional trained dense vectors have received much attention in recent years as such representations. We investigate the structure of semantic spaces that arise from embeddings made with Sentence-BERT and f… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at NLDL 2024

  4. arXiv:2307.12745  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Concept-based explainability for an EEG transformer model

    Authors: Anders Gjølbye Madsen, William Theodor Lehn-Schiøler, Áshildur Jónsdóttir, Bergdís Arnardóttir, Lars Kai Hansen

    Abstract: Deep learning models are complex due to their size, structure, and inherent randomness in training procedures. Additional complexity arises from the selection of datasets and inductive biases. Addressing these challenges for explainability, Kim et al. (2018) introduced Concept Activation Vectors (CAVs), which aim to understand deep models' internal states in terms of human-aligned concepts. These… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: To appear in proceedings of 2023 IEEE International workshop on Machine Learning for Signal Processing

  5. arXiv:2306.03009  [pdf, other

    stat.ML cs.LG stat.AP

    Using Sequences of Life-events to Predict Human Lives

    Authors: Germans Savcisens, Tina Eliassi-Rad, Lars Kai Hansen, Laust Mortensen, Lau Lilleholt, Anna Rogers, Ingo Zettler, Sune Lehmann

    Abstract: Over the past decade, machine learning has revolutionized computers' ability to analyze text through flexible computational models. Due to their structural similarity to written language, transformer-based architectures have also shown promise as tools to make sense of a range of multi-variate sequences from protein-structures, music, electronic health records to weather-forecasts. We can also rep… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  6. arXiv:2306.00561  [pdf, other

    cs.SD cs.AI eess.AS

    Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

    Authors: Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan

    Abstract: In this work, we propose a Multi-Window Masked Autoencoder (MW-MAE) fitted with a novel Multi-Window Multi-Head Attention (MW-MHA) module that facilitates the modelling of local-global interactions in every decoder transformer block through attention heads of several distinct local and global windows. Empirical results on ten downstream audio tasks show that MW-MAEs consistently outperform standar… ▽ More

    Submitted 1 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  7. arXiv:2305.17154  [pdf, other

    cs.LG cs.AI

    On convex decision regions in deep network representations

    Authors: Lenka Tětková, Thea Brüsch, Teresa Karen Scheidt, Fabian Martin Mager, Rasmus Ørtoft Aagaard, Jonathan Foldager, Tommy Sonne Alstrøm, Lars Kai Hansen

    Abstract: Current work on human-machine alignment aims at understanding machine-learned latent spaces and their correspondence to human representations. G{ä}rdenfors' conceptual spaces is a prominent framework for understanding human representations. Convexity of object regions in conceptual spaces is argued to promote generalizability, few-shot learning, and interpersonal alignment. Based on these insights… ▽ More

    Submitted 6 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  8. arXiv:2304.08984  [pdf, other

    cs.CV cs.LG

    Robustness of Visual Explanations to Common Data Augmentation

    Authors: Lenka Tětková, Lars Kai Hansen

    Abstract: As the use of deep neural networks continues to grow, understanding their behaviour has become more crucial than ever. Post-hoc explainability methods are a potential solution, but their reliability is being called into question. Our research investigates the response of post-hoc visual explanations to naturally occurring transformations, often referred to as augmentations. We anticipate explanati… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: Accepted to The 2nd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2023

  9. arXiv:2301.05983  [pdf, other

    stat.ML cs.LG

    On the role of Model Uncertainties in Bayesian Optimization

    Authors: Jonathan Foldager, Mikkel Jordahn, Lars Kai Hansen, Michael Riis Andersen

    Abstract: Bayesian optimization (BO) is a popular method for black-box optimization, which relies on uncertainty as part of its decision-making process when deciding which experiment to perform next. However, not much work has addressed the effect of uncertainty on the performance of the BO algorithm and to what extent calibrated uncertainties improve the ability to find the global optimum. In this work, we… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 14 pages, 4 figures, 2 tables

  10. arXiv:2111.03935  [pdf, other

    quant-ph physics.comp-ph

    Noise-Assisted Variational Quantum Thermalization

    Authors: Jonathan Foldager, Arthur Pesah, Lars Kai Hansen

    Abstract: Preparing thermal states on a quantum computer can have a variety of applications, from simulating many-body quantum systems to training machine learning models. Variational circuits have been proposed for this task on near-term quantum computers, but several challenges remain, such as finding a scalable cost-function, avoiding the need of purification, and mitigating noise effects. We propose a n… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

    Comments: 13 pages, 7 figures. Submitted to Scientific Reports

  11. arXiv:2109.12306  [pdf, other

    cs.IR cs.LG

    Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts

    Authors: Raluca Alexandra Fetic, Mikkel Jordahn, Lucas Chaves Lima, Rasmus Arpe Fogh Egebæk, Martin Carsten Nielsen, Benjamin Biering, Lars Kai Hansen

    Abstract: For a multilingual podcast streaming service, it is critical to be able to deliver relevant content to all users independent of language. Podcast content relevance is conventionally determined using various metadata sources. However, with the increasing quality of speech recognition in many languages, utilizing automatic transcriptions to provide better content recommendations becomes possible. In… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  12. arXiv:2107.02253  [pdf, other

    cs.LG math.DG math.PR

    Generalization by design: Shortcuts to Generalization in Deep Learning

    Authors: Petr Taborsky, Lars Kai Hansen

    Abstract: We take a geometrical viewpoint and present a unifying view on supervised deep learning with the Bregman divergence loss function - this entails frequent classification and prediction tasks. Motivated by simulations we suggest that there is principally no implicit bias of vanilla stochastic gradient descent training of deep models towards "simpler" functions. Instead, we show that good generalizat… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 16 pages + 9 pages supplementary

  13. arXiv:2010.15718  [pdf, other

    cs.CR cs.DC eess.IV

    Minimal Model Structure Analysis for Input Reconstruction in Federated Learning

    Authors: Jia Qian, Hiba Nassar, Lars Kai Hansen

    Abstract: \ac{fl} proposed a distributed \ac{ml} framework where every distributed worker owns a complete copy of global model and their own data. The training is occurred locally, which assures no direct transmission of training data. However, the recent work \citep{zhu2019deep} demonstrated that input data from a neural network may be reconstructed only using knowledge of gradients of that network, which… ▽ More

    Submitted 5 November, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

  14. arXiv:2007.06381  [pdf, other

    cs.LG cs.AI stat.ML

    A simple defense against adversarial attacks on heatmap explanations

    Authors: Laura Rieger, Lars Kai Hansen

    Abstract: With machine learning models being used for more sensitive applications, we rely on interpretability methods to prove that no discriminating attributes were used for classification. A potential concern is the so-called "fair-washing" - manipulating a model such that the features used in reality are hidden and more innocuous features are shown to be important instead. In our work we present an ef… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: Accepted at 2020 Workshop on Human Interpretability in Machine Learning (WHI)

  15. arXiv:2007.04806  [pdf, other

    cs.LG cs.CV stat.ML

    Client Adaptation improves Federated Learning with Simulated Non-IID Clients

    Authors: Laura Rieger, Rasmus M. Th. Høegh, Lars K. Hansen

    Abstract: We present a federated learning approach for learning a client adaptable, robust model when data is non-identically and non-independently distributed (non-IID) across clients. By simulating heterogeneous clients, we show that adding learned client-specific conditioning improves model performance, and the approach is shown to work on balanced and imbalanced data set from both audio and image domain… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: 11 pages, 11 figures. To appear at International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML 2020

  16. arXiv:2006.09046  [pdf, other

    cs.LG stat.ML

    Probabilistic Decoupling of Labels in Classification

    Authors: Jeppe Nørregaard, Lars Kai Hansen

    Abstract: In this paper we develop a principled, probabilistic, unified approach to non-standard classification tasks, such as semi-supervised, positive-unlabelled, multi-positive-unlabelled and noisy-label learning. We train a classifier on the given labels to predict the label-distribution. We then infer the underlying class-distributions by variationally optimizing a model of label-class transitions.

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: Submitted to ICML 2020 (not accepted)

  17. On the Limits to Multi-Modal Popularity Prediction on Instagram -- A New Robust, Efficient and Explainable Baseline

    Authors: Christoffer Riis, Damian Konrad Kowalczyk, Lars Kai Hansen

    Abstract: Our global population contributes visual content on platforms like Instagram, attempting to express themselves and engage their audiences, at an unprecedented and increasing rate. In this paper, we revisit the popularity prediction on Instagram. We present a robust, efficient, and explainable baseline for population-based popularity prediction, achieving strong ranking performance. We employ the l… ▽ More

    Submitted 20 February, 2021; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: Presented at ICAART 2021

    Journal ref: Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-484-8, pages 1200-1209, 2021

  18. arXiv:2003.08747  [pdf, other

    cs.CV

    IROF: a low resource evaluation metric for explanation methods

    Authors: Laura Rieger, Lars Kai Hansen

    Abstract: The adoption of machine learning in health care hinges on the transparency of the used algorithms, necessitating the need for explanation methods. However, despite a growing literature on explaining neural networks, no consensus has been reached on how to evaluate those explanation methods. We propose IROF, a new approach to evaluating explanation methods that circumvents the need for manual evalu… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  19. arXiv:2002.05038  [pdf, other

    cs.DC cs.LG

    Robustness analytics to data heterogeneity in edge computing

    Authors: Jia Qian, Lars Kai Hansen, Xenofon Fafoutis, Prayag Tiwari, Hari Mohan Pandey

    Abstract: Federated Learning is a framework that jointly trains a model \textit{with} complete knowledge on a remotely placed centralized server, but \textit{without} the requirement of accessing the data stored in distributed machines. Some work assumes that the data generated from edge devices are identically and independently sampled from a common population distribution. However, such ideal sampling may… ▽ More

    Submitted 24 October, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

  20. The Complexity of Social Media Response: Statistical Evidence For One-Dimensional Engagement Signal in Twitter

    Authors: Damian Konrad Kowalczyk, Lars Kai Hansen

    Abstract: Many years after online social networks exceeded our collective attention, social influence is still built on attention capital. Quality is not a prerequisite for viral spreading, yet large diffusion cascades remain the hallmark of a social influencer. Consequently, our exposure to low-quality content and questionable influence is expected to increase. Since the conception of influence maximizatio… ▽ More

    Submitted 15 February, 2020; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Presented at ICAART 2020

    Report number: ICAART20-RP-238

    Journal ref: Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART (2020) 918-925

  21. arXiv:1906.10718  [pdf, other

    cs.DC cs.LG

    Active Learning Solution on Distributed Edge Computing

    Authors: Jia Qian, Sayantan Sengupta, Lars Kai Hansen

    Abstract: Industry 4.0 becomes possible through the convergence between Operational and Information Technologies. All the requirements to realize the convergence is integrated on the Fog Platform. Fog Platform is introduced between the cloud server and edge devices when the unprecedented generation of data causes the burden of the cloud server, leading the ineligible latency. In this new paradigm, we divide… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

  22. arXiv:1905.12403  [pdf, other

    cs.LG stat.ML

    Probabilistic Decoupling of Labels in Classification

    Authors: Jeppe Nørregaard, Lars Kai Hansen

    Abstract: We investigate probabilistic decoupling of labels supplied for training, from the underlying classes for prediction. Decoupling enables an inference scheme general enough to implement many classification problems, including supervised, semi-supervised, positive-unlabelled, noisy-label and suggests a general solution to the multi-positive-unlabelled learning problem. We test the method on the Fashi… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 8 pages + 10 pages of supplementary material. NeurIPS preprint

  23. arXiv:1905.00709  [pdf, ps, other

    stat.ML cs.LG

    Phase transition in PCA with missing data: Reduced signal-to-noise ratio, not sample size!

    Authors: Niels Bruun Ipsen, Lars Kai Hansen

    Abstract: How does missing data affect our ability to learn signal structures? It has been shown that learning signal structure in terms of principal components is dependent on the ratio of sample size and dimensionality and that a critical number of observations is needed before learning starts (Biehl and Mietzner, 1993). Here we generalize this analysis to include missing data. Probabilistic principal com… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

    Comments: Accepted to ICML 2019. This version is the submitted paper

    Journal ref: International Conference on Machine Learning. 2019. pp. 2951-2960

  24. arXiv:1903.00519  [pdf, other

    cs.LG cs.AI stat.ML

    Aggregating explanation methods for stable and robust explainability

    Authors: Laura Rieger, Lars Kai Hansen

    Abstract: Despite a growing literature on explaining neural networks, no consensus has been reached on how to explain a neural network decision or how to evaluate an explanation. Our contributions in this paper are twofold. First, we investigate schemes to combine explanation methods and reduce model uncertainty to obtain a single aggregated explanation. We provide evidence that the aggregation is better at… ▽ More

    Submitted 20 March, 2020; v1 submitted 1 March, 2019; originally announced March 2019.

  25. Multi-View Bayesian Correlated Component Analysis

    Authors: Simon Kamronn, Andreas Trier Poulsen, Lars Kai Hansen

    Abstract: Correlated component analysis as proposed by Dmochowski et al. (2012) is a tool for investigating brain process similarity in the responses to multiple views of a given stimulus. Correlated components are identified under the assumption that the involved spatial networks are identical. Here we propose a hierarchical probabilistic model that can infer the level of universality in such multi-view da… ▽ More

    Submitted 7 February, 2018; originally announced February 2018.

    Journal ref: Neural Computation, 27, (10):220730, 2015

  26. arXiv:1710.11379  [pdf, other

    stat.ML

    Latent Space Oddity: on the Curvature of Deep Generative Models

    Authors: Georgios Arvanitidis, Lars Kai Hansen, Søren Hauberg

    Abstract: Deep generative models provide a systematic way to learn nonlinear data distributions, through a set of latent variables and a nonlinear "generator" function that maps latent points into the input space. The nonlinearity of the generator imply that the latent space gives a distorted view of the input space. Under mild conditions, we show that this distortion can be characterized by a stochastic Ri… ▽ More

    Submitted 13 December, 2021; v1 submitted 31 October, 2017; originally announced October 2017.

    Comments: Published at International Conference on Learning Representations (ICLR) 2018

  27. arXiv:1710.00633  [pdf, other

    cs.CV stat.ML

    Deep Convolutional Neural Networks for Interpretable Analysis of EEG Sleep Stage Scoring

    Authors: Albert Vilamala, Kristoffer H. Madsen, Lars K. Hansen

    Abstract: Sleep studies are important for diagnosing sleep disorders such as insomnia, narcolepsy or sleep apnea. They rely on manual scoring of sleep stages from raw polisomnography signals, which is a tedious visual task requiring the workload of highly trained professionals. Consequently, research efforts to purse for an automatic stage scoring based on machine learning techniques have been carried out o… ▽ More

    Submitted 2 October, 2017; originally announced October 2017.

    Comments: 8 pages, 1 figure, 2 tables, IEEE 2017 International Workshop on Machine Learning for Signal Processing

  28. Adaptive Smoothing in fMRI Data Processing Neural Networks

    Authors: Albert Vilamala, Kristoffer Hougaard Madsen, Lars Kai Hansen

    Abstract: Functional Magnetic Resonance Imaging (fMRI) relies on multi-step data processing pipelines to accurately determine brain activity; among them, the crucial step of spatial smoothing. These pipelines are commonly suboptimal, given the local optimisation strategy they use, treating each step in isolation. With the advent of new tools for deep learning, recent work has proposed to turn these pipeline… ▽ More

    Submitted 2 October, 2017; originally announced October 2017.

    Comments: 4 pages, 3 figures, 1 table, IEEE 2017 International Workshop on Pattern Recognition in Neuroimaging (PRNI)

  29. arXiv:1704.05748  [pdf, other

    q-bio.NC

    EEG source imaging assists decoding in a face recognition task

    Authors: Rasmus S. Andersen, Anders U. Eliasen, Nicolai Pedersen, Michael Riis Andersen, Sofie Therese Hansen, Lars Kai Hansen

    Abstract: EEG based brain state decoding has numerous applications. State of the art decoding is based on processing of the multivariate sensor space signal, however evidence is mounting that EEG source reconstruction can assist decoding. EEG source imaging leads to high-dimensional representations and rather strong a priori information must be invoked. Recent work by Edelman et al. (2016) has demonstrated… ▽ More

    Submitted 17 April, 2017; originally announced April 2017.

  30. arXiv:1610.04079  [pdf, other

    cs.CV q-bio.NC stat.ML

    Towards end-to-end optimisation of functional image analysis pipelines

    Authors: Albert Vilamala, Kristoffer Hougaard Madsen, Lars Kai Hansen

    Abstract: The study of neurocognitive tasks requiring accurate localisation of activity often rely on functional Magnetic Resonance Imaging, a widely adopted technique that makes use of a pipeline of data processing modules, each involving a variety of parameters. These parameters are frequently set according to the local goal of each specific module, not accounting for the rest of the pipeline. Given recen… ▽ More

    Submitted 13 October, 2016; originally announced October 2016.

    Comments: 7 pages, 2 figures

  31. arXiv:1606.02518  [pdf, other

    stat.ML

    A Locally Adaptive Normal Distribution

    Authors: Georgios Arvanitidis, Lars Kai Hansen, Søren Hauberg

    Abstract: The multivariate normal density is a monotonic function of the distance to the mean, and its ellipsoidal shape is due to the underlying Euclidean metric. We suggest to replace this metric with a locally adaptive, smoothly changing (Riemannian) metric that favors regions of high local density. The resulting locally adaptive normal distribution (LAND) is a generalization of the normal distribution t… ▽ More

    Submitted 23 September, 2016; v1 submitted 8 June, 2016; originally announced June 2016.

  32. arXiv:1604.03019  [pdf, other

    q-bio.NC cs.HC stat.AP

    EEG in the classroom: Synchronised neural recordings during video presentation

    Authors: Andreas Trier Poulsen, Simon Kamronn, Jacek Dmochowski, Lucas C. Parra, Lars Kai Hansen

    Abstract: We performed simultaneous recordings of electroencephalography (EEG) from multiple students in a classroom, and measured the inter-subject correlation (ISC) of activity evoked by a common video stimulus. The neural reliability, as quantified by ISC, has been linked to engagement and attentional modulation in earlier studies that used high-grade equipment in laboratory settings. Here we reproduce m… ▽ More

    Submitted 27 December, 2016; v1 submitted 11 April, 2016; originally announced April 2016.

    Comments: 14 pages, 5 figures, 3 tables. Preprint version. Revision of original preprint. Supplementary materials added as ancillary file

  33. arXiv:1510.02795  [pdf, other

    cs.CV

    Dreaming More Data: Class-dependent Distributions over Diffeomorphisms for Learned Data Augmentation

    Authors: Søren Hauberg, Oren Freifeld, Anders Boesen Lindbo Larsen, John W. Fisher III, Lars Kai Hansen

    Abstract: Data augmentation is a key element in training high-dimensional models. In this approach, one synthesizes new observations by applying pre-specified transformations to the original training data; e.g.~new images are formed by rotating old ones. Current augmentation schemes, however, rely on manual specification of the applied transformations, making data augmentation an implicit form of feature en… ▽ More

    Submitted 30 June, 2016; v1 submitted 9 October, 2015; originally announced October 2015.

    Journal ref: Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, pp. 342-350, 2016

  34. arXiv:1509.04752  [pdf, other

    stat.ML stat.CO stat.ME

    Bayesian inference for spatio-temporal spike-and-slab priors

    Authors: Michael Riis Andersen, Aki Vehtari, Ole Winther, Lars Kai Hansen

    Abstract: In this work, we address the problem of solving a series of underdetermined linear inverse problems subject to a sparsity constraint. We generalize the spike-and-slab prior distribution to encode a priori correlation of the support of the solution in both space and time by imposing a transformed Gaussian process on the spike-and-slab probabilities. An expectation propagation (EP) algorithm for pos… ▽ More

    Submitted 1 December, 2017; v1 submitted 15 September, 2015; originally announced September 2015.

    Comments: 58 pages, 17 figures

    Journal ref: Journal of Machine Learning Research, 18(139):1-58, 2017

  35. arXiv:1508.04556  [pdf, ps, other

    stat.ML

    Spatio-temporal Spike and Slab Priors for Multiple Measurement Vector Problems

    Authors: Michael Riis Andersen, Ole Winther, Lars Kai Hansen

    Abstract: We are interested in solving the multiple measurement vector (MMV) problem for instances, where the underlying sparsity pattern exhibit spatio-temporal structure motivated by the electroencephalogram (EEG) source localization problem. We propose a probabilistic model that takes this structure into account by generalizing the structured spike and slab prior and the associated Expectation Propagatio… ▽ More

    Submitted 19 August, 2015; originally announced August 2015.

    Comments: 6 pages, 6 figures, accepted for presentation at SPARS 2015

  36. arXiv:1405.6886  [pdf, other

    cs.IR stat.ML

    A Topic Model Approach to Multi-Modal Similarity

    Authors: Rasmus Troelsgård, Bjørn Sand Jensen, Lars Kai Hansen

    Abstract: Calculating similarities between objects defined by many heterogeneous data modalities is an important challenge in many multimedia applications. We use a multi-modal topic model as a basis for defining such a similarity between objects. We propose to compare the resulting similarities from different model realizations using the non-parametric Mantel test. The approach is evaluated on a music data… ▽ More

    Submitted 27 May, 2014; originally announced May 2014.

    Comments: topic modelling workshop at NIPS 2013

  37. arXiv:1403.2745  [pdf, other

    cs.CY

    Privacy for Personal Neuroinformatics

    Authors: Arkadiusz Stopczynski, Dazza Greenwood, Lars Kai Hansen, Alex Pentland

    Abstract: Human brain activity collected in the form of Electroencephalography (EEG), even with low number of sensors, is an extremely rich signal. Traces collected from multiple channels and with high sampling rates capture many important aspects of participants' brain activity and can be used as a unique personal identifier. The motivation for sharing EEG signals is significant, as a mean to understand th… ▽ More

    Submitted 11 March, 2014; originally announced March 2014.

  38. arXiv:1311.6976  [pdf, ps, other

    stat.ML cs.LG stat.AP stat.ME

    Dimensionality reduction for click-through rate prediction: Dense versus sparse representation

    Authors: Bjarne Ørum Fruergaard, Toke Jansen Hansen, Lars Kai Hansen

    Abstract: In online advertising, display ads are increasingly being placed based on real-time auctions where the advertiser who wins gets to serve the ad. This is called real-time bidding (RTB). In RTB, auctions have very tight time constraints on the order of 100ms. Therefore mechanisms for bidding intelligently such as clickthrough rate prediction need to be sufficiently fast. In this work, we propose to… ▽ More

    Submitted 13 May, 2014; v1 submitted 27 November, 2013; originally announced November 2013.

    Comments: Presented at the Probabilistic Models for Big Data workshop at NIPS 2013

  39. Kernel Multivariate Analysis Framework for Supervised Subspace Learning: A Tutorial on Linear and Kernel Multivariate Methods

    Authors: Jerónimo Arenas-García, Kaare Brandt Petersen, Gustavo Camps-Valls, Lars Kai Hansen

    Abstract: Feature extraction and dimensionality reduction are important tasks in many fields of science dealing with signal processing and analysis. The relevance of these techniques is increasing as current sensory devices are developed with ever higher resolution, and problems involving multimodal data sources become more common. A plethora of feature extraction methods are available in the literature col… ▽ More

    Submitted 18 October, 2013; originally announced October 2013.

    Journal ref: IEEE Signal Processing Magazine, 30(4), 16-29, 2013

  40. The Smartphone Brain Scanner: A Mobile Real-time Neuroimaging System

    Authors: Arkadiusz Stopczynski, Carsten Stahlhut, Jakob Eg Larsen, Michael Kai Petersen, Lars Kai Hansen

    Abstract: Combining low cost wireless EEG sensors with smartphones offers novel opportunities for mobile brain imaging in an everyday context. We present a framework for building multi-platform, portable EEG applications with real-time 3D source reconstruction. The system - Smartphone Brain Scanner - combines an off-the-shelf neuroheadset or EEG cap with a smartphone or tablet, and as such represents the fi… ▽ More

    Submitted 1 April, 2013; originally announced April 2013.

  41. FindZebra: A search engine for rare diseases

    Authors: Radu Dragusin, Paula Petcu, Christina Lioma, Birger Larsen, Henrik L. Jørgensen, Ingemar J. Cox, Lars Kai Hansen, Peter Ingwersen, Ole Winther

    Abstract: Background: The web has become a primary information resource about illnesses and treatments for both medical and non-medical users. Standard web search is by far the most common interface for such information. It is therefore of interest to find out how well web search engines work for diagnostic queries and what factors contribute to successes and failures. Among diseases, rare (or orphan) disea… ▽ More

    Submitted 13 March, 2013; originally announced March 2013.

    Journal ref: International Journal of Medical Informatics, Available online 23 February 2013, ISSN 1386-5056

  42. arXiv:1101.5097  [pdf, ps, other

    cs.SI cs.LG physics.soc-ph

    Infinite Multiple Membership Relational Modeling for Complex Networks

    Authors: Morten Mørup, Mikkel N. Schmidt, Lars Kai Hansen

    Abstract: Learning latent structure in complex networks has become an important problem fueled by many types of networked data originating from practically all fields of science. In this paper, we propose a new non-parametric Bayesian multiple-membership latent feature model for networks. Contrary to existing multiple-membership models that scale quadratically in the number of vertices the proposed model sc… ▽ More

    Submitted 26 January, 2011; originally announced January 2011.

    Comments: 8 pages, 4 figures

  43. arXiv:1101.0510  [pdf, ps, other

    cs.SI cs.CL physics.soc-ph

    Good Friends, Bad News - Affect and Virality in Twitter

    Authors: Lars Kai Hansen, Adam Arvidsson, Finn Årup Nielsen, Elanor Colleoni, Michael Etter

    Abstract: The link between affect, defined as the capacity for sentimental arousal on the part of a message, and virality, defined as the probability that it be sent along, is of significant theoretical and practical importance, e.g. for viral marketing. A quantitative study of emailing of articles from the NY Times finds a strong link between positive affect and virality, and, based on psychological theori… ▽ More

    Submitted 3 January, 2011; originally announced January 2011.

    Comments: 14 pages, 1 table. Submitted to The 2011 International Workshop on Social Computing, Network, and Services (SocialComNet 2011)

    MSC Class: 1D30 ACM Class: H.4.3; J.4

  44. arXiv:1008.1398  [pdf, ps, other

    cs.LG

    Semi-Supervised Kernel PCA

    Authors: Christian Walder, Ricardo Henao, Morten Mørup, Lars Kai Hansen

    Abstract: We present three generalisations of Kernel Principal Components Analysis (KPCA) which incorporate knowledge of the class labels of a subset of the data points. The first, MV-KPCA, penalises within class variances similar to Fisher discriminant analysis. The second, LSKPCA is a hybrid of least squares regression and kernel PCA. The final LR-KPCA is an iteratively reweighted version of the previous… ▽ More

    Submitted 8 August, 2010; originally announced August 2010.

  45. arXiv:0903.0687  [pdf, ps, other

    physics.soc-ph physics.data-an

    Second-Order Assortative Mixing in Social Networks

    Authors: Shi Zhou, Ingemar J. Cox, Lars K. Hansen

    Abstract: In a social network, the number of links of a node, or node degree, is often assumed as a proxy for the node's importance or prominence within the network. It is known that social networks exhibit the (first-order) assortative mixing, i.e. if two nodes are connected, they tend to have similar node degrees, suggesting that people tend to mix with those of comparable prominence. In this paper, we re… ▽ More

    Submitted 23 October, 2017; v1 submitted 3 March, 2009; originally announced March 2009.

    Comments: Cite as: Zhou S., Cox I.J., Hansen L.K. (2017) Second-Order Assortative Mixing in Social Networks. In: Goncalves B., Menezes R., Sinatra R., Zlatic V. (eds) Complex Networks VIII. CompleNet 2017. Springer Proceedings in Complexity. Springer, Cham. https://doi.org/10.1007/978-3-319-54241-6_1

  46. arXiv:0710.4867  [pdf, ps, other

    physics.data-an physics.soc-ph

    Bi-clique Communities

    Authors: Sune Lehmann, Martin Schwartz, Lars Kai Hansen

    Abstract: We present a novel method for detecting communities in bipartite networks. Based on an extension of the $k$-clique community detection algorithm, we demonstrate how modular structure in bipartite networks presents itself as overlap** bicliques. If bipartite information is available, the bi-clique community detection algorithm retains all of the advantages of the $k$-clique algorithm, but avoid… ▽ More

    Submitted 7 July, 2008; v1 submitted 25 October, 2007; originally announced October 2007.

    Comments: 10 pages, 6 figures

    Journal ref: Phys. Rev. E, v78, p016108 (2008)

  47. Deterministic Modularity Optimization

    Authors: Sune Lehmann, Lars Kai Hansen

    Abstract: We study community structure of networks. We have developed a scheme for maximizing the modularity Q based on mean field methods. Further, we have defined a simple family of random networks with community structure; we understand the behavior of these networks analytically. Using these networks, we show how the mean field methods display better performance than previously known deterministic met… ▽ More

    Submitted 1 March, 2007; v1 submitted 31 January, 2007; originally announced January 2007.

    Comments: 7 pages, 4 figures, minor changes