Skip to main content

Showing 1–50 of 54 results for author: Kjellstrom, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01244  [pdf, other

    cs.CV

    CLHOP: Combined Audio-Video Learning for Horse 3D Pose and Shape Estimation

    Authors: Ci Li, Elin Hernlund, Hedvig Kjellström, Silvia Zuffi

    Abstract: In the monocular setting, predicting 3D pose and shape of animals typically relies solely on visual information, which is highly under-constrained. In this work, we explore using audio to enhance 3D shape and motion recovery of horses from monocular video. We test our approach on two datasets: an indoor treadmill dataset for 3D evaluation and an outdoor dataset capturing diverse horse movements, t… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: CVPR CV4Animals Workshop 2024

  2. arXiv:2406.08311  [pdf, other

    cs.LG cs.AI

    Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework

    Authors: Ruibo Tu, Zineb Senane, Lele Cao, Cheng Zhang, Hedvig Kjellström, Gustav Eje Henter

    Abstract: Tabular synthesis models remain ineffective at capturing complex dependencies, and the quality of synthetic data is still insufficient for comprehensive downstream tasks, such as prediction under distribution shifts, automated decision-making, and cross-table understanding. A major challenge is the lack of prior knowledge about underlying structures and high-order relationships in tabular data. We… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2405.04161  [pdf, other

    cs.LG cs.AI

    Opportunities for machine learning in scientific discovery

    Authors: Ricardo Vinuesa, Jean Rabault, Hossein Azizpour, Stefan Bauer, Bingni W. Brunton, Arne Elofsson, Elias Jarlebring, Hedvig Kjellstrom, Stefano Markidis, David Marlevi, Paola Cinnella, Steven L. Brunton

    Abstract: Technological advancements have substantially increased computational power and data availability, enabling the application of powerful machine-learning (ML) techniques across various fields. However, our ability to leverage ML methods for scientific discovery, {\it i.e.} to obtain fundamental and formalized knowledge about natural processes, is still in its infancy. In this review, we explore how… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  4. arXiv:2307.15063  [pdf, other

    cs.CV

    To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation

    Authors: Marc Botet Colomer, Pier Luigi Dovesi, Theodoros Panagiotakopoulos, Joao Frederico Carvalho, Linus Härenstam-Nielsen, Hossein Azizpour, Hedvig Kjellström, Daniel Cremers, Matteo Poggi

    Abstract: The goal of Online Domain Adaptation for semantic segmentation is to handle unforeseeable domain changes that occur during deployment, like sudden weather events. However, the high computational costs associated with brute-force adaptation make this paradigm unfeasible for real-world applications. In this paper we propose HAMLET, a Hardware-Aware Modular Least Expensive Training framework for real… ▽ More

    Submitted 7 August, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: ICCV 2023. The first two authors contributed equally. Project page: https://marcbotet.github.io/hamlet-web/

  5. arXiv:2306.05311  [pdf, other

    cs.CV

    Predictive Modeling of Equine Activity Budgets Using a 3D Skeleton Reconstructed from Surveillance Recordings

    Authors: Ernest Pokropek, Sofia Broomé, Pia Haubro Andersen, Hedvig Kjellström

    Abstract: In this work, we present a pipeline to reconstruct the 3D pose of a horse from 4 simultaneous surveillance camera recordings. Our environment poses interesting challenges to tackle, such as limited field view of the cameras and a relatively closed and small environment. The pipeline consists of training a 2D markerless pose estimation model to work on every viewpoint, then applying it to the video… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 3rd Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (in conjunction with CVPR 2023) [POSTER]

  6. arXiv:2304.04681  [pdf, other

    cs.CV cs.LG

    Controllable Motion Synthesis and Reconstruction with Autoregressive Diffusion Models

    Authors: Wenjie Yin, Ruibo Tu, Hang Yin, Danica Kragic, Hedvig Kjellström, Mårten Björkman

    Abstract: Data-driven and controllable human motion synthesis and prediction are active research areas with various applications in interactive media and social robotics. Challenges remain in these fields for generating diverse motions given past observations and dealing with imperfect poses. This paper introduces MoDiff, an autoregressive probabilistic diffusion model over motion sequences conditioned on c… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  7. arXiv:2209.08660  [pdf, ps, other

    cs.LG cs.CV

    Learn the Time to Learn: Replay Scheduling in Continual Learning

    Authors: Marcus Klasson, Hedvig Kjellström, Cheng Zhang

    Abstract: Replay methods are known to be successful at mitigating catastrophic forgetting in continual learning scenarios despite having limited access to historical data. However, storing historical data is cheap in many real-world settings, yet replaying all historical data is often prohibited due to processing time constraints. In such settings, we propose that continual learning systems should learn the… ▽ More

    Submitted 20 November, 2023; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: Published in TMLR (2023)

  8. arXiv:2206.08405  [pdf, ps, other

    cs.CV

    Going Deeper than Tracking: a Survey of Computer-Vision Based Recognition of Animal Pain and Affective States

    Authors: Sofia Broomé, Marcelo Feighelstein, Anna Zamansky, Gabriel Carreira Lencioni, Pia Haubro Andersen, Francisca Pessanha, Marwa Mahmoud, Hedvig Kjellström, Albert Ali Salah

    Abstract: Advances in animal motion tracking and pose recognition have been a game changer in the study of animal behavior. Recently, an increasing number of works go 'deeper' than tracking, and address automated recognition of animals' internal states such as emotions and pain with the aim of improving animal welfare, making this a timely moment for a systematization of the field. This paper provides a com… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  9. arXiv:2201.09366  [pdf, other

    cs.LG stat.ME

    Optimal transport for causal discovery

    Authors: Ruibo Tu, Kun Zhang, Hedvig Kjellström, Cheng Zhang

    Abstract: To determine causal relationships between two variables, approaches based on Functional Causal Models (FCMs) have been proposed by properly restricting model classes; however, the performance is sensitive to the model assumptions, which makes it difficult to use. In this paper, we provide a novel dynamical-system view of FCMs and propose a new framework for identifying causal direction in the biva… ▽ More

    Submitted 29 March, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

  10. arXiv:2112.12175  [pdf, other

    cs.CV

    Recur, Attend or Convolve? On Whether Temporal Modeling Matters for Cross-Domain Robustness in Action Recognition

    Authors: Sofia Broomé, Ernest Pokropek, Boyu Li, Hedvig Kjellström

    Abstract: Most action recognition models today are highly parameterized, and evaluated on datasets with appearance-wise distinct classes. It has also been shown that 2D Convolutional Neural Networks (CNNs) tend to be biased toward texture rather than shape in still image recognition tasks, in contrast to humans. Taken together, this raises suspicion that large video models partly learn spurious spatial text… ▽ More

    Submitted 11 October, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

  11. arXiv:2110.15761  [pdf, other

    stat.ML cs.LG

    Aligned Multi-Task Gaussian Process

    Authors: Olga Mikheeva, Ieva Kazlauskaite, Adam Hartshorne, Hedvig Kjellström, Carl Henrik Ek, Neill D. F. Campbell

    Abstract: Multi-task learning requires accurate identification of the correlations between tasks. In real-world time-series, tasks are rarely perfectly temporally aligned; traditional multi-task models do not account for this and subsequent errors in correlation estimation will result in poor predictive performance and uncertainty quantification. We introduce a method that automatically accounts for tempora… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  12. arXiv:2110.06257  [pdf, other

    cs.LG stat.ML

    Causal Discovery from Conditionally Stationary Time Series

    Authors: Carles Balsells-Rodas, Ruibo Tu, Hedvig Kjellstrom, Yingzhen Li

    Abstract: Causal discovery, i.e., inferring underlying causal relationships from observational data, has been shown to be highly challenging for AI systems. In time series modeling context, traditional causal discovery methods mainly consider constrained scenarios with fully observed variables and/or data from stationary time-series. We develop a causal discovery approach to handle a wide class of non-stati… ▽ More

    Submitted 23 February, 2024; v1 submitted 12 October, 2021; originally announced October 2021.

  13. arXiv:2108.13258  [pdf, other

    cs.CV

    Equine Pain Behavior Classification via Self-Supervised Disentangled Pose Representation

    Authors: Maheen Rashid, Sofia Broomé, Katrina Ask, Elin Hernlund, Pia Haubro Andersen, Hedvig Kjellström, Yong Jae Lee

    Abstract: Timely detection of horse pain is important for equine welfare. Horses express pain through their facial and body behavior, but may hide signs of pain from unfamiliar human observers. In addition, collecting visual data with detailed annotation of horse behavior and pain state is both cumbersome and not scalable. Consequently, a pragmatic equine pain classification system would use video of the un… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

  14. arXiv:2108.05762  [pdf, other

    cs.HC cs.LG cs.MM

    Multimodal analysis of the predictability of hand-gesture properties

    Authors: Taras Kucherenko, Rajmund Nagy, Michael Neff, Hedvig Kjellström, Gustav Eje Henter

    Abstract: Embodied conversational agents benefit from being able to accompany their speech with gestures. Although many data-driven approaches to gesture generation have been proposed in recent years, it is still unclear whether such systems can consistently generate gestures that convey meaning. We investigate which gesture properties (phase, category, and semantics) can be predicted from speech text and/o… ▽ More

    Submitted 14 January, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: Accepted at the International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2022

  15. arXiv:2106.14736  [pdf, other

    cs.HC cs.CV cs.GR cs.LG

    Speech2Properties2Gestures: Gesture-Property Prediction as a Tool for Generating Representational Gestures from Speech

    Authors: Taras Kucherenko, Rajmund Nagy, Patrik Jonell, Michael Neff, Hedvig Kjellström, Gustav Eje Henter

    Abstract: We propose a new framework for gesture generation, aiming to allow data-driven approaches to produce more semantically rich gestures. Our approach first predicts whether to gesture, followed by a prediction of the gesture properties. Those properties are then used as conditioning for a modern probabilistic gesture-generation model capable of high-quality output. This empowers the approach to gener… ▽ More

    Submitted 13 August, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at the ACM International Conference on Intelligent Virtual Agents (IVA 2021)

    ACM Class: I.2.7; I.2.6; I.3.7

    Journal ref: International Conference on Intelligent Virtual Agents 2021

  16. arXiv:2106.10102  [pdf, other

    cs.CV

    hSMAL: Detailed Horse Shape and Pose Reconstruction for Motion Pattern Recognition

    Authors: Ci Li, Nima Ghorbani, Sofia Broomé, Maheen Rashid, Michael J. Black, Elin Hernlund, Hedvig Kjellström, Silvia Zuffi

    Abstract: In this paper we present our preliminary work on model-based behavioral analysis of horse motion. Our approach is based on the SMAL model, a 3D articulated statistical model of animal shape. We define a novel SMAL model for horses based on a new template, skeleton and shape space learned from $37$ horse toys. We test the accuracy of our hSMAL model in reconstructing a horse from 3D mocap data and… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: CV4Animals Workshop in CVPR 2021

  17. Sharing Pain: Using Pain Domain Transfer for Video Recognition of Low Grade Orthopedic Pain in Horses

    Authors: Sofia Broomé, Katrina Ask, Maheen Rashid, Pia Haubro Andersen, Hedvig Kjellström

    Abstract: Orthopedic disorders are common among horses, often leading to euthanasia, which often could have been avoided with earlier detection. These conditions often create varying degrees of subtle long-term pain. It is challenging to train a visual pain recognition method with video data depicting such pain, since the resulting pain behavior also is subtle, sparsely appearing, and varying, making it cha… ▽ More

    Submitted 7 January, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

  18. arXiv:2102.12302  [pdf, other

    cs.HC cs.GR cs.LG

    A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents

    Authors: Rajmund Nagy, Taras Kucherenko, Birger Moell, André Pereira, Hedvig Kjellström, Ulysses Bernardet

    Abstract: Embodied conversational agents (ECAs) benefit from non-verbal behavior for natural and efficient interaction with users. Gesticulation - hand and arm movements accompanying speech - is an essential part of non-verbal behavior. Gesture generation models have been developed for several decades: starting with rule-based and ending with mainly data-driven methods. To date, recent end-to-end gesture ge… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: Rajmund Nagy and Taras Kucherenko contributed equally to this work. To be published in the Proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021), Online, May 3-7, 2021, IFAA-MAS, 3 pages, 1 figure

  19. arXiv:2102.08983  [pdf, other

    cs.CV

    Automated Detection of Equine Facial Action Units

    Authors: Zhenghong Li, Sofia Broomé, Pia Haubro Andersen, Hedvig Kjellström

    Abstract: The recently developed Equine Facial Action Coding System (EquiFACS) provides a precise and exhaustive, but laborious, manual labelling method of facial action units of the horse. To automate parts of this process, we propose a Deep Learning-based method to detect EquiFACS units automatically from images. We use a cascade framework; we firstly train several object detectors to detect the predefine… ▽ More

    Submitted 4 May, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

  20. arXiv:2102.02642  [pdf, other

    stat.ML cs.LG stat.AP stat.CO

    Asymptotically Exact and Fast Gaussian Copula Models for Imputation of Mixed Data Types

    Authors: Benjamin Christoffersen, Mark Clements, Keith Humphreys, Hedvig Kjellström

    Abstract: Missing values with mixed data types is a common problem in a large number of machine learning applications such as processing of surveys and in different medical applications. Recently, Gaussian copula models have been suggested as a means of performing imputation of missing values using a probabilistic framework. While the present Gaussian copula models have shown to yield state of the art perfo… ▽ More

    Submitted 1 July, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: 20 pages, 1 figures, and 4 tables

  21. arXiv:2101.05851  [pdf, other

    cs.AI

    A Subjective Model of Human Decision Making Based on Quantum Decision Theory

    Authors: Chenda Zhang, Hedvig Kjellström

    Abstract: Computer modeling of human decision making is of large importance for, e.g., sustainable transport, urban development, and online recommendation systems. In this paper we present a model for predicting the behavior of an individual during a binary game under different amounts of risk, gain, and time pressure. The model is based on Quantum Decision Theory (QDT), which has been shown to enable model… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  22. arXiv:2012.05846  [pdf, other

    cs.CV cs.LG

    Full-Glow: Fully conditional Glow for more realistic image generation

    Authors: Moein Sorkhei, Gustav Eje Henter, Hedvig Kjellström

    Abstract: Autonomous agents, such as driverless cars, require large amounts of labeled visual data for their training. A viable approach for acquiring such data is training a generative model with collected real data, and then augmenting the collected real dataset with synthetic images from the model, generated with control of the scene layout and ground truth labeling. In this paper we propose Full-Glow, a… ▽ More

    Submitted 7 October, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: Accepted to DAGM GCPR 2021

    MSC Class: 68T07 ACM Class: I.4.0; I.2.9; I.2.6; G.3; I.3.3

  23. arXiv:2010.11300  [pdf, ps, other

    cs.LG cs.CY

    How Do Fair Decisions Fare in Long-term Qualification?

    Authors: Xueru Zhang, Ruibo Tu, Yang Liu, Mingyan Liu, Hedvig Kjellström, Kun Zhang, Cheng Zhang

    Abstract: Although many fairness criteria have been proposed for decision making, their long-term impact on the well-being of a population remains unclear. In this work, we study the dynamics of population qualification and algorithmic decisions under a partially observed Markov decision problem setting. By characterizing the equilibrium of such dynamics, we analyze the long-term impact of static fairness c… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted to the 34th Conference on Neural Information Processing Systems (NeurIPS)

  24. arXiv:2007.09170  [pdf, other

    cs.CV cs.GR cs.HC cs.LG

    Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation

    Authors: Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, Gustav Eje Henter, Hedvig Kjellström

    Abstract: This paper presents a novel framework for speech-driven gesture production, applicable to virtual agents to enhance human-computer interaction. Specifically, we extend recent deep-learning-based, data-driven methods for speech-driven gesture generation by incorporating representation learning. Our model takes speech as input and produces gestures as output, in the form of a sequence of 3D coordina… ▽ More

    Submitted 28 January, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Extension of our IVA'19 paper. Accepted at the International Journal of Human-Computer Interaction. See more at https://svito-zar.github.io/audio2gestures/. arXiv admin note: substantial text overlap with arXiv:1903.03369

    ACM Class: I.2.7; I.2.6; I.3.7

    Journal ref: Int. J. Hum. Comput.Interact.(2021)

  25. arXiv:2002.01449  [pdf, other

    cs.CV

    Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks

    Authors: Maheen Rashid, Hedvig Kjellström, Yong Jae Lee

    Abstract: We present a method for weakly-supervised action localization based on graph convolutions. In order to find and classify video time segments that correspond to relevant action classes, a system must be able to both identify discriminative time segments in each video, and identify the full extent of each action. Achieving this with weak video level labels requires the system to use similarity and d… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: Accepted at WACV 2020

  26. arXiv:2002.00367  [pdf, other

    cs.CV

    Interpreting video features: a comparison of 3D convolutional networks and convolutional LSTM networks

    Authors: Joonatan Mänttäri, Sofia Broomé, John Folkesson, Hedvig Kjellström

    Abstract: A number of techniques for interpretability have been presented for deep learning in computer vision, typically with the goal of understanding what the networks have based their classification on. However, interpretability for deep video architectures is still in its infancy and we do not yet have a clear concept of how to decode spatiotemporal features. In this paper, we present a study comparing… ▽ More

    Submitted 10 July, 2020; v1 submitted 2 February, 2020; originally announced February 2020.

  27. arXiv:2001.09886  [pdf, other

    cs.LG stat.ML

    Bayesian nonparametric shared multi-sequence time series segmentation

    Authors: Olga Mikheeva, Ieva Kazlauskaite, Hedvig Kjellström, Carl Henrik Ek

    Abstract: In this paper, we introduce a method for segmenting time series data using tools from Bayesian nonparametrics. We consider the task of temporal segmentation of a set of time series data into representative stationary segments. We use Gaussian process (GP) priors to impose our knowledge about the characteristics of the underlying stationary segments, and use a nonparametric distribution to partitio… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  28. arXiv:2001.09326  [pdf, other

    cs.HC cs.LG eess.AS

    Gesticulator: A framework for semantically-aware speech-driven gesture generation

    Authors: Taras Kucherenko, Patrik Jonell, Sanne van Waveren, Gustav Eje Henter, Simon Alexanderson, Iolanda Leite, Hedvig Kjellström

    Abstract: During speech, people spontaneously gesticulate, which plays a key role in conveying information. Similarly, realistic co-speech gestures are crucial to enable natural and smooth interactions with social agents. Current end-to-end co-speech gesture generation systems use a single modality for representing speech: either audio or text. These systems are therefore confined to producing either acoust… ▽ More

    Submitted 14 January, 2021; v1 submitted 25 January, 2020; originally announced January 2020.

    Comments: ICMI 2020 Best Paper Award. Code is available. 9 pages, 6 figures

    ACM Class: I.2.7; I.2.6; I.3.7

    Journal ref: Proceedings of the 2020 International Conference on Multimodal Interaction (ICMI '20)

  29. arXiv:1910.00541  [pdf, other

    cs.CV cs.RO

    Real-Time Semantic Stereo Matching

    Authors: Pier Luigi Dovesi, Matteo Poggi, Lorenzo Andraghetti, Miquel Martí, Hedvig Kjellström, Alessandro Pieropan, Stefano Mattoccia

    Abstract: Scene understanding is paramount in robotics, self-navigation, augmented reality, and many other fields. To fully accomplish this task, an autonomous agent has to infer the 3D structure of the sensed scene (to know where it looks at) and its content (to know what it sees). To tackle the two tasks, deep neural networks trained to infer semantic segmentation and depth from stereo images are often th… ▽ More

    Submitted 24 February, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: 8 pages, 3 figures. Accepted to ICRA 2020

  30. arXiv:1906.04933  [pdf, other

    cs.LG cs.CV stat.ML

    Non-Parametric Calibration for Classification

    Authors: Jonathan Wenger, Hedvig Kjellström, Rudolph Triebel

    Abstract: Many applications of classification methods not only require high accuracy but also reliable estimation of predictive uncertainty. However, while many current classification frameworks, in particular deep neural networks, achieve high accuracy, they tend to incorrectly estimate uncertainty. In this paper, we propose a method that adjusts the confidence estimates of a general classifier such that t… ▽ More

    Submitted 27 February, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

  31. arXiv:1906.01732  [pdf, other

    cs.LG stat.ML

    Neuropathic Pain Diagnosis Simulator for Causal Discovery Algorithm Evaluation

    Authors: Ruibo Tu, Kun Zhang, Bo Christer Bertilson, Hedvig Kjellström, Cheng Zhang

    Abstract: Discovery of causal relations from observational data is essential for many disciplines of science and real-world applications. However, unlike other machine learning algorithms, whose development has been greatly fostered by a large amount of available benchmark datasets, causal discovery algorithms are notoriously difficult to be systematically evaluated because few datasets with known ground-tr… ▽ More

    Submitted 28 October, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted by NeurIPS 2019, 6 figures, 10 tables

  32. Analyzing Input and Output Representations for Speech-Driven Gesture Generation

    Authors: Taras Kucherenko, Dai Hasegawa, Gustav Eje Henter, Naoshi Kaneko, Hedvig Kjellström

    Abstract: This paper presents a novel framework for automatic speech-driven gesture generation, applicable to human-agent interaction including both virtual agents and robots. Specifically, we extend recent deep-learning-based, data-driven methods for speech-driven gesture generation by incorporating representation learning. Our model takes speech as input and produces gestures as output, in the form of a s… ▽ More

    Submitted 11 June, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: Accepted at IVA '19. Shorter version published at AAMAS '19. The code is available at https://github.com/GestureGeneration/Speech_driven_gesture_generation_with_autoencoder

    ACM Class: I.2.6; I.5.1; J.4

  33. arXiv:1901.02106  [pdf, other

    cs.CV

    Dynamics are Important for the Recognition of Equine Pain in Video

    Authors: Sofia Broomé, Karina Bech Gleerup, Pia Haubro Andersen, Hedvig Kjellström

    Abstract: A prerequisite to successfully alleviate pain in animals is to recognize it, which is a great challenge in non-verbal species. Furthermore, prey animals such as horses tend to hide their pain. In this study, we propose a deep recurrent two-stream architecture for the task of distinguishing pain from non-pain in videos of horses. Different models are evaluated on a unique dataset showing horses und… ▽ More

    Submitted 24 May, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

    Comments: CVPR 2019: IEEE Conference on Computer Vision and Pattern Recognition

  34. arXiv:1901.00711  [pdf, other

    cs.CV

    A Hierarchical Grocery Store Image Dataset with Visual and Semantic Labels

    Authors: Marcus Klasson, Cheng Zhang, Hedvig Kjellström

    Abstract: Image classification models built into visual support systems and other assistive devices need to provide accurate predictions about their environment. We focus on an application of assistive technology for people with visual impairments, for daily activities such as shop** or cooking. In this paper, we provide a new benchmark dataset for a challenging task in this application - classification o… ▽ More

    Submitted 3 January, 2019; originally announced January 2019.

    Comments: To appear in IEEE Winter Conference on Applications of Computer Vision (WACV) 2019

  35. arXiv:1811.07627  [pdf, ps, other

    cs.LG stat.ML

    Mixed Likelihood Gaussian Process Latent Variable Model

    Authors: Samuel Murray, Hedvig Kjellström

    Abstract: We present the Mixed Likelihood Gaussian process latent variable model (GP-LVM), capable of modeling data with attributes of different types. The standard formulation of GP-LVM assumes that each observation is drawn from a Gaussian distribution, which makes the model unsuited for data with e.g. categorical or nominal attributes. Our model, for which we use a sampling based variational inference, i… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

  36. arXiv:1810.03435  [pdf, ps, other

    q-bio.QM cs.LG stat.ML

    Simultaneous Measurement Imputation and Outcome Prediction for Achilles Tendon Rupture Rehabilitation

    Authors: Charles Hamesse, Ruibo Tu, Paul Ackermann, Hedvig Kjellström, Cheng Zhang

    Abstract: Achilles Tendon Rupture (ATR) is one of the typical soft tissue injuries. Rehabilitation after such a musculoskeletal injury remains a prolonged process with a very variable outcome. Accurately predicting rehabilitation outcome is crucial for treatment decision support. However, it is challenging to train an automatic method for predicting the ATR rehabilitation outcome from treatment data, due to… ▽ More

    Submitted 13 August, 2019; v1 submitted 8 September, 2018; originally announced October 2018.

  37. arXiv:1809.08875  [pdf, ps, other

    cs.CV

    A Probabilistic Semi-Supervised Approach to Multi-Task Human Activity Modeling

    Authors: Judith Bütepage, Hedvig Kjellström, Danica Kragic

    Abstract: Human behavior is a continuous stochastic spatio-temporal process which is governed by semantic actions and affordances as well as latent factors. Therefore, video-based human activity modeling is concerned with a number of tasks such as inferring current and future semantic labels, predicting future continuous observations as well as imagining possible future label and feature sequences. In this… ▽ More

    Submitted 14 March, 2019; v1 submitted 24 September, 2018; originally announced September 2018.

  38. arXiv:1807.04010  [pdf, ps, other

    cs.LG stat.ML

    Causal Discovery in the Presence of Missing Data

    Authors: Ruibo Tu, Kun Zhang, Paul Ackermann, Bo Christer Bertilson, Clark Glymour, Hedvig Kjellström, Cheng Zhang

    Abstract: Missing data are ubiquitous in many domains including healthcare. When these data entries are not missing completely at random, the (conditional) independence relations in the observed data may be different from those in the complete data generated by the underlying causal process. Consequently, simply applying existing causal discovery methods to the observed data may lead to wrong conclusions. I… ▽ More

    Submitted 12 July, 2020; v1 submitted 11 July, 2018; originally announced July 2018.

  39. arXiv:1803.02665  [pdf, other

    cs.LG

    A Neural Network Approach to Missing Marker Reconstruction in Human Motion Capture

    Authors: Taras Kucherenko, Jonas Beskow, Hedvig Kjellström

    Abstract: Optical motion capture systems have become a widely used technology in various fields, such as augmented reality, robotics, movie production, etc. Such systems use a large number of cameras to triangulate the position of optical markers.The marker positions are estimated with high accuracy. However, especially when tracking articulated bodies, a fraction of the markers in each timestep is missing… ▽ More

    Submitted 25 September, 2018; v1 submitted 7 March, 2018; originally announced March 2018.

    Comments: 7 pages, 6 figures

    MSC Class: 68T05

  40. Active Perception and Modeling of Deformable Surfaces using Gaussian Processes and Position-based Dynamics

    Authors: Sergio Caccamo, Püren Güler, Hedvig Kjellström, Danica Kragic

    Abstract: Exploring and modeling heterogeneous elastic surfaces requires multiple interactions with the environment and a complex selection of physical material parameters. The most common approaches model deformable properties from sets of offline observations using computationally expensive force-based simulators. In this work we present an online probabilistic framework for autonomous estimation of a def… ▽ More

    Submitted 13 February, 2018; originally announced February 2018.

    Comments: 8 pages, video of an experiment available at https://youtu.be/mDNSDZz7Qzs

    Journal ref: 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids)

  41. arXiv:1711.10915  [pdf, other

    cs.LG stat.ML

    Causality Refined Diagnostic Prediction

    Authors: Marcus Klasson, Kun Zhang, Bo C. Bertilson, Cheng Zhang, Hedvig Kjellström

    Abstract: Applying machine learning in the health care domain has shown promising results in recent years. Interpretable outputs from learning algorithms are desirable for decision making by health care personnel. In this work, we explore the possibility of utilizing causal relationships to refine diagnostic prediction. We focus on the task of diagnostic prediction using discomfort drawings, and explore two… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Comments: NIPS 2017 Workshop on Machine Learning for Health (ML4H)

  42. arXiv:1711.05597  [pdf, other

    cs.LG stat.ML

    Advances in Variational Inference

    Authors: Cheng Zhang, Judith Butepage, Hedvig Kjellstrom, Stephan Mandt

    Abstract: Many modern unsupervised or semi-supervised machine learning algorithms rely on Bayesian probabilistic models. These models are usually intractable and thus require approximate inference. Variational inference (VI) lets us approximate a high-dimensional Bayesian posterior with a simpler variational distribution by solving an optimization problem. This approach has been successfully used in various… ▽ More

    Submitted 23 October, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

  43. arXiv:1709.01613  [pdf, other

    cs.HC cs.AI cs.CY

    Machine Learning and Social Robotics for Detecting Early Signs of Dementia

    Authors: Patrik Jonell, Joseph Mendelson, Thomas Storskog, Goran Hagman, Per Ostberg, Iolanda Leite, Taras Kucherenko, Olga Mikheeva, Ulrika Akenine, Vesna Jelic, Alina Solomon, Jonas Beskow, Joakim Gustafson, Miia Kivipelto, Hedvig Kjellstrom

    Abstract: This paper presents the EACare project, an ambitious multi-disciplinary collaboration with the aim to develop an embodied system, capable of carrying out neuropsychological tests to detect early signs of dementia, e.g., due to Alzheimer's disease. The system will use methods from Machine Learning and Social Robotics, and be trained with examples of recorded clinician-patient interactions. The inte… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

  44. arXiv:1705.00607  [pdf, other

    cs.LG stat.ML

    Determinantal Point Processes for Mini-Batch Diversification

    Authors: Cheng Zhang, Hedvig Kjellstrom, Stephan Mandt

    Abstract: We study a mini-batch diversification scheme for stochastic gradient descent (SGD). While classical SGD relies on uniformly sampling data points to form a mini-batch, we propose a non-uniform sampling scheme based on the Determinantal Point Process (DPP). The DPP relies on a similarity measure between data points and gives low probabilities to mini-batches which contain redundant data, and higher… ▽ More

    Submitted 23 August, 2017; v1 submitted 1 May, 2017; originally announced May 2017.

  45. arXiv:1702.08212  [pdf, other

    cs.RO cs.CV cs.HC

    Anticipating many futures: Online human motion prediction and synthesis for human-robot collaboration

    Authors: Judith Bütepage, Hedvig Kjellström, Danica Kragic

    Abstract: Fluent and safe interactions of humans and robots require both partners to anticipate the others' actions. A common approach to human intention inference is to model specific trajectories towards known goals with supervised classifiers. However, these approaches do not take possible future movements into account nor do they make use of kinematic cues, such as legible and predictable motion. The bo… ▽ More

    Submitted 27 February, 2017; originally announced February 2017.

  46. arXiv:1702.07486  [pdf, other

    cs.CV

    Deep representation learning for human motion prediction and classification

    Authors: Judith Bütepage, Michael Black, Danica Kragic, Hedvig Kjellström

    Abstract: Generative models of 3D human motion are often restricted to a small number of activities and can therefore not generalize well to novel movements or applications. In this work we propose a deep learning framework for human motion capture data that learns a generic representation from a large corpus of motion capture data and generalizes well to new, unseen, motions. Using an encoding-decoding net… ▽ More

    Submitted 13 April, 2017; v1 submitted 24 February, 2017; originally announced February 2017.

    Comments: This paper is published at the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017

  47. arXiv:1612.02490  [pdf, other

    cs.LG stat.AP

    Bridging Medical Data Inference to Achilles Tendon Rupture Rehabilitation

    Authors: An Qu, Cheng Zhang, Paul Ackermann, Hedvig Kjellström

    Abstract: Imputing incomplete medical tests and predicting patient outcomes are crucial for guiding the decision making for therapy, such as after an Achilles Tendon Rupture (ATR). We formulate the problem of data imputation and prediction for ATR relevant medical measurements into a recommender system framework. By applying MatchBox, which is a collaborative filtering approach, on a real dataset collected… ▽ More

    Submitted 7 December, 2016; originally announced December 2016.

    Comments: Workshop on Machine Learning for Healthcare, NIPS 2016, Barcelona, Spain

  48. arXiv:1612.01356  [pdf, other

    cs.LG

    Diagnostic Prediction Using Discomfort Drawings

    Authors: Cheng Zhang, Hedvig Kjellstrom, Bo C. Bertilson

    Abstract: In this paper, we explore the possibility to apply machine learning to make diagnostic predictions using discomfort drawings. A discomfort drawing is an intuitive way for patients to express discomfort and pain related symptoms. These drawings have proven to be an effective method to collect patient data and make diagnostic decisions in real-life practice. A dataset from real-world patient cases i… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.

    Comments: NIPS 2016 Workshop on Machine Learning for Health

  49. arXiv:1611.05915  [pdf, other

    cs.CV

    Generative One-Class Models for Text-based Person Retrieval in Forensic Applications

    Authors: David Gerónimo, Hedvig Kjellström

    Abstract: Automatic forensic image analysis assists criminal investigation experts in the search for suspicious persons, abnormal behaviors detection and identity matching in images. In this paper we propose a person retrieval system that uses textual queries (e.g., "black trousers and green shirt") as descriptions and a one-class generative color model with outlier filtering to represent the images both to… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

  50. arXiv:1607.08206  [pdf, other

    cs.LG

    Diagnostic Prediction Using Discomfort Drawings with IBTM

    Authors: Cheng Zhang, Hedvig Kjellstrom, Carl Henrik Ek, Bo C. Bertilson

    Abstract: In this paper, we explore the possibility to apply machine learning to make diagnostic predictions using discomfort drawings. A discomfort drawing is an intuitive way for patients to express discomfort and pain related symptoms. These drawings have proven to be an effective method to collect patient data and make diagnostic decisions in real-life practice. A dataset from real-world patient cases i… ▽ More

    Submitted 13 September, 2016; v1 submitted 27 July, 2016; originally announced July 2016.

    Comments: Presented at 2016 Machine Learning and Healthcare Conference (MLHC 2016), Los Angeles, CA