Skip to main content

Showing 1–50 of 110 results for author: Nadine

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02156  [pdf, other

    cs.SD cs.AI cs.IR cs.LG eess.AS

    Towards Training Music Taggers on Synthetic Data

    Authors: Nadine Kroher, Steven Manangu, Aggelos Pikrakis

    Abstract: Most contemporary music tagging systems rely on large volumes of annotated data. As an alternative, we investigate the extent to which synthetically generated music excerpts can improve tagging systems when only small annotated collections are available. To this end, we release GTZAN-synth, a synthetic dataset that follows the taxonomy of the well-known GTZAN dataset while being ten times larger i… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, accepted to 21st International Conference on Content-based Multimedia Indexing (CBMI) 2024, code available https://github.com/NadineKroher/music-tagging-synthetic-data-cbmi-2024

    ACM Class: I.2

  2. arXiv:2406.19081  [pdf, other

    eess.IV cs.CV

    Unsupervised Latent Stain Adaptation for Computational Pathology

    Authors: Daniel Reisenbüchler, Lucas Luttner, Nadine S. Schaadt, Friedrich Feuerhake, Dorit Merhof

    Abstract: In computational pathology, deep learning (DL) models for tasks such as segmentation or tissue classification are known to suffer from domain shifts due to different staining techniques. Stain adaptation aims to reduce the generalization error between different stains by training a model on source stains that generalizes to target stains. Despite the abundance of target stain data, a key challenge… ▽ More

    Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted MICCAI2024

  3. arXiv:2406.16659  [pdf, other

    cs.LG eess.SP

    Data-driven Modeling in Metrology -- A Short Introduction, Current Developments and Future Perspectives

    Authors: Linda-Sophie Schneider, Patrick Krauss, Nadine Schiering, Christopher Syben, Richard Schielein, Andreas Maier

    Abstract: Mathematical models are vital to the field of metrology, playing a key role in the derivation of measurement results and the calculation of uncertainties from measurement data, informed by an understanding of the measurement process. These models generally represent the correlation between the quantity being measured and all other pertinent quantities. Such relationships are used to construct meas… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 31 pages, Preprint

  4. Mind Mansion: Exploring Metaphorical Interactions to Engage with Negative Thoughts in Virtual Reality

    Authors: Julian Rasch, Michelle Johanna Zender, Sophia Sakel, Nadine Wagener

    Abstract: Recurrent negative thoughts can significantly disrupt daily life and contribute to negative emotional states. Facing, confronting, and noticing such thoughts without support can be challenging. To provide a playful setting and leverage the technical maturation of Virtual Reality (VR), our VR experience, Mind Mansion, places the user in an initially cluttered virtual apartment. Here we utilize esta… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: To appear in Proceedings of the Designing Interactive Systems Conference (DIS '24), July 1-5, 2024, IT University of Copenhagen, Denmark

  5. arXiv:2405.01533  [pdf, other

    cs.CV

    OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

    Authors: Shihao Wang, Zhiding Yu, Xiaohui Jiang, Shiyi Lan, Min Shi, Nadine Chang, Jan Kautz, Ying Li, Jose M. Alvarez

    Abstract: The advances in multimodal large language models (MLLMs) have led to growing interests in LLM-based autonomous driving agents to leverage their strong reasoning capabilities. However, capitalizing on MLLMs' strong reasoning capabilities for improved planning behavior is challenging since planning requires full 3D situational awareness beyond 2D reasoning. To address this challenge, our work propos… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  6. arXiv:2404.02529  [pdf, other

    cs.CL

    A School Student Essay Corpus for Analyzing Interactions of Argumentative Structure and Quality

    Authors: Maja Stahl, Nadine Michel, Sebastian Kilsbach, Julian Schmidtke, Sara Rezat, Henning Wachsmuth

    Abstract: Learning argumentative writing is challenging. Besides writing fundamentals such as syntax and grammar, learners must select and arrange argument components meaningfully to create high-quality essays. To support argumentative writing computationally, one step is to mine the argumentative structure. When combined with automatic essay scoring, interactions of the argumentative structure and quality… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024

  7. arXiv:2403.11784  [pdf, other

    cs.RO cs.SE eess.SY

    ForzaETH Race Stack -- Scaled Autonomous Head-to-Head Racing on Fully Commercial off-the-Shelf Hardware

    Authors: Nicolas Baumann, Edoardo Ghignone, Jonas Kühne, Niklas Bastuck, Jonathan Becker, Nadine Imholz, Tobias Kränzlin, Tian Yi Lim, Michael Lötscher, Luca Schwarzenbach, Luca Tognoni, Christian Vogt, Andrea Carron, Michele Magno

    Abstract: Autonomous racing in robotics combines high-speed dynamics with the necessity for reliability and real-time decision-making. While such racing pushes software and hardware to their limits, many existing full-system solutions necessitate complex, custom hardware and software, and usually focus on Time-Trials rather than full unrestricted Head-to-Head racing, due to financial and safety constraints.… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  8. arXiv:2402.18194  [pdf

    cs.SE eess.SY

    Formalized Identification Of Key Factors In Safety-Relevant Failure Scenarios

    Authors: Tim Maurice Julitz, Nadine Schlüter, Manuel Löwer

    Abstract: This research article presents a methodical data-based approach to systematically identify key factors in safety-related failure scenarios, with a focus on complex product-environmental systems in the era of Industry 4.0. The study addresses the uncertainty arising from the growing complexity of modern products. The method uses scenario analysis and focuses on failure analysis within technical pro… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  9. arXiv:2402.12880  [pdf, other

    cs.CL

    Autism Detection in Speech -- A Survey

    Authors: Nadine Probol, Margot Mieskes

    Abstract: There has been a range of studies of how autism is displayed in voice, speech, and language. We analyse studies from the biomedical, as well as the psychological domain, but also from the NLP domain in order to find linguistic, prosodic and acoustic cues that could indicate autism. Our survey looks at all three domains. We define autism and which comorbidities might influence the correct detection… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024 Findings

  10. arXiv:2402.11820  [pdf

    cs.HC

    A critical analysis of cognitive load measurement methods for evaluating the usability of different types of interfaces: guidelines and framework for Human-Computer Interaction

    Authors: Ali Darejeh, Nadine Marcusa, Gelareh Mohammadi, John Sweller

    Abstract: Usability testing is an essential part of product design, particularly for user interfaces. To enhance the reliability of usability evaluations, employing cognitive load measurement methods can be highly effective in assessing the mental effort required to complete tasks during user testing. This review aims to provide an overview of the most suitable cognitive load measurement methods for evaluat… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  11. arXiv:2401.15022  [pdf

    eess.IV cs.CV cs.LG

    Applications of artificial intelligence in the analysis of histopathology images of gliomas: a review

    Authors: Jan-Philipp Redlich, Friedrich Feuerhake, Joachim Weis, Nadine S. Schaadt, Sarah Teuber-Hanselmann, Christoph Buck, Sabine Luttmann, Andrea Eberle, Stefan Nikolin, Arno Appenzeller, Andreas Portmann, André Homeyer

    Abstract: In recent years, the diagnosis of gliomas has become increasingly complex. Analysis of glioma histopathology images using artificial intelligence (AI) offers new opportunities to support diagnosis and outcome prediction. To give an overview of the current state of research, this review examines 70 publicly available research studies that have proposed AI-based methods for whole-slide histopatholog… ▽ More

    Submitted 5 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Journal ref: npj Imaging 2024

  12. arXiv:2401.05450  [pdf

    cs.HC

    Reorienting Learning Game Design in Design-Based Research: a Case Study

    Authors: Nadine Mandran, Estelle Prior, Eric Sanchez, Mathieu Vermeulen

    Abstract: One of the main difficulties remains the collaboration between the various experts involved in designing the Learning Games (LG). Our literature review focuses on the pitfalls and principles that have been identified by various authors in learning games design. Based on this review, a prototype was designed to support the LG design process and to study more precisely the collaboration between acto… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  13. arXiv:2311.13898  [pdf

    cs.HC

    HandiMathKey-Device

    Authors: Frédéric Vella, Nathalie Dubus, Eloise Grolleau, Marjorie Deleau, Cécile Malet, Christine Gallard, Véronique Ades, Nadine Vigouroux

    Abstract: Ty** mathematics is sometimes difficult with text editor functions for students with motor impairment and other associated impairments (visual, cognitive). Based on the HandiMathKey software keyboard, a user-centred design method involving the ecosytem of disabled students was applied to design the HMK-D physical keyboard for mathematical input. We opted for the Stream Deck device because of its… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Universal Access in Human-Computer Interaction. HCII 2023, Jul 2023, Copenhagen (Virtual), Denmark

  14. arXiv:2311.13894  [pdf

    cs.HC

    A first step towards an ecosystem meta-model for humancentered design in case of disabled users

    Authors: Christophe Kolski, Nadine Vigouroux, Yohan Guerrier, Frédéric Vella, Marine Guffroy

    Abstract: The involvement of the ecosystem or social environment of the disabled user is considered as very useful and even essential for the human-centered design of assistive technologies. In the era of model-based approaches, the modeling of the ecosystem is therefore to be considered. The first version of a metamodel of ecosystem is proposed. It is illustrated through a first case study. It concerns a p… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Journal ref: Disab2023 Engineering Interactive Computing Systems for People with Disabilities, Jun 2023, Swansea, United Kingdom

  15. Design Recommendations Based on Speech Analysis for Disability-Friendly Interfaces for the Control of a Home Automation Environment

    Authors: Nadine Vigouroux, Frédéric Vella, Gaëlle Lepage, Éric Campo

    Abstract: The objective of this paper is to describe the study on speech interaction mode for home automation control of equipment by impaired people for an inclusive housing. The study is related to the HIP HOPE project concerning a building of 19 inclusive housing units. 7 participants with different types of disabilities were invited to carry out use cases using voice and touch control. Only the results… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Journal ref: Universal Access in Human-Computer Interaction. HCII 2023, Jul 2023, Copenhagen (Virtual), Denmark. pp.197-211

  16. arXiv:2311.09094  [pdf, other

    cs.SD cs.AI eess.AS

    Can MusicGen Create Training Data for MIR Tasks?

    Authors: Nadine Kroher, Helena Cuesta, Aggelos Pikrakis

    Abstract: We are investigating the broader concept of using AI-based generative music systems to generate training data for Music Information Retrieval (MIR) tasks. To kick off this line of work, we ran an initial experiment in which we trained a genre classifier on a fully artificial music dataset created with MusicGen. We constructed over 50 000 genre- conditioned textual descriptions and generated a coll… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: This is an extended abstract presented at the Late-Breaking / Demo Session of the International Society for Music Information Retrieval Conference (ISMIR) 2023 (Milan, Italy)

  17. arXiv:2311.04780  [pdf, other

    eess.IV cs.LG

    FetMRQC: an open-source machine learning framework for multi-centric fetal brain MRI quality control

    Authors: Thomas Sanchez, Oscar Esteban, Yvan Gomez, Alexandre Pron, Mériam Koob, Vincent Dunet, Nadine Girard, Andras Jakab, Elisenda Eixarch, Guillaume Auzias, Meritxell Bach Cuadra

    Abstract: Fetal brain MRI is becoming an increasingly relevant complement to neurosonography for perinatal diagnosis, allowing fundamental insights into fetal brain development throughout gestation. However, uncontrolled fetal motion and heterogeneity in acquisition protocols lead to data of variable quality, potentially biasing the outcome of subsequent studies. We present FetMRQC, an open-source machine-l… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 22 pages, 10 Figures

  18. arXiv:2310.12956  [pdf, other

    cs.LG cs.AI cs.CV

    Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems

    Authors: David T. Hoffmann, Simon Schrodi, Jelena Bratulić, Nadine Behrmann, Volker Fischer, Thomas Brox

    Abstract: In this work, we study rapid improvements of the training loss in transformers when being confronted with multi-step decision tasks. We found that transformers struggle to learn the intermediate task and both training and validation loss saturate for hundreds of epochs. When transformers finally learn the intermediate task, they do this rapidly and unexpectedly. We call these abrupt improvements E… ▽ More

    Submitted 6 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted at ICML 2024

  19. arXiv:2310.02932  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Assessing Large Language Models on Climate Information

    Authors: Jannis Bulian, Mike S. Schäfer, Afra Amini, Heidi Lam, Massimiliano Ciaramita, Ben Gaiarin, Michelle Chen Hübscher, Christian Buck, Niels G. Mede, Markus Leippold, Nadine Strauß

    Abstract: As Large Language Models (LLMs) rise in popularity, it is necessary to assess their capability in critically relevant domains. We present a comprehensive evaluation framework, grounded in science communication research, to assess LLM responses to questions about climate change. Our framework emphasizes both presentational and epistemological adequacy, offering a fine-grained analysis of LLM genera… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

  20. ProofBuddy: A Proof Assistant for Learning and Monitoring

    Authors: Nadine Karsten, Frederik Krogsdal Jacobsen, Kim Jana Eiken, Uwe Nestmann, Jørgen Villadsen

    Abstract: Proof competence, i.e. the ability to write and check (mathematical) proofs, is an important skill in Computer Science, but for many students it represents a difficult challenge. The main issues are the correct use of formal language and the ascertainment of whether proofs, especially the students' own, are complete and correct. Many authors have suggested using proof assistants to assist in teach… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: In Proceedings TFPIE 2023, arXiv:2308.06110

    ACM Class: K.3.2; D.1.1; F.3.1; D.2.4; D.2.6; G.4; H.5.2

    Journal ref: EPTCS 382, 2023, pp. 1-21

  21. arXiv:2308.00420  [pdf, other

    cs.CC

    The complexity of the Timetable-Based Railway Network Design Problem

    Authors: Nadine Friesen, Tim Sander, Karl Nachtigall, Nils Nießen

    Abstract: Because of the long planning periods and their long life cycle, railway infrastructure has to be outlined long ahead. At the present, the infrastructure is designed while only little about the intended operation is known. Hence, the timetable and the operation are adjusted to the infrastructure. Since space, time and money for extension measures of railway infrastructure are limited, each modifica… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  22. arXiv:2307.02916  [pdf, other

    cs.CY

    The impact of an employee's psychological contract breach on compliance with information security policies: intrinsic and extrinsic motivation

    Authors: Daeun Lee, Har**der Singh Lallie, Nadine Michaelides

    Abstract: Despite the rapid rise in social engineering attacks, not all employees are as compliant with information security policies (ISPs) to the extent that organisations expect them to be. ISP non-compliance is caused by a variety of psychological motivation. This study investigates the effect of psychological contract breach (PCB) of employees on ISP compliance intention (ICI) by dividing them into int… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 27 pages, 3 figures

    Journal ref: Cognition, Technology & Work, pp.1-17 (2023)

  23. arXiv:2306.15694  [pdf, other

    cs.SE

    Scenario-based Failure Analysis of Product Systems and their Environment

    Authors: Tim Maurice Julitz, Nadine Schlüter, Manuel Löwer

    Abstract: During the usage phase, a technical product system is in permanent interaction with its environment. This interaction can lead to failures that significantly endanger the safety of the user and negatively affect the quality and reliability of the product. Conventional methods of failure analysis focus on the technical product system. The interaction of the product with its environment in the usage… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

  24. arXiv:2306.14035  [pdf, other

    cs.CV

    Thinking Like an Annotator: Generation of Dataset Labeling Instructions

    Authors: Nadine Chang, Francesco Ferroni, Michael J. Tarr, Martial Hebert, Deva Ramanan

    Abstract: Large-scale datasets are essential to modern day deep learning. Advocates argue that understanding these methods requires dataset transparency (e.g. "dataset curation, motivation, composition, collection process, etc..."). However, almost no one has suggested the release of the detailed definitions and visual category examples provided to annotators - information critical to understanding the stru… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  25. Toward Mixed Reality Hybrid Objects with IoT Avatar Agents

    Authors: Alexis Morris, Jie Guan, Nadine Lessio, Yiyi Shao

    Abstract: The internet-of-things (IoT) refers to the growing field of interconnected pervasive computing devices and the networking that supports smart, embedded applications. The IoT has multiple human-computer interaction challenges due to its many formats and interlinked components, and central to these is the need to provide sensory information and situational context pertaining to users in a more human… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  26. arXiv:2305.03041  [pdf, other

    cs.LG q-bio.QM

    Are VAEs Bad at Reconstructing Molecular Graphs?

    Authors: Hagen Muenkler, Hubert Misztela, Michal Pikusa, Marwin Segler, Nadine Schneider, Krzysztof Maziarz

    Abstract: Many contemporary generative models of molecules are variational auto-encoders of molecular graphs. One term in their training loss pertains to reconstructing the input, yet reconstruction capabilities of state-of-the-art models have not yet been thoroughly compared on a large and chemically diverse dataset. In this work, we show that when several state-of-the-art generative models are evaluated u… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Published at the ELLIS Workshop on Machine Learning for Molecules (ML4Molecules 2022)

  27. arXiv:2304.03639  [pdf, other

    cs.LG cs.CL cs.FL cs.NE

    Theoretical Conditions and Empirical Failure of Bracket Counting on Long Sequences with Linear Recurrent Networks

    Authors: Nadine El-Naggar, Pranava Madhyastha, Tillman Weyde

    Abstract: Previous work has established that RNNs with an unbounded activation function have the capacity to count exactly. However, it has also been shown that RNNs are challenging to train effectively and generally do not learn exact counting behaviour. In this paper, we focus on this problem by studying the simplest possible RNN, a linear single-cell network. We conduct a theoretical analysis of linear R… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 17th Conference of the European Chapter of the Association for Computational Linguistics Student Research Workshop (EACL 2023 SRW)

  28. arXiv:2301.06078  [pdf

    cs.SD eess.AS

    Training one model to detect heart and lung sound events from single point auscultations

    Authors: Leander Melms, Robert R. Ilesan, Ulrich Köhler, Olaf Hildebrandt, Regina Conradt, Jens Eckstein, Cihan Atila, Sami Matrood, Bernhard Schieffer, Jürgen R. Schaefer, Tobias Müller, Julius Obergassel, Nadine Schlicker, Martin C. Hirsch

    Abstract: Objective: This work proposes a semi-supervised training approach for detecting lung and heart sounds simultaneously with only one trained model and in invariance to the auscultation point. Methods: We use open-access data from the 2016 Physionet/CinC Challenge, the 2022 George Moody Challenge, and from the lung sound database HF_V1. We first train specialist single-task models using foreground gr… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

    Comments: 14 pages, 8 figures

  29. arXiv:2211.16429  [pdf, other

    cs.NE cs.FL cs.LG

    Exploring the Long-Term Generalization of Counting Behavior in RNNs

    Authors: Nadine El-Naggar, Pranava Madhyastha, Tillman Weyde

    Abstract: In this study, we investigate the generalization of LSTM, ReLU and GRU models on counting tasks over long sequences. Previous theoretical work has established that RNNs with ReLU activation and LSTMs have the capacity for counting with suitable configuration, while GRUs have limitations that prevent correct counting over longer sequences. Despite this and some positive empirical results for LSTMs… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Published in I Can't Believe It's Not Better: Understanding Deep Learning Through Empirical Falsification Workshop at NeurIPS 2022

  30. arXiv:2211.13079  [pdf

    cs.HC

    User Centred Method to Design a Platform to Design Augmentative and Alternative Communication Assistive Technologies

    Authors: Frédéric Vella, Flavien Clastres-Babou, Nadine Vigouroux, Philippe Truillet, Charline Calmels, Caroline Mercadier, Karine Gigaud, Margot Issanchou, Kristina Gourinovitch, Anne Garaix

    Abstract: We describe a co-design approach to design the online WebSoKeyTo used to design AAC. This co-design was carried out between a team of therapists and a team of human-computer interaction researchers. Our approach begins with the use and evaluation of an existing SoKeyTo AAC design application. This step was essential in the awareness and definition of the needs by the therapists and in the understa… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Journal ref: HCI INTERNATIONAL 2022 24TH INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION, Jun 2022, Virtual conference, France. pp.559-571, \&\#x27E8;10.1007/978-3-031-17902-0\_40\&\#x27E9

  31. arXiv:2211.13078  [pdf

    cs.HC

    Participation of Stakeholder in the Design of a Conception Application of Augmentative and Alternative Communication

    Authors: Frédéric Vella, Flavien Clastres-Babou, Frédéric Vella, Nadine Vigouroux, Philippe Truillet, Nadine Vigouroux, Charline Calmels, Caroline Mercadier, Karine Gigaud, Margot Issanchou, Kristina Gourinovitch, Anne Garaix

    Abstract: The objective of this paper is to describe the implication of an interdisciplinary team involved during a user-centered design methodology to design the platform (WebSoKeyTo) that meets the needs of therapists to design augmentative and alternative communication (AAC) aids for disabled users. We describe the processes of the design process and the role of the various actors (therapists and human c… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Journal ref: ICCHP-AAATE 2022 Open Access Compendium ''Assistive Technology, Accessibility and (e)Inclusion'', Jul 2022, Lecco, Italy. \&\#x27E8;10.35011/icchp-aaate22-p1-17\&\#x27E9

  32. arXiv:2211.13058  [pdf

    cs.HC cs.NI

    IDEALI: intuitively localising connected devices in order to support autonomy

    Authors: Frédéric Vella, Réjane Dalcé, Antonio Serpa, Thierry Val, Adrien van Den Bossche, Frédéric Vella, Nadine Vigouroux

    Abstract: The ability to localise a smart device is very useful to visually or cognitively impaired people. Localisation-capable technologies are becoming more readily available as off-the-shelf components. In this paper, we highlight the need for such a service in the field of health and autonomy, especially for disabled people. We introduce a model for Semantic Position Description (SPD) (e.g. "The pill o… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  33. arXiv:2211.13042  [pdf

    cs.HC

    Usability Study of Tactile and Voice Interaction Modes by People with Disabilities for Home Automation Controls

    Authors: Nadine Vigouroux, Frédéric Vella, Gaëlle Lepage, Eric Campo

    Abstract: This paper presents a comparative usability study on tactile and vocal interaction modes for home automation control of equipment at home for different profiles of disabled people. The study is related to the HIP HOPE project concerning the construction of 19 inclusive housing in the Toulouse metropolitan area in France. The experimentation took place in a living lab with 7 different disabled peop… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Journal ref: ICCHP-AAATE 2022 Open Access Compendium ''Assistive Technology, Accessibility and (e)Inclusion'', Jul 2022, Lecco, Italy. pp.139-147, \&\#x27E8;10.1007/978-3-031-08645-8\_17\&\#x27E9

  34. Histopathological Image Classification based on Self-Supervised Vision Transformer and Weak Labels

    Authors: Ahmet Gokberk Gul, Oezdemir Cetin, Christoph Reich, Tim Prangemeier, Nadine Flinner, Heinz Koeppl

    Abstract: Whole Slide Image (WSI) analysis is a powerful method to facilitate the diagnosis of cancer in tissue samples. Automating this diagnosis poses various issues, most notably caused by the immense image resolution and limited annotations. WSIs commonly exhibit resolutions of 100Kx100K pixels. Annotating cancerous areas in WSIs on the pixel level is prohibitively labor-intensive and requires a high le… ▽ More

    Submitted 17 April, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Journal ref: Proc. SPIE 12039, Medical Imaging 2022: Digital and Computational Pathology, 120391O (4 April 2022)

  35. arXiv:2209.13598  [pdf, other

    cs.SD cs.IR eess.AS

    Computing Melodic Templates in Oral Music Traditions

    Authors: Sergey Bereg, José-Miguel Díaz-Báñez, Nadine Kroher, Inmaculada Ventura

    Abstract: The term melodic template or skeleton refers to a basic melody which is subject to variation during a music performance. In many oral music tradition, these templates are implicitly passed throughout generations without ever being formalized in a score. In this work, we introduce a new geometric optimization problem, the spanning tube problem, to approximate a melodic template for a set of labeled… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  36. arXiv:2209.10970  [pdf, other

    cs.SD cs.CG eess.AS

    Maths, Computation and Flamenco: overview and challenges

    Authors: José-Miguel Díaz-Báñez, Nadine Kroher

    Abstract: Flamenco is a rich performance-oriented art music genre from Southern Spain which attracts a growing community of aficionados around the globe. Due to its improvisational and expressive nature, its unique musical characteristics, and the fact that the genre is largely undocumented, flamenco poses a number of interesting mathematical and computational challenges. Most existing approaches in Musical… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  37. Model- and Acceleration-based Pursuit Controller for High-Performance Autonomous Racing

    Authors: Jonathan Becker, Nadine Imholz, Luca Schwarzenbach, Edoardo Ghignone, Nicolas Baumann, Michele Magno

    Abstract: Autonomous racing is a research field gaining large popularity, as it pushes autonomous driving algorithms to their limits and serves as a catalyst for general autonomous driving. For scaled autonomous racing platforms, the computational constraint and complexity often limit the use of Model Predictive Control (MPC). As a consequence, geometric controllers are the most frequently deployed controll… ▽ More

    Submitted 7 July, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: 6 pages, 6 figures, 1 table

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  38. arXiv:2209.00638  [pdf, other

    cs.CV

    Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation

    Authors: Nadine Behrmann, S. Alireza Golestaneh, Zico Kolter, Juergen Gall, Mehdi Noroozi

    Abstract: This paper introduces a unified framework for video action segmentation via sequence to sequence (seq2seq) translation in a fully and timestamp supervised setup. In contrast to current state-of-the-art frame-level prediction methods, we view action segmentation as a seq2seq translation task, i.e., map** a sequence of video frames to a sequence of action segments. Our proposed method involves a s… ▽ More

    Submitted 11 October, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 (Main Conference)

  39. arXiv:2205.07575  [pdf, other

    cs.CV q-bio.QM

    An automatic pipeline for atlas-based fetal and neonatal brain segmentation and analysis

    Authors: Urru, Andrea, Nakaki, Ayako, Benkarim, Oualid, Crovetto, Francesca, Segales, Laura, Comte, Valentin, Hahner, Nadine, Eixarch, Elisenda, Gratacós, Eduard, Crispi, Fàtima, Piella, Gemma, González Ballester, Miguel A

    Abstract: The automatic segmentation of perinatal brain structures in magnetic resonance imaging (MRI) is of utmost importance for the study of brain growth and related complications. While different methods exist for adult and pediatric MRI data, there is a lack for automatic tools for the analysis of perinatal imaging. In this work, a new pipeline for fetal and neonatal segmentation has been developed. We… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  40. arXiv:2204.11550  [pdf, other

    cs.CL cs.SD eess.AS

    Speech Detection For Child-Clinician Conversations In Danish For Low-Resource In-The-Wild Conditions: A Case Study

    Authors: Sneha Das, Nicole Nadine Lønfeldt, Anne Katrine Pagsberg, Line. H. Clemmensen

    Abstract: Use of speech models for automatic speech processing tasks can improve efficiency in the screening, analysis, diagnosis and treatment in medicine and psychiatry. However, the performance of pre-processing speech tasks like segmentation and diarization can drop considerably on in-the-wild clinical data, specifically when the target dataset comprises of atypical speech. In this paper we study the pe… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 5 pages. Submitted to Interspeech 2022

  41. arXiv:2203.15536  [pdf, other

    cs.CV cs.AI cs.LG

    BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information

    Authors: Nadine Rueegg, Silvia Zuffi, Konrad Schindler, Michael J. Black

    Abstract: Our goal is to recover the 3D shape and pose of dogs from a single image. This is a challenging task because dogs exhibit a wide range of shapes and appearances, and are highly articulated. Recent work has proposed to directly regress the SMAL animal model, with additional limb scale parameters, from images. Our method, called BARC (Breed-Augmented Regression using Classification), goes beyond pri… ▽ More

    Submitted 18 June, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: accepted for publication at CVPR 2022

    ACM Class: I.4; I.2

  42. arXiv:2203.14867  [pdf, other

    eess.AS cs.SD

    Continuous Metric Learning For Transferable Speech Emotion Recognition and Embedding Across Low-resource Languages

    Authors: Sneha Das, Nicklas Leander Lund, Nicole Nadine Lønfeldt, Anne Katrine Pagsberg, Line H. Clemmensen

    Abstract: Speech emotion recognition~(SER) refers to the technique of inferring the emotional state of an individual from speech signals. SERs continue to garner interest due to their wide applicability. Although the domain is mainly founded on signal processing, machine learning, and deep learning, generalizing over languages continues to remain a challenge. However, develo** generalizable and transferab… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Preprint of paper accepted to be presented at the Northern Lights Deep Learning Conference (NLDL), 2022. The labels are available at: https://bit.ly/3rg6VsA

  43. arXiv:2203.14865  [pdf, other

    eess.AS cs.SD

    Towards Transferable Speech Emotion Representation: On loss functions for cross-lingual latent representations

    Authors: Sneha Das, Nicole Nadine Lønfeldt, Anne Katrine Pagsberg, Line H. Clemmensen

    Abstract: In recent years, speech emotion recognition (SER) has been used in wide ranging applications, from healthcare to the commercial sector. In addition to signal processing approaches, methods for SER now also use deep learning techniques which provide transfer learning possibilities. However, generalizing over languages, corpora and recording conditions is still an open challenge. In this work we add… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Preprint of paper accepted to be presented at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022. Source code at https://bit.ly/34CgkSZ. arXiv admin note: text overlap with arXiv:2105.02055

  44. arXiv:2203.01429   

    cs.SD eess.AS

    SMTNet: Hierarchical cavitation intensity recognition based on sub-main transfer network

    Authors: Yu Sha, Johannes Faber, Shui** Gou, Bo Liu, Wei Li, Stefan Schramm, Horst Stoecker, Thomas Steckenreiter, Domagoj Vnucec, Nadine Wetzstein, Andreas Widl, Kai Zhou

    Abstract: With the rapid development of smart manufacturing, data-driven machinery health management has been of growing attention. In situations where some classes are more difficult to be distinguished compared to others and where classes might be organised in a hierarchy of categories, current DL methods can not work well. In this study, a novel hierarchical cavitation intensity recognition framework usi… ▽ More

    Submitted 12 July, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: we need update this paper

  45. A multi-task learning for cavitation detection and cavitation intensity recognition of valve acoustic signals

    Authors: Yu Sha, Johannes Faber, Shui** Gou, Bo Liu, Wei Li, Stefan Schramm, Horst Stoecker, Thomas Steckenreiter, Domagoj Vnucec, Nadine Wetzstein, Andreas Widl, Kai Zhou

    Abstract: With the rapid development of smart manufacturing, data-driven machinery health management has received a growing attention. As one of the most popular methods in machinery health management, deep learning (DL) has achieved remarkable successes. However, due to the issues of limited samples and poor separability of different cavitation states of acoustic signals, which greatly hinder the eventual… ▽ More

    Submitted 20 April, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2202.13226

    Journal ref: Engineering Applications of Artificial Intelligence, 113 (2022), 104904

  46. Regional-Local Adversarially Learned One-Class Classifier Anomalous Sound Detection in Global Long-Term Space

    Authors: Yu Sha, Johannes Faber, Shui** Gou, Bo Liu, Wei Li, Stefan Schramm, Horst Stoecker, Thomas Steckenreiter, Domagoj Vnucec, Nadine Wetzstein, Andreas Widl, Kai Zhou

    Abstract: Anomalous sound detection (ASD) is one of the most significant tasks of mechanical equipment monitoring and maintaining in complex industrial systems. In practice, it is vital to precisely identify abnormal status of the working mechanical system, which can further facilitate the failure troubleshooting. In this paper, we propose a multi-pattern adversarial learning one-class classification framew… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Journal ref: KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 2022

  47. An acoustic signal cavitation detection framework based on XGBoost with adaptive selection feature engineering

    Authors: Yu Sha, Johannes Faber, Shui** Gou, Bo Liu, Wei Li, Stefan Schramm, Horst Stoecker, Thomas Steckenreiter, Domagoj Vnucec, Nadine Wetzstein, Andreas Widl, Kai Zhou

    Abstract: Valves are widely used in industrial and domestic pipeline systems. However, during their operation, they may suffer from the occurrence of the cavitation, which can cause loud noise, vibration and damage to the internal components of the valve. Therefore, monitoring the flow status inside valves is significantly beneficial to prevent the additional cost induced by cavitation. In this paper, a nov… ▽ More

    Submitted 1 March, 2022; v1 submitted 26 February, 2022; originally announced February 2022.

    Journal ref: Measurement 192 (2022), 110897

  48. arXiv:2201.11736  [pdf, other

    cs.CV

    Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives

    Authors: David T. Hoffmann, Nadine Behrmann, Juergen Gall, Thomas Brox, Mehdi Noroozi

    Abstract: This paper introduces Ranking Info Noise Contrastive Estimation (RINCE), a new member in the family of InfoNCE losses that preserves a ranked ordering of positive samples. In contrast to the standard InfoNCE loss, which requires a strict binary separation of the training pairs into similar and dissimilar samples, RINCE can exploit information about a similarity ranking for learning a corresponding… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: AAAI 2022 (Main Track)

  49. arXiv:2109.11593  [pdf, other

    cs.CV

    Long Short View Feature Decomposition via Contrastive Video Representation Learning

    Authors: Nadine Behrmann, Mohsen Fayyaz, Juergen Gall, Mehdi Noroozi

    Abstract: Self-supervised video representation methods typically focus on the representation of temporal attributes in videos. However, the role of stationary versus non-stationary attributes is less explored: Stationary features, which remain similar throughout the video, enable the prediction of video-level action classes. Non-stationary features, which represent temporally varying attributes, are more be… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: ICCV 2021 (Main Conference)

  50. arXiv:2106.03170  [pdf

    cs.LG cs.AI cs.CL

    FlexParser -- the adaptive log file parser for continuous results in a changing world

    Authors: Nadine Ruecker, Andreas Maier

    Abstract: Any modern system writes events into files, called log files. Those contain crucial information which are subject to various analyses. Examples range from cybersecurity, intrusion detection over usage analyses to trouble shooting. Before data analysis is possible, desired information needs to be extracted first out of the semi-structured log messages. State-of-the-art event parsing often assumes s… ▽ More

    Submitted 1 February, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: 17 pages, 9 figures, 3 tables