Skip to main content

Showing 1–15 of 15 results for author: Draxler, F

.
  1. arXiv:2404.12549  [pdf, other

    cs.HC

    "If the Machine Is As Good As Me, Then What Use Am I?" -- How the Use of ChatGPT Changes Young Professionals' Perception of Productivity and Accomplishment

    Authors: Charlotte Kobiella, Yarhy Said Flores López, Fiona Draxler, Albrecht Schmidt

    Abstract: Large language models (LLMs) like ChatGPT have been widely adopted in work contexts. We explore the impact of ChatGPT on young professionals' perception of productivity and sense of accomplishment. We collected LLMs' main use cases in knowledge work through a preliminary study, which served as the basis for a two-week diary study with 21 young professionals reflecting on their ChatGPT use. Finding… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  2. arXiv:2402.06578  [pdf, other

    cs.LG stat.ML

    On the Universality of Coupling-based Normalizing Flows

    Authors: Felix Draxler, Stefan Wahl, Christoph Schnörr, Ullrich Köthe

    Abstract: We present a novel theoretical framework for understanding the expressive power of normalizing flows. Despite their prevalence in scientific applications, a comprehensive understanding of flows remains elusive due to their restricted architectures. Existing theorems fall short as they require the use of arbitrarily ill-conditioned neural networks, limiting practical applicability. We propose a dis… ▽ More

    Submitted 5 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  3. arXiv:2312.09852  [pdf, other

    cs.LG stat.ML

    Learning Distributions on Manifolds with Free-form Flows

    Authors: Peter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Ullrich Köthe

    Abstract: Many real world data, particularly in the natural sciences and computer vision, lie on known Riemannian manifolds such as spheres, tori or the group of rotation matrices. The predominant approaches to learning a distribution on such a manifold require solving a differential equation in order to sample from the model and evaluate densities. The resulting sampling times are slowed down by a high num… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Preprint, under review

  4. arXiv:2310.16624  [pdf, other

    cs.LG stat.ML

    Free-form Flows: Make Any Architecture a Normalizing Flow

    Authors: Felix Draxler, Peter Sorrenson, Lea Zimmermann, Armand Rousselot, Ullrich Köthe

    Abstract: Normalizing Flows are generative models that directly maximize the likelihood. Previously, the design of normalizing flows was largely constrained by the need for analytical invertibility. We overcome this constraint by a training procedure that uses an efficient estimator for the gradient of the change of variables formula. This enables any dimension-preserving neural network to serve as a genera… ▽ More

    Submitted 24 April, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Camera-ready version: accepted at AISTATS 2024

  5. arXiv:2310.06556  [pdf, other

    cs.CY cs.HC

    Gender, Age, and Technology Education Influence the Adoption and Appropriation of LLMs

    Authors: Fiona Draxler, Daniel Buschek, Mikke Tavast, Perttu Hämäläinen, Albrecht Schmidt, Juhi Kulshrestha, Robin Welsch

    Abstract: Large Language Models (LLMs) such as ChatGPT have become increasingly integrated into critical activities of daily life, raising concerns about equitable access and utilization across diverse demographics. This study investigates the usage of LLMs among 1,500 representative US citizens. Remarkably, 42% of participants reported utilizing an LLM. Our findings reveal a gender gap in LLM technology ad… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    ACM Class: H.1.2; I.2.7

  6. arXiv:2307.05870  [pdf, other

    cs.HC

    Useful but Distracting: Keyword Highlights and Time-Synchronization in Captions for Language Learning

    Authors: Fiona Draxler, Henrike Weingärtner, Maximiliane Windl, Albrecht Schmidt, Lewis L. Chuang

    Abstract: Captions provide language learners with a scaffold for comprehension and vocabulary acquisition. Past work has proposed several enhancements such as keyword highlights for increased learning gains. However, little is known about learners' experience with enhanced captions, although this is critical for adoption in everyday life. We conducted a survey and focus group to elicit learner preferences a… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    ACM Class: H.5.2; K.3.1

  7. arXiv:2306.13520  [pdf, other

    cs.LG stat.ML

    On the Convergence Rate of Gaussianization with Random Rotations

    Authors: Felix Draxler, Lars Kühmichel, Armand Rousselot, Jens Müller, Christoph Schnörr, Ullrich Köthe

    Abstract: Gaussianization is a simple generative model that can be trained without backpropagation. It has shown compelling performance on low dimensional data. As the dimension increases, however, it has been observed that the convergence speed slows down. We show analytically that the number of required layers scales linearly with the dimension for Gaussian input. We argue that this is because the model i… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  8. arXiv:2306.01843  [pdf, other

    cs.LG

    Lifting Architectural Constraints of Injective Flows

    Authors: Peter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Lea Zimmermann, Ullrich Köthe

    Abstract: Normalizing Flows explicitly maximize a full-dimensional likelihood on the training data. However, real data is typically only supported on a lower-dimensional manifold leading the model to expend significant compute on modeling noise. Injective Flows fix this by jointly learning a manifold and the distribution on it. So far, they have been limited by restrictive architectures and/or high computat… ▽ More

    Submitted 27 June, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Camera-ready version: accepted to ICLR 2024

  9. arXiv:2304.14905  [pdf, other

    cond-mat.quant-gas quant-ph

    Bose Einstein condensate as nonlinear block of a Machine Learning pipeline

    Authors: Maurus Hans, Elinor Kath, Marius Sparn, Nikolas Liebster, Felix Draxler, Christoph Schnörr, Helmut Strobel, Markus K. Oberthaler

    Abstract: Physical systems can be used as an information processing substrate and with that extend traditional computing architectures. For such an application the experimental platform must guarantee pristine control of the initial state, the temporal evolution and readout. All these ingredients are provided by modern experimental realizations of atomic Bose Einstein condensates. By embedding the nonlinear… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

  10. arXiv:2303.09989  [pdf, other

    cs.LG stat.ML

    Finding Competence Regions in Domain Generalization

    Authors: Jens Müller, Stefan T. Radev, Robert Schmier, Felix Draxler, Carsten Rother, Ullrich Köthe

    Abstract: We investigate a "learning to reject" framework to address the problem of silent failures in Domain Generalization (DG), where the test distribution differs from the training distribution. Assuming a mild distribution shift, we wish to accept out-of-distribution (OOD) data from a new domain whenever a model's estimated competence foresees trustworthy responses, instead of rejecting OOD data outrig… ▽ More

    Submitted 21 June, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: The paper has been published at TMLR (see https://openreview.net/forum?id=TSy0vuwQFN)

    Journal ref: Transactions on Machine Learning Research (06/2023)

  11. arXiv:2303.03283  [pdf, other

    cs.HC cs.CL

    The AI Ghostwriter Effect: When Users Do Not Perceive Ownership of AI-Generated Text But Self-Declare as Authors

    Authors: Fiona Draxler, Anna Werner, Florian Lehmann, Matthias Hoppe, Albrecht Schmidt, Daniel Buschek, Robin Welsch

    Abstract: Human-AI interaction in text production increases complexity in authorship. In two empirical studies (n1 = 30 & n2 = 96), we investigate authorship and ownership in human-AI collaboration for personalized language generation. We show an AI Ghostwriter Effect: Users do not consider themselves the owners and authors of AI-generated text but refrain from publicly declaring AI authorship. Personalizat… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Pre-print; currently under review

  12. arXiv:2210.14032  [pdf, other

    cs.LG stat.ML

    Whitening Convergence Rate of Coupling-based Normalizing Flows

    Authors: Felix Draxler, Christoph Schnörr, Ullrich Köthe

    Abstract: Coupling-based normalizing flows (e.g. RealNVP) are a popular family of normalizing flow architectures that work surprisingly well in practice. This calls for theoretical understanding. Existing work shows that such flows weakly converge to arbitrary data distributions. However, they make no statement about the stricter convergence criterion used in practice, the maximum likelihood loss. For the f… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Proceedings of 36th Conference on Neural Information Processing System (NeurIPS 2022)

  13. arXiv:1806.08734  [pdf, other

    stat.ML cs.LG

    On the Spectral Bias of Neural Networks

    Authors: Nasim Rahaman, Aristide Baratin, Devansh Arpit, Felix Draxler, Min Lin, Fred A. Hamprecht, Yoshua Bengio, Aaron Courville

    Abstract: Neural networks are known to be a class of highly expressive functions able to fit even random input-output map**s with $100\%$ accuracy. In this work, we present properties of neural networks that complement this aspect of expressivity. By using tools from Fourier analysis, we show that deep ReLU networks are biased towards low frequency functions, meaning that they cannot have local fluctuatio… ▽ More

    Submitted 31 May, 2019; v1 submitted 22 June, 2018; originally announced June 2018.

    Comments: 23 pages

    Journal ref: ICML 2019

  14. arXiv:1803.00885  [pdf, other

    stat.ML cs.AI cs.LG

    Essentially No Barriers in Neural Network Energy Landscape

    Authors: Felix Draxler, Kambis Veschgini, Manfred Salmhofer, Fred A. Hamprecht

    Abstract: Training neural networks involves finding minima of a high-dimensional non-convex loss function. Knowledge of the structure of this energy landscape is sparse. Relaxing from linear interpolations, we construct continuous paths between minima of recent neural network architectures on CIFAR10 and CIFAR100. Surprisingly, the paths are essentially flat in both the training and test landscapes. This im… ▽ More

    Submitted 22 February, 2019; v1 submitted 2 March, 2018; originally announced March 2018.

    Comments: In Proceedings of 35th International Conference on Machine Learning (ICML 2018)

    Journal ref: Proceedings of the 35th International Conference on Machine Learning, PMLR 80:1308-1317, 2018

  15. arXiv:1606.06620  [pdf, ps, other

    math.CO

    Equiangular Lines and Spherical Codes in Euclidean Space

    Authors: Igor Balla, Felix Dräxler, Peter Keevash, Benny Sudakov

    Abstract: A family of lines through the origin in Euclidean space is called equiangular if any pair of lines defines the same angle. The problem of estimating the maximum cardinality of such a family in $\mathbb{R}^n$ was extensively studied for the last 70 years. Motivated by a question of Lemmens and Seidel from 1973, in this paper we prove that for every fixed angle $θ$ and sufficiently large $n$ there a… ▽ More

    Submitted 28 June, 2017; v1 submitted 21 June, 2016; originally announced June 2016.

    Comments: 24 pages, 0 figures