Skip to main content

Showing 1–50 of 64 results for author: Hahn, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09347  [pdf, other

    cs.LG stat.ML

    Separations in the Representational Capabilities of Transformers and Recurrent Architectures

    Authors: Satwik Bhattamishra, Michael Hahn, Phil Blunsom, Varun Kanade

    Abstract: Transformer architectures have been widely adopted in foundation models. Due to their high inference costs, there is renewed interest in exploring the potential of efficient recurrent architectures (RNNs). In this paper, we analyze the differences in the representational capabilities of Transformers and RNNs across several tasks of practical relevance, including index lookup, nearest neighbor, rec… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Preprint

  2. arXiv:2405.17653  [pdf, other

    cs.LG cs.AI cs.CL

    InversionView: A General-Purpose Method for Reading Information from Neural Activations

    Authors: Xinting Huang, Madhur Panwar, Navin Goyal, Michael Hahn

    Abstract: The inner workings of neural networks can be better understood if we can fully decipher the information encoded in neural activations. In this paper, we argue that this information is embodied by the subset of inputs that give rise to similar activations. Computing such subsets is nontrivial as the input space is exponentially large. We propose InversionView, which allows us to practically inspect… ▽ More

    Submitted 2 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2405.17394  [pdf, other

    cs.CL cs.FL cs.LG

    The Expressive Capacity of State Space Models: A Formal Language Perspective

    Authors: Yash Sarrof, Yana Veitsman, Michael Hahn

    Abstract: Recently, recurrent models based on linear state space models (SSMs) have shown promising performance in language modeling (LM), competititve with transformers. However, there is little understanding of the in-principle abilities of such models, which could provide useful guidance to the search for better LM architectures. We present a comprehensive theoretical study of the capacity of such SSMs a… ▽ More

    Submitted 2 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2405.13583  [pdf, other

    cs.LO

    Tools at the Frontiers of Quantitative Verification

    Authors: Roman Andriushchenko, Alexander Bork, Carlos E. Budde, Milan Češka, Kush Grover, Ernst Moritz Hahn, Arnd Hartmanns, Bryant Israelsen, Nils Jansen, Joshua Jeppson, Sebastian Junges, Maximilian A. Köhl, Bettina Könighofer, Jan Křetínský, Tobias Meggendorfer, David Parker, Stefan Pranger, Tim Quatmann, Enno Ruijters, Landon Taylor, Matthias Volk, Maximilian Weininger, Zhen Zhang

    Abstract: The analysis of formal models that include quantitative aspects such as timing or probabilistic choices is performed by quantitative verification tools. Broad and mature tool support is available for computing basic properties such as expected rewards on basic models such as Markov chains. Previous editions of QComp, the comparison of tools for the analysis of quantitative formal models, focused o… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  5. arXiv:2405.12109  [pdf, other

    cs.CL cs.IT

    Linguistic Structure from a Bottleneck on Sequential Information Processing

    Authors: Richard Futrell, Michael Hahn

    Abstract: Human language is a unique form of communication in the natural world, distinguished by its structured nature. Most fundamentally, it is systematic, meaning that signals can be broken down into component parts that are individually meaningful -- roughly, words -- which are combined in a regular way to form sentences. Furthermore, the way in which these parts are combined maintains a kind of locali… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2403.11295  [pdf, other

    cs.CV math.AG math.NA

    Order-One Rolling Shutter Cameras

    Authors: Marvin Anas Hahn, Kathlén Kohn, Orlando Marigliano, Tomas Pajdla

    Abstract: Rolling shutter (RS) cameras dominate consumer and smartphone markets. Several methods for computing the absolute pose of RS cameras have appeared in the last 20 years, but the relative pose problem has not been fully solved yet. We provide a unified theory for the important class of order-one rolling shutter (RS$_1$) cameras. These cameras generalize the perspective projection to RS cameras, proj… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 36 pages, 6 figures, 3 ancillary files

    MSC Class: 14M20; 14Q15; 14N99; 15A69; 65H20; 68T45; 13P10; 13P25

  7. exploreCOSMOS: Interactive Exploration of Conditional Statistical Shape Models in the Web-Browser

    Authors: Maximilian Hahn, Bernhard Egger

    Abstract: Statistical Shape Models of faces and various body parts are heavily used in medical image analysis, computer vision and visualization. Whilst the field is well explored with many existing tools, all of them aim at experts, which limits their applicability. We demonstrate the first tool that enables the convenient exploration of statistical shape models in the browser, with the capability to manip… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Dies ist ein Vorabdruck des folgenden Beitrages, veröffentlicht in BVM 2024, herausgegeben von Maier, A. et al, 2024, Springer Nature, vervielfältigt mit Genehmigung von Springer Nature. Die finale authentifizierte Version ist online verfügbar unter: https://doi.org/10.1007/978-3-658-44037-4_32

  8. arXiv:2402.09963  [pdf, other

    cs.LG

    Why are Sensitive Functions Hard for Transformers?

    Authors: Michael Hahn, Mark Rofin

    Abstract: Empirical studies have identified a range of learnability biases and limitations of transformers, such as a persistent difficulty in learning to compute simple formal languages such as PARITY, and a bias towards low-degree functions. However, theoretical understanding remains limited, with existing expressiveness theory either overpredicting or underpredicting realistic learning abilities. We prov… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  9. arXiv:2401.16015  [pdf

    cs.LO

    Querying Fault and Attack Trees: Property Specification on a Water Network

    Authors: Stefano M. Nicoletti, Milan Lopuhaä-Zwakenberg, E. Moritz Hahn, Mariëlle Stoelinga

    Abstract: We provide an overview of three different query languages whose objective is to specify properties on the highly popular formalisms of fault trees (FTs) and attack trees (ATs). These are BFL, a Boolean Logic for FTs, PFL, a probabilistic extension of BFL and ATM, a logic for security metrics on ATs. We validate the framework composed by these three logics by applying them to the case study of a wa… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  10. arXiv:2312.14930  [pdf

    cs.NI

    A Data-Driven Digital Twin Network Architecture in the Industrial Internet of Things (IIoT) Applications

    Authors: Abubakar Isah, Hyeju Shin, Ibrahim Aliyu, Sangwon Oh, Sangjoon Lee, Jaehyung Park, Minsoo Hahn, **sul Kim

    Abstract: A new network named the "Digital Twin Network" (DTN) uses the "Digital Twin" (DT) technology to produce virtual twins of real things. The network load and size continue to grow as a result of the development of 5G, the Internet of Things, and cloud computing technology as well as the advent of new network services. As a result, network operation and maintenance are becoming more difficult. A digit… ▽ More

    Submitted 17 July, 2023; originally announced December 2023.

  11. arXiv:2312.14125  [pdf, other

    cs.CV cs.AI

    VideoPoet: A Large Language Model for Zero-Shot Video Generation

    Authors: Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Josh Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam , et al. (6 additional authors not shown)

    Abstract: We present VideoPoet, a language model capable of synthesizing high-quality video, with matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder-only transformer architecture that processes multimodal inputs -- including images, videos, text, and audio. The training protocol follows that of Large Language Models (LLMs), consisting of two stages: pretraining and tas… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear at ICML 2024; Project page: http://sites.research.google/videopoet/

  12. arXiv:2312.08602  [pdf, other

    cs.LO cs.LG

    Omega-Regular Decision Processes

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: Regular decision processes (RDPs) are a subclass of non-Markovian decision processes where the transition and reward functions are guarded by some regular property of the past (a lookback). While RDPs enable intuitive and succinct representation of non-Markovian decision processes, their expressive power coincides with finite-state Markov decision processes (MDPs). We introduce omega-regular decis… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  13. arXiv:2312.06662  [pdf, other

    cs.CV cs.AI cs.LG

    Photorealistic Video Generation with Diffusion Models

    Authors: Agrim Gupta, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, José Lezama

    Abstract: We present W.A.L.T, a transformer-based approach for photorealistic video generation via diffusion modeling. Our approach has two key design decisions. First, we use a causal encoder to jointly compress images and videos within a unified latent space, enabling training and generation across modalities. Second, for memory and training efficiency, we use a window attention architecture tailored for… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Project website https://walt-video-diffusion.github.io/

  14. arXiv:2312.00151  [pdf, other

    cs.CV cs.AI

    Which way is `right'?: Uncovering limitations of Vision-and-Language Navigation model

    Authors: Meera Hahn, Amit Raj, James M. Rehg

    Abstract: The challenging task of Vision-and-Language Navigation (VLN) requires embodied agents to follow natural language instructions to reach a goal location or object (e.g. `walk down the hallway and turn left at the piano'). For agents to complete this task successfully, they must be able to ground objects referenced into the instruction (e.g.`piano') into the visual scene as well as ground directional… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  15. arXiv:2311.14822  [pdf, other

    cs.CV

    Text and Click inputs for unambiguous open vocabulary instance segmentation

    Authors: Nikolai Warner, Meera Hahn, Jonathan Huang, Irfan Essa, Vighnesh Birodkar

    Abstract: Segmentation localizes objects in an image on a fine-grained per-pixel scale. Segmentation benefits by humans-in-the-loop to provide additional input of objects to segment using a combination of foreground or background clicks. Tasks include photoediting or novel dataset annotation, where human annotators leverage an existing segmentation model instead of drawing raw pixel level annotations. We pr… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 20 pages, 9 figures, 8 tables

  16. arXiv:2311.04009  [pdf, other

    cs.LG cs.CR cs.CV

    AGNES: Abstraction-guided Framework for Deep Neural Networks Security

    Authors: Akshay Dhonthi, Marcello Eiermann, Ernst Moritz Hahn, Vahid Hashemi

    Abstract: Deep Neural Networks (DNNs) are becoming widespread, particularly in safety-critical areas. One prominent application is image recognition in autonomous driving, where the correct classification of objects, such as traffic signs, is essential for safe driving. Unfortunately, DNNs are prone to backdoors, meaning that they concentrate on attributes of the image that should be irrelevant for their co… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 14 pages, 6 Figures, 4 Tables, Accepted at 25th International Conference on Verification, Model Checking, and Abstract Interpretation (VMCAI 2024)

  17. arXiv:2309.09231  [pdf

    cs.CR cs.LO

    ATM: a Logic for Quantitative Security Properties on Attack Trees

    Authors: Stefano M. Nicoletti, Milan Lopuhaä-Zwakenberg, E. Moritz Hahn, Mariëlle Stoelinga

    Abstract: Critical infrastructure systems - for which high reliability and availability are paramount - must operate securely. Attack trees (ATs) are hierarchical diagrams that offer a flexible modelling language used to assess how systems can be attacked. ATs are widely employed both in industry and academia but - in spite of their popularity - little work has been done to give practitioners instruments to… ▽ More

    Submitted 17 May, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

  18. arXiv:2308.07469  [pdf, other

    cs.LG cs.AI cs.FL

    Omega-Regular Reward Machines

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success. However, in many cases, the complexity of the learning objectives goes beyond the capabilities of the Markovian assumption, necessitating a more sophisticated reward mechanism. Reward machines and omega-regular languages are two formalis… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: To appear in ECAI-2023

  19. arXiv:2306.03734  [pdf, other

    cs.CL

    A Cross-Linguistic Pressure for Uniform Information Density in Word Order

    Authors: Thomas Hikaru Clark, Clara Meister, Tiago Pimentel, Michael Hahn, Ryan Cotterell, Richard Futrell, Roger Levy

    Abstract: While natural languages differ widely in both canonical word order and word order flexibility, their word orders still follow shared cross-linguistic statistical patterns, often attributed to functional pressures. In the effort to identify these pressures, prior work has compared real and counterfactual word orders. Yet one functional pressure has been overlooked in such investigations: the unifor… ▽ More

    Submitted 9 July, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  20. arXiv:2304.08330  [pdf, other

    cs.LO

    Scenario Approach for Parametric Markov Models

    Authors: Ying Liu, Andrea Turrini, Moritz Hahn, Bai Xue, Lijun Zhang

    Abstract: In this paper, we propose an approximating framework for analyzing parametric Markov models. Instead of computing complex rational functions encoding the reachability probability and the reward values of the parametric model, we exploit the scenario approach to synthesize a relatively simple polynomial approximation. The approximation is probably approximately correct (PAC), meaning that with high… ▽ More

    Submitted 13 November, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 24 pages, 8 figures; updated to add acknowledgements and data availability

  21. PFL: a Probabilistic Logic for Fault Trees

    Authors: Stefano M. Nicoletti, Milan Lopuhaä-Zwakenberg, E. Moritz Hahn, Mariëlle Stoelinga

    Abstract: Safety-critical infrastructures must operate in a safe and reliable way. Fault tree analysis is a widespread method used for risk assessment of these systems: fault trees (FTs) are required by, e.g., the Federal Aviation Administration and the Nuclear Regulatory Commission. In spite of their popularity, little work has been done on formulating structural queries about FT and analyzing these, e.g.,… ▽ More

    Submitted 1 June, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2208.13424

    Journal ref: In: Chechik, M., Katoen, JP., Leucker, M. (eds) Formal Methods. FM 2023. Lecture Notes in Computer Science, vol 14000. Springer, Cham

  22. arXiv:2303.07971  [pdf, other

    cs.CL cs.LG

    A Theory of Emergent In-Context Learning as Implicit Structure Induction

    Authors: Michael Hahn, Navin Goyal

    Abstract: Scaling large language models (LLMs) leads to an emergent capacity to learn in-context from example demonstrations. Despite progress, theoretical understanding of this phenomenon remains limited. We argue that in-context learning relies on recombination of compositional operations found in natural language data. We derive an information-theoretic bound showing how in-context learning abilities ari… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  23. arXiv:2212.07278  [pdf, other

    cs.CR cs.LG cs.LO

    Backdoor Mitigation in Deep Neural Networks via Strategic Retraining

    Authors: Akshay Dhonthi, Ernst Moritz Hahn, Vahid Hashemi

    Abstract: Deep Neural Networks (DNN) are becoming increasingly more important in assisted and automated driving. Using such entities which are obtained using machine learning is inevitable: tasks such as recognizing traffic signs cannot be developed reasonably using traditional software development methods. DNN however do have the problem that they are mostly black boxes and therefore hard to understand and… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 13 Pages, 7 Tables, 4 Figures. Accepted at the International Symposium of Formal Methods 2023 (FM 2023)

  24. arXiv:2210.04864  [pdf, other

    cs.CV cs.AI cs.CL

    Transformer-based Localization from Embodied Dialog with Large-scale Pre-training

    Authors: Meera Hahn, James M. Rehg

    Abstract: We address the challenging task of Localization via Embodied Dialog (LED). Given a dialog from two agents, an Observer navigating through an unknown environment and a Locator who is attempting to identify the Observer's location, the goal is to predict the Observer's final location in a map. We develop a novel LED-Bert architecture and present an effective pretraining strategy. We show that a grap… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Journal ref: International Joint Conference on Natural Language Processing (2022)

  25. arXiv:2210.03787  [pdf, other

    cs.CV cs.HC

    Learning a Visually Grounded Memory Assistant

    Authors: Meera Hahn, Kevin Carlberg, Ruta Desai, James Hillis

    Abstract: We introduce a novel interface for large scale collection of human memory and assistance. Using the 3D Matterport simulator we create a realistic indoor environments in which we have people perform specific embodied memory tasks that mimic household daily activities. This interface was then deployed on Amazon Mechanical Turk allowing us to test and record human memory, navigation and needs for ass… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  26. BFL: a Logic to Reason about Fault Trees

    Authors: Stefano M. Nicoletti, E. Moritz Hahn, Marielle Stoelinga

    Abstract: Safety-critical infrastructures must operate safely and reliably. Fault tree analysis is a widespread method used to assess risks in these systems: fault trees (FTs) are required - among others - by the Federal Aviation Authority, the Nuclear Regulatory Commission, in the ISO26262 standard for autonomous driving and for software development in aerospace systems. Although popular both in industry a… ▽ More

    Submitted 1 June, 2024; v1 submitted 29 August, 2022; originally announced August 2022.

  27. arXiv:2206.11430  [pdf, other

    cs.LG cs.AI

    Recursive Reinforcement Learning

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: Recursion is the fundamental paradigm to finitely describe potentially infinite objects. As state-of-the-art reinforcement learning (RL) algorithms cannot directly reason about recursion, they must rely on the practitioner's ingenuity in designing a suitable "flat" representation of the environment. The resulting manual feature constructions and approximations are cumbersome and error-prone; their… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  28. Crosslinguistic word order variation reflects evolutionary pressures of dependency and information locality

    Authors: Michael Hahn, Yang Xu

    Abstract: Languages vary considerably in syntactic structure. About 40% of the world's languages have subject-verb-object order, and about 40% have subject-object-verb order. Extensive work has sought to explain this word order variation across languages. However, the existing approaches are not able to explain coherently the frequency distribution and evolution of word order in individual languages. We pro… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Preprint of peer-reviewed paper published in PNAS. Final copyedited version is available at: https://www.pnas.org/doi/10.1073/pnas.2122604119

    Journal ref: Proceedings of the National Academy of the United States of America, 119(2022):24 e2122604119

  29. arXiv:2205.03243  [pdf, other

    cs.FL cs.AI cs.LO

    Alternating Good-for-MDP Automata

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: When omega-regular objectives were first proposed in model-free reinforcement learning (RL) for controlling MDPs, deterministic Rabin automata were used in an attempt to provide a direct translation from their transitions to scalar values. While these translations failed, it has turned out that it is possible to repair them by using good-for-MDPs (GFM) Büchi automata instead. These are nondetermin… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  30. arXiv:2110.09470  [pdf, other

    cs.CV

    No RL, No Simulation: Learning to Navigate without Navigating

    Authors: Meera Hahn, Devendra Chaplot, Shubham Tulsiani, Mustafa Mukadam, James M. Rehg, Abhinav Gupta

    Abstract: Most prior methods for learning navigation policies require access to simulation environments, as they need online policy interaction and rely on ground-truth maps for rewards. However, building simulators is expensive (requires manual effort for each and every scene) and creates challenges in transferring learned policies to robotic platforms in the real-world, due to the sim-to-real domain gap.… ▽ More

    Submitted 22 October, 2021; v1 submitted 18 October, 2021; originally announced October 2021.

  31. arXiv:2106.09161  [pdf, other

    cs.LG cs.LO eess.SY

    Mungojerrie: Reinforcement Learning of Linear-Time Objectives

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: Reinforcement learning synthesizes controllers without prior knowledge of the system. At each timestep, a reward is given. The controllers optimize the discounted sum of these rewards. Applying this class of algorithms requires designing a reward scheme, which is typically done manually. The designer must ensure that their intent is accurately captured. This may not be trivial, and is prone to err… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Mungojerrie is available at https://plv.colorado.edu/mungojerrie/

  32. arXiv:2106.06777  [pdf, other

    cs.LG cs.LO eess.SY

    Model-free Reinforcement Learning for Branching Markov Decision Processes

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs). The state of a (discrete-time) BMCs is a collection of entities of various types that, while spawning other entities, generate a payoff. In comparison with BMCs, where the evolution of a each entity of the same type follows the s… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: to appear in CAV 2021

  33. arXiv:2104.10343  [pdf, other

    cs.CL cs.CC cs.LG

    Sensitivity as a Complexity Measure for Sequence Classification Tasks

    Authors: Michael Hahn, Dan Jurafsky, Richard Futrell

    Abstract: We introduce a theoretical framework for understanding and predicting the complexity of sequence classification tasks, using a novel extension of the theory of Boolean function sensitivity. The sensitivity of a function, given a distribution over input sequences, quantifies the number of disjoint subsets of the input sequence that can each be individually changed to change the output. We argue tha… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted by TACL. This is a pre-MIT Press publication version

  34. arXiv:2104.03216  [pdf, ps, other

    math.NT cs.IT math.CO

    Valued rank-metric codes

    Authors: Yassine El Maazouz, Marvin Anas Hahn, Alessandro Neri, Mima Stanojkovski

    Abstract: In this paper, we study linear spaces of matrices defined over discretely valued fields and discuss their dimension and minimal rank drops over the associated residue fields. To this end, we take first steps into the theory of rank-metric codes over discrete valuation rings by means of skew algebras derived from Galois extensions of rings. Additionally, we model projectivizations of rank-metric co… ▽ More

    Submitted 13 October, 2023; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: 31 pages

    MSC Class: 05E14; 11T71; 94B05 (Primary); 05B25; 14D06; 16S35 (Secondary)

  35. arXiv:2104.02493  [pdf, other

    cs.LG

    RadarScenes: A Real-World Radar Point Cloud Data Set for Automotive Applications

    Authors: Ole Schumann, Markus Hahn, Nicolas Scheiner, Fabio Weishaupt, Julius F. Tilly, Jürgen Dickmann, Christian Wöhler

    Abstract: A new automotive radar data set with measurements and point-wise annotations from more than four hours of driving is presented. Data provided by four series radar sensors mounted on one test vehicle were recorded and the individual detections of dynamic objects were manually grouped to clusters and labeled afterwards. The purpose of this data set is to enable the development of novel (machine lear… ▽ More

    Submitted 18 February, 2024; v1 submitted 6 April, 2021; originally announced April 2021.

  36. Motion Classification and Height Estimation of Pedestrians Using Sparse Radar Data

    Authors: Markus Horn, Ole Schumann, Markus Hahn, Jürgen Dickmann, Klaus Dietmayer

    Abstract: A complete overview of the surrounding vehicle environment is important for driver assistance systems and highly autonomous driving. Fusing results of multiple sensor types like camera, radar and lidar is crucial for increasing the robustness. The detection and classification of objects like cars, bicycles or pedestrians has been analyzed in the past for many sensor types. Beyond that, it is also… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 6 pages, 6 figures, 1 table

    Journal ref: 2018 Sensor Data Fusion: Trends, Solutions, Applications (SDF)

  37. arXiv:2011.08277  [pdf, other

    cs.CV cs.CL

    Where Are You? Localization from Embodied Dialog

    Authors: Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James M. Rehg, Stefan Lee, Peter Anderson

    Abstract: We present Where Are You? (WAY), a dataset of ~6k dialogs in which two humans -- an Observer and a Locator -- complete a cooperative localization task. The Observer is spawned at random in a 3D environment and can navigate from first-person views while answering questions from the Locator. The Locator must localize the Observer in a detailed top-down map by asking questions and giving instructions… ▽ More

    Submitted 3 September, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

    Journal ref: EMNLP 2020

  38. arXiv:2010.07515  [pdf, other

    cs.CL

    RNNs can generate bounded hierarchical languages with optimal memory

    Authors: John Hewitt, Michael Hahn, Surya Ganguli, Percy Liang, Christopher D. Manning

    Abstract: Recurrent neural networks empirically generate natural language with high syntactic fidelity. However, their success is not well-understood theoretically. We provide theoretical insight into this success, proving in a finite-precision setting that RNNs can efficiently generate bounded hierarchical languages that reflect the scaffolding of natural language syntax. We introduce Dyck-($k$,$m$), the l… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: EMNLP2020 + appendix typo fixes

  39. Demand Forecasting of Individual Probability Density Functions with Machine Learning

    Authors: F. Wick, U. Kerzel, M. Hahn, M. Wolf, T. Singhal, D. Stemmer, J. Ernst, M. Feindt

    Abstract: Demand forecasting is a central component of the replenishment process for retailers, as it provides crucial input for subsequent decision making like ordering processes. In contrast to point estimates, such as the conditional mean of the underlying probability distribution, or confidence intervals, forecasting complete probability density functions allows to investigate the impact on operational… ▽ More

    Submitted 22 July, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: final version published in Springer Nature Operations Research Forum

    Journal ref: SN Oper. Res. Forum 2, 37 (2021)

  40. arXiv:2008.10117  [pdf, other

    cs.LG stat.ML

    Collaborative Filtering under Model Uncertainty

    Authors: Robin M. Schmidt, Moritz Hahn

    Abstract: In their work, Dean, Rich, and Recht create a model to research recourse and availability of items in a recommender system. We used the definition of predictive multiplicity by Marx, Pin Calmon, and Ustun to examine different variations of this model, using different values for two model parameters. Pairwise comparison of their models show, that most of these models produce very similar results in… ▽ More

    Submitted 24 August, 2020; v1 submitted 23 August, 2020; originally announced August 2020.

    Comments: v2: small display fix in affiliation

  41. arXiv:2001.05977  [pdf, ps, other

    cs.LO cs.LG

    Reward Sha** for Reinforcement Learning with Omega-Regular Objectives

    Authors: E. M. Hahn, M. Perez, S. Schewe, F. Somenzi, A. Trivedi, D. Wojtczak

    Abstract: Recently, successful approaches have been made to exploit good-for-MDPs automata (Büchi automata with a restricted form of nondeterminism) for model free reinforcement learning, a class of automata that subsumes good for games automata and the most widespread class of limit deterministic automata. The foundation of using these Büchi automata is that the Büchi condition can, for good-for-MDP automa… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

  42. arXiv:2001.04289  [pdf, ps, other

    cs.LO

    Symblicit Exploration and Elimination for Probabilistic Model Checking

    Authors: Ernst Moritz Hahn, Arnd Hartmanns

    Abstract: Binary decision diagrams can compactly represent vast sets of states, mitigating the state space explosion problem in model checking. Probabilistic systems, however, require multi-terminal diagrams storing rational numbers. They are inefficient for models with many distinct probabilities and for iterative numeric algorithms like value iteration. In this paper, we present a new "symblicit" approach… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

  43. arXiv:1912.10442  [pdf

    eess.AS cs.SD

    End-Point Detection with State Transition Model based on Chunk-Wise Classification

    Authors: Juntae Kim, Jaesung Bae, Minsoo Hahn

    Abstract: A state transition model (STM) based on chunk-wise classification was proposed for end-point detection (EPD). In general, EPD is developed using frame-wise voice activity detection (VAD) with additional STM, in which the state transition is conducted based on VAD's frame-level decision (speech or non-speech). However, VAD errors frequently occur in noisy environments, even though we use state-of-t… ▽ More

    Submitted 22 December, 2019; originally announced December 2019.

  44. Wreath Products of Distributive Forest Algebras

    Authors: Michael Hahn, Andreas Krebs, Howard Straubing

    Abstract: It is an open problem whether definability in Propositional Dynamic Logic (PDL) on forests is decidable. Based on an algebraic characterization by Bojańczyk, et. al.,(2012) in terms of forest algebras, Straubing (2013) described an approach to PDL based on a k-fold iterated distributive law. A proof that all languages satisfying such a k-fold iterated distributive law are in PDL would settle decid… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: Appeared in: LICS '18 Proceedings of the 33rd Annual ACM/IEEE Symposium on Logic in Computer Science, pages 512-520

  45. arXiv:1909.05081  [pdf, other

    cs.FL

    Good-for-MDPs Automata for Probabilistic Analysis and Reinforcement Learning

    Authors: Ernst Moritz Hahn, Mateo Perez, Fabio Somenzi, Ashutosh Trivedi, Sven Schewe, Dominik Wojtczak

    Abstract: We characterize the class of nondeterministic $ω$-automata that can be used for the analysis of finite Markov decision processes (MDPs). We call these automata `good-for-MDPs' (GFM). We show that GFM automata are closed under classic simulation as well as under more powerful simulation relations that leverage properties of optimal control strategies for MDPs. This closure enables us to exploit sta… ▽ More

    Submitted 30 October, 2019; v1 submitted 11 September, 2019; originally announced September 2019.

  46. arXiv:1906.07285  [pdf, other

    cs.CL

    Tabula nearly rasa: Probing the Linguistic Knowledge of Character-Level Neural Language Models Trained on Unsegmented Text

    Authors: Michael Hahn, Marco Baroni

    Abstract: Recurrent neural networks (RNNs) have reached striking performance in many natural language processing tasks. This has renewed interest in whether these generic sequence processing devices are inducing genuine linguistic knowledge. Nearly all current analytical studies, however, initialize the RNNs with a vocabulary of known words, and feed them tokenized input during training. We present a multi-… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: Accepted by Transactions of the Association for Computational Linguistics

  47. arXiv:1906.06755  [pdf, other

    cs.CL cs.FL cs.LG

    Theoretical Limitations of Self-Attention in Neural Sequence Models

    Authors: Michael Hahn

    Abstract: Transformers are emerging as the new workhorse of NLP, showing great success across tasks. Unlike LSTMs, transformers process input sequences entirely through self-attention. Previous work has suggested that the computational capabilities of self-attention to process hierarchical structures are limited. In this work, we mathematically investigate the computational power of self-attention to model… ▽ More

    Submitted 12 February, 2020; v1 submitted 16 June, 2019; originally announced June 2019.

    Comments: Accepted by: Transactions of the Association for Computational Linguistics

  48. arXiv:1904.09936  [pdf, other

    cs.CV

    Trip** through time: Efficient Localization of Activities in Videos

    Authors: Meera Hahn, Asim Kadav, James M. Rehg, Hans Peter Graf

    Abstract: Localizing moments in untrimmed videos via language queries is a new and interesting task that requires the ability to accurately ground language into video. Previous works have approached this task by processing the entire video, often more than once, to localize relevant activities. In the real world applications of this approach, such as video surveillance, efficiency is a key system requiremen… ▽ More

    Submitted 18 August, 2020; v1 submitted 22 April, 2019; originally announced April 2019.

    Comments: Presented at BMVC, 2020

  49. arXiv:1902.00595  [pdf, other

    cs.CL

    Character-based Surprisal as a Model of Reading Difficulty in the Presence of Error

    Authors: Michael Hahn, Frank Keller, Yonatan Bisk, Yonatan Belinkov

    Abstract: Intuitively, human readers cope easily with errors in text; typos, misspelling, word substitutions, etc. do not unduly disrupt natural reading. Previous work indicates that letter transpositions result in increased reading times, but it is unclear if this effect generalizes to more natural errors. In this paper, we report an eye-tracking study that compares two error types (letter transpositions a… ▽ More

    Submitted 19 May, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: Published in Proceedings of CogSci 2019

  50. arXiv:1901.00484  [pdf, other

    cs.CV

    Action2Vec: A Crossmodal Embedding Approach to Action Learning

    Authors: Meera Hahn, Andrew Silva, James M. Rehg

    Abstract: We describe a novel cross-modal embedding space for actions, named Action2Vec, which combines linguistic cues from class labels with spatio-temporal features derived from video clips. Our approach uses a hierarchical recurrent network to capture the temporal structure of video features. We train our embedding using a joint loss that combines classification accuracy with similarity to Word2Vec sema… ▽ More

    Submitted 2 January, 2019; originally announced January 2019.