Skip to main content

Showing 1–25 of 25 results for author: Franke, H

.
  1. arXiv:2407.06040  [pdf, other

    cs.CR

    Enabling Performant and Secure EDA as a Service in Public Clouds Using Confidential Containers

    Authors: Mengmei Ye, Derren Dunn, Daniele Buono, Angelo Ruocco, Claudio Carvalho, Tobin Feldman-fitzthum, Hubertus Franke, James Bottomley

    Abstract: Increasingly, business opportunities available to fabless design teams in the semiconductor industry far exceed those addressable with on-prem compute resources. An attractive option to capture these electronic design automation (EDA) design opportunities is through public cloud bursting. However, security concerns with public cloud bursting arise from having to protect process design kits, third… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2405.10299  [pdf, other

    cs.LG cs.AI

    HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models

    Authors: Rhea Sanjay Sukthanker, Arber Zela, Benedikt Staffler, Aaron Klein, Lennart Purucker, Joerg K. H. Franke, Frank Hutter

    Abstract: The increasing size of language models necessitates a thorough analysis across multiple dimensions to assess trade-offs among crucial hardware metrics such as latency, energy consumption, GPU memory usage, and performance. Identifying optimal model configurations under specific hardware constraints is becoming essential but remains challenging due to the computational load of exhaustive training a… ▽ More

    Submitted 21 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 48 pages, 69 figures, 10 tables

  3. arXiv:2404.08509  [pdf, other

    cs.DC cs.CL cs.LG

    Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

    Authors: Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew T. Kalbarczyk, Tamer Başar, Ravishankar K. Iyer

    Abstract: Large language models (LLMs) have been driving a new wave of interactive AI applications across numerous domains. However, efficiently serving LLM inference requests is challenging due to their unpredictable execution times originating from the autoregressive nature of generative models. Existing LLM serving systems exploit first-come-first-serve (FCFS) scheduling, suffering from head-of-line bloc… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted at AIOps'24

  4. arXiv:2403.07890  [pdf, other

    cs.GT cs.AI cs.LG

    $\widetilde{O}(T^{-1})$ Convergence to (Coarse) Correlated Equilibria in Full-Information General-Sum Markov Games

    Authors: Weichao Mao, Haoran Qiu, Chen Wang, Hubertus Franke, Zbigniew Kalbarczyk, Tamer Başar

    Abstract: No-regret learning has a long history of being closely connected to game theory. Recent works have devised uncoupled no-regret learning dynamics that, when adopted by all the players in normal-form games, converge to various equilibrium solutions at a near-optimal rate of $\widetilde{O}(T^{-1})$, a significant improvement over the $O(1/\sqrt{T})$ rate of classic no-regret learners. However, analog… ▽ More

    Submitted 23 April, 2024; v1 submitted 2 February, 2024; originally announced March 2024.

  5. arXiv:2401.05351  [pdf, other

    q-bio.BM cs.LG

    Rethinking Performance Measures of RNA Secondary Structure Problems

    Authors: Frederic Runge, Jörg K. H. Franke, Daniel Fertmann, Frank Hutter

    Abstract: Accurate RNA secondary structure prediction is vital for understanding cellular regulation and disease mechanisms. Deep learning (DL) methods have surpassed traditional algorithms by predicting complex features like pseudoknots and multi-interacting base pairs. However, traditional distance measures can hardly deal with such tertiary interactions and the currently used evaluation measures (F1 scor… ▽ More

    Submitted 4 December, 2023; originally announced January 2024.

    Comments: 12 pages, Accepted at the Machine Learning for Structural Biology Workshop, NeurIPS 2023

  6. arXiv:2312.06808  [pdf, other

    cs.OS

    BPF-oF: Storage Function Pushdown Over the Network

    Authors: Ioannis Zarkadas, Tal Zussman, Jeremy Carin, Sheng Jiang, Yuhong Zhong, Jonas Pfefferle, Hubertus Franke, Junfeng Yang, Kostis Kaffes, Ryan Stutsman, Asaf Cidon

    Abstract: Storage disaggregation, wherein storage is accessed over the network, is popular because it allows applications to independently scale storage capacity and bandwidth based on dynamic application demand. However, the added network processing introduced by disaggregation can consume significant CPU resources. In many storage systems, logical storage operations (e.g., lookups, aggregations) involve a… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  7. arXiv:2311.09058  [pdf, other

    cs.LG

    Constrained Parameter Regularization

    Authors: Jörg K. H. Franke, Michael Hefenbrock, Gregor Koehler, Frank Hutter

    Abstract: Regularization is a critical component in deep learning training, with weight decay being a commonly used approach. It applies a constant penalty coefficient uniformly across all parameters. This may be unnecessarily restrictive for some parameters, while insufficiently restricting others. To dynamically adjust penalty coefficients for different parameter groups, we present constrained parameter r… ▽ More

    Submitted 6 December, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

  8. arXiv:2310.03940  [pdf, other

    cs.CV cs.AI

    Beyond Random Augmentations: Pretraining with Hard Views

    Authors: Fabio Ferreira, Ivo Rapant, Jörg K. H. Franke, Frank Hutter

    Abstract: Many Self-Supervised Learning (SSL) methods aim for model invariance to different image augmentations known as views. To achieve this invariance, conventional approaches make use of random sampling operations within the image augmentation pipeline. We hypothesize that the efficacy of pretraining pipelines based on conventional random view sampling can be enhanced by explicitly selecting views that… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  9. arXiv:2309.07513  [pdf, other

    cs.CV

    RecycleNet: Latent Feature Recycling Leads to Iterative Decision Refinement

    Authors: Gregor Koehler, Tassilo Wald, Constantin Ulrich, David Zimmerer, Paul F. Jaeger, Jörg K. H. Franke, Simon Kohl, Fabian Isensee, Klaus H. Maier-Hein

    Abstract: Despite the remarkable success of deep learning systems over the last decade, a key difference still remains between neural network and human decision-making: As humans, we cannot only form a decision on the spot, but also ponder, revisiting an initial guess from different angles, distilling relevant information, arriving at a better decision. Here, we propose RecycleNet, a latent feature recyclin… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted at 2024 Winter Conference on Applications of Computer Vision (WACV)

  10. arXiv:2307.10073  [pdf, other

    cs.LG q-bio.BM

    Scalable Deep Learning for RNA Secondary Structure Prediction

    Authors: Jörg K. H. Franke, Frederic Runge, Frank Hutter

    Abstract: The field of RNA secondary structure prediction has made significant progress with the adoption of deep learning techniques. In this work, we present the RNAformer, a lean deep learning model using axial attention and recycling in the latent space. We gain performance improvements by designing the architecture for modeling the adjacency matrix directly in the latent space and by scaling the size o… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted at the 2023 ICML Workshop on Computational Biology. Honolulu, Hawaii, USA, 2023

  11. arXiv:2307.08801  [pdf, other

    cs.LG q-bio.GN

    Towards Automated Design of Riboswitches

    Authors: Frederic Runge, Jörg K. H. Franke, Frank Hutter

    Abstract: Experimental screening and selection pipelines for the discovery of novel riboswitches are expensive, time-consuming, and inefficient. Using computational methods to reduce the number of candidates for the screen could drastically decrease these costs. However, existing computational approaches do not fully satisfy all requirements for the design of such initial screening libraries. In this work,… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 9 pages, Accepted at the 2023 ICML Workshop on Computational Biology

  12. Remote attestation of SEV-SNP confidential VMs using e-vTPMs

    Authors: Vikram Narayanan, Claudio Carvalho, Angelo Ruocco, Gheorghe Almási, James Bottomley, Mengmei Ye, Tobin Feldman-Fitzthum, Daniele Buono, Hubertus Franke, Anton Burtsev

    Abstract: Trying to address the security challenges of a cloud-centric software deployment paradigm, silicon and cloud vendors are introducing confidential computing - an umbrella term aimed at providing hardware and software mechanisms for protecting cloud workloads from the cloud provider and its software stack. Today, Intel SGX, AMD SEV, Intel TDX, etc., provide a way to shield cloud applications from th… ▽ More

    Submitted 25 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: 12 pages, 4 figures

    Journal ref: In Proceedings of the 39th Annual Computer Security Applications Conference (ACSAC 2023). 732-743

  13. arXiv:2303.15540  [pdf, other

    cs.CR cs.OS

    Intel TDX Demystified: A Top-Down Approach

    Authors: Pau-Chen Cheng, Wojciech Ozga, Enriquillo Valdez, Salman Ahmed, Zhongshu Gu, Hani Jamjoom, Hubertus Franke, James Bottomley

    Abstract: Intel Trust Domain Extensions (TDX) is a new architectural extension in the 4th Generation Intel Xeon Scalable Processor that supports confidential computing. TDX allows the deployment of virtual machines in the Secure-Arbitration Mode (SEAM) with encrypted CPU state and memory, integrity protection, and remote attestation. TDX aims to enforce hardware-assisted isolation for virtual machines and m… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  14. arXiv:2302.10366  [pdf, other

    cs.OS cs.CR

    Programmable System Call Security with eBPF

    Authors: **ghao Jia, YiFei Zhu, Dan Williams, Andrea Arcangeli, Claudio Canella, Hubertus Franke, Tobin Feldman-Fitzthum, Dimitrios Skarlatos, Daniel Gruss, Tianyin Xu

    Abstract: System call filtering is a widely used security mechanism for protecting a shared OS kernel against untrusted user applications. However, existing system call filtering techniques either are too expensive due to the context switch overhead imposed by userspace agents, or lack sufficient programmability to express advanced policies. Seccomp, Linux's system call filtering module, is widely used by m… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  15. arXiv:2210.12610  [pdf, other

    cs.CR

    Partially Trusting the Service Mesh Control Plane

    Authors: Constantin Adam, Abdulhamid Adebayo, Hubertus Franke, Edward Snible, Tobin Feldman-Fitzthum, James Cadden, Nerla Jean-Louis

    Abstract: Zero Trust is a novel cybersecurity model that focuses on continually evaluating trust to prevent the initiation and horizontal spreading of attacks. A cloud-native Service Mesh is an example of Zero Trust Architecture that can filter out external threats. However, the Service Mesh does not shield the Application Owner from internal threats, such as a rogue administrator of the cluster where their… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  16. arXiv:2205.13927  [pdf, other

    cs.LG q-bio.BM

    Probabilistic Transformer: Modelling Ambiguities and Distributions for RNA Folding and Molecule Design

    Authors: Jörg K. H. Franke, Frederic Runge, Frank Hutter

    Abstract: Our world is ambiguous and this is reflected in the data we use to train our algorithms. This is particularly true when we try to model natural processes where collected data is affected by noisy measurements and differences in measurement techniques. Sometimes, the process itself is ambiguous, such as in the case of RNA folding, where the same nucleotide sequence can fold into different structure… ▽ More

    Submitted 14 November, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: 38 pages, Accepted at 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  17. arXiv:2203.13396  [pdf, other

    cs.AR cs.DC cs.OS

    HetSched: Quality-of-Mission Aware Scheduling for Autonomous Vehicle SoCs

    Authors: Aporva Amarnath, Subhankar Pal, Hiwot Kassa, Augusto Vega, Alper Buyuktosunoglu, Hubertus Franke, John-David Wellman, Ronald Dreslinski, Pradip Bose

    Abstract: Systems-on-Chips (SoCs) that power autonomous vehicles (AVs) must meet stringent performance and safety requirements prior to deployment. With increasing complexity in AV applications, the system needs to meet these real-time demands of multiple safety-critical applications simultaneously. A typical AV-SoC is a heterogeneous multiprocessor consisting of accelerators supported by general-purpose co… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: 14 pages, 11 figures, 4 tables

  18. arXiv:2010.13117  [pdf, other

    cs.LG cs.AI

    Hyperparameter Transfer Across Developer Adjustments

    Authors: Danny Stoll, Jörg K. H. Franke, Diane Wagner, Simon Selg, Frank Hutter

    Abstract: After developer adjustments to a machine learning (ML) algorithm, how can the results of an old hyperparameter optimization (HPO) automatically be used to speedup a new HPO? This question poses a challenging problem, as developer adjustments can change which hyperparameter settings perform well, or even the hyperparameter search space itself. While many approaches exist that leverage knowledge obt… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

  19. arXiv:2009.01555  [pdf, other

    cs.LG stat.ML

    Sample-Efficient Automated Deep Reinforcement Learning

    Authors: Jörg K. H. Franke, Gregor Köhler, André Biedenkapp, Frank Hutter

    Abstract: Despite significant progress in challenging problems across various domains, applying state-of-the-art deep reinforcement learning (RL) algorithms remains challenging due to their sensitivity to the choice of hyperparameters. This sensitivity can partly be attributed to the non-stationarity of the RL problem, potentially requiring different hyperparameter settings at various stages of the learning… ▽ More

    Submitted 17 March, 2021; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: In Proceedings of the International Conference on Learning Representations (ICLR 2021), 2021

  20. arXiv:2007.14371  [pdf, other

    cs.AR cs.DC

    STOMP: A Tool for Evaluation of Scheduling Policies in Heterogeneous Multi-Processors

    Authors: Augusto Vega, Aporva Amarnath, John-David Wellman, Hiwot Kassa, Subhankar Pal, Hubertus Franke, Alper Buyuktosunoglu, Ronald Dreslinski, Pradip Bose

    Abstract: The proliferation of heterogeneous chip multiprocessors in recent years has reached unprecedented levels. Traditional homogeneous platforms have shown fundamental limitations when it comes to enabling high-performance yet-ultra-low-power computing, in particular in application domains with real-time execution deadlines or criticality constraints. By combining the right set of general purpose cores… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

  21. arXiv:1910.12824  [pdf, other

    cs.LG cs.NE stat.ML

    Neural Architecture Evolution in Deep Reinforcement Learning for Continuous Control

    Authors: Jörg K. H. Franke, Gregor Köhler, Noor Awad, Frank Hutter

    Abstract: Current Deep Reinforcement Learning algorithms still heavily rely on handcrafted neural network architectures. We propose a novel approach to automatically find strong topologies for continuous control tasks while only adding a minor overhead in terms of interactions in the environment. To achieve this, we combine Neuroevolution techniques with off-policy training and propose a novel architecture… ▽ More

    Submitted 27 February, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 MetaLearn Workshop

  22. arXiv:1602.06804  [pdf, ps, other

    cond-mat.mes-hall cond-mat.quant-gas

    Coexistence of strong and weak coupling in ZnO nanowire cavities

    Authors: Tom Michalsky, Helena Franke, Robert Buschlinger, Ulf Peschel, Marius Grundmann, Rüdiger Schmidt-Grund

    Abstract: We present a high quality two-dimensional cavity structure based on ZnO nanowires coated with concentrical Bragg reflectors. The spatial mode distribution leads to the simultaneous appearance of the weak and strong coupling regime even at room temperature. Photoluminescence measurements agree with FDTD simulations. Furthermore the ZnO core nanowires allow for the observation of middle polariton br… ▽ More

    Submitted 22 February, 2016; originally announced February 2016.

    Comments: 17 pages, 8 figures

  23. arXiv:1503.01416  [pdf, ps, other

    cs.DC cs.AR cs.PF

    Disaggregated and optically interconnected memory: when will it be cost effective?

    Authors: Bulent Abali, Richard J. Eickemeyer, Hubertus Franke, Chung-Sheng Li, Marc A. Taubenblatt

    Abstract: The "Disaggregated Server" concept has been proposed for datacenters where the same type server resources are aggregated in their respective pools, for example a compute pool, memory pool, network pool, and a storage pool. Each server is constructed dynamically by allocating the right amount of resources from these pools according to the workload's requirements. Modularity, higher packaging and co… ▽ More

    Submitted 3 March, 2015; originally announced March 2015.

    Comments: 9 pages, 7 figures

    ACM Class: B.3

  24. arXiv:1501.02644  [pdf, ps, other

    cond-mat.mes-hall

    Discrete relaxation of exciton-polaritons in an inhomogeneous potential

    Authors: T. Michalsky, H. Franke, C. Sturm, M. Grundmann, R. Schmidt-Grund

    Abstract: We present indications, that the wave function-stiffness condition during energy-relaxation as observed in single-phase state quantum systems manifests also in a single particle ensemble. This is demonstrated for exciton-polaritons in the strong coupling regime in a ZnO-based microcavity at T = 10 K for non-resonant excitation. It is well known that the pump-induced spatially inhomogeneous backgro… ▽ More

    Submitted 12 January, 2015; originally announced January 2015.

    Comments: 9 pages, 3 figures

  25. arXiv:1412.8667  [pdf, other

    cond-mat.mes-hall cond-mat.dis-nn cond-mat.quant-gas physics.optics

    Cavity Polariton Condensate in a Disordered Environment

    Authors: Martin Thunert, Alexander Janot, Helena Franke, Chris Sturm, Tom Michalsky, María Dolores Martín, Luis Viña, Bernd Rosenow, Marius Grundmann, Rüdiger Schmidt-Grund

    Abstract: We report on the influence of disorder on an exciton-polariton condensate in a ZnO based bulk planar microcavity and compare experimental results with a theoretical model for a non-equilibrium condensate. Experimentally, we detect intensity fluctuations within the far-field emission pattern even at high condensate densities which indicates a significant impact of disorder. We show that these effec… ▽ More

    Submitted 16 November, 2015; v1 submitted 30 December, 2014; originally announced December 2014.

    Comments: main article and supplementary, 13 pages, 8 figures (main article)

    Journal ref: Phys. Rev. B 93, 064203 (2016)