Skip to main content

Showing 1–16 of 16 results for author: Kramer, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10999  [pdf, other

    cs.LG cs.CL cs.NE

    Large Language Models for Tuning Evolution Strategies

    Authors: Oliver Kramer

    Abstract: Large Language Models (LLMs) exhibit world knowledge and inference capabilities, making them powerful tools for various applications. This paper proposes a feedback loop mechanism that leverages these capabilities to tune Evolution Strategies (ES) parameters effectively. The mechanism involves a structured process of providing programming instructions, executing the corresponding code, and conduct… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2401.09862  [pdf, other

    cs.NE cs.AI cs.CL cs.LG

    Evolutionary Multi-Objective Optimization of Large Language Model Prompts for Balancing Sentiments

    Authors: Jill Baumann, Oliver Kramer

    Abstract: The advent of large language models (LLMs) such as ChatGPT has attracted considerable attention in various domains due to their remarkable performance and versatility. As the use of these models continues to grow, the importance of effective prompt engineering has come to the fore. Prompt optimization emerges as a crucial challenge, as it has a direct impact on model performance and the extraction… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted in EvoApps at EvoStar 2024

  3. arXiv:2110.14535  [pdf, other

    cs.AI

    Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem

    Authors: Stefan Böhm, Martin Neumayer, Oliver Kramer, Alexander Schiendorfer, Alois Knoll

    Abstract: Cutting and Packing problems are occurring in different industries with a direct impact on the revenue of businesses. Generally, the goal in Cutting and Packing is to assign a set of smaller objects to a set of larger objects. To solve Cutting and Packing problems, practitioners can resort to heuristic and exact methodologies. Lately, machine learning is increasingly used for solving such problems… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  4. arXiv:2105.02636  [pdf, other

    cs.CV cs.MM

    Estimating Presentation Competence using Multimodal Nonverbal Behavioral Cues

    Authors: Ömer Sümer, Cigdem Beyan, Fabian Ruth, Olaf Kramer, Ulrich Trautwein, Enkelejda Kasneci

    Abstract: Public speaking and presentation competence plays an essential role in many areas of social interaction in our educational, professional, and everyday life. Since our intention during a speech can differ from what is actually understood by the audience, the ability to appropriately convey our message requires a complex set of skills. Presentation competence is cultivated in the early school years… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  5. arXiv:2006.03132  [pdf, other

    q-fin.GN cs.LG stat.ML

    Earnings Prediction with Deep Learning

    Authors: Lars Elend, Sebastian A. Tideman, Kerstin Lopatta, Oliver Kramer

    Abstract: In the financial sector, a reliable forecast the future financial performance of a company is of great importance for investors' investment decisions. In this paper we compare long-term short-term memory (LSTM) networks to temporal convolution network (TCNs) in the prediction of future earnings per share (EPS). The experimental analysis is based on quarterly financial reporting data and daily stoc… ▽ More

    Submitted 12 October, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: 7 pages, 4 figures, 2 tables

    Journal ref: LNCS 12325 (2020) 267-274

  6. arXiv:2005.02666  [pdf, other

    cs.NE cs.LG q-bio.BM

    Evolutionary Multi-Objective Design of SARS-CoV-2 Protease Inhibitor Candidates

    Authors: Tim Cofala, Lars Elend, Philip Mirbach, Jonas Prellberg, Thomas Teusch, Oliver Kramer

    Abstract: Computational drug design based on artificial intelligence is an emerging research area. At the time of writing this paper, the world suffers from an outbreak of the coronavirus SARS-CoV-2. A promising way to stop the virus replication is via protease inhibition. We propose an evolutionary multi-objective algorithm (EMOA) to design potential protease inhibitors for SARS-CoV-2's main protease. Base… ▽ More

    Submitted 18 May, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 15 pages, 7 figures, submitted to PPSN 2020

    ACM Class: I.2.8; J.3

    Journal ref: LNCS 12270 (2020) 357-371

  7. arXiv:2003.10159  [pdf, other

    cs.LG stat.ML

    Learned Weight Sharing for Deep Multi-Task Learning by Natural Evolution Strategy and Stochastic Gradient Descent

    Authors: Jonas Prellberg, Oliver Kramer

    Abstract: In deep multi-task learning, weights of task-specific networks are shared between tasks to improve performance on each single one. Since the question, which weights to share between layers, is difficult to answer, human-designed architectures often share everything but a last task-specific layer. In many cases, this simplistic approach severely limits performance. Instead, we propose an algorithm… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: Accepted at IJCNN 2020

  8. arXiv:1906.09020  [pdf, other

    cs.CV

    Acute Lymphoblastic Leukemia Classification from Microscopic Images using Convolutional Neural Networks

    Authors: Jonas Prellberg, Oliver Kramer

    Abstract: Examining blood microscopic images for leukemia is necessary when expensive equipment for flow cytometry is unavailable. Automated systems can ease the burden on medical experts for performing this examination and may be especially helpful to quickly screen a large number of patients. We present a simple, yet effective classification approach using a ResNeXt convolutional neural network with Squee… ▽ More

    Submitted 1 April, 2020; v1 submitted 21 June, 2019; originally announced June 2019.

  9. arXiv:1806.09819  [pdf, other

    cs.NE

    Limited Evaluation Evolutionary Optimization of Large Neural Networks

    Authors: Jonas Prellberg, Oliver Kramer

    Abstract: Stochastic gradient descent is the most prevalent algorithm to train neural networks. However, other approaches such as evolutionary algorithms are also applicable to this task. Evolutionary algorithms bring unique trade-offs that are worth exploring, but computational demands have so far restricted exploration to small networks with few parameters. We implement an evolutionary algorithm that exec… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: Accepted at KI 2018

  10. arXiv:1806.08099  [pdf, other

    cs.NE

    Lamarckian Evolution of Convolutional Neural Networks

    Authors: Jonas Prellberg, Oliver Kramer

    Abstract: Convolutional neural networks belong to the most successul image classifiers, but the adaptation of their network architecture to a particular problem is computationally expensive. We show that an evolutionary algorithm saves training time during the network architecture optimization, if learned network weights are inherited over generations by Lamarckian evolution. Experiments on typical image da… ▽ More

    Submitted 19 December, 2018; v1 submitted 21 June, 2018; originally announced June 2018.

    Comments: Accepted at PPSN 2018

  11. arXiv:1805.05760  [pdf, other

    cs.CV

    Multi-label Classification of Surgical Tools with Convolutional Neural Networks

    Authors: Jonas Prellberg, Oliver Kramer

    Abstract: Automatic tool detection from surgical imagery has a multitude of useful applications, such as real-time computer assistance for the surgeon. Using the successful residual network architecture, a system that can distinguish 21 different tools in cataract surgery videos is created. The videos are provided as part of the 2017 CATARACTS challenge and pose difficulties found in many real-world dataset… ▽ More

    Submitted 15 May, 2018; originally announced May 2018.

    Comments: Accepted at IJCNN 2018

  12. arXiv:1709.03247  [pdf, other

    cs.NE

    Evolution of Convolutional Highway Networks

    Authors: Oliver Kramer

    Abstract: Convolutional highways are deep networks based on multiple stacked convolutional layers for feature preprocessing. We introduce an evolutionary algorithm (EA) for optimization of the structure and hyperparameters of convolutional highways and demonstrate the potential of this optimization setting on the well-known MNIST data set. The (1+1)-EA employs Rechenberg's mutation rate control and a nichin… ▽ More

    Submitted 11 September, 2017; originally announced September 2017.

    Comments: 8 pages, 4 figures

  13. arXiv:1405.1234  [pdf, ps, other

    cs.DS math.CO

    A Novel Approach to the Common Due-Date Problem on Single and Parallel Machines

    Authors: Abhishek Awasthi, Jörg Lässig, Oliver Kramer

    Abstract: This paper presents a novel idea for the general case of the Common Due-Date (CDD) scheduling problem. The problem is about scheduling a certain number of jobs on a single or parallel machines where all the jobs possess different processing times but a common due-date. The objective of the problem is to minimize the total penalty incurred due to earliness or tardiness of the job completions. This… ▽ More

    Submitted 6 May, 2014; originally announced May 2014.

    Comments: Book Chapter 22 pages

  14. arXiv:1311.2880  [pdf, ps, other

    cs.DS math.CO

    Aircraft Landing Problem: Efficient Algorithm for a Given Landing Sequence

    Authors: Abhishek Awasthi, Oliver Kramer, Jörg Lässig

    Abstract: In this paper, we investigate a special case of the static aircraft landing problem (ALP) with the objective to optimize landing sequences and landing times for a set of air planes. The problem is to land the planes on one or multiple runways within a time window as close as possible to the preferable target landing time, maintaining a safety distance constraint. The objective of this well-known N… ▽ More

    Submitted 26 October, 2013; originally announced November 2013.

    Comments: 16th IEEE International Conference on Computational Science and Engineering (CSE 2013)

  15. arXiv:1311.2879  [pdf, ps, other

    cs.DS math.CO

    Common Due-Date Problem: Exact Polynomial Algorithms for a Given Job Sequence

    Authors: Abhishek Awasthi, Jörg Lässig, Oliver Kramer

    Abstract: This paper considers the problem of scheduling jobs on single and parallel machines where all the jobs possess different processing times but a common due date. There is a penalty involved with each job if it is processed earlier or later than the due date. The objective of the problem is to find the assignment of jobs to machines, the processing sequence of jobs and the time at which they are pro… ▽ More

    Submitted 26 October, 2013; originally announced November 2013.

    Comments: 15th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing

  16. arXiv:1107.3600  [pdf, other

    stat.ML cs.LG

    Unsupervised K-Nearest Neighbor Regression

    Authors: Oliver Kramer

    Abstract: In many scientific disciplines structures in high-dimensional data have to be found, e.g., in stellar spectra, in genome data, or in face recognition tasks. In this work we present a novel approach to non-linear dimensionality reduction. It is based on fitting K-nearest neighbor regression to the unsupervised regression framework for learning of low-dimensional manifolds. Similar to related approa… ▽ More

    Submitted 26 September, 2011; v1 submitted 18 July, 2011; originally announced July 2011.

    Comments: 4 pages, 12 figures