Skip to main content

Showing 1–3 of 3 results for author: Mattick, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.00112  [pdf, other

    cs.LG

    Reinforcement Learning for Node Selection in Branch-and-Bound

    Authors: Alexander Mattick, Christopher Mutschler

    Abstract: A big challenge in branch and bound lies in identifying the optimal node within the search tree from which to proceed. Current state-of-the-art selectors utilize either hand-crafted ensembles that automatically switch between naive sub-node selectors, or learned node selectors that rely on individual node data. We propose a novel simulation technique that uses reinforcement learning (RL) while con… ▽ More

    Submitted 5 June, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

  2. arXiv:2304.07327  [pdf, other

    cs.CL cs.AI

    OpenAssistant Conversations -- Democratizing Large Language Model Alignment

    Authors: Andreas Köpf, Yannic Kilcher, Dimitri von Rütte, Sotiris Anagnostidis, Zhi-Rui Tam, Keith Stevens, Abdullah Barhoum, Nguyen Minh Duc, Oliver Stanley, Richárd Nagyfi, Shahul ES, Sameer Suri, David Glushkov, Arnav Dantuluri, Andrew Maguire, Christoph Schuhmann, Huu Nguyen, Alexander Mattick

    Abstract: Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their acce… ▽ More

    Submitted 31 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Published in NeurIPS 2023 Datasets and Benchmarks

    Report number: V-02 ACM Class: I.2

  3. SmartPatch: Improving Handwritten Word Imitation with Patch Discriminators

    Authors: Alexander Mattick, Martin Mayr, Mathias Seuret, Andreas Maier, Vincent Christlein

    Abstract: As of recent generative adversarial networks have allowed for big leaps in the realism of generated images in diverse domains, not the least of which being handwritten text generation. The generation of realistic-looking hand-written text is important because it can be used for data augmentation in handwritten text recognition (HTR) systems or human-computer interaction. We propose SmartPatch, a n… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: to be published the in 16th International Conference on Document Analysis and Recognition 2021 ICDAR