Skip to main content

Showing 1–6 of 6 results for author: Große, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03951  [pdf, other

    cs.LG

    Uncertainty-Guided Optimization on Large Language Model Search Trees

    Authors: Julia Grosse, Ruotian Wu, Ahmad Rashid, Philipp Hennig, Pascal Poupart, Agustinus Kristiadi

    Abstract: Beam search is a standard tree search algorithm when it comes to finding sequences of maximum likelihood, for example, in the decoding processes of large language models. However, it is myopic since it does not take the whole path from the root to a leaf into account. Moreover, it is agnostic to prior knowledge available about the process: For example, it does not consider that the objective being… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 10 pages

  2. arXiv:2406.07780  [pdf, other

    cs.LG cs.CL

    A Critical Look At Tokenwise Reward-Guided Text Generation

    Authors: Ahmad Rashid, Ruotian Wu, Julia Grosse, Agustinus Kristiadi, Pascal Poupart

    Abstract: Large language models (LLMs) can significantly be improved by aligning to human preferences -- the so-called reinforcement learning from human feedback (RLHF). However, the cost of fine-tuning an LLM is prohibitive for many users. Due to their ability to bypass LLM finetuning, tokenwise reward-guided text generation (RGTG) methods have recently been proposed. They use a reward model trained on ful… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2209.00895  [pdf, other

    cs.LG

    Optimistic Optimization of Gaussian Process Samples

    Authors: Julia Grosse, Cheng Zhang, Philipp Hennig

    Abstract: Bayesian optimization is a popular formalism for global optimization, but its computational costs limit it to expensive-to-evaluate functions. A competing, computationally more efficient, global optimization framework is optimistic optimization, which exploits prior knowledge about the geometry of the search space in form of a dissimilarity function. We investigate to which degree the conceptual a… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: 10 pages, 6 figures

  4. arXiv:2106.08717  [pdf, other

    cs.LG cs.AI

    Probabilistic DAG Search

    Authors: Julia Grosse, Cheng Zhang, Philipp Hennig

    Abstract: Exciting contemporary machine learning problems have recently been phrased in the classic formalism of tree search -- most famously, the game of Go. Interestingly, the state-space underlying these sequential decision-making problems often posses a more general latent structure than can be captured by a tree. In this work, we develop a probabilistic framework to exploit a search space's latent stru… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 10 pages, 8 figures, to be published at the Conference on Uncertainty in Artificial Intelligence (UAI) 2021

  5. arXiv:0801.2175  [pdf, other

    cs.GR

    MathPSfrag 2: Convenient LaTeX Labels in Mathematica

    Authors: Johannes Große

    Abstract: This article introduces the next version of MathPSfrag. MathPSfrag is a Mathematica package that during export automatically replaces all expressions in a plot by corresponding LaTeX commands. The new version can also produce LaTeX independent images; e.g., PDF files for inclusion in pdfLaTeX. Moreover from these files a preview is generated and shown within Mathematica.

    Submitted 15 January, 2008; originally announced January 2008.

    Comments: 9 pages, package can be found at http://wwwth.mppmu.mpg.de/members/jgrosse/mathpsfrag/

    ACM Class: I.3.4

  6. arXiv:cs/0510087  [pdf, ps, other

    cs.GR

    MathPSfrag: Creating Publication-Quality Labels in Mathematica Plots

    Authors: J. Grosse

    Abstract: This article introduces a Mathematica package providing a graphics export function that automatically replaces Mathematica expressions in a graphic by the corresponding LaTeX constructs and positions them correctly. It thus facilitates the creation of publication-quality Enscapulated PostScript (EPS) graphics.

    Submitted 31 October, 2005; originally announced October 2005.

    Comments: 7 pages, 8 figures, for associated Mathematica package, see http://wwwth.mppmu.mpg.de/members/jgrosse/mathpsfrag/MathPSfrag-1.0.tar.gz

    Report number: LMU-ASC 70/05; MPP-2005-126 ACM Class: I.3.4