-
Re-evaluating sample efficiency in de novo molecule generation
Authors:
Morgan Thomas,
Noel M. O'Boyle,
Andreas Bender,
Chris De Graaf
Abstract:
De novo molecule generation can suffer from data inefficiency; requiring large amounts of training data or many sampled data points to conduct objective optimization. The latter is a particular disadvantage when combining deep generative models with computationally expensive molecule scoring functions (a.k.a. oracles) commonly used in computer-aided drug design. Recent works have therefore focused…
▽ More
De novo molecule generation can suffer from data inefficiency; requiring large amounts of training data or many sampled data points to conduct objective optimization. The latter is a particular disadvantage when combining deep generative models with computationally expensive molecule scoring functions (a.k.a. oracles) commonly used in computer-aided drug design. Recent works have therefore focused on methods to improve sample efficiency in the context of de novo molecule drug design, or to benchmark it. In this work, we discuss and adapt a recent sample efficiency benchmark to better reflect realistic goals also with respect to the quality of chemistry generated, which must always be considered in the context of small-molecule drug design; we then re-evaluate all benchmarked generative models. We find that accounting for molecular weight and LogP with respect to the training data, and the diversity of chemistry proposed, re-orders the ranking of generative models. In addition, we benchmark a recently proposed method to improve sample efficiency (Augmented Hill-Climb) and found it ranked top when considering both the sample efficiency and chemistry of molecules generated. Continual improvements in sample efficiency and chemical desirability enable more routine integration of computationally expensive scoring functions on a more realistic timescale.
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
Learning Dense Visual Descriptors using Image Augmentations for Robot Manipulation Tasks
Authors:
Christian Graf,
David B. Adrian,
Joshua Weil,
Miroslav Gabriel,
Philipp Schillinger,
Markus Spies,
Heiko Neumann,
Andras Kupcsik
Abstract:
We propose a self-supervised training approach for learning view-invariant dense visual descriptors using image augmentations. Unlike existing works, which often require complex datasets, such as registered RGBD sequences, we train on an unordered set of RGB images. This allows for learning from a single camera view, e.g., in an existing robotic cell with a fix-mounted camera. We create synthetic…
▽ More
We propose a self-supervised training approach for learning view-invariant dense visual descriptors using image augmentations. Unlike existing works, which often require complex datasets, such as registered RGBD sequences, we train on an unordered set of RGB images. This allows for learning from a single camera view, e.g., in an existing robotic cell with a fix-mounted camera. We create synthetic views and dense pixel correspondences using data augmentations. We find our descriptors are competitive to the existing methods, despite the simpler data recording and setup requirements. We show that training on synthetic correspondences provides descriptor consistency across a broad range of camera views. We compare against training with geometric correspondence from multiple views and provide ablation studies. We also show a robotic bin-picking experiment using descriptors learned from a fix-mounted camera for defining grasp preferences.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Computational Performance of Deep Reinforcement Learning to find Nash Equilibria
Authors:
Christoph Graf,
Viktor Zobernig,
Johannes Schmidt,
Claude Klöckl
Abstract:
We test the performance of deep deterministic policy gradient (DDPG), a deep reinforcement learning algorithm, able to handle continuous state and action spaces, to learn Nash equilibria in a setting where firms compete in prices. These algorithms are typically considered model-free because they do not require transition probability functions (as in e.g., Markov games) or predefined functional for…
▽ More
We test the performance of deep deterministic policy gradient (DDPG), a deep reinforcement learning algorithm, able to handle continuous state and action spaces, to learn Nash equilibria in a setting where firms compete in prices. These algorithms are typically considered model-free because they do not require transition probability functions (as in e.g., Markov games) or predefined functional forms. Despite being model-free, a large set of parameters are utilized in various steps of the algorithm. These are e.g., learning rates, memory buffers, state-space dimensioning, normalizations, or noise decay rates and the purpose of this work is to systematically test the effect of these parameter configurations on convergence to the analytically derived Bertrand equilibrium. We find parameter choices that can reach convergence rates of up to 99%. The reliable convergence may make the method a useful tool to study strategic behavior of firms even in more complex settings. Keywords: Bertrand Equilibrium, Competition in Uniform Price Auctions, Deep Deterministic Policy Gradient Algorithm, Parameter Sensitivity Analysis
△ Less
Submitted 26 April, 2021;
originally announced April 2021.
-
When redundancy is useful: A Bayesian approach to 'overinformative' referring expressions
Authors:
Judith Degen,
Robert D. Hawkins,
Caroline Graf,
Elisa Kreiss,
Noah D. Goodman
Abstract:
Referring is one of the most basic and prevalent uses of language. How do speakers choose from the wealth of referring expressions at their disposal? Rational theories of language use have come under attack for decades for not being able to account for the seemingly irrational overinformativeness ubiquitous in referring expressions. Here we present a novel production model of referring expressions…
▽ More
Referring is one of the most basic and prevalent uses of language. How do speakers choose from the wealth of referring expressions at their disposal? Rational theories of language use have come under attack for decades for not being able to account for the seemingly irrational overinformativeness ubiquitous in referring expressions. Here we present a novel production model of referring expressions within the Rational Speech Act framework that treats speakers as agents that rationally trade off cost and informativeness of utterances. Crucially, we relax the assumption that informativeness is computed with respect to a deterministic Boolean semantics, in favor of a non-deterministic continuous semantics. This innovation allows us to capture a large number of seemingly disparate phenomena within one unified framework: the basic asymmetry in speakers' propensity to overmodify with color rather than size; the increase in overmodification in complex scenes; the increase in overmodification with atypical features; and the increase in specificity in nominal reference as a function of typicality. These findings cast a new light on the production of referring expressions: rather than being wastefully overinformative, reference is usefully redundant.
△ Less
Submitted 10 December, 2019; v1 submitted 19 March, 2019;
originally announced March 2019.
-
Smart Charging Technologies for Portable Electronic Devices
Authors:
Stefan Hild,
Sean Leavey,
Christian Gräf,
Borja Sorazu
Abstract:
In this article we describe our efforts of extending demand-side control concepts to the application in portable electronic devices, such as laptop computers, mobile phones and tablet computers. As these devices feature built-in energy storage (in the form of batteries) and the ability to run complex control routines, they are ideal for the implementation of smart charging concepts. We developed a…
▽ More
In this article we describe our efforts of extending demand-side control concepts to the application in portable electronic devices, such as laptop computers, mobile phones and tablet computers. As these devices feature built-in energy storage (in the form of batteries) and the ability to run complex control routines, they are ideal for the implementation of smart charging concepts. We developed a prototype of a smart laptop charger that controls the charging process depending on the locally measured frequency of the electricity grid. If this technique is incorporated into millions of devices in UK households, this will contribute significantly to the stability of the electricity grid, help to mitigate the power production fluctuations from renewable energy sources and avoid the high cost of building and maintaining conventional power plants as standby reserve.
△ Less
Submitted 23 March, 2013; v1 submitted 25 September, 2012;
originally announced September 2012.