Skip to main content

Showing 1–5 of 5 results for author: Theisen, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.12304  [pdf, other

    stat.ML cs.AI cs.LG

    Preference Optimization for Molecular Language Models

    Authors: Ryan Park, Ryan Theisen, Navriti Sahni, Marcel Patek, Anna CichoĊ„ska, Rayees Rahman

    Abstract: Molecular language modeling is an effective approach to generating novel chemical structures. However, these models do not \emph{a priori} encode certain preferences a chemist may desire. We investigate the use of fine-tuning using Direct Preference Optimization to better align generated molecules with chemist preferences. Our findings suggest that this approach is simple, efficient, and highly ef… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  2. arXiv:2305.12313  [pdf, other

    stat.ML cs.LG

    When are ensembles really effective?

    Authors: Ryan Theisen, Hyunsuk Kim, Yaoqing Yang, Liam Hodgkinson, Michael W. Mahoney

    Abstract: Ensembling has a long history in statistical data analysis, with many impactful applications. However, in many modern machine learning settings, the benefits of ensembling are less ubiquitous and less obvious. We study, both theoretically and empirically, the fundamental question of when ensembling yields significant performance improvements in classification tasks. Theoretically, we prove new res… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

  3. arXiv:2106.03357  [pdf, other

    stat.ML cs.LG

    Evaluating State-of-the-Art Classification Models Against Bayes Optimality

    Authors: Ryan Theisen, Huan Wang, Lav R. Varshney, Caiming Xiong, Richard Socher

    Abstract: Evaluating the inherent difficulty of a given data-driven classification problem is important for establishing absolute benchmarks and evaluating progress in the field. To this end, a natural quantity to consider is the \emph{Bayes error}, which measures the optimal classification error theoretically achievable for a given data distribution. While generally an intractable quantity, we show that we… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  4. arXiv:2006.12625  [pdf, other

    stat.ML cs.LG

    Good Classifiers are Abundant in the Interpolating Regime

    Authors: Ryan Theisen, Jason M. Klusowski, Michael W. Mahoney

    Abstract: Within the machine learning community, the widely-used uniform convergence framework has been used to answer the question of how complex, over-parameterized models can generalize well to new data. This approach bounds the test error of the worst-case model one could have fit to the data, but it has fundamental limitations. Inspired by the statistical mechanics approach to learning, we formally def… ▽ More

    Submitted 4 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

  5. arXiv:1910.10245  [pdf, other

    stat.ML cs.LG

    Global Capacity Measures for Deep ReLU Networks via Path Sampling

    Authors: Ryan Theisen, Jason M. Klusowski, Huan Wang, Nitish Shirish Keskar, Caiming Xiong, Richard Socher

    Abstract: Classical results on the statistical complexity of linear models have commonly identified the norm of the weights $\|w\|$ as a fundamental capacity measure. Generalizations of this measure to the setting of deep networks have been varied, though a frequently identified quantity is the product of weight norms of each layer. In this work, we show that for a large class of networks possessing a posit… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.