Skip to main content

Showing 1–2 of 2 results for author: Wunder, M

.
  1. arXiv:2402.17327  [pdf, other

    cs.LG cs.DS

    Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond

    Authors: Kyriakos Axiotis, Vincent Cohen-Addad, Monika Henzinger, Sammy Jerome, Vahab Mirrokni, David Saulpic, David Woodruff, Michael Wunder

    Abstract: We study the data selection problem, whose aim is to select a small representative subset of data that can be used to efficiently train a machine learning model. We present a new data selection approach based on $k$-means clustering and sensitivity sampling. Assuming access to an embedding representation of the data with respect to which the model loss is Hölder continuous, our approach provably a… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  2. arXiv:2306.11903  [pdf, other

    cs.LG

    Deep Fusion: Efficient Network Training via Pre-trained Initializations

    Authors: Hanna Mazzawi, Xavi Gonzalvo, Michael Wunder, Sammy Jerome, Benoit Dherin

    Abstract: In recent years, deep learning has made remarkable progress in a wide range of domains, with a particularly notable impact on natural language processing tasks. One of the challenges associated with training deep neural networks in the context of LLMs is the need for large amounts of computational resources and time. To mitigate this, network growing algorithms offer potential cost savings, but th… ▽ More

    Submitted 26 June, 2024; v1 submitted 20 June, 2023; originally announced June 2023.