Skip to main content

Showing 1–6 of 6 results for author: Dery, L M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.05738  [pdf, other

    cs.LG

    Cross-Modal Fine-Tuning: Align then Refine

    Authors: Junhong Shen, Liam Li, Lucio M. Dery, Corey Staten, Mikhail Khodak, Graham Neubig, Ameet Talwalkar

    Abstract: Fine-tuning large-scale pretrained models has led to tremendous progress in well-studied modalities such as vision and NLP. However, similar gains have not been observed in many other modalities due to a lack of relevant pretrained models. In this work, we propose ORCA, a general cross-modal fine-tuning framework that extends the applicability of a single large-scale pretrained model to diverse mo… ▽ More

    Submitted 18 March, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

  2. arXiv:2210.04971  [pdf, other

    cs.LG cs.AI

    Multi-step Planning for Automated Hyperparameter Optimization with OptFormer

    Authors: Lucio M. Dery, Abram L. Friesen, Nando De Freitas, Marc'Aurelio Ranzato, Yutian Chen

    Abstract: As machine learning permeates more industries and models become more expensive and time consuming to train, the need for efficient automated hyperparameter optimization (HPO) has never been more pressing. Multi-step planning based approaches to hyperparameter optimization promise improved efficiency over myopic alternatives by more effectively balancing out exploration and exploitation. However, t… ▽ More

    Submitted 16 November, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 8 pages, 7 figures

  3. arXiv:2205.14082  [pdf, other

    cs.LG cs.AI

    AANG: Automating Auxiliary Learning

    Authors: Lucio M. Dery, Paul Michel, Mikhail Khodak, Graham Neubig, Ameet Talwalkar

    Abstract: Auxiliary objectives, supplementary learning signals that are introduced to help aid learning on data-starved or highly complex end-tasks, are commonplace in machine learning. Whilst much work has been done to formulate useful auxiliary objectives, their construction is still an art which proceeds by slow and tedious hand-design. Intuition for how and when these objectives improve end-task perform… ▽ More

    Submitted 27 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to ICLR 2023 22 pages, 7 tables and 5 figures

  4. arXiv:2109.07437  [pdf, other

    cs.LG cs.CL

    Should We Be Pre-training? An Argument for End-task Aware Training as an Alternative

    Authors: Lucio M. Dery, Paul Michel, Ameet Talwalkar, Graham Neubig

    Abstract: In most settings of practical concern, machine learning practitioners know in advance what end-task they wish to boost with auxiliary tasks. However, widely used methods for leveraging auxiliary data like pre-training and its continued-pretraining variant are end-task agnostic: they rarely, if ever, exploit knowledge of the target task. We study replacing end-task agnostic continued training of pr… ▽ More

    Submitted 6 February, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 18 pages, 4 figures

  5. arXiv:2108.11346  [pdf, other

    cs.LG

    Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral

    Authors: Lucio M. Dery, Yann Dauphin, David Grangier

    Abstract: While deep learning has been very beneficial in data-rich settings, tasks with smaller training set often resort to pre-training or multitask learning to leverage data from other tasks. In this case, careful consideration is needed to select tasks and model parameterizations such that updates from the auxiliary tasks actually help the primary task. We seek to alleviate this burden by formulating a… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: 15 pages, 3 figures, Accepted to International Conference on Learning Representations (ICLR) 2021 See https://github.com/ldery/ATTITTUD}{https://github.com/ldery/ATTITTUD for associated code

  6. arXiv:1712.09382  [pdf, other

    eess.AS cs.CV cs.SD

    Audio to Body Dynamics

    Authors: Eli Shlizerman, Lucio M. Dery, Hayden Schoen, Ira Kemelmacher-Shlizerman

    Abstract: We present a method that gets as input an audio of violin or piano playing, and outputs a video of skeleton predictions which are further used to animate an avatar. The key idea is to create an animation of an avatar that moves their hands similarly to how a pianist or violinist would do, just from audio. Aiming for a fully detailed correct arms and fingers motion is a goal, however, it's not clea… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: Link with videos https://arviolin.github.io/AudioBodyDynamics/

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018