Skip to main content

Showing 1–20 of 20 results for author: Dery, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.05406  [pdf, other

    cs.LG cs.CL

    Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes

    Authors: Lucio Dery, Steven Kolawole, Jean-François Kagy, Virginia Smith, Graham Neubig, Ameet Talwalkar

    Abstract: Given the generational gap in available hardware between lay practitioners and the most endowed institutions, LLMs are becoming increasingly inaccessible as they grow in size. Whilst many approaches have been proposed to compress LLMs to make their resource consumption manageable, these methods themselves tend to be resource intensive, putting them out of the reach of the very user groups they tar… ▽ More

    Submitted 9 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 15 pages, 4 fiigures, 15 tables

  2. arXiv:2312.03151  [pdf, other

    cs.LG

    Multitask Learning Can Improve Worst-Group Outcomes

    Authors: Atharva Kulkarni, Lucio Dery, Amrith Setlur, Aditi Raghunathan, Ameet Talwalkar, Graham Neubig

    Abstract: In order to create machine learning systems that serve a variety of users well, it is vital to not only achieve high average performance but also ensure equitable outcomes across diverse groups. However, most machine learning methods are designed to improve a model's average performance on a chosen end task without consideration for their impact on worst group error. Multitask learning (MTL) is on… ▽ More

    Submitted 28 February, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 20 pages, 7 tables, 6 Figures

  3. arXiv:2311.06382  [pdf, other

    cs.CL cs.LG

    Transfer Learning for Structured Pruning under Limited Task Data

    Authors: Lucio Dery, David Grangier, Awni Hannun

    Abstract: Large, pre-trained models are problematic to use in resource constrained applications. Fortunately, task-aware structured pruning methods offer a solution. These approaches reduce model size by drop** structural units like layers and attention heads in a manner that takes into account the end-task. However, these pruning algorithms require more task-specific data than is typically available. We… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 8 pages, 7 figures and 3 tables

  4. arXiv:2311.06379  [pdf, other

    cs.CL

    DeMuX: Data-efficient Multilingual Learning

    Authors: Simran Khanuja, Srinivas Gowriraj, Lucio Dery, Graham Neubig

    Abstract: We consider the task of optimally fine-tuning pre-trained multilingual models, given small amounts of unlabelled target data and an annotation budget. In this paper, we introduce DEMUX, a framework that prescribes the exact data-points to label from vast amounts of unlabelled multilingual data, having unknown degrees of overlap with the target set. Unlike most prior works, our end-to-end framework… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  5. arXiv:2309.09684  [pdf, other

    cs.AI cs.GT

    Distributed course allocation with asymmetric friendships

    Authors: Ilya Khakhiashvili, Lihi Dery, Tal Grinshpoun

    Abstract: Students' decisions on whether to take a class are strongly affected by whether their friends plan to take the class with them. A student may prefer to be assigned to a course they likes less, just to be with their friends, rather than taking a more preferred class alone. It has been shown that taking classes with friends positively affects academic performance. Thus, academic institutes should pr… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  6. arXiv:2302.05738  [pdf, other

    cs.LG

    Cross-Modal Fine-Tuning: Align then Refine

    Authors: Junhong Shen, Liam Li, Lucio M. Dery, Corey Staten, Mikhail Khodak, Graham Neubig, Ameet Talwalkar

    Abstract: Fine-tuning large-scale pretrained models has led to tremendous progress in well-studied modalities such as vision and NLP. However, similar gains have not been observed in many other modalities due to a lack of relevant pretrained models. In this work, we propose ORCA, a general cross-modal fine-tuning framework that extends the applicability of a single large-scale pretrained model to diverse mo… ▽ More

    Submitted 18 March, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

  7. arXiv:2212.13774  [pdf, other

    cs.GT

    Forming a ranking from tied evaluations: a case of an online, interactive student peer assessment system

    Authors: Lihi Dery

    Abstract: In higher education courses, peer assessment activities are common for kee** students engaged during presentations. Defining precisely how students assess the work of others requires careful consideration. Asking the student for numeric grades is the most common method. However, students tend to assign high grades to most projects. Aggregating peer assessments, therefore, results in all projects… ▽ More

    Submitted 30 August, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

  8. arXiv:2210.04971  [pdf, other

    cs.LG cs.AI

    Multi-step Planning for Automated Hyperparameter Optimization with OptFormer

    Authors: Lucio M. Dery, Abram L. Friesen, Nando De Freitas, Marc'Aurelio Ranzato, Yutian Chen

    Abstract: As machine learning permeates more industries and models become more expensive and time consuming to train, the need for efficient automated hyperparameter optimization (HPO) has never been more pressing. Multi-step planning based approaches to hyperparameter optimization promise improved efficiency over myopic alternatives by more effectively balancing out exploration and exploitation. However, t… ▽ More

    Submitted 16 November, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 8 pages, 7 figures

  9. arXiv:2205.14082  [pdf, other

    cs.LG cs.AI

    AANG: Automating Auxiliary Learning

    Authors: Lucio M. Dery, Paul Michel, Mikhail Khodak, Graham Neubig, Ameet Talwalkar

    Abstract: Auxiliary objectives, supplementary learning signals that are introduced to help aid learning on data-starved or highly complex end-tasks, are commonplace in machine learning. Whilst much work has been done to formulate useful auxiliary objectives, their construction is still an art which proceeds by slow and tedious hand-design. Intuition for how and when these objectives improve end-task perform… ▽ More

    Submitted 27 February, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to ICLR 2023 22 pages, 7 tables and 5 figures

  10. arXiv:2205.10580  [pdf, other

    cs.CR

    Towards Secure Virtual Elections: Multiparty Computation of Order Based Voting Rules

    Authors: Tamir Tassa, Lihi Dery

    Abstract: Electronic voting systems are essential for holding virtual elections, and the need for such systems increases due to the COVID-19 pandemic and the social distancing that it mandates. One of the main challenges in e-voting systems is to secure the voting process: namely, to certify that the computed results are consistent with the cast ballots, and that the privacy of the voters is preserved. We p… ▽ More

    Submitted 8 May, 2024; v1 submitted 21 May, 2022; originally announced May 2022.

  11. arXiv:2201.05071  [pdf, other

    cs.CR cs.CV cs.LG cs.NE

    Evaluation of Neural Networks Defenses and Attacks using NDCG and Reciprocal Rank Metrics

    Authors: Haya Brama, Lihi Dery, Tal Grinshpoun

    Abstract: The problem of attacks on neural networks through input modification (i.e., adversarial examples) has attracted much attention recently. Being relatively easy to generate and hard to detect, these attacks pose a security breach that many suggested defenses try to mitigate. However, the evaluation of the effect of attacks and defenses commonly relies on traditional classification metrics, without a… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: 12 pages, 5 figures

    Journal ref: International Journal of Information Security 2022

  12. arXiv:2109.07437  [pdf, other

    cs.LG cs.CL

    Should We Be Pre-training? An Argument for End-task Aware Training as an Alternative

    Authors: Lucio M. Dery, Paul Michel, Ameet Talwalkar, Graham Neubig

    Abstract: In most settings of practical concern, machine learning practitioners know in advance what end-task they wish to boost with auxiliary tasks. However, widely used methods for leveraging auxiliary data like pre-training and its continued-pretraining variant are end-task agnostic: they rarely, if ever, exploit knowledge of the target task. We study replacing end-task agnostic continued training of pr… ▽ More

    Submitted 6 February, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 18 pages, 4 figures

  13. arXiv:2108.11346  [pdf, other

    cs.LG

    Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral

    Authors: Lucio M. Dery, Yann Dauphin, David Grangier

    Abstract: While deep learning has been very beneficial in data-rich settings, tasks with smaller training set often resort to pre-training or multitask learning to leverage data from other tasks. In this case, careful consideration is needed to select tasks and model parameterizations such that updates from the auxiliary tasks actually help the primary task. We seek to alleviate this burden by formulating a… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: 15 pages, 3 figures, Accepted to International Conference on Learning Representations (ICLR) 2021 See https://github.com/ldery/ATTITTUD}{https://github.com/ldery/ATTITTUD for associated code

  14. Multi-label Ranking: Mining Multi-label and Label Ranking Data

    Authors: Lihi Dery

    Abstract: We survey multi-label ranking tasks, specifically multi-label classification and label ranking classification. We highlight the unique challenges, and re-categorize the methods, as they no longer fit into the traditional categories of transformation and adaptation. We survey developments in the last demi-decade, with a special focus on state-of-the-art methods in deep learning multi-label mining,… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

  15. arXiv:2001.07744  [pdf, other

    cs.LG stat.ML

    Improving Label Ranking Ensembles using Boosting Techniques

    Authors: Lihi Dery, Erez Shmueli

    Abstract: Label ranking is a prediction task which deals with learning a map** between an instance and a ranking (i.e., order) of labels from a finite set, representing their relevance to the instance. Boosting is a well-known and reliable ensemble technique that was shown to often outperform other learning algorithms. While boosting algorithms were developed for a multitude of machine learning tasks, lab… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Journal ref: IEEE Access 2020

  16. arXiv:1905.09116  [pdf, other

    econ.TH cs.CY

    Cheating in Ranking Systems

    Authors: Lihi Dery, Dror Hermel, Artyom Jelnov

    Abstract: Consider an application sold on an on-line platform, with the app paying a commission fee and, henceforth, offered for sale on the platform. The ability to sell the application depends on its customer ranking. Therefore, developers may have an incentive to promote their applications ranking in a dishonest manner. One way to do this is by faking positive customer reviews. However, the platform is a… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  17. Reaching Consensus Under a Deadline

    Authors: Marina Bannikova, Lihi Dery, Svetlana Obraztsova, Zinovi Rabinovich, Jeffrey S. Rosenschein

    Abstract: Committee decisions are complicated by a deadline, e.g., the next start of a budget, or the beginning of a semester. In committee hiring decisions, it may be that if no candidate is supported by a strong majority, the default is to hire no one - an option that may cost dearly. As a result, committee members might prefer to agree on a reasonable, if not necessarily the best, candidate, to avoid unf… ▽ More

    Submitted 26 January, 2021; v1 submitted 17 May, 2019; originally announced May 2019.

    Journal ref: Autonomous Agents and Multi-Agent Systems, 35(1), 1-42 (2021)

  18. Lie on the Fly: Strategic Voting in an Iterative Preference Elicitation Process

    Authors: Lihi Dery, Svetlana Obraztsova, Zinovi Rabinovich, Meir Kalech

    Abstract: A voting center is in charge of collecting and aggregating voter preferences. In an iterative process, the center sends comparison queries to voters, requesting them to submit their preference between two items. Voters might discuss the candidates among themselves, figuring out during the elicitation process which candidates stand a chance of winning and which do not. Consequently, strategic voter… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

  19. Fear Not, Vote Truthfully: Secure Multiparty Computation of Score Based Rules

    Authors: Lihi Dery, Tamir Tassa, Avishay Yanai

    Abstract: We propose a secure voting protocol for score-based voting rules, where independent talliers perform the tallying procedure. The protocol outputs the winning candidate(s) while preserving the privacy of the voters and the secrecy of the ballots. It offers perfect secrecy, in the sense that apart from the desired output, all other information -- the ballots, intermediate values, and the final score… ▽ More

    Submitted 27 January, 2022; v1 submitted 15 March, 2019; originally announced March 2019.

    Journal ref: Expert Systems with Applications, 168, 114434 (2021)

  20. arXiv:1712.09382  [pdf, other

    eess.AS cs.CV cs.SD

    Audio to Body Dynamics

    Authors: Eli Shlizerman, Lucio M. Dery, Hayden Schoen, Ira Kemelmacher-Shlizerman

    Abstract: We present a method that gets as input an audio of violin or piano playing, and outputs a video of skeleton predictions which are further used to animate an avatar. The key idea is to create an animation of an avatar that moves their hands similarly to how a pianist or violinist would do, just from audio. Aiming for a fully detailed correct arms and fingers motion is a goal, however, it's not clea… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: Link with videos https://arviolin.github.io/AudioBodyDynamics/

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018