Skip to main content

Showing 1–19 of 19 results for author: Navon, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02649  [pdf, other

    eess.AS cs.LG cs.SD

    Keyword-Guided Adaptation of Automatic Speech Recognition

    Authors: Aviv Shamsian, Aviv Navon, Neta Glazer, Gill Hetz, Joseph Keshet

    Abstract: Automatic Speech Recognition (ASR) technology has made significant progress in recent years, providing accurate transcription across various domains. However, some challenges remain, especially in noisy environments and specialized jargon. In this paper, we propose a novel approach for improved jargon word recognition by contextual biasing Whisper-based models. We employ a keyword spotting model t… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to InterSpeech 2024

  2. arXiv:2403.00590  [pdf, other

    cs.NI

    Hercules: Heterogeneous Requirements Congestion Control Protocol

    Authors: Neta Rozen-Schiff, Itzcak Pechtalt, Amit Navon, Leon Bruckman

    Abstract: Future network services present a significant challenge for network providers due to high number and high variety of co-existing requirements. Despite many advancements in network architectures and management schemes, congested network links continue to constrain the Quality of Service (QoS) for critical applications like tele-surgery and autonomous driving. A prominent, complimentary approach con… ▽ More

    Submitted 4 June, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  3. arXiv:2402.11367  [pdf, other

    cs.LG

    Multi Task Inverse Reinforcement Learning for Common Sense Reward

    Authors: Neta Glazer, Aviv Navon, Aviv Shamsian, Ethan Fetaya

    Abstract: One of the challenges in applying reinforcement learning in a complex real-world environment lies in providing the agent with a sufficiently detailed reward function. Any misalignment between the reward and the desired behavior can result in unwanted outcomes. This may lead to issues like "reward hacking" where the agent maximizes rewards by unintended behavior. In this work, we propose to disenta… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  4. arXiv:2402.04081  [pdf, other

    cs.LG cs.AI

    Improved Generalization of Weight Space Networks via Augmentations

    Authors: Aviv Shamsian, Aviv Navon, David W. Zhang, Yan Zhang, Ethan Fetaya, Gal Chechik, Haggai Maron

    Abstract: Learning in deep weight spaces (DWS), where neural networks process the weights of other neural networks, is an emerging research direction, with applications to 2D and 3D neural fields (INRs, NeRFs), as well as making inferences about other types of neural networks. Unfortunately, weight space models tend to suffer from substantial overfitting. We empirically analyze the reasons for this overfitt… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Under Review

  5. arXiv:2402.02428  [pdf, other

    eess.SY cs.GT

    On the Stability of Strategic Energy Storage Operation in Wholesale Electricity Markets (Extended Version)

    Authors: Aviad Navon, Juri Belikov, Ariel Orda, Yoash Levron

    Abstract: High shares of variable renewable energy necessitate substantial energy storage capacity. However, it remains unclear how to design a market that, on the one hand, ensures a stable and sufficient income for storage firms, and, on the other hand, maintains stable and affordable electricity costs for the consumers. Here, we use a game theoretic model to study storage competition in wholesale electri… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  6. arXiv:2311.08851  [pdf, other

    cs.LG cs.CV

    Data Augmentations in Deep Weight Spaces

    Authors: Aviv Shamsian, David W. Zhang, Aviv Navon, Yan Zhang, Miltiadis Kofinas, Idan Achituve, Riccardo Valperga, Gertjan J. Burghouts, Efstratios Gavves, Cees G. M. Snoek, Ethan Fetaya, Gal Chechik, Haggai Maron

    Abstract: Learning in weight spaces, where neural networks process the weights of other deep neural networks, has emerged as a promising research direction with applications in various fields, from analyzing and editing neural fields and implicit neural representations, to network pruning and quantization. Recent works designed architectures for effective learning in that space, which takes into account its… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023 Workshop on Symmetry and Geometry in Neural Representations

  7. arXiv:2310.19708  [pdf, other

    cs.CL cs.LG

    Combining Language Models For Specialized Domains: A Colorful Approach

    Authors: Daniel Eitan, Menachem Pirchi, Neta Glazer, Shai Meital, Gil Ayach, Gidon Krendel, Aviv Shamsian, Aviv Navon, Gil Hetz, Joseph Keshet

    Abstract: General purpose language models (LMs) encounter difficulties when processing domain-specific jargon and terminology, which are frequently utilized in specialized fields such as medicine or industrial settings. Moreover, they often find it challenging to interpret mixed speech that blends general language with specialized jargon. This poses a challenge for automatic speech recognition systems opera… ▽ More

    Submitted 1 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Under Review

  8. arXiv:2310.13397  [pdf, other

    cs.LG

    Equivariant Deep Weight Space Alignment

    Authors: Aviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik, Nadav Dym, Haggai Maron

    Abstract: Permutation symmetries of deep networks make basic operations like model merging and similarity estimation challenging. In many cases, aligning the weights of the networks, i.e., finding optimal permutations between their weights, is necessary. Unfortunately, weight alignment is an NP-hard problem. Prior research has mainly focused on solving relaxed versions of the alignment problem, leading to e… ▽ More

    Submitted 31 May, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  9. arXiv:2309.08561  [pdf, other

    eess.AS cs.LG cs.SD

    Open-vocabulary Keyword-spotting with Adaptive Instance Normalization

    Authors: Aviv Navon, Aviv Shamsian, Neta Glazer, Gill Hetz, Joseph Keshet

    Abstract: Open vocabulary keyword spotting is a crucial and challenging task in automatic speech recognition (ASR) that focuses on detecting user-defined keywords within a spoken utterance. Keyword spotting methods commonly map the audio utterance and keyword into a joint embedding space to obtain some affinity score. In this work, we propose AdaKWS, a novel method for keyword spotting in which a text encod… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Under Review

  10. arXiv:2307.01683  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Discrete Weights and Activations Using the Local Reparameterization Trick

    Authors: Guy Berger, Aviv Navon, Ethan Fetaya

    Abstract: In computer vision and machine learning, a crucial challenge is to lower the computation and memory demands for neural network inference. A commonplace solution to address this challenge is through the use of binarization. By binarizing the network weights and activations, one can significantly reduce computational complexity by substituting the computationally expensive floating operations with f… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  11. arXiv:2301.13501  [pdf, other

    cs.LG

    Auxiliary Learning as an Asymmetric Bargaining Game

    Authors: Aviv Shamsian, Aviv Navon, Neta Glazer, Kenji Kawaguchi, Gal Chechik, Ethan Fetaya

    Abstract: Auxiliary learning is an effective method for enhancing the generalization capabilities of trained models, particularly when dealing with small datasets. However, this approach may present several difficulties: (i) optimizing multiple objectives can be more challenging, and (ii) how to balance the auxiliary tasks to best assist the main task is unclear. In this work, we propose a novel approach, n… ▽ More

    Submitted 5 June, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  12. arXiv:2301.12780  [pdf, other

    cs.LG

    Equivariant Architectures for Learning in Deep Weight Spaces

    Authors: Aviv Navon, Aviv Shamsian, Idan Achituve, Ethan Fetaya, Gal Chechik, Haggai Maron

    Abstract: Designing machine learning architectures for processing neural networks in their raw weight matrix form is a newly introduced research direction. Unfortunately, the unique symmetry structure of deep weight spaces makes this design very challenging. If successful, such architectures would be capable of performing a wide range of intriguing tasks, from adapting a pre-trained network to a new domain… ▽ More

    Submitted 31 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  13. arXiv:2206.10935  [pdf, other

    cs.LG cs.CV

    A Study on the Evaluation of Generative Models

    Authors: Eyal Betzalel, Coby Penso, Aviv Navon, Ethan Fetaya

    Abstract: Implicit generative models, which do not return likelihood values, such as generative adversarial networks and diffusion models, have become prevalent in recent years. While it is true that these models have shown remarkable results, evaluating their performance is challenging. This issue is of vital importance to push research forward and identify meaningful gains from random noise. Currently, he… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 13 pages

  14. arXiv:2202.01017  [pdf, other

    cs.LG cs.GT

    Multi-Task Learning as a Bargaining Game

    Authors: Aviv Navon, Aviv Shamsian, Idan Achituve, Haggai Maron, Kenji Kawaguchi, Gal Chechik, Ethan Fetaya

    Abstract: In Multi-task learning (MTL), a joint model is trained to simultaneously make predictions for several tasks. Joint training reduces computation costs and improves data efficiency; however, since the gradients of these different tasks may conflict, training a joint model for MTL often yields lower performance than its corresponding single-task counterparts. A common method for alleviating this issu… ▽ More

    Submitted 8 July, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  15. arXiv:2106.15482  [pdf, other

    cs.LG stat.ML

    Personalized Federated Learning with Gaussian Processes

    Authors: Idan Achituve, Aviv Shamsian, Aviv Navon, Gal Chechik, Ethan Fetaya

    Abstract: Federated learning aims to learn a global model that performs well on client devices with limited cross-client communication. Personalized federated learning (PFL) further extends this setup to handle data heterogeneity between clients by learning personalized models. A key challenge in this setting is to learn effectively across clients even though each client has unique data that is often limite… ▽ More

    Submitted 26 October, 2021; v1 submitted 29 June, 2021; originally announced June 2021.

  16. arXiv:2103.04628  [pdf, other

    cs.LG

    Personalized Federated Learning using Hypernetworks

    Authors: Aviv Shamsian, Aviv Navon, Ethan Fetaya, Gal Chechik

    Abstract: Personalized federated learning is tasked with training machine learning models for multiple clients, each with its own data distribution. The goal is to train personalized models in a collaborative way while accounting for data disparities across clients and reducing communication costs. We propose a novel approach to this problem using hypernetworks, termed pFedHN for personalized Federated Hype… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

  17. arXiv:2102.07868  [pdf, other

    cs.LG

    GP-Tree: A Gaussian Process Classifier for Few-Shot Incremental Learning

    Authors: Idan Achituve, Aviv Navon, Yochai Yemini, Gal Chechik, Ethan Fetaya

    Abstract: Gaussian processes (GPs) are non-parametric, flexible, models that work well in many tasks. Combining GPs with deep learning methods via deep kernel learning (DKL) is especially compelling due to the strong representational power induced by the network. However, inference in GPs, whether with or without DKL, can be computationally challenging on large datasets. Here, we propose GP-Tree, a novel me… ▽ More

    Submitted 13 July, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  18. arXiv:2010.04104  [pdf, other

    cs.LG

    Learning the Pareto Front with Hypernetworks

    Authors: Aviv Navon, Aviv Shamsian, Gal Chechik, Ethan Fetaya

    Abstract: Multi-objective optimization (MOO) problems are prevalent in machine learning. These problems have a set of optimal solutions, called the Pareto front, where each point on the front represents a different trade-off between possibly conflicting objectives. Recent MOO methods can target a specific desired ray in loss space however, most approaches still face two grave limitations: (i) A separate mod… ▽ More

    Submitted 26 April, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted to ICLR 2021

  19. arXiv:2007.02693  [pdf, other

    cs.CV cs.LG stat.ML

    Auxiliary Learning by Implicit Differentiation

    Authors: Aviv Navon, Idan Achituve, Haggai Maron, Gal Chechik, Ethan Fetaya

    Abstract: Training neural networks with auxiliary tasks is a common practice for improving the performance on a main task of interest. Two main challenges arise in this multi-task learning setting: (i) designing useful auxiliary tasks; and (ii) combining auxiliary tasks into a single coherent loss. Here, we propose a novel framework, AuxiLearn, that targets both challenges based on implicit differentiation.… ▽ More

    Submitted 11 May, 2021; v1 submitted 22 June, 2020; originally announced July 2020.

    Comments: Published at ICLR 2021