-
TAME: Task Agnostic Continual Learning using Multiple Experts
Authors:
Haoran Zhu,
Maryam Majzoubi,
Arihant Jain,
Anna Choromanska
Abstract:
The goal of lifelong learning is to continuously learn from non-stationary distributions, where the non-stationarity is typically imposed by a sequence of distinct tasks. Prior works have mostly considered idealistic settings, where the identity of tasks is known at least at training. In this paper we focus on a fundamentally harder, so-called task-agnostic setting where the task identities are no…
▽ More
The goal of lifelong learning is to continuously learn from non-stationary distributions, where the non-stationarity is typically imposed by a sequence of distinct tasks. Prior works have mostly considered idealistic settings, where the identity of tasks is known at least at training. In this paper we focus on a fundamentally harder, so-called task-agnostic setting where the task identities are not known and the learning machine needs to infer them from the observations. Our algorithm, which we call TAME (Task-Agnostic continual learning using Multiple Experts), automatically detects the shift in data distributions and switches between task expert networks in an online manner. At training, the strategy for switching between tasks hinges on an extremely simple observation that for each new coming task there occurs a statistically-significant deviation in the value of the loss function that marks the onset of this new task. At inference, the switching between experts is governed by the selector network that forwards the test sample to its relevant expert network. The selector network is trained on a small subset of data drawn uniformly at random. We control the growth of the task expert networks as well as selector network by employing online pruning. Our experimental results show the efficacy of our approach on benchmark continual learning data sets, outperforming the previous task-agnostic methods and even the techniques that admit task identities at both training and testing, while at the same time using a comparable model size.
△ Less
Submitted 2 June, 2024; v1 submitted 7 October, 2022;
originally announced October 2022.
-
Efficient Contextual Bandits with Continuous Actions
Authors:
Maryam Majzoubi,
Chicheng Zhang,
Rajan Chari,
Akshay Krishnamurthy,
John Langford,
Aleksandrs Slivkins
Abstract:
We create a computationally tractable algorithm for contextual bandits with continuous actions having unknown structure. Our reduction-style algorithm composes with most supervised learning representations. We prove that it works in a general sense and verify the new functionality with large-scale experiments.
We create a computationally tractable algorithm for contextual bandits with continuous actions having unknown structure. Our reduction-style algorithm composes with most supervised learning representations. We prove that it works in a general sense and verify the new functionality with large-scale experiments.
△ Less
Submitted 3 December, 2020; v1 submitted 10 June, 2020;
originally announced June 2020.
-
Analytical Modeling of k33 Mode Partial Electrode Configuration for Loss Characterization
Authors:
Yoonsang Park,
Maryam Majzoubi,
Yuxuan Zhang,
Timo Scholehwar,
Eberhard Hennig,
Kenji Uchino
Abstract:
Accurate determination of three types of losses (dielectric, elastic and piezoelectric) in piezoelectric materials is critical, since they are closely related to the performance of high-power piezoelectric devices. The Standard k33 mode has a number of serious deficits that hinders researchers from determining accurate physical parameters and losses. In order to overcome such deficits, partial ele…
▽ More
Accurate determination of three types of losses (dielectric, elastic and piezoelectric) in piezoelectric materials is critical, since they are closely related to the performance of high-power piezoelectric devices. The Standard k33 mode has a number of serious deficits that hinders researchers from determining accurate physical parameters and losses. In order to overcome such deficits, partial electrode has been devised and proposed. This study provides analytical derivation process and proposes parameter determination method by utilizing analytical solutions. Compared to finite element analysis, analytical solutions show 0.1 % difference in resonance frequencies and 2 % difference in mechanical quality factors, proving them as valid modeling. The analytical solutions are fitted to experimental data to determine physical parameters and losses. The k33 (electromechanical coupling factor) values were calculated with determined values from curve fitting in two different ways and show good agreement to each other.
△ Less
Submitted 3 January, 2020; v1 submitted 31 December, 2019;
originally announced December 2019.
-
Improvement of the Standard Characterization Method on k33 Mode Piezoelectric Specimens
Authors:
Yoonsang Park,
Yuxuan Zhang,
Maryam Majzoubi,
Timo Scholehwar,
Eberhard Hennig,
Kenji Uchino
Abstract:
Even though standard method to determine physical parameters of piezoelectric materials has been set up for several decades, bare attention has been made on loss determination method. Furthermore, several deficits have been recognized in the standard method for k33 mode. In this study, detailed discussion on deficits of IEEE Standard k33 is investigated and the method to resolve such deficits will…
▽ More
Even though standard method to determine physical parameters of piezoelectric materials has been set up for several decades, bare attention has been made on loss determination method. Furthermore, several deficits have been recognized in the standard method for k33 mode. In this study, detailed discussion on deficits of IEEE Standard k33 is investigated and the method to resolve such deficits will be introduced. The standard k33 specimen suffers from small capacitance, which causes huge experimental error, intrinsic electrical energy leakage, specimen setup issue and inability to directly determine intensive elastic properties. In order to resolve these issues, partial electrode method was introduced, and curve fitting was demonstrated to determine physical parameters and losses.
△ Less
Submitted 3 January, 2020; v1 submitted 31 December, 2019;
originally announced December 2019.
-
LdSM: Logarithm-depth Streaming Multi-label Decision Trees
Authors:
Maryam Majzoubi,
Anna Choromanska
Abstract:
We consider multi-label classification where the goal is to annotate each data point with the most relevant $\textit{subset}$ of labels from an extremely large label set. Efficient annotation can be achieved with balanced tree predictors, i.e. trees with logarithmic-depth in the label complexity, whose leaves correspond to labels. Designing prediction mechanism with such trees for real data applic…
▽ More
We consider multi-label classification where the goal is to annotate each data point with the most relevant $\textit{subset}$ of labels from an extremely large label set. Efficient annotation can be achieved with balanced tree predictors, i.e. trees with logarithmic-depth in the label complexity, whose leaves correspond to labels. Designing prediction mechanism with such trees for real data applications is non-trivial as it needs to accommodate sending examples to multiple leaves while at the same time sustain high prediction accuracy. In this paper we develop the LdSM algorithm for the construction and training of multi-label decision trees, where in every node of the tree we optimize a novel objective function that favors balanced splits, maintains high class purity of children nodes, and allows sending examples to multiple directions but with a penalty that prevents tree over-growth. Each node of the tree is trained once the previous node is completed leading to a streaming approach for training. We analyze the proposed objective theoretically and show that minimizing it leads to pure and balanced data splits. Furthermore, we show a boosting theorem that captures its connection to the multi-label classification error. Experimental results on benchmark data sets demonstrate that our approach achieves high prediction accuracy and low prediction time and position LdSM as a competitive tool among existing state-of-the-art approaches.
△ Less
Submitted 10 June, 2020; v1 submitted 24 May, 2019;
originally announced May 2019.