Showing 1–2 of 2 results for author: Newell, A

Search v0.5.6 released 2020-02-24

arXiv:1908.04339 [pdf, other]

cs.LG cs.CV stat.ML

Feature Partitioning for Efficient Multi-Task Architectures

Authors: Alejandro Newell, Lu Jiang, Chong Wang, Li-Jia Li, Jia Deng

Abstract: Multi-task learning holds the promise of less data, parameters, and time than training of separate models. We propose a method to automatically search over multi-task architectures while taking resource constraints into consideration. We propose a search space that compactly represents different parameter sharing strategies. This provides more effective coverage and sampling of the space of multi-… ▽ More Multi-task learning holds the promise of less data, parameters, and time than training of separate models. We propose a method to automatically search over multi-task architectures while taking resource constraints into consideration. We propose a search space that compactly represents different parameter sharing strategies. This provides more effective coverage and sampling of the space of multi-task architectures. We also present a method for quick evaluation of different architectures by using feature distillation. Together these contributions allow us to quickly optimize for efficient multi-task models. We benchmark on Visual Decathlon, demonstrating that we can automatically search for and identify multi-task architectures that effectively make trade-offs between task resource requirements while achieving a high level of final performance. △ Less

Submitted 12 August, 2019; originally announced August 2019.
arXiv:1401.1608 [pdf, ps, other]

stat.AP

doi 10.1214/13-AOAS671

An algorithm for deciding the number of clusters and validation using simulated data with application to exploring crop population structure

Authors: Mark A. Newell, Dianne Cook, Heike Hofmann, Jean-Luc Jannink

Abstract: A first step in exploring population structure in crop plants and other organisms is to define the number of subpopulations that exist for a given data set. The genetic marker data sets being generated have become increasingly large over time and commonly are of the high-dimension, low sample size (HDLSS) situation. An algorithm for deciding the number of clusters is proposed, and is validated on… ▽ More A first step in exploring population structure in crop plants and other organisms is to define the number of subpopulations that exist for a given data set. The genetic marker data sets being generated have become increasingly large over time and commonly are of the high-dimension, low sample size (HDLSS) situation. An algorithm for deciding the number of clusters is proposed, and is validated on simulated data sets varying in both the level of structure and the number of clusters covering the range of variation observed empirically. The algorithm was then tested on six empirical data sets across three small grain species. The algorithm uses bootstrap**, three methods of clustering, and defines the optimum number of clusters based on a common criterion, the Hubert's gamma statistic. Validation on simulated sets coupled with testing on empirical sets suggests that the algorithm can be used for a wide variety of genetic data sets. △ Less

Submitted 8 January, 2014; originally announced January 2014.

Comments: Published in at http://dx.doi.org/10.1214/13-AOAS671 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS671

Journal ref: Annals of Applied Statistics 2013, Vol. 7, No. 4, 1898-1916

Search v0.5.6 released 2020-02-24