Skip to main content

Showing 1–27 of 27 results for author: Vladymyrov, M

.
  1. arXiv:2402.14180  [pdf, other

    cs.LG

    Linear Transformers are Versatile In-Context Learners

    Authors: Max Vladymyrov, Johannes von Oswald, Mark Sandler, Rong Ge

    Abstract: Recent research has demonstrated that transformers, particularly linear attention models, implicitly execute gradient-descent-like algorithms on data provided in-context during their forward inference step. However, their capability in handling more complex problems remains unexplored. In this paper, we prove that any linear transformer maintains an implicit linear model and can be interpreted as… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  2. arXiv:2309.05858  [pdf, other

    cs.LG cs.AI

    Uncovering mesa-optimization algorithms in Transformers

    Authors: Johannes von Oswald, Eyvind Niklasson, Maximilian Schlegel, Sei** Kobayashi, Nicolas Zucchet, Nino Scherrer, Nolan Miller, Mark Sandler, Blaise Agüera y Arcas, Max Vladymyrov, Razvan Pascanu, João Sacramento

    Abstract: Transformers have become the dominant model in deep learning, but the reason for their superior performance is poorly understood. Here, we hypothesize that the strong performance of Transformers stems from an architectural bias towards mesa-optimization, a learned process running within the forward pass of a model consisting of the following two steps: (i) the construction of an internal learning… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  3. arXiv:2301.04584  [pdf, other

    cs.LG cs.CV

    Continual Few-Shot Learning Using HyperTransformers

    Authors: Max Vladymyrov, Andrey Zhmoginov, Mark Sandler

    Abstract: We focus on the problem of learning without forgetting from multiple tasks arriving sequentially, where each task is defined using a few-shot episode of novel or already seen classes. We approach this problem using the recently published HyperTransformer (HT), a Transformer-based hypernetwork that generates specialized task-specific CNN weights directly from the support set. In order to learn from… ▽ More

    Submitted 12 January, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

  4. arXiv:2301.02312  [pdf, other

    cs.LG

    Training trajectories, mini-batch losses and the curious role of the learning rate

    Authors: Mark Sandler, Andrey Zhmoginov, Max Vladymyrov, Nolan Miller

    Abstract: Stochastic gradient descent plays a fundamental role in nearly all applications of deep learning. However its ability to converge to a global minimum remains shrouded in mystery. In this paper we propose to study the behavior of the loss function on fixed mini-batches along SGD trajectories. We show that the loss function on a fixed batch appears to be remarkably convex-like. In particular for Res… ▽ More

    Submitted 1 February, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: 21 pages, 14 figures

  5. arXiv:2212.07677  [pdf, other

    cs.LG cs.AI cs.CL

    Transformers learn in-context by gradient descent

    Authors: Johannes von Oswald, Eyvind Niklasson, Ettore Randazzo, João Sacramento, Alexander Mordvintsev, Andrey Zhmoginov, Max Vladymyrov

    Abstract: At present, the mechanisms of in-context learning in Transformers are not well understood and remain mostly an intuition. In this paper, we suggest that training Transformers on auto-regressive objectives is closely related to gradient-based meta-learning formulations. We start by providing a simple weight construction that shows the equivalence of data transformations induced by 1) a single linea… ▽ More

    Submitted 31 May, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  6. arXiv:2211.15774  [pdf, other

    cs.LG cs.CV

    Decentralized Learning with Multi-Headed Distillation

    Authors: Andrey Zhmoginov, Mark Sandler, Nolan Miller, Gus Kristiansen, Max Vladymyrov

    Abstract: Decentralized learning with private data is a central problem in machine learning. We propose a novel distillation-based decentralized learning technique that allows multiple agents with private non-iid data to learn from each other, without having to share their data, weights or weight updates. Our approach is communication efficient, utilizes an unlabeled public dataset and uses multiple auxilia… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  7. arXiv:2203.15243  [pdf, other

    cs.CV

    Fine-tuning Image Transformers using Learnable Memory

    Authors: Mark Sandler, Andrey Zhmoginov, Max Vladymyrov, Andrew Jackson

    Abstract: In this paper we propose augmenting Vision Transformer models with learnable memory tokens. Our approach allows the model to adapt to new tasks, using few parameters, while optionally preserving its capabilities on previously learned tasks. At each layer we introduce a set of learnable embedding vectors that provide contextual information useful for specific datasets. We call these "memory tokens"… ▽ More

    Submitted 29 March, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: CVPR 2022, to appear

  8. arXiv:2201.05125  [pdf, other

    cs.LG cs.CV

    GradMax: Growing Neural Networks using Gradient Information

    Authors: Utku Evci, Bart van Merriënboer, Thomas Unterthiner, Max Vladymyrov, Fabian Pedregosa

    Abstract: The architecture and the parameters of neural networks are often optimized independently, which requires costly retraining of the parameters whenever the architecture is modified. In this work we instead focus on growing the architecture without requiring costly retraining. We present a method that adds new neurons during training without impacting what is already learned, while improving the trai… ▽ More

    Submitted 7 June, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: ICLR 2022

    Journal ref: International Conference on Learning Representations, 2022

  9. arXiv:2201.04182  [pdf, other

    cs.LG cs.CV

    HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning

    Authors: Andrey Zhmoginov, Mark Sandler, Max Vladymyrov

    Abstract: In this work we propose a HyperTransformer, a Transformer-based model for supervised and semi-supervised few-shot learning that generates weights of a convolutional neural network (CNN) directly from support samples. Since the dependence of a small generated CNN model on a specific task is encoded by a high-capacity Transformer model, we effectively decouple the complexity of the large task space… ▽ More

    Submitted 13 July, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

  10. arXiv:2104.04657  [pdf, other

    cs.LG cs.NE

    Meta-Learning Bidirectional Update Rules

    Authors: Mark Sandler, Max Vladymyrov, Andrey Zhmoginov, Nolan Miller, Andrew Jackson, Tom Madams, Blaise Aguera y Arcas

    Abstract: In this paper, we introduce a new type of generalized neural network where neurons and synapses maintain multiple states. We show that classical gradient-based backpropagation in neural networks can be seen as a special case of a two-state network where one state is used for activations and another for gradients, with update rules derived from the chain rule. In our generalized framework, networks… ▽ More

    Submitted 11 June, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: ICML 2021, 17 pages

  11. arXiv:2011.03395  [pdf, other

    cs.LG stat.ML

    Underspecification Presents Challenges for Credibility in Modern Machine Learning

    Authors: Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew D. Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne , et al. (15 additional authors not shown)

    Abstract: ML models often exhibit unexpectedly poor behavior when they are deployed in real-world domains. We identify underspecification as a key reason for these failures. An ML pipeline is underspecified when it can return many predictors with equivalently strong held-out performance in the training domain. Underspecification is common in modern ML pipelines, such as those based on deep learning. Predict… ▽ More

    Submitted 24 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Updates: Updated statistical analysis in Section 6; Additional citations

  12. Novel tracking approach based on fully-unsupervised disentanglement of the geometrical factors of variation

    Authors: Mykhailo Vladymyrov, Akitaka Ariga

    Abstract: Efficient tracking algorithms are a crucial part of particle tracking detectors. While a lot of work has been done in designing a plethora of algorithms, these usually require tedious tuning for each use case. (Weakly) supervised Machine Learning-based approaches can leverage the actual raw data for maximal performance. Yet in realistic scenarios, sufficient high-quality labeled data is not availa… ▽ More

    Submitted 13 February, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: Accepted for publication in JINST

  13. arXiv:1906.11389  [pdf, ps, other

    cs.LG stat.ML

    No Pressure! Addressing the Problem of Local Minima in Manifold Learning Algorithms

    Authors: Max Vladymyrov

    Abstract: Nonlinear embedding manifold learning methods provide invaluable visual insights into the structure of high-dimensional data. However, due to a complicated nonconvex objective function, these methods can easily get stuck in local minima and their embedding quality can be poor. We propose a natural extension to several manifold learning methods aimed at identifying pressured points, i.e. points stu… ▽ More

    Submitted 27 December, 2019; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: 10 pages, NeurIPS 2019

  14. DsTau: Study of tau neutrino production with 400 GeV protons from the CERN-SPS

    Authors: Shigeki Aoki, Akitaka Ariga, Tomoko Ariga, Sergey Dmitrievsky, Elena Firu, Dean Forshaw, Tsutomu Fukuda, Yuri Gornushkin, Ali Murat Guler, Maria Haiduc, Koichi Kodama, Masahiro Komatsu, Muhtesem Akif Korkmaz, Umut Kose, Madalina Miloi, Antonio Miucci, Motoaki Miyanishi, Mitsuhiro Nakamura, Toshiyuki Nakano, Alina Neagu, Hiroki Rokujo, Osamu Sato, Elizaveta Sitnikova, Yosuke Suzuki, Tomoki Takao , et al. (5 additional authors not shown)

    Abstract: In the DsTau experiment at the CERN SPS, an independent and direct way to measure tau neutrino production following high energy proton interactions was proposed. As the main source of tau neutrinos is a decay of Ds mesons, produced in proton-nucleus interactions, the project aims at measuring a differential cross section of this reaction. The experimental method is based on a use of high resolutio… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

    Journal ref: J. High Energ. Phys. 2020, 33 (2020)

  15. arXiv:1802.04074  [pdf, other

    physics.ins-det hep-ex

    Nuclear emulsions for the detection of micrometric-scale fringe patterns: an application to positron interferometry

    Authors: S. Aghion, A. Ariga, M. Bollani, A. Ereditato, R. Ferragut, M. Giammarchi, M. Lodari, C. Pistillo, S. Sala, P. Scampoli, M. Vladymyrov

    Abstract: Nuclear emulsions are capable of very high position resolution in the detection of ionizing particles. This feature can be exploited to directly resolve the micrometric-scale fringe pattern produced by a matter-wave interferometer for low energy positrons (in the 10-20 keV range). We have tested the performance of emulsion films in this specific scenario. Exploiting silicon nitride diffraction gra… ▽ More

    Submitted 11 April, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: 15 pages, 10 figures

  16. arXiv:1708.08700  [pdf, other

    hep-ex physics.ins-det

    Study of tau-neutrino production at the CERN SPS

    Authors: S. Aoki, A. Ariga, T. Ariga, E. Firu, T. Fukuda, Y. Gornushkin, A. M. Guler, M. Haiduc, K. Kodama, M. A. Korkmaz, U. Kose, M. Nakamura, T. Nakano, A. T. Neagu, H. Rokujo, O. Sato, S. Vasina, M. Vladymyrov, M. Yoshimoto

    Abstract: The DsTau project proposes to study tau-neutrino production in high-energy proton interactions. The outcome of this experiment are prerequisite for measuring the $ν_τ$ charged-current cross section that has never been well measured. Precisely measuring the cross section would enable testing of lepton universality in $ν_τ$ scattering and it also has practical implications for neutrino oscillation e… ▽ More

    Submitted 29 August, 2017; originally announced August 2017.

    Comments: CERN-SPSC-2017-029 (SPSC-P-354)

  17. arXiv:1701.06306  [pdf, other

    hep-ex physics.ins-det

    Measurement of antiproton annihilation on Cu, Ag and Au with emulsion films

    Authors: S. Aghion, C. Amsler, A. Ariga, T. Ariga, G. Bonomi, P. Braunig, R. S. Brusa, L. Cabaret, M. Caccia, R. Caravita, F. Castelli, G. Cerchiari, D. Comparat, G. Consolati, A. Demetrio, L. Di Noto, M. Doser, A. Ereditato, C. Evans, R. Ferragut, J. Fesel, A. Fontana, S. Gerber, M. Giammarchi, A. Gligorova , et al. (47 additional authors not shown)

    Abstract: The characteristics of low energy antiproton annihilations on nuclei (e.g. hadronization and product multiplicities) are not well known, and Monte Carlo simulation packages that use different models provide different descriptions of the annihilation events. In this study, we measured the particle multiplicities resulting from antiproton annihilations on nuclei. The results were compared with predi… ▽ More

    Submitted 23 April, 2017; v1 submitted 23 January, 2017; originally announced January 2017.

    Journal ref: 2017 JINST 12 P04021

  18. arXiv:1604.04199  [pdf, other

    astro-ph.IM physics.ins-det

    NEWS: Nuclear Emulsions for WIMP Search

    Authors: A. Aleksandrov, A. Anokhina, T. Asada, D. Bender, I. Bodnarchuk, A. Buonaura, S. Buontempo, M. Chernyavskii, A. Chukanov, L. Consiglio, N. D'Ambrosio, G. De Lellis, M. De Serio, A. Di Crescenzo, N. Di Marco, S. Dmitrievski, T. Dzhatdoev, R. A. Fini, S. Furuya, G. Galati, V. Gentile, S. Gorbunov, Y. Gornushkin, A. M. Guler, H. Ichiki , et al. (34 additional authors not shown)

    Abstract: Nowadays there is compelling evidence for the existence of dark matter in the Universe. A general consensus has been expressed on the need for a directional sensitive detector to confirm, with a complementary approach, the candidates found in conventional searches and to finally extend their sensitivity beyond the limit of neutrino-induced background. We propose here the use of a detector based on… ▽ More

    Submitted 14 April, 2016; originally announced April 2016.

  19. Extra-large crystal emulsion detectors for future large-scale experiments

    Authors: T. Ariga, A. Ariga, K. Kuwabara, K. Morishima, M. Moto, A. Nishio, P. Scampoli, M. Vladymyrov

    Abstract: Photographic emulsion is a particle tracking device which features the best spatial resolution among particle detectors. For certain applications, for example muon radiography, large-scale detectors are required. Therefore, a huge surface has to be analyzed by means of automated optical microscopes. An improvement of the readout speed is then a crucial point to make these applications possible and… ▽ More

    Submitted 11 February, 2016; v1 submitted 4 December, 2015; originally announced December 2015.

    Comments: Version accepted for publication in JINST

    Journal ref: JINST 11 (2016) P03003

  20. Discovery of tau neutrino appearance in the CNGS neutrino beam with the OPERA experiment

    Authors: OPERA Collaboration, N. Agafonova, A. Aleksandrov, A. Anokhina, S. Aoki, A. Ariga, T. Ariga, D. Bender, A. Bertolin, I. Bodnarchuk, C. Bozza, R. Brugnera, A. Buonaura, S. Buontempo, B. Büttner, M. Chernyavsky, A. Chukanov, L. Consiglio, N. D'Ambrosio, G. De Lellis, M. De Serio, P. Del Amo Sanchez, A. Di Crescenzo, D. Di Ferdinando, N. Di Marco , et al. (117 additional authors not shown)

    Abstract: The OPERA experiment was designed to search for $ν_μ \rightarrow ν_τ$ oscillations in appearance mode, i.e. by detecting the $τ$-leptons produced in charged current $ν_τ$ interactions. The experiment took data from 2008 to 2012 in the CERN Neutrinos to Gran Sasso beam. The observation of $ν_μ \rightarrow ν_τ$ appearance, achieved with four candidate events in a sub-sample of the data, was previous… ▽ More

    Submitted 2 November, 2015; v1 submitted 6 July, 2015; originally announced July 2015.

    Comments: 7 pages, 3 figures

    Journal ref: Phys. Rev. Lett. 115, 121802 (2015)

  21. arXiv:1504.04956  [pdf, other

    physics.ins-det hep-ex

    A facility to Search for Hidden Particles (SHiP) at the CERN SPS

    Authors: SHiP Collaboration, M. Anelli, S. Aoki, G. Arduini, J. J. Back, A. Bagulya, W. Baldini, A. Baranov, G. J. Barker, S. Barsuk, M. Battistin, J. Bauche, A. Bay, V. Bayliss, L. Bellagamba, G. Bencivenni, M. Bertani, O. Bezshyyko, D. Bick, N. Bingefors, A. Blondel, M. Bogomilov, A. Boyarsky, D. Bonacorsi, D. Bondarenko , et al. (211 additional authors not shown)

    Abstract: A new general purpose fixed target facility is proposed at the CERN SPS accelerator which is aimed at exploring the domain of hidden particles and make measurements with tau neutrinos. Hidden particles are predicted by a large number of models beyond the Standard Model. The high intensity of the SPS 400~GeV beam allows probing a wide variety of models containing light long-lived exotic particles w… ▽ More

    Submitted 20 April, 2015; originally announced April 2015.

    Comments: Technical Proposal

    Report number: CERN-SPSC-2015-016, SPSC-P-350, 8 April 2015

  22. arXiv:1503.07471  [pdf, other

    hep-ph hep-ex physics.ins-det

    Search for Sterile Neutrinos in the Muon Neutrino Disappearance Mode at FNAL

    Authors: A. Anokhina, A. Bagulya, M. Benettoni, P. Bernardini, R. Brugnera, M. Calabrese, A. Cecchetti, S. Cecchini, M. Chernyavskiy, F. Dal Corso, O. Dalkarov, A. Del Prete, G. De Robertis, M. De Serio, D. Di Ferdinando, S. Dusini, T. Dzhatdoev, R. A. Fini, G. Fiore, A. Garfagnini, M. Guerzoni, B. Klicek, U. Kose, K. Jakovcic, G. Laurenti , et al. (39 additional authors not shown)

    Abstract: The NESSiE Collaboration has been setup to undertake a conclusive experiment to clarify the {\em muon--neutrino disappearance} measurements at short baselines in order to put severe constraints to models with more than the three--standard neutrinos. To this aim the current FNAL--Booster neutrino beam for a Short--Baseline experiment was carefully evaluated by considering the use of magnetic spectr… ▽ More

    Submitted 2 February, 2017; v1 submitted 25 March, 2015; originally announced March 2015.

    Comments: 19 pages, published in EPJC

    Journal ref: Eur. Phys. J. C (2017) 77:23

  23. Limits on muon-neutrino to tau-neutrino oscillations induced by a sterile neutrino state obtained by OPERA at the CNGS beam

    Authors: OPERA Collaboration, N. Agafonova, A. Aleksandrov, A. Anokhina, S. Aoki, A. Ariga, T. Ariga, D. Bender, A. Bertolin, I. Bodnarchuk, C. Bozza, R. Brugnera, A. Buonaura, S. Buontempo, B. Büttner, M. Chernyavsky, A. Chukanov, L. Consiglio, N. D'Ambrosio, G. De Lellis, M. De Serio, P. Del Amo Sanchez, A. Di Crescenzo, D. Di Ferdinando, N. Di Marco , et al. (106 additional authors not shown)

    Abstract: The OPERA experiment, exposed to the CERN to Gran Sasso $ν_μ$ beam, collected data from 2008 to 2012. Four oscillated $ν_τ$ Charged Current interaction candidates have been detected in appearance mode, which are consistent with $ν_μ\to ν_τ$ oscillations at the atmospheric $Δm^2$ within the "standard" three-neutrino framework. In this paper, the OPERA $ν_τ$ appearance results are used to derive lim… ▽ More

    Submitted 14 March, 2015; v1 submitted 6 March, 2015; originally announced March 2015.

    Comments: 11 pages, 4 figures; reference to Planck result updated in the Introduction. Submitted to JHEP

  24. arXiv:1410.3980  [pdf, other

    hep-ph hep-ex physics.ins-det

    The NESSiE way to searches for sterile neutrinos at FNAL

    Authors: L. Stanco, A. Anokhina, A. Bagulya, M. Benettoni, P. Bernardini, R. Brugnera, M. Calabrese, A. Cecchetti, S. Cecchini, M. Chernyavskiy, P. Creti, F. Dal Corso, O. Dalkarov, A. Del Prete, G. De Robertis, M. De Serio, L. Degli Esposti, D. Di Ferdinando, S. Dusini, T. Dzhatdoev, C. Fanin, R. A. Fini, G. Fiore, A. Garfagnini, S. Golovanov , et al. (44 additional authors not shown)

    Abstract: Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the mixing angle $θ_{13}$ in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, puzzling measuremen… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    Comments: Proceeding of ICHEP2014, 2-7 July 2014, Valencia (Spain), 12 pages, 6 figures

  25. arXiv:1404.2521  [pdf, other

    hep-ph hep-ex physics.ins-det

    Prospects for the measurement of muon-neutrino disappearance at the FNAL-Booster

    Authors: A. Anokhina, A. Bagulya, M. Benettoni, P. Bernardini, R. Brugnera, M. Calabrese, A. Cecchetti, S. Cecchini, M. Chernyavskiy, P. Creti, F. Dal Corso, O. Dalkarov, A. Del Prete, G. De Robertis, M. De Serio, L. Degli Esposti, D. Di Ferdinando, S. Dusini, T. Dzhatdoev, C. Fanin, R. A. Fini, G. Fiore, A. Garfagnini, S. Golovanov, M. Guerzoni , et al. (44 additional authors not shown)

    Abstract: Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the mixing angle $θ_{13}$ in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, puzzling measuremen… ▽ More

    Submitted 9 April, 2014; originally announced April 2014.

    Comments: 76 pages, 52 figures

  26. arXiv:1312.1227  [pdf, other

    hep-ph physics.ins-det

    The NESSiE Concept for Sterile Neutrinos

    Authors: L. Stanco, A. Anokhina, A. Bagulya, M. Benettoni, P. Bernardini, A. Bertolin, R. Brugnera, M. Calabrese, A. Cecchetti, S. Cecchini, M. Chernyavskiy, G. Collazuol, P. Creti, F. Dal Corso, O. Dalkarov, A. Del Prete, I. De Mitri, G. De Robertis, M. De Serio, L. Degli Esposti, D. Di Ferdinando, U. Dore, S. Dusini, T. Dzhatdoev, C. Fanin , et al. (56 additional authors not shown)

    Abstract: Neutrino physics is nowadays receiving more and more attention as a possible source of information for the long-standing problem of new physics beyond the Standard Model. The recent measurement of the third mixing angle theta13 in the standard mixing oscillation scenario encourages us to pursue the still missing results on leptonic CP violation and absolute neutrino masses. However, several puzzli… ▽ More

    Submitted 4 December, 2013; originally announced December 2013.

    Comments: 13 pages, Proceedings of XV Workshop on Neutrino Telescopes, 11-15 March 2013, Venice, Italy

  27. arXiv:1206.4646  [pdf

    cs.LG stat.ML

    Partial-Hessian Strategies for Fast Learning of Nonlinear Embeddings

    Authors: Max Vladymyrov, Miguel Carreira-Perpinan

    Abstract: Stochastic neighbor embedding (SNE) and related nonlinear manifold learning algorithms achieve high-quality low-dimensional representations of similarity data, but are notoriously slow to train. We propose a generic formulation of embedding algorithms that includes SNE and other existing algorithms, and study their relation with spectral methods and graph Laplacians. This allows us to define sever… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012