Skip to main content

Showing 1–7 of 7 results for author: Tomita, T M

.
  1. arXiv:2201.13001  [pdf, other

    cs.LG cs.AI cs.DS q-bio.NC stat.ML

    Deep Discriminative to Kernel Density Graph for In- and Out-of-distribution Calibrated Inference

    Authors: Jayanta Dey, Haoyin Xu, Will LeVine, Ashwin De Silva, Tyler M. Tomita, Ali Geisa, Tiffany Chu, Jacob Desman, Joshua T. Vogelstein

    Abstract: Deep discriminative approaches like random forests and deep neural networks have recently found applications in many important real-world scenarios. However, deploying these learning algorithms in safety-critical applications raises concerns, particularly when it comes to ensuring confidence calibration for both in-distribution and out-of-distribution data points. Many popular methods for in-distr… ▽ More

    Submitted 7 June, 2024; v1 submitted 31 January, 2022; originally announced January 2022.

  2. arXiv:2109.14501  [pdf, other

    stat.ML cs.AI cs.LG

    Towards a theory of out-of-distribution learning

    Authors: Jayanta Dey, Ali Geisa, Ronak Mehta, Tyler M. Tomita, Hayden S. Helm, Haoyin Xu, Eric Eaton, Jeffery Dick, Carey E. Priebe, Joshua T. Vogelstein

    Abstract: Learning is a process wherein a learning agent enhances its performance through exposure of experience or data. Throughout this journey, the agent may encounter diverse learning environments. For example, data may be presented to the leaner all at once, in multiple batches, or sequentially. Furthermore, the distribution of each data sample could be either identical and independent (iid) or non-iid… ▽ More

    Submitted 7 June, 2024; v1 submitted 29 September, 2021; originally announced September 2021.

  3. arXiv:2007.13843  [pdf, other

    stat.ML cs.IR cs.LG cs.SI

    Robust Similarity and Distance Learning via Decision Forests

    Authors: Tyler M. Tomita, Joshua T. Vogelstein

    Abstract: Canonical distances such as Euclidean distance often fail to capture the appropriate relationships between items, subsequently leading to subpar inference and prediction. Many algorithms have been proposed for automated learning of suitable distances, most of which employ linear methods to learn a global metric over the feature space. While such methods offer nice theoretical properties, interpret… ▽ More

    Submitted 21 August, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: Submitted to NeurIPS 2020

  4. arXiv:2004.12908  [pdf, other

    cs.AI cs.LG stat.ML

    A Simple Lifelong Learning Approach

    Authors: Joshua T. Vogelstein, Jayanta Dey, Hayden S. Helm, Will LeVine, Ronak D. Mehta, Tyler M. Tomita, Haoyin Xu, Ali Geisa, Qingyang Wang, Gido M. van de Ven, Chenyu Gao, Weiwei Yang, Bryan Tower, Jonathan Larson, Christopher M. White, Carey E. Priebe

    Abstract: In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain perf… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 April, 2020; originally announced April 2020.

  5. arXiv:1909.11799  [pdf, other

    cs.LG stat.ML

    Manifold Oblique Random Forests: Towards Closing the Gap on Convolutional Deep Networks

    Authors: Adam Li, Ronan Perry, Chester Huynh, Tyler M. Tomita, Ronak Mehta, Jesus Arroyo, Jesse Patsolic, Benjamin Falk, Joshua T. Vogelstein

    Abstract: Decision forests (Forests), in particular random forests and gradient boosting trees, have demonstrated state-of-the-art accuracy compared to other methods in many supervised learning scenarios. In particular, Forests dominate other methods in tabular data, that is, when the feature space is unstructured, so that the signal is invariant to a permutation of the feature indices. However, in structur… ▽ More

    Submitted 5 September, 2022; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: Updated manuscript based on review at SIMODS

    MSC Class: 68T05

  6. arXiv:1806.07300  [pdf, other

    cs.PF cs.DC

    Forest Packing: Fast, Parallel Decision Forests

    Authors: James Browne, Tyler M. Tomita, Disa Mhembere, Randal Burns, Joshua T. Vogelstein

    Abstract: Machine learning has an emerging critical role in high-performance computing to modulate simulations, extract knowledge from massive data, and replace numerical models with efficient approximations. Decision forests are a critical tool because they provide insight into model operation that is critical to interpreting learned results. While decision forests are trivially parallelizable, the travers… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

  7. arXiv:1506.03410  [pdf, other

    stat.ML cs.LG

    Sparse Projection Oblique Randomer Forests

    Authors: Tyler M. Tomita, James Browne, Cencheng Shen, Jaewon Chung, Jesse L. Patsolic, Benjamin Falk, Jason Yim, Carey E. Priebe, Randal Burns, Mauro Maggioni, Joshua T. Vogelstein

    Abstract: Decision forests, including Random Forests and Gradient Boosting Trees, have recently demonstrated state-of-the-art performance in a variety of machine learning settings. Decision forests are typically ensembles of axis-aligned decision trees; that is, trees that split only along feature dimensions. In contrast, many recent extensions to decision forests are based on axis-oblique splits. Unfortuna… ▽ More

    Submitted 3 October, 2019; v1 submitted 10 June, 2015; originally announced June 2015.

    Comments: 31 pages; submitted to Journal of Machine Learning Research for review

    MSC Class: 68T10 ACM Class: I.5.2

    Journal ref: Journal of Machine Learning Research 21(104), 1-39, 2020