Skip to main content

Showing 1–6 of 6 results for author: Stephens, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.16348  [pdf, other

    cs.CL cs.CY cs.HC

    Improving the TENOR of Labeling: Re-evaluating Topic Models for Content Analysis

    Authors: Zongxia Li, Andrew Mao, Daniel Stephens, Pranav Goel, Emily Walpole, Alden Dima, Juan Fung, Jordan Boyd-Graber

    Abstract: Topic models are a popular tool for understanding text collections, but their evaluation has been a point of contention. Automated evaluation metrics such as coherence are often used, however, their validity has been questioned for neural topic models (NTMs) and can overlook a models benefits in real world applications. To this end, we conduct the first evaluation of neural, supervised and classic… ▽ More

    Submitted 19 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 19 pages, 5 tables, 6 figures, Accepted to EACL Main Conference 2024

  2. arXiv:2306.11908  [pdf, other

    stat.ML cs.LG stat.ME

    Accelerating Generalized Random Forests with Fixed-Point Trees

    Authors: David Fleischer, David A. Stephens, Archer Yang

    Abstract: Generalized random forests arXiv:1610.01271 build upon the well-established success of conventional forests (Breiman, 2001) to offer a flexible and powerful non-parametric method for estimating local solutions of heterogeneous estimating equations. Estimators are constructed by leveraging random forests as an adaptive kernel weighting algorithm and implemented through a gradient-based tree-growing… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 22 pages, 5 figures

  3. arXiv:2108.05792  [pdf, other

    cs.RO eess.SY

    From market-ready ROVs to low-cost AUVs

    Authors: Jonatan Scharff Willners, Ignacio Carlucho, Tomasz Łuczyński, Sean Katagiri, Chandler Lemoine, Joshua Roe, Dylan Stephens, Shida Xu, Yaniel Carreno, Èric Pairet, Corina Barbalata, Yvan Petillot, Sen Wang

    Abstract: Autonomous Underwater Vehicles (AUVs) are becoming increasingly important for different types of industrial applications. The generally high cost of (AUVs) restricts the access to them and therefore advances in research and technological development. However, recent advances have led to lower cost commercially available Remotely Operated Vehicles (ROVs), which present a platform that can be enhanc… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  4. arXiv:2103.12293  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Reweighted Gradient Descent

    Authors: Ayoub El Hanchi, David A. Stephens

    Abstract: Despite the strong theoretical guarantees that variance-reduced finite-sum optimization algorithms enjoy, their applicability remains limited to cases where the memory overhead they introduce (SAG/SAGA), or the periodic full gradient computation they require (SVRG/SARAH) are manageable. A promising approach to achieving variance reduction while avoiding these drawbacks is the use of importance sam… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  5. arXiv:2103.12243  [pdf, other

    cs.LG math.OC stat.ML

    Adaptive Importance Sampling for Finite-Sum Optimization and Sampling with Decreasing Step-Sizes

    Authors: Ayoub El Hanchi, David A. Stephens

    Abstract: Reducing the variance of the gradient estimator is known to improve the convergence rate of stochastic gradient-based optimization and sampling algorithms. One way of achieving variance reduction is to design importance sampling strategies. Recently, the problem of designing such schemes was formulated as an online learning problem with bandit feedback, and algorithms with sub-linear static regret… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: Advances in Neural Information Processing Systems, Dec 2020, Vancouver, Canada

  6. arXiv:1812.00528  [pdf, ps, other

    cs.LG q-bio.PE stat.ML

    Modeling disease progression in longitudinal EHR data using continuous-time hidden Markov models

    Authors: Aman Verma, Guido Powell, Yu Luo, David Stephens, David L. Buckeridge

    Abstract: Modeling disease progression in healthcare administrative databases is complicated by the fact that patients are observed only at irregular intervals when they seek healthcare services. In a longitudinal cohort of 76,888 patients with chronic obstructive pulmonary disease (COPD), we used a continuous-time hidden Markov model with a generalized linear model to model healthcare utilization events. W… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/145