Skip to main content

Showing 1–13 of 13 results for author: Hocking, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.15225  [pdf, other

    cs.LG q-bio.QM

    Cross-Validation for Training and Testing Co-occurrence Network Inference Algorithms

    Authors: Daniel Agyapong, Jeffrey Ryan Propster, Jane Marks, Toby Dylan Hocking

    Abstract: Microorganisms are found in almost every environment, including the soil, water, air, and inside other organisms, like animals and plants. While some microorganisms cause diseases, most of them help in biological processes such as decomposition, fermentation and nutrient cycling. A lot of research has gone into studying microbial communities in various environments and how their interactions and r… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  2. arXiv:2302.11062  [pdf, other

    cs.LG cs.AI

    A Log-linear Gradient Descent Algorithm for Unbalanced Binary Classification using the All Pairs Squared Hinge Loss

    Authors: Kyle R. Rust, Toby D. Hocking

    Abstract: Receiver Operating Characteristic (ROC) curves are plots of true positive rate versus false positive rate which are used to evaluate binary classification algorithms. Because the Area Under the Curve (AUC) is a constant function of the predicted values, learning algorithms instead optimize convex relaxations which involve a sum over all pairs of labeled positive and negative examples. Naive learni… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  3. arXiv:2210.02580  [pdf, other

    cs.LG stat.ML

    Functional Labeled Optimal Partitioning

    Authors: Toby D. Hocking, Jacob M. Kaufman, Alyssa J. Stenberg

    Abstract: Peak detection is a problem in sequential data analysis that involves differentiating regions with higher counts (peaks) from regions with lower counts (background noise). It is crucial to correctly predict areas that deviate from the background noise, in both the train and test sets of labels. Dynamic programming changepoint algorithms have been proposed to solve the peak detection problem by… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  4. arXiv:2107.01285  [pdf, other

    stat.ML cs.LG

    Optimizing ROC Curves with a Sort-Based Surrogate Loss Function for Binary Classification and Changepoint Detection

    Authors: Jonathan Hillman, Toby Dylan Hocking

    Abstract: Receiver Operating Characteristic (ROC) curves are plots of true positive rate versus false positive rate which are useful for evaluating binary classification models, but difficult to use for learning since the Area Under the Curve (AUC) is non-convex. ROC curves can also be used in other problems that have false positive and true positive rates such as changepoint detection. We show that in this… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  5. A Greedy Graph Search Algorithm Based on Changepoint Analysis for Automatic QRS Complex Detection

    Authors: Atiyeh Fotoohinasab, Toby Hocking, Fatemeh Afghah

    Abstract: The electrocardiogram (ECG) signal is the most widely used non-invasive tool for the investigation of cardiovascular diseases. Automatic delineation of ECG fiducial points, in particular the R-peak, serves as the basis for ECG processing and analysis. This study proposes a new method of ECG signal analysis by introducing a new class of graphical models based on optimal changepoint detection models… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

  6. arXiv:2102.01319  [pdf, other

    eess.SP cs.LG

    A Graph-Constrained Changepoint Learning Approach for Automatic QRS-Complex Detection

    Authors: Atiyeh Fotoohinasab, Toby Hocking, Fatemeh Afghah

    Abstract: This study presents a new viewpoint on ECG signal analysis by applying a graph-based changepoint detection model to locate R-peak positions. This model is based on a new graph learning algorithm to learn the constraint graph given the labeled ECG data. The proposed learning algorithm starts with a simple initial graph and iteratively edits the graph so that the final graph has the maximum accuracy… ▽ More

    Submitted 6 February, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: accepted in Asilomar 2020 conference

  7. Chatbots language design: the influence of language variation on user experience

    Authors: Ana Paula Chaves, Jesse Egbert, Toby Hocking, Eck Doerry, Marco Aurelio Gerosa

    Abstract: Chatbots are often designed to mimic social roles attributed to humans. However, little is known about the impact on user's perceptions of using language that fails to conform to the associated social role. Our research draws on sociolinguistic theory to investigate how a chatbot's language choices can adhere to the expected social role the agent performs within a given context. In doing so, we se… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Journal ref: 7th International Conference on Human-Agent Interaction (HAI 2019), Kyoto, Japan

  8. arXiv:2006.13967  [pdf, other

    stat.ML cs.LG

    Labeled Optimal Partitioning

    Authors: Toby Dylan Hocking, Anuraag Srivastava

    Abstract: In data sequences measured over space or time, an important problem is accurate detection of abrupt changes. In partially labeled data, it is important to correctly predict presence/absence of changes in positive/negative labeled regions, in both the train and test sets. One existing dynamic programming algorithm is designed for prediction in unlabeled test regions (and ignores the labels in the t… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  9. arXiv:2006.04920  [pdf, other

    cs.LG stat.ML

    Survival regression with accelerated failure time model in XGBoost

    Authors: Avinash Barnwal, Hyunsu Cho, Toby Dylan Hocking

    Abstract: Survival regression is used to estimate the relation between time-to-event and feature variables, and is important in application domains such as medicine, marketing, risk management and sales management. Nonlinear tree based machine learning algorithms as implemented in libraries such as XGBoost, scikit-learn, LightGBM, and CatBoost are often more accurate in practice than linear models. However,… ▽ More

    Submitted 21 August, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

  10. arXiv:2004.13558  [pdf, other

    eess.SP cs.LG stat.ML

    A Graph-constrained Changepoint Detection Approach for ECG Segmentation

    Authors: Atiyeh Fotoohinasab, Toby Hocking, Fatemeh Afghah

    Abstract: Electrocardiogram (ECG) signal is the most commonly used non-invasive tool in the assessment of cardiovascular diseases. Segmentation of the ECG signal to locate its constitutive waves, in particular the R-peaks, is a key step in ECG processing and analysis. Over the years, several segmentation and QRS complex detection algorithms have been proposed with different features; however, their performa… ▽ More

    Submitted 24 April, 2020; originally announced April 2020.

  11. arXiv:2003.02808  [pdf, other

    cs.LG cs.DS stat.ML

    Linear time dynamic programming for the exact path of optimal models selected from a finite set

    Authors: Toby Hocking, Joseph Vargovich

    Abstract: Many learning algorithms are formulated in terms of finding model parameters which minimize a data-fitting loss function plus a regularizer. When the regularizer involves the l0 pseudo-norm, the resulting regularization path consists of a finite set of models. The fastest existing algorithm for computing the breakpoints in the regularization path is quadratic in the number of models, so it scales… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: 14 pages

  12. arXiv:1710.04234  [pdf, other

    stat.ML cs.DS cs.LG stat.AP

    Maximum Margin Interval Trees

    Authors: Alexandre Drouin, Toby Dylan Hocking, François Laviolette

    Abstract: Learning a regression function using censored or interval-valued output data is an important problem in fields such as genomics and medicine. The goal is to learn a real-valued prediction function, and the training output labels indicate an interval of possible values. Whereas most existing algorithms for this task are linear models, in this paper we investigate learning nonlinear tree models. We… ▽ More

    Submitted 27 October, 2017; v1 submitted 11 October, 2017; originally announced October 2017.

    Comments: Accepted for presentation at the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA

  13. arXiv:1401.8008  [pdf, ps, other

    stat.ML cs.LG

    Support vector comparison machines

    Authors: David Venuto, Toby Dylan Hocking, Lakjaree Sphanurattana, Masashi Sugiyama

    Abstract: In ranking problems, the goal is to learn a ranking function from labeled pairs of input points. In this paper, we consider the related comparison problem, where the label indicates which element of the pair is better, or if there is no significant difference. We cast the learning problem as a margin maximization, and show that it can be solved by converting it to a standard SVM. We use simulated… ▽ More

    Submitted 23 July, 2020; v1 submitted 30 January, 2014; originally announced January 2014.