Skip to main content

Showing 1–8 of 8 results for author: Perry, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.16251  [pdf, other

    cs.CL cs.AI

    Speakerly: A Voice-based Writing Assistant for Text Composition

    Authors: Dhruv Kumar, Vipul Raheja, Alice Kaiser-Schatzlein, Robyn Perry, Apurva Joshi, Justin Hugues-Nuger, Samuel Lou, Navid Chowdhury

    Abstract: We present Speakerly, a new real-time voice-based writing assistance system that helps users with text composition across various use cases such as emails, instant messages, and notes. The user can interact with the system through instructions or dictation, and the system generates a well-formatted and coherent document. We describe the system architecture and detail how we address the various cha… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 Industry Track

  2. arXiv:2206.02013  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Causal Discovery in Heterogeneous Environments Under the Sparse Mechanism Shift Hypothesis

    Authors: Ronan Perry, Julius von Kügelgen, Bernhard Schölkopf

    Abstract: Machine learning approaches commonly rely on the assumption of independent and identically distributed (i.i.d.) data. In reality, however, this assumption is almost always violated due to distribution shifts between environments. Although valuable learning signals can be provided by heterogeneous data from changing distributions, it is also known that learning under arbitrary (adversarial) changes… ▽ More

    Submitted 15 October, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version. JvK and BS are shared last authors. 10 pages + Bibliography + Appendix (26 pages total)

  3. arXiv:2005.11890  [pdf, other

    stat.ML cs.LG stat.CO

    mvlearn: Multiview Machine Learning in Python

    Authors: Ronan Perry, Gavin Mischler, Richard Guo, Theodore Lee, Alexander Chang, Arman Koul, Cameron Franz, Hugo Richard, Iain Carmichael, Pierre Ablin, Alexandre Gramfort, Joshua T. Vogelstein

    Abstract: As data are generated more and more from multiple disparate sources, multiview data sets, where each sample has features in distinct views, have ballooned in recent years. However, no comprehensive package exists that enables non-specialists to use these methods easily. mvlearn is a Python library which implements the leading multiview machine learning methods. Its simple API closely follows that… ▽ More

    Submitted 25 May, 2021; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: 6 pages, 2 figures, 1 table

  4. arXiv:1910.08883  [pdf, other

    stat.ML cs.LG

    High-dimensional and universally consistent k-sample tests

    Authors: Sambit Panda, Cencheng Shen, Ronan Perry, Jelle Zorn, Antoine Lutz, Carey E. Priebe, Joshua T. Vogelstein

    Abstract: The k-sample testing problem involves determining whether $k$ groups of data points are each drawn from the same distribution. The standard method for k-sample testing in biomedicine is Multivariate analysis of variance (MANOVA), despite that it depends on strong, and often unsuitable, parametric assumptions. Moreover, independence testing and k-sample testing are closely related, and several univ… ▽ More

    Submitted 11 October, 2023; v1 submitted 19 October, 2019; originally announced October 2019.

  5. arXiv:1909.11799  [pdf, other

    cs.LG stat.ML

    Manifold Oblique Random Forests: Towards Closing the Gap on Convolutional Deep Networks

    Authors: Adam Li, Ronan Perry, Chester Huynh, Tyler M. Tomita, Ronak Mehta, Jesus Arroyo, Jesse Patsolic, Benjamin Falk, Joshua T. Vogelstein

    Abstract: Decision forests (Forests), in particular random forests and gradient boosting trees, have demonstrated state-of-the-art accuracy compared to other methods in many supervised learning scenarios. In particular, Forests dominate other methods in tabular data, that is, when the feature space is unstructured, so that the signal is invariant to a permutation of the feature indices. However, in structur… ▽ More

    Submitted 5 September, 2022; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: Updated manuscript based on review at SIMODS

    MSC Class: 68T05

  6. arXiv:1907.00325  [pdf, other

    cs.LG stat.ML

    Random Forests for Adaptive Nearest Neighbor Estimation of Information-Theoretic Quantities

    Authors: Ronan Perry, Ronak Mehta, Richard Guo, Eva Yezerets, Jesús Arroyo, Mike Powell, Hayden Helm, Cencheng Shen, Joshua T. Vogelstein

    Abstract: Information-theoretic quantities, such as conditional entropy and mutual information, are critical data summaries for quantifying uncertainty. Current widely used approaches for computing such quantities rely on nearest neighbor methods and exhibit both strong performance and theoretical guarantees in certain simple scenarios. However, existing approaches fail in high-dimensional settings and when… ▽ More

    Submitted 5 October, 2021; v1 submitted 30 June, 2019; originally announced July 2019.

  7. arXiv:1208.1969  [pdf, ps, other

    cs.CY

    An Internet Approach for Engineering Student Exercises

    Authors: Richard Perry

    Abstract: An approach for engineering student exercises using the Internet is described. In this approach, for a given exercise, each student receives the same problem, but with different data. The exercise content can be static or dynamic, and the dynamic form can be timeless or real-time. The implementation provides immediate feedback to the students, letting them know if their submitted answers are corre… ▽ More

    Submitted 9 August, 2012; originally announced August 2012.

    Comments: Mid-Atlantic ASEE Conference, October 15-16, 2010, Villanova University

  8. arXiv:1208.1906  [pdf, ps, other

    cs.SE

    Batch Spreadsheet for C Programmers

    Authors: Richard Perry

    Abstract: A computing environment is proposed, based on batch spreadsheet processing, which produces a spreadsheet display from plain text input files of commands, similar to the way documents are created using LaTeX. In this environment, besides the usual spreadsheet rows and columns of cells, variables can be defined and are stored in a separate symbol table. Cell and symbol formulas may contain cycles, a… ▽ More

    Submitted 9 August, 2012; originally announced August 2012.

    Comments: (2009) Batch Spreadsheet for C Programmers, International Conference on Scientific Computing (CSC'09)

    MSC Class: CS:se