Skip to main content

Showing 1–20 of 20 results for author: Lea, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04240  [pdf, other

    cs.LG cs.CL

    Hypernetworks for Personalizing ASR to Atypical Speech

    Authors: Max Müller-Eberstein, Dianna Yee, Karren Yang, Gautam Varma Mantena, Colin Lea

    Abstract: Parameter-efficient fine-tuning (PEFT) for personalizing automatic speech recognition (ASR) has recently shown promise for adapting general population models to atypical speech. However, these approaches assume a priori knowledge of the atypical speech disorder being adapted for -- the diagnosis of which requires expert knowledge that is not always available. Even given this knowledge, data scarci… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2306.05446  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    Latent Phrase Matching for Dysarthric Speech

    Authors: Colin Lea, Dianna Yee, Jaya Narain, Zifang Huang, Lauren Tooley, Jeffrey P. Bigham, Leah Findlater

    Abstract: Many consumer speech recognition systems are not tuned for people with speech disabilities, resulting in poor recognition and user experience, especially for severe speech differences. Recent studies have emphasized interest in personalized speech models from people with atypical speech patterns. We propose a query-by-example-based personalized phrase recognition system that is trained using small… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  3. arXiv:2302.09044  [pdf, other

    cs.HC

    From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition

    Authors: Colin Lea, Zifang Huang, Lauren Tooley, Jaya Narain, Dianna Yee, Panayiotis Georgiou, Tien Dung Tran, Jeffrey P. Bigham, Leah Findlater

    Abstract: Consumer speech recognition systems do not work as well for many people with speech diferences, such as stuttering, relative to the rest of the general population. However, what is not clear is the degree to which these systems do not work, how they can be improved, or how much people want to use them. In this paper, we frst address these questions using results from a 61-person survey from people… ▽ More

    Submitted 27 February, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: CHI 2023

  4. arXiv:2207.07712  [pdf, other

    cs.HC

    Reflow: Automatically Improving Touch Interactions in Mobile Applications through Pixel-based Refinements

    Authors: Jason Wu, Titus Barik, Xiaoyi Zhang, Colin Lea, Jeffrey Nichols, Jeffrey P. Bigham

    Abstract: Touch is the primary way that users interact with smartphones. However, building mobile user interfaces where touch interactions work well for all users is a difficult problem, because users have different abilities and preferences. We propose a system, Reflow, which automatically applies small, personalized UI adaptations, called refinements -- to mobile app screens to improve touch efficiency. R… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  5. arXiv:2202.07750  [pdf, other

    eess.AS cs.CL cs.SD

    Nonverbal Sound Detection for Disordered Speech

    Authors: Colin Lea, Zifang Huang, Dhruv Jain, Lauren Tooley, Zeinab Liaghat, Shrinath Thelapurath, Leah Findlater, Jeffrey P. Bigham

    Abstract: Voice assistants have become an essential tool for people with various disabilities because they enable complex phone- or tablet-based interactions without the need for fine-grained motor control, such as with touchscreens. However, these systems are not tuned for the unique characteristics of individuals with speech disorders, including many of those who have a motor-speech disorder, are deaf or… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: Accepted at ICASSP 2022

  6. arXiv:2106.11759  [pdf, other

    eess.AS cs.AI cs.CL cs.CV cs.LG cs.SD

    Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

    Authors: Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham

    Abstract: Dysfluencies and variations in speech pronunciation can severely degrade speech recognition performance, and for many individuals with moderate-to-severe speech disorders, voice operated systems do not work. Current speech recognition systems are trained primarily with data from fluent speakers and as a consequence do not generalize well to speech with dysfluencies such as sound or word repetition… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: 5 pages, 1 page reference, 2 figures

  7. arXiv:2102.12394  [pdf, other

    eess.AS cs.SD

    SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

    Authors: Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey P. Bigham

    Abstract: The ability to automatically detect stuttering events in speech could help speech pathologists track an individual's fluency over time or help improve speech recognition systems for people with atypical speech patterns. Despite increasing interest in this area, existing public datasets are too small to build generalizable dysfluency detection systems and lack sufficient annotations. In this work,… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP 2021

  8. arXiv:2102.03472  [pdf, other

    cs.SI cs.LG

    Overcoming Bias in Community Detection Evaluation

    Authors: Jeancarlo Campos Leão, Alberto H. F. Laender, Pedro O. S. Vaz de Melo

    Abstract: Community detection is a key task to further understand the function and the structure of complex networks. Therefore, a strategy used to assess this task must be able to avoid biased and incorrect results that might invalidate further analyses or applications that rely on such communities. Two widely used strategies to assess this task are generally known as structural and functional. The structu… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

    Comments: 16 pages, 4 figures. This paper has been submitted to Journal of Information and Data Management (JIDM). arXiv admin note: substantial text overlap with arXiv:1909.09903

  9. arXiv:2008.05023  [pdf, other

    cs.CV

    Audio- and Gaze-driven Facial Animation of Codec Avatars

    Authors: Alexander Richard, Colin Lea, Shugao Ma, Juergen Gall, Fernando de la Torre, Yaser Sheikh

    Abstract: Codec Avatars are a recent class of learned, photorealistic face models that accurately represent the geometry and texture of a person in 3D (i.e., for virtual reality), and are almost indistinguishable from video. In this paper we describe the first approach to animate these parametric models in real-time which could be deployed on commodity virtual reality hardware using audio and/or eye trackin… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  10. arXiv:1909.09903  [pdf, other

    cs.SI cs.DB cs.LG

    A Multi-Strategy Approach to Overcoming Bias in Community Detection Evaluation

    Authors: Jeancarlo Campos Leão, Alberto H. F. Laender, Pedro O. S. Vaz de Melo

    Abstract: Community detection is key to understand the structure of complex networks. However, the lack of appropriate evaluation strategies for this specific task may produce biased and incorrect results that might invalidate further analyses or applications based on such networks. In this context, the main contribution of this paper is an approach that supports a robust quality evaluation when detecting c… ▽ More

    Submitted 21 September, 2019; originally announced September 2019.

    Comments: 12 pages, 6 figures, 3 tables. This paper has been submitted to the 34th Brazilian Symposium on Databases, 2019 (SBBD2019)

  11. arXiv:1810.02002  [pdf, other

    cs.SI physics.soc-ph

    Improving Community Detection by Mining Social Interactions

    Authors: Jeancarlo Campos Leão, Michele Amaral Brandão, Pedro O. S. Vaz de Melo, Alberto H. F. Laender

    Abstract: Social relationships can be divided into different classes based on the regularity with which they occur and the similarity among them. Thus, rare and somewhat similar relationships are random and cause noise in a social network, thus hiding the actual structure of the network and preventing an accurate analysis of it. In this context, in this paper we propose a process to handle social network da… ▽ More

    Submitted 4 October, 2018; v1 submitted 3 October, 2018; originally announced October 2018.

  12. arXiv:1611.05267  [pdf, other

    cs.CV

    Temporal Convolutional Networks for Action Segmentation and Detection

    Authors: Colin Lea, Michael D. Flynn, Rene Vidal, Austin Reiter, Gregory D. Hager

    Abstract: The ability to identify and temporally segment fine-grained human actions throughout a video is crucial for robotics, surveillance, education, and beyond. Typical approaches decouple this problem by first extracting local spatiotemporal features from video frames and then feeding them into a temporal classifier that captures high-level temporal patterns. We introduce a new class of temporal models… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

  13. arXiv:1608.08242  [pdf, other

    cs.CV

    Temporal Convolutional Networks: A Unified Approach to Action Segmentation

    Authors: Colin Lea, Rene Vidal, Austin Reiter, Gregory D. Hager

    Abstract: The dominant paradigm for video-based action segmentation is composed of two steps: first, for each frame, compute low-level features using Dense Trajectories or a Convolutional Neural Network that encode spatiotemporal information locally, and second, input these features into a classifier that captures high-level temporal relationships, such as a Recurrent Neural Network (RNN). While often effec… ▽ More

    Submitted 29 August, 2016; originally announced August 2016.

    Comments: Submitted to the ECCV workshop on "Brave new ideas for motion representations in videos" (http://bravenewmotion.github.io/)

  14. arXiv:1608.02307  [pdf, other

    cs.CV q-bio.QM

    SANTIAGO: Spine Association for Neuron Topology Improvement and Graph Optimization

    Authors: William Gray Roncal, Colin Lea, Akira Baruah, Gregory D. Hager

    Abstract: Develo** automated and semi-automated solutions for reconstructing wiring diagrams of the brain from electron micrographs is important for advancing the field of connectomics. While the ultimate goal is to generate a graph of neuron connectivity, most prior automated methods have focused on volume segmentation rather than explicit graph estimation. In these approaches, one of the key, commonly o… ▽ More

    Submitted 7 August, 2016; originally announced August 2016.

    Comments: 13 pp

  15. arXiv:1606.06329  [pdf, other

    cs.CV

    Recognizing Surgical Activities with Recurrent Neural Networks

    Authors: Robert DiPietro, Colin Lea, Anand Malpani, Narges Ahmidi, S. Swaroop Vedula, Gyusung I. Lee, Mija R. Lee, Gregory D. Hager

    Abstract: We apply recurrent neural networks to the task of recognizing surgical activities from robot kinematics. Prior work in this area focuses on recognizing short, low-level activities, or gestures, and has been based on variants of hidden Markov models and conditional random fields. In contrast, we work on recognizing both gestures and longer, higher-level activites, or maneuvers, and we model the map… ▽ More

    Submitted 22 June, 2016; v1 submitted 20 June, 2016; originally announced June 2016.

    Comments: Conditionally accepted at MICCAI 2016

  16. arXiv:1602.02995  [pdf, other

    cs.CV cs.RO

    Segmental Spatiotemporal CNNs for Fine-grained Action Segmentation

    Authors: Colin Lea, Austin Reiter, Rene Vidal, Gregory D. Hager

    Abstract: Joint segmentation and classification of fine-grained actions is important for applications of human-robot interaction, video surveillance, and human skill evaluation. However, despite substantial recent progress in large-scale action classification, the performance of state-of-the-art fine-grained action recognition approaches remains low. We propose a model for action segmentation which combines… ▽ More

    Submitted 30 September, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: Updated from the ECCV 2016 version. We fixed an important mathematical error and made the section on segmental inference clearer

  17. arXiv:1111.1599  [pdf

    cs.CV

    Efficient Hierarchical Markov Random Fields for Object Detection on a Mobile Robot

    Authors: Colin S. Lea, Jason J. Corso

    Abstract: Object detection and classification using video is necessary for intelligent planning and navigation on a mobile robot. However, current methods can be too slow or not sufficient for distinguishing multiple classes. Techniques that rely on binary (foreground/background) labels incorrectly identify areas with multiple overlap** objects as single segment. We propose two Hierarchical Markov Random… ▽ More

    Submitted 7 November, 2011; originally announced November 2011.

    Comments: 7 pages

  18. Approximate conditional distributions of distances between nodes in a two-dimensional sensor network

    Authors: Rodrigo S. C. Leao, Valmir C. Barbosa

    Abstract: When we represent a network of sensors in Euclidean space by a graph, there are two distances between any two nodes that we may consider. One of them is the Euclidean distance. The other is the distance between the two nodes in the graph, defined to be the number of edges on a shortest path between them. In this paper, we consider a network of sensors placed uniformly at random in a two-dimensio… ▽ More

    Submitted 17 December, 2008; originally announced December 2008.

    Journal ref: Lecture Notes in Computer Science 5513 (2009), 324-338

  19. arXiv:cs/0505088  [pdf, ps, other

    cs.DM

    6-cycle double covers of cubic graphs

    Authors: Rodrigo S. C. Leao, Valmir C. Barbosa

    Abstract: A cycle double cover (CDC) of an undirected graph is a collection of the graph's cycles such that every edge of the graph belongs to exactly two cycles. We describe a constructive method for generating all the cubic graphs that have a 6-CDC (a CDC in which every cycle has length 6). As an application of the method, we prove that all such graphs have a Hamiltonian cycle. A sense of direction is a… ▽ More

    Submitted 17 April, 2009; v1 submitted 31 May, 2005; originally announced May 2005.

    Comments: This version fixes typos and minor technical problems, and updates references

    ACM Class: G.2.2

  20. Minimal chordal sense of direction and circulant graphs

    Authors: R. S. C. Leao, V. C. Barbosa

    Abstract: A sense of direction is an edge labeling on graphs that follows a globally consistent scheme and is known to considerably reduce the complexity of several distributed problems. In this paper, we study a particular instance of sense of direction, called a chordal sense of direction (CSD). In special, we identify the class of k-regular graphs that admit a CSD with exactly k labels (a minimal CSD).… ▽ More

    Submitted 3 March, 2005; originally announced March 2005.

    ACM Class: G.2.1; G.2.2

    Journal ref: Lecture Notes in Computer Science 4162 (2006), 670-680