Skip to main content

Showing 1–11 of 11 results for author: Elder, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13896  [pdf, other

    cs.CV

    A General Framework for Jersey Number Recognition in Sports Video

    Authors: Maria Koshkina, James H. Elder

    Abstract: Jersey number recognition is an important task in sports video analysis, partly due to its importance for long-term player tracking. It can be viewed as a variant of scene text recognition. However, there is a lack of published attempts to apply scene text recognition models on jersey number data. Here we introduce a novel public jersey number recognition dataset for hockey and study how scene tex… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2404.16831  [pdf, other

    cs.CV

    The Third Monocular Depth Estimation Challenge

    Authors: Jaime Spencer, Fabio Tosi, Matteo Poggi, Ripudaman Singh Arora, Chris Russell, Simon Hadfield, Richard Bowden, GuangYuan Zhou, ZhengXin Li, Qiang Rao, Yi** Bao, Xiao Liu, Dohyeong Kim, **seong Kim, Myunghyun Kim, Mykola Lavreniuk, Rui Li, Qing Mao, Jiang Wu, Yu Zhu, **qiu Sun, Yanning Zhang, Suraj Patni, Aradhye Agarwal, Chetan Arora , et al. (16 additional authors not shown)

    Abstract: This paper discusses the results of the third edition of the Monocular Depth Estimation Challenge (MDEC). The challenge focuses on zero-shot generalization to the challenging SYNS-Patches dataset, featuring complex scenes in natural and indoor settings. As with the previous edition, methods can use any form of supervision, i.e. supervised or self-supervised. The challenge received a total of 19 su… ▽ More

    Submitted 27 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: To appear in CVPRW2024

  3. arXiv:2207.12934  [pdf, other

    cs.CV

    A Reliable Online Method for Joint Estimation of Focal Length and Camera Rotation

    Authors: Yiming Qian, James H. Elder

    Abstract: Linear perspectivecues deriving from regularities of the built environment can be used to recalibrate both intrinsic and extrinsic camera parameters online, but these estimates can be unreliable due to irregularities in the scene, uncertainties in line segment estimation and background clutter. Here we address this challenge through four initiatives. First, we use the PanoContext panoramic image d… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  4. arXiv:2104.10068  [pdf, other

    cs.CV

    Contrastive Learning for Sports Video: Unsupervised Player Classification

    Authors: Maria Koshkina, Hemanth Pidaparthy, James H. Elder

    Abstract: We address the problem of unsupervised classification of players in a team sport according to their team affiliation, when jersey colours and design are not known a priori. We adopt a contrastive learning approach in which an embedding network learns to maximize the distance between representations of players on different teams relative to players on the same team, in a purely unsupervised fashion… ▽ More

    Submitted 3 May, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

  5. arXiv:2007.01711  [pdf, other

    cs.CV

    Synergistic saliency and depth prediction for RGB-D saliency detection

    Authors: Yue Wang, Yuke Li, James H. Elder, Huchuan Lu, Runmin Wu, Lu Zhang

    Abstract: Depth information available from an RGB-D camera can be useful in segmenting salient objects when figure/ground cues from RGB channels are weak. This has motivated the development of several RGB-D saliency datasets and algorithms that use all four channels of the RGB-D data for both training and inference. Unfortunately, existing RGB-D saliency datasets are small, which may lead to overfitting and… ▽ More

    Submitted 26 October, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

  6. arXiv:2005.02693  [pdf, other

    cs.CL

    Shape of synth to come: Why we should use synthetic data for English surface realization

    Authors: Henry Elder, Robert Burke, Alexander O'Connor, Jennifer Foster

    Abstract: The Surface Realization Shared Tasks of 2018 and 2019 were Natural Language Generation shared tasks with the goal of exploring approaches to surface realization from Universal-Dependency-like trees to surface strings for several languages. In the 2018 shared task there was very little difference in the absolute performance of systems trained with and without additional, synthetically created data,… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

  7. arXiv:2001.01788  [pdf, other

    cs.CV

    MCMLSD: A Probabilistic Algorithm and Evaluation Framework for Line Segment Detection

    Authors: James H. Elder, Emilio J. AlmazĂ n, Yiming Qian, Ron Tal

    Abstract: Traditional approaches to line segment detection typically involve perceptual grou** in the image domain and/or global accumulation in the Hough domain. Here we propose a probabilistic algorithm that merges the advantages of both approaches. In a first stage lines are detected using a global probabilistic Hough approach. In the second stage each detected line is analyzed in the image domain to l… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

  8. arXiv:1911.11981  [pdf, other

    cs.CV

    Class-Conditional Domain Adaptation on Semantic Segmentation

    Authors: Yue Wang, Yuke Li, James H. Elder, Runmin Wu, Huchuan Lu

    Abstract: Semantic segmentation is an important sub-task for many applications, but pixel-level ground truth labeling is costly and there is a tendency to overfit the training data, limiting generalization. Unsupervised domain adaptation can potentially address these problems, allowing systems trained on labelled datasets from one or more source domains (including less expensive synthetic domains) to be ada… ▽ More

    Submitted 27 November, 2019; v1 submitted 27 November, 2019; originally announced November 2019.

  9. arXiv:1905.10486  [pdf, other

    cs.CL

    Designing a Symbolic Intermediate Representation for Neural Surface Realization

    Authors: Henry Elder, Jennifer Foster, James Barry, Alexander O'Connor

    Abstract: Generated output from neural NLG systems often contain errors such as hallucination, repetition or contradiction. This work focuses on designing a symbolic intermediate representation to be used in multi-stage neural generation with the intention of reducing the frequency of failed outputs. We show that surface realization from this intermediate representation is of high quality and when the full… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

  10. arXiv:1810.04700  [pdf, other

    cs.CL cs.AI

    End-to-End Content and Plan Selection for Data-to-Text Generation

    Authors: Sebastian Gehrmann, Falcon Z. Dai, Henry Elder, Alexander M. Rush

    Abstract: Learning to generate fluent natural language from structured data with neural networks has become an common approach for NLG. This problem can be challenging when the form of the structured data varies between examples. This paper presents a survey of several extensions to sequence-to-sequence models to account for the latent content selection process, particularly variants of copy attention and c… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.

    Comments: INLG 2018

  11. arXiv:1805.07731  [pdf, ps, other

    cs.CL

    Generating High-Quality Surface Realizations Using Data Augmentation and Factored Sequence Models

    Authors: Henry Elder, Chris Hokamp

    Abstract: This work presents a new state of the art in reconstruction of surface realizations from obfuscated text. We identify the lack of sufficient training data as the major obstacle to training high-performing models, and solve this issue by generating large amounts of synthetic training data. We also propose preprocessing techniques which make the structure contained in the input features more accessi… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.