Skip to main content

Showing 1–47 of 47 results for author: Hilton, A

.
  1. arXiv:2406.14412  [pdf, other

    cs.CV

    Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data

    Authors: Moira Shooter, Charles Malleson, Adrian Hilton

    Abstract: We introduce a new benchmark analysis focusing on 3D canine pose estimation from monocular in-the-wild images. A multi-modal dataset 3DDogs-Lab was captured indoors, featuring various dog breeds trotting on a walkway. It includes data from optical marker-based mocap systems, RGBD cameras, IMUs, and a pressure mat. While providing high-quality motion data, the presence of optical markers and limite… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 5 pages, 8 figures, including supplementary, CV4Animals Workshop 2024 (CVPRW)

  2. arXiv:2406.06499  [pdf, other

    cs.CV cs.HC

    NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative

    Authors: Asmar Nadeem, Faegheh Sardari, Robert Dawes, Syed Sameed Husain, Adrian Hilton, Armin Mustafa

    Abstract: Existing video captioning benchmarks and models lack coherent representations of causal-temporal narrative, which is sequences of events linked through cause and effect, unfolding over time and driven by characters or agents. This lack of narrative restricts models' ability to generate text descriptions that capture the causal and temporal dynamics inherent in video content. To address this gap, w… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2406.06187  [pdf, other

    cs.CV

    An Effective-Efficient Approach for Dense Multi-Label Action Detection

    Authors: Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton

    Abstract: Unlike the sparse label action detection task, where a single action occurs in each timestamp of a video, in a dense multi-label scenario, actions can overlap. To address this challenging task, it is necessary to simultaneously learn (i) temporal dependencies and (ii) co-occurrence action relationships. Recent approaches model temporal information by extracting multi-scale features through hierarc… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 14 pages. arXiv admin note: substantial text overlap with arXiv:2308.05051

  4. arXiv:2406.04251  [pdf, other

    cs.CV

    Gaussian Splatting with Localized Points Management

    Authors: Haosen Yang, Chenhao Zhang, Wenqing Wang, Marco Volino, Adrian Hilton, Li Zhang, Xiatian Zhu

    Abstract: Point management is a critical component in optimizing 3D Gaussian Splatting (3DGS) models, as the point initiation (e.g., via structure from motion) is distributionally inappropriate. Typically, the Adaptive Density Control (ADC) algorithm is applied, leveraging view-averaged gradient magnitude thresholding for point densification, opacity thresholding for pruning, and regular all-points opacity… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2406.03716  [pdf, other

    physics.atom-ph

    Demonstration of a Mobile Optical Clock Ensemble at Sea

    Authors: E. Ahern, J. W. Allison, C. Billington, N. Bourbeau Hébert, A. P. Hilton, E. Klantsataya, C. Locke, A. N. Luiten, M. Nelligan, R. F. Offer, C. Perrella, S. K. Scholten, B. White, B. M. Sparkes, R. Beard, J. D. Elgin, K. W. Martin

    Abstract: Atomic clocks have been at the leading edge of accuracy and precision since their inception in the 1950s. However, typically the most capable of these clocks have been confined to laboratories despite the fact that there are compelling reasons to apply them in the field and/or while in motion. These applications include synchronization of distributed critical infrastructure (e.g. data servers, com… ▽ More

    Submitted 21 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  6. arXiv:2405.10690  [pdf, other

    cs.CV

    CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

    Authors: Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton

    Abstract: Weakly supervised audio-visual video parsing (AVVP) methods aim to detect audible-only, visible-only, and audible-visible events using only video-level labels. Existing approaches tackle this by leveraging unimodal and cross-modal contexts. However, we argue that while cross-modal learning is beneficial for detecting audible-visible events, in the weakly supervised scenario, it negatively impacts… ▽ More

    Submitted 7 July, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted at ECCV 2024

  7. arXiv:2403.10357  [pdf, other

    cs.CV cs.GR

    ANIM: Accurate Neural Implicit Model for Human Reconstruction from a single RGB-D image

    Authors: Marco Pesavento, Yuanlu Xu, Nikolaos Sarafianos, Robert Maier, Ziyan Wang, Chun-Han Yao, Marco Volino, Edmond Boyer, Adrian Hilton, Tony Tung

    Abstract: Recent progress in human shape learning, shows that neural implicit models are effective in generating 3D human surfaces from limited number of views, and even from a single RGB image. However, existing monocular approaches still struggle to recover fine geometric details such as face, hands or cloth wrinkles. They are also easily prone to depth ambiguities that result in distorted geometries alon… ▽ More

    Submitted 18 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR24; Project page: https://marcopesavento.github.io/ANIM/

  8. arXiv:2310.16754  [pdf, other

    cs.CV

    CAD -- Contextual Multi-modal Alignment for Dynamic AVQA

    Authors: Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa

    Abstract: In the context of Audio Visual Question Answering (AVQA) tasks, the audio visual modalities could be learnt on three levels: 1) Spatial, 2) Temporal, and 3) Semantic. Existing AVQA methods suffer from two major shortcomings; the audio-visual (AV) information passing through the network isn't aligned on Spatial and Temporal levels; and, inter-modal (audio and visual) Semantic information is often n… ▽ More

    Submitted 27 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  9. arXiv:2308.05051  [pdf, other

    cs.CV

    PAT: Position-Aware Transformer for Dense Multi-Label Action Detection

    Authors: Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton

    Abstract: We present PAT, a transformer-based network that learns complex temporal co-occurrence action dependencies in a video by exploiting multi-scale temporal features. In existing methods, the self-attention mechanism in transformers loses the temporal positional information, which is essential for robust action detection. To address this issue, we (i) embed relative positional encoding in the self-att… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  10. End-to-End Latency Optimization of Multi-view 3D Reconstruction for Disaster Response

    Authors: Xiaojie Zhang, Mingjun Li, Andrew Hilton, Amitangshu Pal, Soumyabrata Dey, Saptarshi Debroy

    Abstract: In order to plan rapid response during disasters, first responder agencies often adopt `bring your own device' (BYOD) model with inexpensive mobile edge devices (e.g., drones, robots, tablets) for complex video analytics applications, e.g., 3D reconstruction of a disaster scene. Unlike simpler video applications, widely used Multi-view Stereo (MVS) based 3D reconstruction applications (e.g., openM… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 2022 10th IEEE International Conference on Mobile Cloud Computing, Services, and Engineering (MobileCloud)

  11. arXiv:2303.14829  [pdf, other

    cs.CV

    SEM-POS: Grammatically and Semantically Correct Video Captioning

    Authors: Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa

    Abstract: Generating grammatically and semantically correct captions in video captioning is a challenging task. The captions generated from the existing methods are either word-by-word that do not align with grammatical structure or miss key information from the input videos. To address these issues, we introduce a novel global-local fusion network, with a Global-Local Fusion Block (GLFB) that encodes and f… ▽ More

    Submitted 4 April, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  12. Wavefront Curvature in Optical Atomic Beam Clocks

    Authors: A. Strathearn, R. F. Offer, A. P. Hilton, E. Klantsataya, A. N. Luiten, R. P. Anderson, B. M. Sparkes, T. M. Stace

    Abstract: Atomic clocks provide a reproducible basis for our understanding of time and frequency. Recent demonstrations of compact optical clocks, employing thermal atomic beams, have achieved short-term fractional frequency instabilities in the $10^{-16}$, competitive with the best international frequency standards available. However, a serious challenge inherent in compact clocks is the necessarily smalle… ▽ More

    Submitted 24 January, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: 13 pages, 7 figures

  13. arXiv:2208.10738  [pdf, other

    cs.CV

    Super-resolution 3D Human Shape from a Single Low-Resolution Image

    Authors: Marco Pesavento, Marco Volino, Adrian Hilton

    Abstract: We propose a novel framework to reconstruct super-resolution human shape from a single low-resolution input image. The approach overcomes limitations of existing approaches that reconstruct 3D human shape from a single image, which require high-resolution images together with auxiliary data such as surface normal or a parametric model to reconstruct high-detail shape. The proposed framework repres… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  14. arXiv:2203.03291  [pdf, other

    eess.AS cs.SD eess.IV

    Visually Supervised Speaker Detection and Localization via Microphone Array

    Authors: Davide Berghi, Adrian Hilton, Philip J. B. Jackson

    Abstract: Active speaker detection (ASD) is a multi-modal task that aims to identify who, if anyone, is speaking from a set of candidates. Current audio-visual approaches for ASD typically rely on visually pre-extracted face tracks (sequences of consecutive face crops) and the respective monaural audio. However, their recall rate is often low as only the visible faces are included in the set of candidates.… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: Erratum: Due to a bug in the evaluation script, the correct average distance (aD) metric is here reported in yellow. The analysis remains unchanged from the original paper as the trend between the old and new measures are perfectly monotonic. The bug was caused by an incorrect normalization factor

    Journal ref: IEEE 23rd International Workshop on Multimedia Signal Processing (MMSP), 2021

  15. arXiv:2108.13739  [pdf, other

    cs.CV cs.LG cs.MM

    Super-Resolution Appearance Transfer for 4D Human Performances

    Authors: Marco Pesavento, Marco Volino, Adrian Hilton

    Abstract: A common problem in the 4D reconstruction of people from multi-view video is the quality of the captured dynamic texture appearance which depends on both the camera resolution and capture volume. Typically the requirement to frame cameras to capture the volume of a dynamic performance ($>50m^3$) results in the person occupying only a small proportion $<$ 10% of the field of view. Even with ultra h… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

  16. arXiv:2108.13697  [pdf, other

    cs.CV cs.AI cs.LG

    Attention-based Multi-Reference Learning for Image Super-Resolution

    Authors: Marco Pesavento, Marco Volino, Adrian Hilton

    Abstract: This paper proposes a novel Attention-based Multi-Reference Super-resolution network (AMRSR) that, given a low-resolution image, learns to adaptively transfer the most similar texture from multiple reference images to the super-resolution output whilst maintaining spatial coherence. The use of multiple reference images together with attention-based sampling is demonstrated to achieve significantly… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

  17. arXiv:2108.00249  [pdf, other

    cs.CV cs.AI cs.GR

    SyDog: A Synthetic Dog Dataset for Improved 2D Pose Estimation

    Authors: Moira Shooter, Charles Malleson, Adrian Hilton

    Abstract: Estimating the pose of animals can facilitate the understanding of animal motion which is fundamental in disciplines such as biomechanics, neuroscience, ethology, robotics and the entertainment industry. Human pose estimation models have achieved high performance due to the huge amount of training data available. Achieving the same results for animal pose estimation is challenging due to the lack… ▽ More

    Submitted 31 July, 2021; originally announced August 2021.

    Comments: 5 pages, 1 figure, Poster presentation at the Computer Vision for Animal Behavior Tracking and Modeling (CV4Animals:) Workshop in conjunction with CVPR 2021

  18. arXiv:2105.06828  [pdf

    physics.app-ph

    Spalling-induced liftoff and transfer of electronic films using a van der Waals release layer

    Authors: Eric W. Blanton, Michael J. Motala, Timothy A. Prusnick, Albert Hilton, Jeff L. Brown, Arkka Bhattacharyya, Sriram Krishnamoorthy, Kevin Leedy, Nicholas R. Glavin, Michael Snure

    Abstract: Heterogeneous integration strategies are increasingly being employed to achieve more compact and capable electronics systems for multiple applications including space, electric vehicles, and wearable and medical devices. To enable new integration strategies, the growth and transfer of thin electronic films and devices, including III-nitrides, metal oxides, and two-dimensional (2D) materials, using… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: 11 pages, 4 figures

  19. arXiv:2104.09283  [pdf, other

    cs.CV

    Multi-person Implicit Reconstruction from a Single Image

    Authors: Armin Mustafa, Akin Caliskan, Lourdes Agapito, Adrian Hilton

    Abstract: We present a new end-to-end learning framework to obtain detailed and spatially coherent reconstructions of multiple people from a single image. Existing multi-person methods suffer from two main drawbacks: they are often model-based and therefore cannot capture accurate 3D models of people with loose clothing and hair; or they require manual intervention to resolve occlusions or interactions. Our… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: To appear in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021

  20. arXiv:2104.09259  [pdf, other

    cs.CV

    Temporal Consistency Loss for High Resolution Textured and Clothed 3DHuman Reconstruction from Monocular Video

    Authors: Akin Caliskan, Armin Mustafa, Adrian Hilton

    Abstract: We present a novel method to learn temporally consistent 3D reconstruction of clothed people from a monocular video. Recent methods for 3D human reconstruction from monocular video using volumetric, implicit or parametric human shape models, produce per frame reconstructions giving temporally inconsistent output and limited performance when applied to video. In this paper, we introduce an approach… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: To appear in Dynavis Workshop, CVPR 2021

  21. arXiv:2009.14162  [pdf, other

    cs.CV

    Multi-View Consistency Loss for Improved Single-Image 3D Reconstruction of Clothed People

    Authors: Akin Caliskan, Armin Mustafa, Evren Imre, Adrian Hilton

    Abstract: We present a novel method to improve the accuracy of the 3D reconstruction of clothed human shape from a single image. Recent work has introduced volumetric, implicit and model-based shape learning frameworks for reconstruction of objects and people from one or more images. However, the accuracy and completeness for reconstruction of clothed people is limited due to the large variation in shape re… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to Asian Conference on Computer Vision 2020 (ACCV)

  22. arXiv:2009.05235  [pdf, ps, other

    cs.CV

    Spectral Analysis Network for Deep Representation Learning and Image Clustering

    Authors: **ghua Wang, Adrian Hilton, Jianmin Jiang

    Abstract: Deep representation learning is a crucial procedure in multimedia analysis and attracts increasing attention. Most of the popular techniques rely on convolutional neural network and require a large amount of labeled data in the training procedure. However, it is time consuming or even impossible to obtain the label information in some tasks due to cost limitation. Thus, it is necessary to develop… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

    Journal ref: ICME2019

  23. arXiv:2004.01848  [pdf, other

    math.CO

    Bounds Related to The Edge-List Chromatic and Total Chromatic Numbers of a Simple Graph

    Authors: M. Henderson, A. J. W. Hilton, R. Mary Jeya Jothi

    Abstract: We show that for a simple graph $G$, $c'(G)\leqΔ(G)+2$ where $c'(G)$ is the choice index (or edge-list chromatic number) of $G$, and $Δ(G)$ is the maximum degree of $G$. As a simple corollary of this result, we show that the total chromatic number $χ_T(G)$ of a simple graph satisfies the inequality $χ_T(G)\leq\ Δ(G)+4$ and the total choice number $c_T(G)$ also satisfies this inequality. We als… ▽ More

    Submitted 8 March, 2022; v1 submitted 4 April, 2020; originally announced April 2020.

    MSC Class: 05C15

  24. arXiv:2003.06656  [pdf, other

    eess.AS cs.SD eess.IV

    Audio-Visual Spatial Aligment Requirements of Central and Peripheral Object Events

    Authors: Davide Berghi, Hanne Stenzel, Marco Volino, Adrian Hilton, Philip J. B. Jackson

    Abstract: Immersive audio-visual perception relies on the spatial integration of both auditory and visual information which are heterogeneous sensing modalities with different fields of reception and spatial resolution. This study investigates the perceived coherence of audiovisual object events presented either centrally or peripherally with horizontally aligned/misaligned sound. Various object events were… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: Two-pages poster abstract

    Journal ref: IEEE VR 2020

  25. arXiv:2002.02839  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Transferrable AlGaN/GaN HEMTs to Arbitrary Substrates via a Two-dimensional Boron Nitride Release Layer

    Authors: Michael J. Motala, Eric Blanton, Al Hilton, Eric Heller, Chris Muratore, Katherine Burzynski, Jeff Brown, Kelson Chabak, Michael Durstock, Michael Snure, Nicholas Glavin

    Abstract: Mechanical transfer of high performing thin film devices onto arbitrary substrates represents an exciting opportunity to improve device performance, explore non-traditional manufacturing approaches, and paves the way for soft, conformal, and flexible electronics. Using a two-dimensional (2D) boron nitride (BN) release layer, we demonstrate the transfer of AlGaN/GaN high-electron mobility transisto… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 21 pages, 4 figures

  26. arXiv:1911.03926  [pdf, ps, other

    cs.PL

    Gemini: A Functional Programming Language for Hardware Description

    Authors: Aditya Srinivasan, Andrew D. Hilton

    Abstract: This paper presents Gemini, a functional programming language for hardware description that provides features such as parametric polymorphism, recursive datatypes, higher-order functions, and type inference for higher expressivity compared to modern hardware description languages. Gemini demonstrates the theory and implementation of novel type-theoretical concepts through its unique type system co… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

  27. Light-shift spectroscopy of optically trapped atomic ensembles

    Authors: Ashby P. Hilton, Andre N. Luiten, Philip S. Light

    Abstract: We develop a method for extracting the physical parameters of interest for a dipole trapped cold atomic ensemble. This technique uses the spatially dependent ac-Stark shift of the trap itself to project the atomic distribution onto a light-shift broadened transmission spectrum. We develop a model that connects the atomic distribution with the expected transmission spectrum. We then demonstrate the… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: Pre-print of journal article

  28. arXiv:1910.01241  [pdf, other

    cs.CV

    Learning Dense Wide Baseline Stereo Matching for People

    Authors: Akin Caliskan, Armin Mustafa, Evren Imre, Adrian Hilton

    Abstract: Existing methods for stereo work on narrow baseline image pairs giving limited performance between wide baseline views. This paper proposes a framework to learn and estimate dense stereo for people from wide baseline image pairs. A synthetic people stereo patch dataset (S2P2) is introduced to learn wide baseline dense stereo matching for people. The proposed framework not only learns human specifi… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: To appear in 3D Reconstruction in the Wild Workshop, ICCV 2019

  29. arXiv:1909.02693  [pdf, other

    physics.optics physics.ins-det

    Heterodyne fiber interferometer for frequency-noise reduction and rapid wide-band tunability of a conventional laser source

    Authors: Ashby P. Hilton, Philip S. Light, Lauris J. B. Talbot, Andre N. Luiten

    Abstract: Self-heterodyne fiber interferometers have been shown to be capable of stabilizing lasers to ultra-narrow linewidths and present an excellent alternative to high finesse cavities for frequency stabilization. In addition to suppressing frequency noise, these devices are highly tunable, and can be manipulated to produce high speed frequency sweeps over the entire range of the laser. We present an an… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

    Comments: Presubmission journal article

  30. arXiv:1908.03030  [pdf, other

    cs.CV

    Semantic Estimation of 3D Body Shape and Pose using Minimal Cameras

    Authors: Andrew Gilbert, Matthew Trumble, Adrian Hilton, John Collomosse

    Abstract: We aim to simultaneously estimate the 3D articulated pose and high fidelity volumetric occupancy of human performance, from multiple viewpoint video (MVV) with as few as two views. We use a multi-channel symmetric 3D convolutional encoder-decoder with a dual loss to enforce the learning of a latent embedding that enables inference of skeletal joint positions and a volumetric reconstruction of the… ▽ More

    Submitted 7 September, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

  31. EdgeNet: Semantic Scene Completion from a Single RGB-D Image

    Authors: Aloisio Dourado, Teofilo Emidio de Campos, Hansung Kim, Adrian Hilton

    Abstract: Semantic scene completion is the task of predicting a complete 3D representation of volumetric occupancy with corresponding semantic labels for a scene from a single point of view. Previous works on Semantic Scene Completion from RGB-D data used either only depth or depth with colour by projecting the 2D image into the 3D volume resulting in a sparse data representation. In this work, we present a… ▽ More

    Submitted 6 September, 2020; v1 submitted 7 August, 2019; originally announced August 2019.

    Comments: 10 pages, 5 figures Accepted at ICPR 2020

    ACM Class: I.4.6; I.4.8

  32. arXiv:1907.09905  [pdf, other

    cs.CV

    U4D: Unsupervised 4D Dynamic Scene Understanding

    Authors: Armin Mustafa, Chris Russell, Adrian Hilton

    Abstract: We introduce the first approach to solve the challenging problem of unsupervised 4D visual scene understanding for complex dynamic scenes with multiple interacting people from multi-view video. Our approach simultaneously estimates a detailed model that includes a per-pixel semantically and temporally coherent reconstruction, together with instance-level segmentation exploiting photo-consistency,… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: To appear in IEEE International Conference in Computer Vision ICCV 2019

  33. arXiv:1907.08195  [pdf, other

    cs.CV

    Temporally Coherent General Dynamic Scene Reconstruction

    Authors: Armin Mustafa, Marco Volino, Hansung Kim, Jean-Yves Guillemaut, Adrian Hilton

    Abstract: Existing techniques for dynamic scene reconstruction from multiple wide-baseline cameras primarily focus on reconstruction in controlled environments, with fixed calibrated cameras and strong prior constraints. This paper introduces a general approach to obtain a 4D representation of complex dynamic scenes from multi-view wide-baseline static or moving cameras without prior knowledge of the scene… ▽ More

    Submitted 3 August, 2020; v1 submitted 18 July, 2019; originally announced July 2019.

    Comments: Submitted to IJCV 2019. arXiv admin note: substantial text overlap with arXiv:1603.03381

  34. arXiv:1902.05381  [pdf, other

    math.CO

    The simple graph threshold number $σ(r,s,a,t)$

    Authors: A. J. W. Hilton, A. Rajkumar

    Abstract: For $d \ge 1$, $s \ge 0$ a $(d, d+s)$-{\em graph} is a graph whose degrees all lie in the interval $\{d, d+1, \ldots, d + s\}$. For $r \ge 1$, $a \ge 0$, an $(r, r+a)$-{\em factor} of a graph $G$ is a spanning $(r, r+a)$-subgraph of $G$. An $(r, r+a)$-{\em factorization} of a graph $G$ is a decomposition of $G$ into edge-disjoint $(r, r+a)$-factors. A graph is $(r, r+a)$-{\em factorable} if it has… ▽ More

    Submitted 14 February, 2019; originally announced February 2019.

    Comments: 38 pages, 4 figures

    MSC Class: 05C70

  35. Dual-colour magic-wavelength trap for suppression of light shifts in atoms

    Authors: Ashby P. Hilton, Christopher Perrella, Andre N. Luiten, Philip S. Light

    Abstract: We present an optical approach to compensating for spatially varying ac-Stark shifts that appear on atomic ensembles subject to strong optical control or trap** fields. The introduction of an additional weak light field produces an intentional perturbation between atomic states that is tuned to suppress the influence of the strong field. The compensation field suppresses sensitivity in one of th… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Journal ref: Phys. Rev. Applied 11, 024065 (2019)

  36. arXiv:1807.01950  [pdf, other

    cs.CV

    Volumetric performance capture from minimal camera viewpoints

    Authors: Andrew Gilbert, Marco Volino, John Collomosse, Adrian Hilton

    Abstract: We present a convolutional autoencoder that enables high fidelity volumetric reconstructions of human performance to be captured from multi-view video comprising only a small set of camera views. Our method yields similar end-to-end reconstruction error to that of a probabilistic visual hull computed using significantly more (double or more) viewpoints. We use a deep prior implicitly learned by th… ▽ More

    Submitted 10 July, 2018; v1 submitted 5 July, 2018; originally announced July 2018.

  37. arXiv:1807.01511  [pdf, other

    cs.CV

    Deep Autoencoder for Combined Human Pose Estimation and body Model Upscaling

    Authors: Matthew Trumble, Andrew Gilbert, Adrian Hilton, John Collomosse

    Abstract: We present a method for simultaneously estimating 3D human pose and body shape from a sparse set of wide-baseline camera views. We train a symmetric convolutional autoencoder with a dual loss that enforces learning of a latent representation that encodes skeletal joint positions, and at the same time learns a deep representation of volumetric body shape. We harness the latter to up-scale input vol… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

  38. arXiv:1804.11276  [pdf, other

    cs.CV

    4D Temporally Coherent Light-field Video

    Authors: Armin Mustafa, Marco Volino, Jean-yves Guillemaut, Adrian Hilton

    Abstract: Light-field video has recently been used in virtual and augmented reality applications to increase realism and immersion. However, existing light-field methods are generally limited to static scenes due to the requirement to acquire a dense scene representation. The large amount of data and the absence of methods to infer temporal coherence pose major challenges in storage, compression and editing… ▽ More

    Submitted 30 April, 2018; originally announced April 2018.

    Comments: Published in 3D Vision (3DV) 2017

  39. High-efficiency cold-atom transport into a waveguide trap

    Authors: Ashby P. Hilton, Christopher Perrella, Fetah Benabid, Ben M. Sparkes, Andre N. Luiten, Philip S. Light

    Abstract: We have developed and characterized an atom-guiding technique that loads $3\times10^6$ cold rubidium atoms into hollow-core optical fibre, an order-of-magnitude larger than previously reported results. This result was possible because it was guided by a physically realistic simulation that could provide the specifications for loading efficiencies of 3% and a peak optical depth of 600. The simulati… ▽ More

    Submitted 30 October, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Journal ref: Phys. Rev. Applied 10, 044034 (2018)

  40. arXiv:1802.04735  [pdf, other

    cs.CV

    Semantic Scene Completion Combining Colour and Depth: preliminary experiments

    Authors: Andre Bernardes Soares Guedes, Teofilo Emidio de Campos, Adrian Hilton

    Abstract: Semantic scene completion is the task of producing a complete 3D voxel representation of volumetric occupancy with semantic labels for a scene from a single-view observation. We built upon the recent work of Song et al. (CVPR 2017), who proposed SSCnet, a method that performs scene completion and semantic labelling in a single end-to-end 3D convolutional network. SSCnet uses only depth maps as inp… ▽ More

    Submitted 13 February, 2018; originally announced February 2018.

    Comments: 5 pages, 2 figures

  41. arXiv:1708.07218  [pdf

    cs.SD

    Object-Based Audio Rendering

    Authors: Philip Jackson, Filippo Fazi, Frank Melchior, Trevor Cox, Adrian Hilton, Chris Pike, Jon Francombe, Andreas Franck, Philip Coleman, Dylan Menzies-Gow, James Woodcock, Yan Tang, Qingju Liu, Rick Hughes, Marcos Simon Galvez, Teo de Campos, Hansung Kim, Hanne Stenzel

    Abstract: Apparatus and methods are disclosed for performing object-based audio rendering on a plurality of audio objects which define a sound scene, each audio object comprising at least one audio signal and associated metadata. The apparatus comprises: a plurality of renderers each capable of rendering one or more of the audio objects to output rendered audio data; and object adapting means for adapting o… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

    Comments: This is a transcript of GB Patent Application No: GB1609316.3, filed in the UK by the University of Surrey on 23 May 2016. It describes an intelligent system for customising, personalising and perceptually monitoring the rendering of an object-based audio stream for an arbitrary connected system of loudspeakers to optimize the listening experience as the producer intended. 30 pages, 5 figures

  42. arXiv:1610.02027  [pdf

    physics.ins-det

    Drift-compensated Low-noise Frequency Synthesis Based on a cryoCSO for the KRISS-F1

    Authors: Myoung-Sun Heo, Sang Eon Park, Won-Kyu Lee, Sang-Bum Lee, Hyun-Gue Hong, Taeg Yong Kwon, Chang Yong Park, Dai-Hyuk Yu, G. Santarelli, Ashby Hilton, Andre N. Luiten, John G. Hartnett

    Abstract: In this paper we report on the implementation and stability analysis of a drift-compensated frequency synthesizer from a cryogenic sapphire oscillator (CSO) designed for a Cs/Rb atomic fountain clock. The synthesizer has two microwave outputs of 7 GHz and 9 GHz for Rb and Cs atom interrogation, respectively. The short-term stability of these microwave signals, measured using an optical frequency c… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

    Comments: 8 pages, 6 figures

  43. arXiv:1608.00571  [pdf

    cs.DC cs.OS cs.PL

    TREES: A CPU/GPU Task-Parallel Runtime with Explicit Epoch Synchronization

    Authors: Blake A. Hechtman, Andrew D. Hilton, Daniel J. Sorin

    Abstract: We have developed a task-parallel runtime system, called TREES, that is designed for high performance on CPU/GPU platforms. On platforms with multiple CPUs, Cilk's "work-first" principle underlies how task-parallel applications can achieve performance, but work-first is a poor fit for GPUs. We build upon work-first to create the "work-together" principle that addresses the specific strengths and w… ▽ More

    Submitted 1 August, 2016; originally announced August 2016.

  44. arXiv:1603.03381  [pdf, other

    cs.CV

    Temporally coherent 4D reconstruction of complex dynamic scenes

    Authors: Armin Mustafa, Hansung Kim, Jean-Yves Guillemaut, Adrian Hilton

    Abstract: This paper presents an approach for reconstruction of 4D temporally coherent models of complex dynamic scenes. No prior knowledge is required of scene structure or camera calibration allowing reconstruction from multiple moving cameras. Sparse-to-dense temporal correspondence is integrated with joint multi-view segmentation and reconstruction to obtain a complete 4D representation of static and dy… ▽ More

    Submitted 28 March, 2016; v1 submitted 10 March, 2016; originally announced March 2016.

    Comments: To appear in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016 . Video available at: https://www.youtube.com/watch?v=bm_P13_-DsQ

  45. arXiv:1509.09294  [pdf, other

    cs.CV

    General Dynamic Scene Reconstruction from Multiple View Video

    Authors: Armin Mustafa, Hansung Kim, Jean-Yves Guillemaut, Adrian Hilton

    Abstract: This paper introduces a general approach to dynamic scene reconstruction from multiple moving cameras without prior knowledge or limiting constraints on the scene structure, appearance, or illumination. Existing techniques for dynamic scene reconstruction from multiple wide-baseline camera views primarily focus on accurate reconstruction in controlled environments, where the cameras are fixed and… ▽ More

    Submitted 30 September, 2015; originally announced September 2015.

  46. arXiv:1107.2639  [pdf, ps, other

    math.CO

    Hall's Condition for Partial Latin Squares

    Authors: A. J. W. Hilton, E. R. Vaughan

    Abstract: Hall's Condition is a necessary condition for a partial latin square to be completable. Hilton and Johnson showed that for a partial latin square whose filled cells form a rectangle, Hall's Condition is equivalent to Ryser's Condition, which is a necessary and sufficient condition for completability. We give what could be regarded as an extension of Ryser's Theorem, by showing that for a partial… ▽ More

    Submitted 13 July, 2011; originally announced July 2011.

    Comments: 23 pages; 9 figures

    MSC Class: 05B15

  47. arXiv:1107.2634  [pdf, ps, other

    math.CO

    An analogue of Ryser's Theorem for partial Sudoku squares

    Authors: P. J. Cameron, A. J. W. Hilton, E. R. Vaughan

    Abstract: In 1956 Ryser gave a necessary and sufficient condition for a partial latin rectangle to be completable to a latin square. In 1990 Hilton and Johnson showed that Ryser's condition could be reformulated in terms of Hall's Condition for partial latin squares. Thus Ryser's Theorem can be interpreted as saying that any partial latin rectangle $R$ can be completed if and only if $R$ satisfies Hall's Co… ▽ More

    Submitted 14 July, 2011; v1 submitted 13 July, 2011; originally announced July 2011.

    Comments: 19 pages, 10 figures

    MSC Class: 05B15